Productivity boost & MultiModal LLM interaction: txt + image with Gemini Pro Vision
Gemini

Productivity boost & MultiModal LLM interaction: txt + image with Gemini Pro Vision

Today I'm working on a prompt to get insights from a financial data table.

I'm trying out Gemini Pro LLM on Google AI Studio: https://ai.google.dev/

(Click the Get an API key to start)

Once in Google AI Studio, I work tirelessly on getting the best prompt for my LLM!

Prompt:

You are a financial analyst and you are required to summarize the key insights of given numerical tables.

CONSOLIDATED STATEMENTS OF INCOME (In millions, except per share amounts)

|Year Ended December 31 | 2020 | 2021 | 2022 |

|--- | --- | --- | --- |

|Revenues | $ 182,527| $ 257,637| $ 282,836|

|Costs and expenses:|

|Cost of revenues | 84,732 | 110,939 | 126,203|

|Research and development | 27,573 | 31,562 | 39,500|

|Sales and marketing | 17,946 | 22,912 | 26,567|

|General and administrative | 11,052 | 13,510 | 15,724|

|Total costs and expenses | 141,303| 178,923| 207,994|

|Income from operations | 41,224 | 78,714 | 74,842|

|Other income (expense), net | 6,858 | 12,020 | (3,514)|

|Income before income taxes | 48,082 | 90,734 | 71,328|

|Provision for income taxes | 7,813 | 14,701 | 11,356|

|Net income | $40,269| $76,033 | $59,972|

|Basic net income per share of Class A, Class B, and Class C stock | $2.96| $5.69| $4.59|

|Diluted net income per share of Class A, Class B, and Class C stock| $2.93| $5.61| $4.56|

Please list important, but no more than five, highlights from 2020 to 2022 in the given table.

Please write in a professional and business-neutral tone.

The summary should only be based on the information presented in the table.        
 Gemini Pro Answer:

 1. The company saw a steady increase in revenues from 2020 to 2022, with revenues reaching $282,836 million in 2022, marking a 55% increase from $182,527 million in 2020.

2. The total costs and expenses experienced a noticeable 47% surge from $141,303 million in 2020 to $207,994 million in 2022, primarily driven by increases in the cost of revenues, research and development, sales and marketing, and general and administrative expenses.

3. Consequently, the net income experienced a steep decline of 21% from $76,033 million in 2021 to $59,972 million in 2022.

4. Both basic and diluted net income per share witnessed a similar pattern, reaching $5.69 and $5.61 in 2021, respectively, before dropping to $4.59 and $4.56 in 2022.

5. In 2022, the company faced a significant downturn, reporting a net loss of $3,514 million in other income, compared to gains of $6,858 million and $12,020 million in 2020 and 2021, respectively.        

I'm really satisfied with the results and want to share this with my coworkers

Here is where the Multimodality of LLMs could help. Let's say I'm new to using Google AI Studio, and I want to share my prompt but I don't see the option directly.

I take a screenshot of my current UI and ask the Multimodal LLM for help

Here is the screenshot:

Google AI Studio - Gemini Pro - prompt to get Financial table summarization

I send this screenshot along with my prompt question for the Multimodal LLM Gemini Pro Vision

Prompt :

[Screenshot] + How can I share this result with my coworker?        
Google AI Studio - Gemini Pro Vision - Get help on how to share a prompt sending the screenshot along the question to the Multimodal LLMs

Answer:

Click the Share button at the top of the Page and enter the email addresses of your coworkers        

The LLMs understand the Screenshot I sent and the question so is able to guide me through the next steps


Here is a simplistic use case to help illustrate how MultiModal interaction could be used to help someone get a productivity boost.


Thanks for reading and feel free to share uses cases that you find useful in comments

Megha Sharma

Founder @GrowthFueler | Inbound & Outbound Growth | AI-Powered LinkedIn Personal Branding & Cold Outreach | Building Sales Systems That Sell for You

11 个月

Really insightful take on team dynamics at Elecbits! Focusing on building stronger bonds across all levels is key for growth – love how you guys are tackling this challenge head-on.

回复
Mathieu Trachino

Legal AI - Building Neomagus

1 年

I couldn’t get a Gemini API key. Not available in EU I believe. You might need a VPN

回复
Johan Verroust

Directeur Conseil chez Dékuple B2B (ex AWE)

1 年

Merci pour ce partage Frederic ! ??

Thulfakar Hammodi

Client integration Engineer chez Conversant LLC

1 年

Frederic Molina brilliant demo !

回复
Louis Janin

Expert technique chez Eulerian Technologies en solutions marketing numérique

1 年

Nice demo!!

回复

要查看或添加评论,请登录

Frederic Molina的更多文章

社区洞察