Getting Started with Gemini 1.5 Pro and Google AI Studio:
chamindu lakshan
Out of the box thinker/YouTubepreneuer/programmer/Wordpress and Wix Designer
What is Google AI Studio?
Google AI Studio is a web-based environment where developers can write, run, and test prompts using Google’s Gemini models. Additionally, if you want to use the Gemini API, you can get your API key from inside Google AI Studio. Broadly, it is designed to be a simple entry point for developers to not just use models but also get started building with the Gemini API. If you don’t want to use the Gemini API, you can skip the API key step all together and just test the models.
The Google AI Studio Basics
If you’re familiar with OpenAI’s Playground, then much of what is about to be discussed will be familiar. Let’s walk through the basic UI as is shown below:
Regardless of what mode you select, the “Run Settings” will be the same.
Google AI Studio’s different Modes
Currently, Google AI Studio is offering three distinct modes when creating with the Gemini API. These options can be selected by clicking “Create new” in the top left corner, as seen below.
领英推荐
Each of these modes is meant to address a specific use case.
How to use the multimodal features available in Google AI Studio
One of the most unique features of the Google AI Studio is that various file types can be used in the environment. These include images, videos, audio, and files from Google Drive. This means that developers can easily test out if their idea works, and how to work through any bugs. For example, if we use the Chat Prompt from above and add a video we want to be summarized, the model will access the inserted video and accomplish its task. Let’s take a look.
In this case, we used a five-minute video that showcased various dinosaur fossils. When prompted, the model can interact with the video and produce a summary of its contents. These types of use cases can’t be done in other AI Playgrounds (OpenAI for example).
When to use Google AI Studio vs Gemini
While the Google AI Studio is a powerful tool, it’s important to understand when you should use it vs Google’s Gemini chatbot. The Gemini chatbot is Google’s equivalent to OpenAI’s ChatGPT. Users can expect to engage with the model through conversation and with limited control of the reasoning and response. Alternatively, if users intend to make changes to the way the Gemini model responds or need an API key, Google AI Studio is the tool to achieve this. After testing and creating a project in the Studio, users can export their work directly to code by clicking “Get Code” in the top right corner. Once outside the Studio and connected to the Gemini API, users can connect to other APIs like the Keymate.AI API. By doing so, they can utilize features like Keymate Memory and Keymate’s Confidence Scoring to help identify hallucinations. Overall, it was impressive to see what the Gemini models were capable of, especially around multi-modal use cases.
Thank you ????????