Google I/O 2024: Here’s everything Google just announced

Google I/O 2024: Here’s everything Google just announced

Thank you for reading the article. Here at Linkedin, I regularly write about latest topics on Artificial Intelligence, democratizing #AI knowledge that is relevant to you.

It’s that time of the year we’ve been waiting for: Google’s I/O Keynote Day! As it does every year, #Google kicks-off its developer conference with a rapid-fire stream of announcements, unveiling things it’s been working on for the past year.

Since you might not have the time to see the full 2-hour presentation, here is an easy-to-digest list of all things useful to you! Let’s dive right in…

Generative AI for Learning

Google unveiled LearnLM, a new family of #generativeAI models “fine-tuned” for learning. LearnLM models are designed to “conversationally” tutor students on a range of subjects.

Source: Google Blog

It is also working with educators to see how #LearnLM might simplify and improve the process of lesson planning. LearnLM could help teachers discover new ideas, content and activities, or find materials tailored to the needs of specific student cohorts. ???

Quiz Master

Google launched AI-generated quizzes into #YouTube, a new #Conversational-AI tool that allows users to figuratively “raise their” hand when watching longer educational videos, such as lectures and seminars. Viewers can ask clarifying questions, get helpful explanations or take a quiz on the subject matter.

Source: Google Blog

New LLM: Gemma 2 updates

Google is adding a new 27Billion parameter model to #Gemma2. This size is optimized by Nvidia to run on next-generation #GPU and can run efficiently on a single TPU host and vertex AI.

AI-based Scam calls detection

Google previewed a feature that will alert users to potential scams during calls. The feature utilizes Gemini Nano, the smallest version of Google’s generative AI offering, which can be run entirely on-device.

Source: Techcrunch

Common scammer tactics like password requests and gift cards will also trigger the system.

Ask Photos: Use AI to search photos

Google Photos is getting an AI infusion with the launch Ask Photos, powered by Google’s #Gemini AI model. The new addition will allow users to search across their Google Photos collection using natural language queries that leverage an AI’s understanding of their photo’s content and other metadata.

Sourcs: Google Blog

While before users could search for specific people, places, or things in their photos, thanks to natural language processing, the AI upgrade will make finding the right content more intuitive and less of a manual search process.

Google has decided to integrate AI into the common Google applications that we use on daily basis.

AI in your Gmail

Gmail users will be able to search, summarize, and draft their emails using its Gemini AI technology. It will also be able to take action on emails for more complex tasks, like helping you process an e-commerce return by searching your inbox, finding the receipt and filling out an online form.

Gemini 1.5Pro: 2Million context window

Google previewed a new version of #Gemini1.5Pro, its current flagship model, with a 2Million context window which is largest in any commercially available model

Source: Google Blog

This means you can now analyse longer documents, codebases, videos and audio recordings than before.

Gemini Live Feature

Google previewed a new experience in Gemini called Gemini Live, which lets users have “in-depth” voice chats with Gemini on their smartphones. Users can interrupt Gemini while the chatbot’s speaking to ask clarifying questions, and it’ll adapt to their speech patterns in real time. And Gemini can see and respond to users’ surroundings, either via photos or video captured by their smartphones’ cameras.

Gemini Nano: Small Language Model

Google is also building #GeminiNano, a Small Language Model, directly into the Chrome desktop client, starting with Chrome 126. This, the company says, will enable developers to use the on-device model to power their own AI features.

Gemini on Android

Google is taking advantage of its ability to deeply integrate with Android’s mobile OS and Google’s apps. It is replacing #GoogleAssistant with AI-based Google Gemini on #Android.

Users will be able to drag and drop AI-generated images directly into their Gmail, Google Messages and other apps. Meanwhile, YouTube users will be able to tap “Ask this video” to find specific information from within that YouTube video.

Google Maps get an AI upgrade

Gemini model capabilities are coming to the Google Maps platform for developers, starting with the Places API. Developers can show generative AI summaries of places and areas in their own apps and websites.

Source: Google Blog

The summaries are created based on Gemini’s analysis of insights from Google Maps’ community of more than 300 million contributors. What’s better? Developers will no longer have to write their own custom descriptions of places

New AI chip

Google unveiled its 6th generation Tensor Processing Units (TPU) AI chips, dubbed Trillium. These new TPUs will feature a 4.7x performance boost in compute performance per chip when compared to the fifth generation.

AI in search

Google is adding more AI to its search, assuaging doubts that the company is losing market share to competitors like ChatGPT and Perplexity.

Google plans to use generative AI to organize the entire search results page for some search results. That’s in addition to the existing AI Overview feature, which creates a short snippet with aggregate information about a topic you were searching for.

Generative AI upgrades

Imagen 3: Google’s highest quality text-to-image model

Imagen 3?is our highest quality text-to-image model. It generates an incredible level of detail, producing photorealistic, lifelike images, with far fewer distracting visual artifacts than our prior models.

Source: Google Blog

Imagen 3 better understands natural language, the intent behind your prompt and incorporates small details from longer prompts. The model’s advanced understanding helps it master a range of styles.

Veo: text-to-video model

Google’s gunning for OpenAI’s Sora with Veo, an AI model that can create 1080p video clips around a minute long given a text prompt. Veo can capture different visual and cinematic styles, including shots of landscapes and time lapses, and make edits and adjustments to already-generated footage.

Pixel 8a and Pixel Slate

Google launched the latest addition to the #Pixel line by announcing the New Pixel 8a phone and a tablet, Pixel Slate.

Source: Techcrunch

Google is following a strategy of fully integrating AI into its mainstream applications such as #Gmail, #GoogleMaps, search etc. used by the massive number of users worldwide. With this integration, the way we use our phones and the workflow of general purpose apps will be changed completely. Bringing AI more closer to the users.

?? If you found this article insightful and informative, please like, comment, repost!

?? Which announcement bring the biggest change in your workflow? Comment below...

?? Stay ahead of the curve with the latest developments in AI by subscribing to my newsletter, “All Things AI.” Be the first to receive cutting-edge insights, news, and trends straight to your inbox!"









要查看或添加评论,请登录

Siddharth Asthana的更多文章

社区洞察

其他会员也浏览了