登录查看更多内容

Google is reportedly developing a ‘computer-using agent’ AI system

Rajat Kapoor

MCA-AIML | Chandigarh University | Data Science |?Python ?C++ ?VBA|SQL|?ML?DL?AI |?Open CV ?NLP ?Transformer ?MLOOP |?Android Development ?Firebase| AWS |?Flask ?Django ?Docker | ?Figma ?Canvas Designs ?Adobe | Badminton

发布日期: 2024年10月28日

Google’s Project Jarvis: Automating Web-Based Tasks with AI Innovation

In the coming months, Google may introduce “Project Jarvis,” an AI assistant designed to streamline online tasks directly within Chrome. This project, reportedly powered by the next generation of Google’s Gemini model, aims to make repetitive digital actions—like gathering research, shopping, or booking flights—faster and more intuitive. Jarvis captures and interprets screen details, performing clicks, and text entries to handle steps users would normally complete manually.

The AI ecosystem is racing forward with similar innovations. Microsoft’s Copilot Vision, Apple’s Intelligence, and Anthropic’s Claude all explore ways to enhance productivity and digital interactions. With Jarvis’s December debut on the horizon, Google is planning a limited launch for testing, ensuring a refined user experience.

The Jarvis project highlights Google’s commitment to making AI an integral, everyday tool, capable of automating complex online tasks while seamlessly integrating with Chrome’s browsing experience. For professionals, this tech could redefine productivity, particularly for routine and research-heavy roles. As we move closer to launch, the impact of Jarvis—and other AI automation solutions—will undoubtedly change how we interact with digital environments.

领英推荐

Microsoft's Copilot Upgrade, OpenAI's DevDay…

Dr. Joerg Storm 5 个月前

This week's AI industry updates: March 12, 2025

SymphonyAI 1 周前

?? Microsoft launching AI Agents ?? Glean ?? US Dept…

Steven Wolfe Pereira ?? 5 个月前

Google could preview its own take on Rabbit’s large action model concept as soon as December, reports The Information. “Project Jarvis,” as it’s reportedly codenamed, would carry tasks out for users, including “gathering research, purchasing a product, or booking a flight,” according to three people the outlet spoke with who have direct knowledge of the project.

Powered by a future version of Google’s Gemini, Jarvis reportedly only works with a web browser (it’s tuned specifically for Chrome). The tool is aimed at helping people “automate everyday, web-based tasks” by taking and interpreting screenshots and then clicking buttons or entering text, The Information writes. In its current state, it apparently takes “a few seconds” between actions.

The biggest AI companies are all working on models that do things like what The Information is describing. Microsoft’s Copilot Vision will let you talk with it about webpages you’re viewing. Apple Intelligence is expected to be aware of what’s on your screen and do things for you across multiple apps at some point in the next year. Anthropic debuted a “cumbersome and error-prone” Claude beta update that can use a computer for you, and OpenAI is reportedly working on a version of that, too.

The Information cautions that Google’s plan to show Jarvis off in December is subject to change. The company is reportedly considering releasing it to some small number of testers to find and help the company work out bugs.

要查看或添加评论，请登录

Rajat Kapoor的更多文章

Transformer Model Architecture: Encoder-Decoder Structure with Attention Mechanisms

2024年11月12日

Transformer Model Architecture: Encoder-Decoder Structure with Attention Mechanisms

This image depicts a visual representation of a Transformer model architecture, commonly used in natural language…
Heart Disease Prediction Using Machine Learning

2024年11月6日

Heart Disease Prediction Using Machine Learning

Heart Disease Prediction Using Machine Learning: A Mini AI Project Introduction: In today’s fast-paced world…
An AI collar That make a dog talk !

2024年10月23日

An AI collar That make a dog talk !

All of us talk to our pets, but what if our pets could talk back? That’s the premise of Personifi AI’s Shazam Band, a…
GPT-4 vs. GPT-3.5: how much difference is there ?

2024年10月21日

GPT-4 vs. GPT-3.5: how much difference is there ?

follow: https://medium.com/@rajat01kapoor Rajatkapoor The ChatGPT chatbot is an innovative AI tool developed by OpenAI.
Apple’s internal tests show Siri isn’t quite ready to beat ChatGPT

2024年10月21日

Apple’s internal tests show Siri isn’t quite ready to beat ChatGPT

With the introduction of the new iPad Mini, Apple made it clear that a software experience brimming with AI is the way…
Marketing firm finally admits that smartphones overhear your conversations

2024年9月9日

Marketing firm finally admits that smartphones overhear your conversations

Have you ever felt like your smartphone was listening to your conversations? Doesn’t it feel like they show us the same…

See all articles

Google is reportedly developing a ‘computer-using agent’ AI system

Rajat Kapoor

MCA-AIML | Chandigarh University | Data Science |?Python ?C++ ?VBA|SQL|?ML?DL?AI |?Open CV ?NLP ?Transformer ?MLOOP |?Android Development ?Firebase| AWS |?Flask ?Django ?Docker | ?Figma ?Canvas Designs ?Adobe | Badminton

领英推荐

Rajat Kapoor的更多文章

社区洞察

其他会员也浏览了

How to Help Ensure Quality and Reliability with GenAI Evaluation

AI Hype: Bad Data Is Bad Data.

OpenAI’s O3 Breakthrough, Google’s AI Mode & More: Today’s Top AI Headlines

Morgan Stanley’s GenAI Assistant ?? Exceeding Human Intelligence with AI ?? Midjourney’s Image Editor ??

Edition 37 – How to Build Smarter AI Agents

Gemini's Latest Leap: How Google's New Updates Are Set to Dominate the AI Chatbot Arena

OpenAI's Operator: A New Step Towards the Concept of RaaS

?? Welcome to AI Insights Unleashed! ?? - Vol. 41

????#13: Action! How AI Agents Execute Tasks with UI and API Tools

Artificial Intelligence (AI) and Data Analytics- My Weekend Reflections – Nov 10, 2024!

领英推荐

Rajat Kapoor的更多文章

Transformer Model Architecture: Encoder-Decoder Structure with Attention Mechanisms

Heart Disease Prediction Using Machine Learning

An AI collar That make a dog talk !

GPT-4 vs. GPT-3.5: how much difference is there ?

Apple’s internal tests show Siri isn’t quite ready to beat ChatGPT

Marketing firm finally admits that smartphones overhear your conversations

社区洞察

其他会员也浏览了

How to Help Ensure Quality and Reliability with GenAI Evaluation

AI Hype: Bad Data Is Bad Data.

OpenAI’s O3 Breakthrough, Google’s AI Mode & More: Today’s Top AI Headlines

Morgan Stanley’s GenAI Assistant ?? Exceeding Human Intelligence with AI ?? Midjourney’s Image Editor ??

Edition 37 – How to Build Smarter AI Agents

Gemini's Latest Leap: How Google's New Updates Are Set to Dominate the AI Chatbot Arena

OpenAI's Operator: A New Step Towards the Concept of RaaS

?? Welcome to AI Insights Unleashed! ?? - Vol. 41

????#13: Action! How AI Agents Execute Tasks with UI and API Tools

Artificial Intelligence (AI) and Data Analytics- My Weekend Reflections – Nov 10, 2024!