登录查看更多内容

How Google’s Robots Can Learn from the Web Using AI ??

Junaid Awan

Digital Marketer

发布日期: 2023年7月31日

Google is one of the leading companies in the field of artificial intelligence (AI), developing cutting-edge technologies that can perform various tasks, from understanding natural language to playing complex games. But can AI also help robots learn from the web?

That is the question that Google researchers are trying to answer with their new approach to using large language models (LLMs), which are AI systems that can generate natural language based on massive amounts of text data.?The researchers have shown how LLMs can enable robots to write and execute their own code in Python, one of the most popular programming languages, based on instructions from humans.

The new approach builds on Google’s previous work on PaLM-SayCan, a model that allows robots to understand open-ended prompts from humans and respond reasonably and safely in a physical space. For example, if a human asks a robot to “pick up the red ball and put it in the blue box”, the robot can use PaLM-SayCan to parse the request, plan the actions, and execute them.

However, PaLM-SayCan has some limitations. It can only handle simple commands that involve predefined actions and objects. It cannot deal with complex scenarios that require logic, reasoning, or creativity. It also cannot learn from its own experience or improve its performance over time.

To overcome these challenges, the researchers have integrated PaLM-SayCan with another LLM called Codex, which was developed by OpenAI and can generate Python code based on natural language queries. By combining these two models, the researchers have created a system that can translate human instructions into Python code, and then execute the code using a robot.

The system works as follows: First, the human provides a high-level description of what they want the robot to do, such as “sort the balls by colour”. Then, the system uses Codex to generate a Python script that implements the task. Next, the system uses PaLM-SayCan to execute the Python script using a robot arm. The robot arm can interact with the environment and manipulate objects such as balls and boxes. The system also monitors the robot’s actions and provides feedback and corrections if needed.

领英推荐

Importance of Frameworks in AI

Analytics Insight? 2 个月前

Importance of Frameworks in AI

Analytics Insight? 3 个月前

AI Prompt Mastery: Learn Science-backed Techniques for…

TEAM International 3 个月前

The researchers have tested their system on various tasks, such as sorting objects by shape or size, stacking blocks in a specific order, or drawing shapes on a paper. They have found that their system can generate accurate and efficient code for most of these tasks, and that the robot can execute them successfully.

The researchers claim that their system is the first of its kind to enable robots to learn from the web using LLMs. They believe that this approach can open up new possibilities for robotic applications, such as education, entertainment, or assistance. They also hope that their system can inspire more research on how LLMs can be used for other domains and tasks.

One of the key features of their system is that it can “see” the world around it, “understand” the task, and instruct the robot what to do.?This is possible because their system uses a vision-language-action (VLA) model called RT-2, which is based on Transformers4. Transformers are neural network architectures that can process different types of data, such as text, images, or audio. RT-2 is trained on text and images from the web, which allows it to learn general concepts and skills that can be applied to different situations.?For example, RT-2 can recognize trash and know how to dispose of it, even if it has never seen those objects before5.

RT-2 is also able to communicate with humans using natural language. It can understand queries and commands from humans, and generate responses or actions accordingly. It can also ask questions or provide feedback to humans if needed. For example, if a human asks RT-2 to “pick up the extinct animal”, RT-2 can locate and pick out a dinosaur figurine from a table. If RT-2 is unsure about something, it can ask for clarification or confirmation from humans.

RT-2 is not only a powerful tool for robot learning, but also a potential companion for humans. It can perform useful tasks for humans, such as cleaning or organizing. It can also entertain humans with games or jokes. It can even learn from humans and improve its skills over time.

#AI #Robotics #Coding #Google #LLM #Codex #PaLM-SayCan #RT-2 #VLA #Transformers

How Google’s Robots Can Learn from the Web Using AI ??

Junaid Awan

Digital Marketer

领英推荐

更多精彩文章

社区洞察

其他会员也浏览了

Artificial Intelligence #185

Artificial Intelligence #185

OpenAI Hype Cycle

Recognize, Detect, Segment, and Moderate Your Images with a Single API! ??

Introducing Claude 3.5 Sonnet: Anthropic's Fastest and Smartest Model that Outperforms Claude 3 Opus. ??

Geometric Learning in Python: Basics

Introducing CodeLlama 70B: A 70 billion-parameter model achieving SOTA performance in code generation.

How to Use ChatGPT API in Python?

Modular GANs with Neural Blocks in Python

Langchain

领英推荐

How TikTok Failed to Detect a Deepfake Ad of MrBeast and What It Means for the Future of Online Trust

2023年10月4日

How to Create Stunning Visuals with Canva and ChatGPT

2023年9月27日

???? How to make your podcast global and interactive with AI ?????

2023年9月26日

WhatsApp Adverts: What You Need to Know About the Possible Changes in the App

2023年9月15日

How Twitter’s New Terms of Service Affect User Privacy and AI Ethics

2023年9月5日

How AI Chatbots Can Help Businesses Protect Their Data from Cyberattacks ???

2023年8月21日

How McKinsey’s Chatbot Leverages Its Knowledge Base to Provide Insights and Advice

2023年8月17日

How Hackers Exposed the Flaws and Biases of AI Chatbots ??

2023年8月14日

Zoom Under Fire for Collecting and Sharing User Data! ??

2023年8月8日

How Barbie’s Marketing Campaign Became a Cultural Phenomenon

2023年7月28日

社区洞察

其他会员也浏览了

Artificial Intelligence #185

Artificial Intelligence #185

OpenAI Hype Cycle

Recognize, Detect, Segment, and Moderate Your Images with a Single API! ??

Introducing Claude 3.5 Sonnet: Anthropic's Fastest and Smartest Model that Outperforms Claude 3 Opus. ??

Geometric Learning in Python: Basics

Introducing CodeLlama 70B: A 70 billion-parameter model achieving SOTA performance in code generation.

How to Use ChatGPT API in Python?

Modular GANs with Neural Blocks in Python

Langchain