登录查看更多内容

#E1I68: Breathing Binary ????

Ravi Naukarkar

GenAI Specialist

发布日期: 2024年6月21日

Chakra Champions, welcome to our AI Ashram on this Yoga Day! Today, we’re bending our minds around the latest innovations. In the race of LLMs, Anthropic launches Claude 3.5 Sonnet, following the industry’s focus on smaller and faster models. Moving into the next āsana, we introduce LLARVA, a cutting-edge AI system. This system uses vision-action instruction tuning to teach robots new skills across various environments, outperforming existing methods with just 2D image inputs.

?? LLARVA: Robo-School Revolution ??

Ever wished you could teach a robot new skills as easily as explaining a recipe to a friend? That's the goal of LLARVA, a clever new AI system cooked up by researchers at 美国加州大学伯克利分校 . It's like a universal translator for robot instructions, allowing it to understand tasks across all sorts of mechanical helpers and environments. At its heart, LLARVA uses a beefy language model with 7 billion parameters, paired with a sharp-eyed vision system that can make sense of what the robot sees.

?? Action Anticipation: Here's the secret sauce: LLARVA learns from a massive buffet of 8.5 million image and movement pairs, showing all kinds of robots doing all sorts of tasks. It uses a special recipe of instructions that includes the robot type, how it's controlled, what it needs to do, and what it can feel (like joint positions). The clever bit is that LLARVA doesn't just predict what the robot should do next — it also imagines a visual "trace" of where the robot's arm or tool will move, like a GPS route for robot parts. This helps it plan and tackle tricky, multi-step tasks.

?? Performance Powerhouse: So how well does it work? Pretty darn well, actually. In a virtual robot Olympics with 12 different tasks, LLARVA left other 2D-based systems in the dust, scoring an average of 43.3% success rate compared to their measly 1.3%. It even gave fancier 3D systems a run for their money. But the real test came when they unleashed LLARVA on a physical robot arm. It outperformed top-notch AI systems in tasks like picking up, stacking, and unstacking blocks. The kicker? LLARVA does all this with just 2D images - no fancy 3D data required. While we're getting ready for fully adaptable home robots, LLARVA is a big leap in the right direction, paving the way for mechanical helpers who can easily pick up new skills and adapt to our needs.

?? Researchers: Dantong Niu , Yuvan Sharma , Giscard Biamby , Jerome Quenum, Yutong Bai , Baifeng Shi , Trevor Darrell , and Roei Herzig

??? Research Paper | ?? Project

? True or False: LLARVA's pre-training dataset consists of 8.5K image-visual trace pairs. Let me know in the comments. ??

Consistency Models Made Easy
ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning
PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs
MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding
Instruction Pre-Training: Language Models are Supervised Multitask Learners
Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities

AI Infrastructure Alliance 1 年前

3RS OF AUTHENTICITY – ARE YOU A ROBOT OR A HUMAN – DO…

Norma Hollis, PhD 5 个月前

EMOTIONAL SINGULARITY

Seshank Mavalapalli 6 年前

Tweet: Train Neural Nets in 94 Lines Of Code
A Pathologist–AI Collaboration Framework For Enhancing Diagnostic Accuracies And Efficiencies
Large-Scale AI Models: Explore Over 180 Models Trained With Over 10^23 FLOPs
Infinity-Instruct: A New 3M Sample Big Deduplicated Instruction Dataset
Course: Function-Calling and Data Extraction with LLMs ??

Staff AI Engineer — Remote, Canada — Simbian
Founding AI Engineer — San Francisco — Caspian
AI Designer Marketing — Dubai, UAE — RAKBANK
Applied AI ML Lead — Hyderabad, India — 摩根大通
AI Engineer — Seoul, South Korea — ProtoPie
AI Engineer — Brisbane, Australia — Jumbo Interactive Limited

China's Core AI Industry Hits 578.4 Billion Yuan in 2023, Growing 13.9% Year-on-Year
Oracle Commits $1 Billion to Boost AI and Cloud in Spain
Paris-Based Poolside.ai Aims for $2B Valuation with $400M Raise
HeyGen Raises $60M Series A to Revolutionize AI Video Creation for Businesses
Daydream Secures $50 Million Seed Funding to Revolutionize Online Shopping with AI
Speak Secures $20M Series B Extension, Focuses on Verbal Language Mastery with AI
Substrate Labs Inc. Secures $8 Million Seed Funding for AI Cloud Platform
Materia Launches with $6.3 Million Funding for AI-Driven Public Accounting Platform
Anthropic Amplifies AI Abilities of 3.5 with New Artifacts Feature
SoftBank's Masayoshi Son Sets Sights on 'Super' AI in New Investment Initiative

Time to conclude our tech meditation, Chakra Champions! We hope today's insights have aligned your mind with the latest and highest in AI. Have a wonderful weekend, and we'll reconnect on Monday!

1100 GMT No Newsletter? Check My LinkedIn

Cognitaize

1,136 位关注者

要查看或添加评论，请登录

Ravi Naukarkar的更多文章

#E1I73: Tau Times The Tech ????

2024年6月28日

#E1I73: Tau Times The Tech ????

Happy Tau Day, Arc Angels! Today we're tracing a full circle of AI developments that are as captivating as τ itself…
#E1I72: Tear-Free Tech ??

2024年6月27日

#E1I72: Tear-Free Tech ??

Happy Onion Day, Layered Logicians! Today we're peeling back the layers of cutting-edge AI to reveal some truly…
#E1I71: Tech Tundra ??

2024年6月26日

#E1I71: Tech Tundra ??

Happy Refrigeration Day, Frosty Futurists! Today we're presenting the freshest AI breakthroughs from our digital…
#E1I70: Technicolor Tech ??

2024年6月25日

#E1I70: Technicolor Tech ??

Happy Color TV Day, Tele-Tinters! Today we're broadcasting in vivid hues, bringing you a spectrum of AI advancements…

1 条评论
#E1I69: AI Athletes in Action ??

2024年6月24日

#E1I69: AI Athletes in Action ??

Welcome to AI Arena, Tech Torchbearers! We’re carrying the Olympic spirit from yesterday by bringing you gold-winning…
#E1I67: Optimizing the Output ???

2024年6月20日

#E1I67: Optimizing the Output ???

Workflow Wizards, sharpen your pencils for Productivity Day! Today, we're diving headfirst into AI tools and tips…
#E1I66: Thinking Inside the Bot ??

2024年6月19日

#E1I66: Thinking Inside the Bot ??

Bit Boxers, let's unwrap some exciting AI awesomeness on this Box Day! First up, Meta FAIR is lifting the lid on their…
#E1I65: Basketful of Bytes ??

2024年6月18日

#E1I65: Basketful of Bytes ??

Happy Picnic Day, Silicon Snackers! Today, we'll unfold a digital blanket and unpack a basket brimming with the latest…
#E1I64: Interlocking Innovations

2024年6月17日

#E1I64: Interlocking Innovations

Mosaic Makers, on this Tessellation Day, let's explore how the intricate patterns of innovation perfectly fit together.…
#E1I63: Scrub-a-Dub Debug ??

2024年6月14日

#E1I63: Scrub-a-Dub Debug ??

Lather Logicians, let's immerse ourselves in some refreshing tech news on this Bath Day! Making a splash in the tech…

See all articles

#E1I68: Breathing Binary ????

Ravi Naukarkar

GenAI Specialist

?? LLARVA: Robo-School Revolution ??

领英推荐

Cognitaize

1,136 位关注者

Ravi Naukarkar的更多文章

社区洞察

其他会员也浏览了

AI and HR, Robots and Jazz?

#134: LOVE, ROBOTS, AND BUILDING A FULFILLING LIFE: DIVE DEEP THIS WEEK!

Robots Can... Sing and Dance?

Intuition Robotics is 3 years old ElliQ is live and our cognitive AI agent brings 3rd party product to life - starting with cars

How Robots are Changing Our Communication

Consciousness: Robot vs Human

DAILY INNOVATION BRIEF by Edward Kane, Journalist

I am a robot

Mind Control and the Power of Thought in the Age of AI

My #AI Resolution for 2022

?? LLARVA: Robo-School Revolution ??

领英推荐

Cognitaize

1,136 位关注者

Ravi Naukarkar的更多文章

#E1I73: Tau Times The Tech ????

#E1I72: Tear-Free Tech ??

#E1I71: Tech Tundra ??

#E1I70: Technicolor Tech ??

#E1I69: AI Athletes in Action ??

#E1I67: Optimizing the Output ???

#E1I66: Thinking Inside the Bot ??

#E1I65: Basketful of Bytes ??

#E1I64: Interlocking Innovations

#E1I63: Scrub-a-Dub Debug ??

社区洞察

其他会员也浏览了

AI and HR, Robots and Jazz?

#134: LOVE, ROBOTS, AND BUILDING A FULFILLING LIFE: DIVE DEEP THIS WEEK!

Robots Can... Sing and Dance?

Intuition Robotics is 3 years old ElliQ is live and our cognitive AI agent brings 3rd party product to life - starting with cars

How Robots are Changing Our Communication

Consciousness: Robot vs Human

DAILY INNOVATION BRIEF by Edward Kane, Journalist

I am a robot

Mind Control and the Power of Thought in the Age of AI

My #AI Resolution for 2022