GPT-4 Built this new multimodal model!!

Louis-Fran?ois Bouchard

Making AI accessible. ?? What's AI on YouTube. Co-founder at Towards AI. ex-PhD Student.

发布日期: 2023年9月3日

Good morning fellow AI enthusiast! This week's iteration focuses on a very hot GitHub repository and research called LLaVA, an end-to-end large multimodal model that connects a vision encoder and LLM for general-purpose visual and language understanding.

Receive the weekly digest right in your emails ??

GPT-4 is powerful, but did you know that some AIs are built entirely thanks to it? Yes, GPT-4 is so good that it can be used to generate good enough data to train other AI models. And not any model but better models than itself!

Liu et al. just used GPT-4 to create a general-purpose language vision model called LLaVA, the first general-purpose model that understands and follows visual and language-based instructions. Yes, they didn't use GPT-4 as the base model, but to train their model! As we will see in the video, GPT-4 was used to generate a large and high-quality dataset to train a new model that understands images. Oh and obviously it not only understands images but also text (there's the multimodality), which means it can answer a wide variety of questions about them! Learn more in the full article or in the video...

We are incredibly grateful that?the newsletter?is now read by over 12'000+ incredible human beings counting our email list and LinkedIn subscribers. Reach out [email protected]?with any questions or details on sponsorships or visit my Passionfroot profile. Follow our newsletter at Towards AI, sharing the most exciting news, learning resources, articles, and memes from our Discord community weekly.

If you need more content to go through your week, check out the podcast!

Thank you for reading, and we wish you a fantastic week! Be sure to have?enough rest and sleep!

Louis

The What's AI Newsletter

14,531 位关注者

Elliott A.

Senior System Reliability Engineer / Platform Engineer

1 年

Yup, this is the way. Big time,

1 次回应

Janvi V.

Data Scientist/Data Analyst/Machine Learning Engineer/ Data Storyteller/SQL/Tech./"Unleashing the Power of Data: Data Scientist and AI Alchemist, Aiming to Revolutionize the Future of Analytics"

1 年

Sounds promising.

Tyler Suard

Senior AI Researcher & Developer at Parker-Hannifin. Ex-Apple, Ex-Meta. Contributor to Autogen, Tensorflow, PyTorch, Huggingface Transformers. Stanford affiliate. Interested in longevity, AI +Bio.

1 年

Excellent explanation, thank you

查看更多评论

要查看或添加评论，请登录

Louis-Fran?ois Bouchard的更多文章

Want to start programming in the AI era? This is for you...

2025年2月28日

Want to start programming in the AI era? This is for you...

Good morning! If you’ve been wanting to break into AI development but feel like your coding foundation isn’t quite…
Using AI for Writing

2025年2月17日

Using AI for Writing

Good morning! We’ve (Towards AI) been using AI to research, plan, help us with drafts, and refine our lessons for our…

4 条评论
How LLMs Are Changing Every Job

2025年2月12日

How LLMs Are Changing Every Job

Good morning! Today, I’m sharing our third video out of 6 we made for our “8-hour Generative AI Primer” course. In this…
LLM Developers: The future of software development

2025年2月6日

LLM Developers: The future of software development

Software engineers vs. ML engineers vs.

1 条评论
Real Agents vs. Workflows

2025年2月3日

Real Agents vs. Workflows

What most people call agents aren’t agents. I’ve never really liked the term “agent”, until I saw this recent article…

1 条评论
CAG vs RAG: Which One to Use?

2025年1月30日

CAG vs RAG: Which One to Use?

If you're using ChatGPT or other AI models, you've probably noticed they sometimes give incorrect information or…

2 条评论
Why LLMs Are the Future of Work

2025年1月28日

Why LLMs Are the Future of Work

Good morning! Today, we start the new series of videos for our most recent Towards AI course: 8-hour Generative AI…

1 条评论
Introducing Our 8-Hour Generative AI Primer

2025年1月18日

Introducing Our 8-Hour Generative AI Primer

Once again, I’m super excited to share some news from the Towards AI team—we’ve just launched a brand-new 8-hour…

11 条评论
Best Practices for Building and Deploying Scalable APIs in 2025

2025年1月12日

Best Practices for Building and Deploying Scalable APIs in 2025

Good morning! When we talk about building powerful machine learning solutions, like large language models or…

1 条评论
Lessons from Nvidia Minitron

2025年1月9日

Lessons from Nvidia Minitron

We’d all love to build a model from scratch like Llama, but how realistic is that? The computing, architecture, and…

3 条评论

See all articles

The What's AI Newsletter

14,531 位关注者

Louis-Fran?ois Bouchard的更多文章

Want to start programming in the AI era? This is for you...

Using AI for Writing

How LLMs Are Changing Every Job

LLM Developers: The future of software development

Real Agents vs. Workflows

CAG vs RAG: Which One to Use?

Why LLMs Are the Future of Work

Introducing Our 8-Hour Generative AI Primer

Best Practices for Building and Deploying Scalable APIs in 2025

Lessons from Nvidia Minitron

社区洞察