登录查看更多内容

ChatGPT-4o: The Next Evolution in Interactive AI with Advanced Vision and Listening Capabilities

Duncan Eadie

Leveraging Technology for Law Firm Success

发布日期: 2024年5月14日

Exciting News from OpenAI: Yesterday, I had the pleasure of watching OpenAI's CTO, Mira Murati, unveil the revolutionary ChatGPT-4o ('four-oh') model, boasting ground breaking AI interaction capabilities—it can now see, hear, and speak. Stay tuned as it rolls out to everyone in the upcoming weeks, completely free of charge, accessible via desktop and smartphone, complete with a sleek new user interface.

GPT-4o marks a significant leap forward in AI capabilities, seamlessly integrating text, vision, and audio interactions into its reasoning. The demo’s were very impressive, lightning fast, conveyed with a voice that Alan Turing couldn't tell from a human (albeit a Californian). It understands emotion, and facial expressions and can talk back translating 50 different languages “as spoken by 97% of the worlds internet population”. So, yes, making AI available to all (restated as OpenAI’s mission).

The demo's showcased GPT-4o's ability to recognise emotions, objects via your camera, interpret handwritten messages (including a badly drawn ‘I love ChatGPT'), as well as answering questions about what a graph was describing. From explaining complex code in lay terms to providing personalised assistance, GPT-4o was shown to make interacting with AI very simple. More than that, enjoyable. OpenAI is working with governments and organisations to ensure safety is a key element brought into its persona.

The effect of the new GPT-4o? It will swell the 100 million people who regularly interact with ChatGPT and you’ll probably see GPT-4o integrated into cars and other appliances given its wide abilities. Its human interaction voice is so compelling that even my aged mum would hold, and enjoy, great conversations with 'her'. You'll use it to practice for interviews, sing the kids a lullaby, watch your golf swing and offer improvements.

You’ll use it as your assistant providing an educated second opinion on many aspects of your life, in the way you want to interact – talk to it, draw a diagram, turn on your camera and let it ‘see’ the scenario you are in.

Hacking HR 5 个月前

ChatGPT+ Will Soon Be Able To See, Hear, Speak, And…

Artificial Inspiration 1 年前

ChatGPT's Multisensory Journey: How "Vision" is…

Alejandro De La Parra Solomon 1 年前

There are chargeable options which will be faster, and have 5x greater capacity. Its believed that GPT-4o will revert to GPT3.5 if its runs out of steam. ?

GPT-4o will also increase pressure for Nvidia’s newest AI semiconductors and its undoubted popularity make even greater demands on data centres daunting water and power supplies. But yes, you are going to love it. Mira admitted at the end that the demo was "only possible due to the availability of the most advanced GPU's" (computing power) - so will the rest of us have a lesser experience ? Find out in a few short weeks.

So, no AGI but this is, after all, “just” a ‘Spring Update’. Mira hinted at “progress towards the next frontier and the next big thing”…… aka GPT5, and AGI ?

#GTP-4o #AI

Martyn Wells

Looking to return to work after a career break

6 个月

Do you think this has been ushed out as there are further delays with training and safety guiderails on GPT-5? OpenAI have been losing out as other models, including “free” models were being launched that were outperforming GPT-4. SORA was the press darling but has yet to see the true light of day. I do appreciate that this is a “new” model, but it’s still fundamentally built on the GPT-4 architecture. Also, OpenAI were crafty releasing this the day before Google’s big developer conference ?? I have read that this new model now leads the standard AI benchmark tests, and the mysterious GPT2 chatbot that appeared on an AI review website recently to much hype was actually GPT-4o in disguise, getting some real world testing incognito. Proof of the pudding is in the eating as they say… but at least even “free” users will be able to have some sort of limited interaction.

查看更多评论

要查看或添加评论，请登录

Duncan Eadie的更多文章

Where Were You ?

2021年9月11日

Where Were You ?

Twenty years ago today was one of those “Where were you moments”. Few of those happen in a lifetime, thank fully, as we…

5 条评论
16 Core Values behind the success of one of the world’s leading Companies

2021年7月9日

16 Core Values behind the success of one of the world’s leading Companies

As you may know Jeff Bezos, the CEO of Amazon, has stepped down after 27 years at the company he founded. Like most…

2 条评论
Is this a Signal to reappraise WhatsApp?

2021年1月11日

Is this a Signal to reappraise WhatsApp?

Privacy is a key concern for everyone, especially for personal communications. This is why, with its ease of use and…
Is Teams about to Zoom into first place?

2020年7月9日

Is Teams about to Zoom into first place?

Whilst Zoom is still grabbing headlines, for positive reasons now that their 90-day security improvement programme…

4 条评论
Zoom or Boom - What is the issue with Zoom’s Security?

2020年4月8日

Zoom or Boom - What is the issue with Zoom’s Security?

Many companies have suffered from the current global lockdown, but Californian based Zoom is not one of them. Trading…

4 条评论
Happy 40th Tech Birthday!

2020年1月28日

Happy 40th Tech Birthday!

In the first month of this new year it’s tempting, and recommended of course, to look forward to the possibilities that…

3 条评论
What would you Like To Be ?

2019年11月5日

What would you Like To Be ?

"You will never amount to anything” were the chilling words from one of the most influential people in his life, his…

1 条评论
Law - Back to the Future

2017年11月21日

Law - Back to the Future

It was titled "Man versus Machine". This could only mean one thing, as a James Cameron character appears in your mind –…

3 条评论

See all articles

ChatGPT-4o: The Next Evolution in Interactive AI with Advanced Vision and Listening Capabilities

Duncan Eadie

Leveraging Technology for Law Firm Success

领英推荐

Duncan Eadie的更多文章

社区洞察

其他会员也浏览了

ChatGPT's Multisensory Journey: How "Vision" is Changing Conversations ????

Navigating the new frontier with Brad Lightcap

The ChatGPT Observer EP24

ChatGPT-4 Reveals Its Secrets: Unbelievable Upgrades That'll Change AI Forever

Unleashing the Future: The Transformative Capabilities of ChatGPT-5

ChatGPT can see, speak and hear / The Key to OpenAI's Models & APIs / The Next Generation of AI Artistry: DALL-E 3

The ChatGPT Observer #12

ChatGPT-4 vs. ChatGPT-4o: The Marketing Showdown You Need to Know About

Insider's Edit: ChatGPT Performance Drift - a New Risk for Business

Creating Conversational AI with User-Centric Mindset: A Step-by-Step Guide with ChatGPT-4 ??????(??)

领英推荐

Duncan Eadie的更多文章

Where Were You ?

16 Core Values behind the success of one of the world’s leading Companies

Is this a Signal to reappraise WhatsApp?

Is Teams about to Zoom into first place?

Zoom or Boom - What is the issue with Zoom’s Security?

Happy 40th Tech Birthday!

What would you Like To Be ?

Law - Back to the Future

社区洞察

其他会员也浏览了

ChatGPT's Multisensory Journey: How "Vision" is Changing Conversations ????

Navigating the new frontier with Brad Lightcap

The ChatGPT Observer EP24

ChatGPT-4 Reveals Its Secrets: Unbelievable Upgrades That'll Change AI Forever

Unleashing the Future: The Transformative Capabilities of ChatGPT-5

ChatGPT can see, speak and hear / The Key to OpenAI's Models & APIs / The Next Generation of AI Artistry: DALL-E 3

The ChatGPT Observer #12

ChatGPT-4 vs. ChatGPT-4o: The Marketing Showdown You Need to Know About

Insider's Edit: ChatGPT Performance Drift - a New Risk for Business

Creating Conversational AI with User-Centric Mindset: A Step-by-Step Guide with ChatGPT-4 ??????(??)