登录查看更多内容

Sensing: The Next Frontier for ChatGPTs of the World?

Samer L. Hijazi

Innovator, AI-expert, & Entrepreneur

发布日期: 2025年1月2日

ChatGPT and similar tools built using large language models (LLMs) have revolutionized how we interact with AI. Yet, they still face notable limitations. For instance, I once asked ChatGPT to write a children's book titled Bunny Diaries, inspired by Owl Diaries, for my 9-year-old daughter. Her verdict? “It made me cringe!”.?

Similarly, the infamous "human hand problem" in LLM-generated images and videos highlights subtle yet critical visual inaccuracies. While advancements in training methods and cost functions are likely to mitigate such issues, they won't address a deeper, more fundamental challenge: the ability to sense.

Beyond Pattern Matching: The Power of Sensing

What is “sensing”? While the term is often associated with physical perception, here it means the dynamic process of interpreting and making sense of data streams in real time. The key word here is “dynamic”.

LLMs are trained to predict the next element in a sequence—text, speech, or video—based on patterns detected in prior data. This method is powerful but passive. What’s missing is an essential human capability: dynamic sensing.

Let’s consider sensing in familiar contexts like hearing and vision. Humans excel at interpreting sensory input through a feedback loop between perception and cognition. Two scenarios illustrate this:

Noisy Speech: If you hear garbled speech and replay it, the second attempt often sounds clearer—not because the audio changed but because your brain processes it differently, using context to adjust perception dynamically.
Seeing What You Desire: When you decide to buy a black car, suddenly you will notice more black cars on the street than before. This isn't because more black cars appeared, but because your perception has shifted.?
Finding Waldo: Once you find Waldo in an image, you can’t “unsee” him. Your brain locks onto that visual cue, and it becomes a fixed point of recognition.

This interaction between thought and perception helps humans refine their understanding, interpreting the world through context and adaptability.

Bridging the Gap: Teaching LLMs to Sense

Sure, we can—and likely will—train LLMs differently in the future to be more inquisitive and improve their knowledge bases dynamically. But this is not where LLMs are today.

领英推荐

tsuzumi Speaks Human: The Future of AI Communication

NTT 9 个月前

“Is It Real, or Is It Memorex?”

Bell Bank 10 个月前

AI Revolutionizes Voice Interaction: The Dawn Of A New…

Bernard Marr 1 年前

However, by leveraging creative prompting alongside computer vision and AI-based audio processing stacks, we can develop LLM-based systems that emulate sensory capabilities.This involves? involves using well-designed prompts to enable LLMs to provide iterative feedback to the AI subsystems they rely on for interacting with the real world, For instance:?

Vision Systems: Much like how our eyes adjust to focus on a nearby object to adapt to bright sunlight, a camara could fine-tuns its focus and saturation in real-time based on the LLM′s analysis of what is most relevant in these scenes. For example, if the system detects? an important object in motion, it could sharpen its focus to capture that detail.
Audio Systems: Similar to how our brain filters out background chatter to concentrate on a single conversation at a noisy party, a microphone pipeline could adjust to suppress unwanted sounds. For instance, it might zero in on the voice of a specific speaker of interest in a crowded room, enhancing clarity and focus for the LLM.

These enhancements would enable LLMs to process sensory data iteratively, bridging the gap between static data interpretation and dynamic sensing.

The Path Forward

The technology to build adaptable subsystems already exists, and fields like prompt engineering are well-developed. However, the challenge lies in integrating these components into cohesive systems that allow LLMs to engage dynamically with the world.

Each subsystem—be it audio, visual, or contextual—must be carefully designed and built. These components must then be harmonized to create a synergistic interaction between LLMs and their environments.

By moving beyond passive pattern matching, we can transform LLMs into tools capable of sensing and engaging dynamically with their surroundings. This advancement would significantly expand their utility, enabling them to achieve far more than just searching for cat videos on the internet.

How Far is This Future?

All the components of this future are within reach for AI experts. The team of https://AZcare.AI has already taken a bold step forward by developing the industry′s first sensing agent to serve as a personal health agent. This milestone signals the vast potential ahead for AI systems that sense and adapt, transforming industries and redefining how we interact with technology. And many are following in their footsteps soon.

要查看或添加评论，请登录

Samer L. Hijazi的更多文章

Thank You, DeepSeek, for Rescuing the US AI Industry!

2025年2月3日

Thank You, DeepSeek, for Rescuing the US AI Industry!

For the past decade, we have witnessed the decline of the US AI startup scene from a meritocracy—where technical…

2 条评论
How to Increase Your Life Expectancy by 10 Years Using AI

2025年1月13日

How to Increase Your Life Expectancy by 10 Years Using AI

Step 1: Start an AI-based startup. Step 2: Become a billionaire.

3 条评论
ChatGPT and Critical Thinking

2025年1月6日

ChatGPT and Critical Thinking

I recently asked ChatGPT 4.0: “Does ChatGPT think critically?” The response was enlightening.
Will AI be used to help, or hurt, patients?

2024年12月26日

Will AI be used to help, or hurt, patients?

My Mother, now in her 80s, has faced numerous chronic and intermittent health issues. I am deeply grateful to the…

1 条评论
Are Conference Calls Taxing on your Brain? AI to The Rescue!

2020年4月22日

Are Conference Calls Taxing on your Brain? AI to The Rescue!

The Cognitive Overload Problem In the current era of social distancing and COVID-19, more people than ever are working…

3 条评论
What Is Deep Learning Good For? --Beyond the Hype!

2020年1月2日

What Is Deep Learning Good For? --Beyond the Hype!

If you are among the remaining few who still believe that level-5 autonomous driving technology is just around the…

12 条评论
The End of Bad Audio in Conference Calls

2019年12月29日

The End of Bad Audio in Conference Calls

If you have ever joined a conference call and felt that some of the participants seem to be calling-in from a…
“Artificial Intelligence is The New Electricity”, Are you Kidding Me!

2019年3月26日

“Artificial Intelligence is The New Electricity”, Are you Kidding Me!

Artificial intelligence (AI) spearheaded by Deep Learning (DL) has made good progress in the past few years…

3 条评论
Is the Deep Learning and AI Bubble Bursting?

2019年3月7日

Is the Deep Learning and AI Bubble Bursting?

You bet! You can stop reading right now and start writing a hate email :-) Or, keep on reading, and you might agree…

21 条评论
Is Silicon Valley Sexist?!

2018年9月17日

Is Silicon Valley Sexist?!

Why does this it matter? Social injustice: Tech is probably the strongest growth engine in the world economy. And if…

See all articles

Sensing: The Next Frontier for ChatGPTs of the World?

Samer L. Hijazi

Innovator, AI-expert, & Entrepreneur

Beyond Pattern Matching: The Power of Sensing

Bridging the Gap: Teaching LLMs to Sense

领英推荐

The Path Forward

How Far is This Future?

Samer L. Hijazi的更多文章

社区洞察

其他会员也浏览了

Let's talk: How Large Language Models are changing the work reality

Potential Uses For ChatGPT Chatbot In Healthcare

Generative AI: Birthday of a restless and demanding two-year old

Machine Empathy: From Eliza to ChatGPT - A 60-Year Journey

What's New in ChatGPT4: A Closer Look at the Latest Language Model

Workarounds for AI hallucinations

CONVERSATIONS UNLEASHED - APRIL 2024

The Next Frontier in AI: Exploring the Impact of Large Action Models (LAMs)

"How does ChatGPT work, under the hood?" - no math, no code.

AI and its impact on society: why Yuval Noah Harari makes little sense

Beyond Pattern Matching: The Power of Sensing

Bridging the Gap: Teaching LLMs to Sense

领英推荐

The Path Forward

How Far is This Future?

Samer L. Hijazi的更多文章

Thank You, DeepSeek, for Rescuing the US AI Industry!

How to Increase Your Life Expectancy by 10 Years Using AI

ChatGPT and Critical Thinking

Will AI be used to help, or hurt, patients?

Are Conference Calls Taxing on your Brain? AI to The Rescue!

What Is Deep Learning Good For? --Beyond the Hype!

The End of Bad Audio in Conference Calls

“Artificial Intelligence is The New Electricity”, Are you Kidding Me!

Is the Deep Learning and AI Bubble Bursting?

Is Silicon Valley Sexist?!

社区洞察

其他会员也浏览了

Let's talk: How Large Language Models are changing the work reality

Potential Uses For ChatGPT Chatbot In Healthcare

Generative AI: Birthday of a restless and demanding two-year old

Machine Empathy: From Eliza to ChatGPT - A 60-Year Journey

What's New in ChatGPT4: A Closer Look at the Latest Language Model

Workarounds for AI hallucinations

CONVERSATIONS UNLEASHED - APRIL 2024

The Next Frontier in AI: Exploring the Impact of Large Action Models (LAMs)

"How does ChatGPT work, under the hood?" - no math, no code.

AI and its impact on society: why Yuval Noah Harari makes little sense