登录查看更多内容

Why ChatGPT cannot generate a Photo with a correctly written name on it?

Lolita Ndoci

Figuring it out??????

发布日期: 2024年2月18日

I saw the videos of SORA, and they were great videos generated with only a prompt. But why the hell cannot generate the simplest photo with text in it? You can see how my name was generated in the photo by ChatGPT or a simple quote by Leonard Sweet.?

While DALL·E (ChatGPT) is highly advanced in interpreting prompts and generating visuals, it is not without some difficulties. Especially when it comes to rendering precise text within images.?

Let's imagine we have a robot designed to bake cakes using recipes from a big cookbook. This robot is smart and can follow instructions to bake various cakes. However, when it's time to decorate these cakes with special messages written in icing, things don't always go as planned. This situation helps us understand why AI, like our robot, sometimes struggles with writing text correctly in images. Let's dive into the reasons through this simple analogy.

1. Learning from a Broad Range of Examples

Our robot has learned to bake cakes by looking at many pictures and recipes. But, it wasn't specifically taught how to decorate cakes with messages. This is similar to how AI models learn to generate images. They see lots of data, but not enough focused on writing text. So, while our robot can make icing, writing neatly with it is another story. It's like it knows what icing letters should look like but can only sometimes get them right on the cake.

领英推荐

Why ChatGPT is Not a Threat to IT Professionals

IT Specialist Network 5 个月前

Is ChatGPT Going to Replace Estimators?

Beck Technology 1 年前

What is ChatGPT? Can ChatGPT Replace Human Labor?

SOTATEK., JSC 2 年前

2. Grasping the Meaning

The robot knows that messages on cakes matter but doesn't really get the meaning behind the words. For example, writing "Happy Birthday" needs a different touch than a "Congratulations" message. This shows how AI might recognize words but not fully understand when and how to use them properly in an image. It's as if the robot knows to put words on the cake but misses what makes each message special.

3. Having the Right Tools

Imagine our robot only has basic tools for cake decoration, not the precise ones needed for detailed writing. This is like AI's limitation in creating clear text in images. The AI has tools for making pictures but not for fine text details. So, when our robot tries to write on a cake, the letters might not look very good, just like AI might mess up the text in a picture.

4. Seeing the Details

Let's say the robot's camera, used to check the finished cake, isn't sharp enough to see small mistakes in the icing text. It looks at the overall cake but misses the little errors. This is similar to how AI focuses on making the main part of the image but might not pay enough attention to the text details. So, just like our robot might not notice a smudge in the letters, AI-generated images might have text that's not quite right.

Conclusion

Through this cake-baking robot story, we can see why AI has a tough time with text in images. It's about how the AI learns, understands context, uses its tools, and pays attention to details. Just like our robot tries to improve its cake decorating, AI technology is always getting better. We can hope for clearer and more accurate text in AI images as technology advances.

AI with a Human Touch

993 位关注者

G. L. Pedersen

Author and Independent Scholar

2 个月

Bull doogy! It misspells things on purpose, so you can't sell graphics, at least easily. For example, you can't generate a graphic for New Year with 2025.

Leonard S.

Specializing in personality assessments for the nursing/medical industry for onboarding, hiring, team building, and reduction of high employee turnovers. Rebuilding workplace culture for your company.

8 个月

So, how can I fix it? It's a wonderful image...

Greg Prickril

IBM MSFT SAP - B2B product management coach, consultant, trainer, and speaker passionate about increasing business impact with innovative, customized programs for individuals and organizations.

9 个月

This behavior is obviously intentional as it gets it wrong 100% of the time. The interesting conversation is their motivation. At the top of my list? Probably for future monetization, they don't want people using it for commercial graphics. It's lame AF.

3 次回应

Robb Ryniak

Senior Software Engineer & Architect

9 个月

I'm sorry but your assessment of the situation is extremely naive. ChatGPT has the capability of generating suitable text. That's a fact. Measuring text in the context of an image is technology that's been around since writing text on a screen has been around. That's also a fact. Assuming they're actually not using pre-existing fonts directly for rendering, I promise you that they are using them as training data to understand how to write the language. So I promise you this is not a *technical* issue for them. That's simply not a rational assessment.

2 次回应

Bledar Aliaj

Energy Transition Expert, Energy Auditing and Management

1 年

Because they spent so much time and money enhancing captcha, it would be basically working agains themselves.

查看更多评论

要查看或添加评论，请登录

Lolita Ndoci的更多文章

Prompt Engineering - Making your life easier

2024年5月22日

Prompt Engineering - Making your life easier

What is Prompt Engineering? I know some of you are already asking, "What is prompt engineering?" Imagine you have a…

1 条评论
Is AI writing the death of human creativity?

2024年3月24日

Is AI writing the death of human creativity?

Imagine we're on a journey across a vast digital ocean, where waves are not made of water, but of artificial…
AI and E-Commerce: Questions Answered

2024年3月17日

AI and E-Commerce: Questions Answered

The integration of Artificial Intelligence (AI) in e-commerce is not just a trend but a significant shift, transforming…

1 条评论
AI Note-Takers: The End of Privacy or the Future of Efficiency in Meetings?

2024年2月5日

AI Note-Takers: The End of Privacy or the Future of Efficiency in Meetings?

The Dual Sides of AI Meeting Note-Taking Bots In the rapidly evolving landscape of workplace technology, AI meeting…

3 条评论
Navigating the Complex Landscape of AI in Mental Health

2024年2月2日

Navigating the Complex Landscape of AI in Mental Health

A Closer Look at Woebot and Wysa In the evolving panorama of mental health support, artificial intelligence (AI) has…
Ethical Implications of Deepfakes

2024年1月28日

Ethical Implications of Deepfakes

Introduction: The Rise of Digital Deepfakes In an era where seeing is no longer believing, deepfakes have emerged as a…

1 条评论
Is it rape?

2024年1月21日

Is it rape?

News that British police are investigating a case of 'virtual rape' in the metaverse has spread like wildfire, stirring…
Navigating the Hype and Reality Generative AI

2024年1月11日

Navigating the Hype and Reality Generative AI

It's a topic that's generating as much skepticism as it is excitement. We're delving into an innovation that promises…
Embracing ChatGPT for a festive New Year

2024年1月1日

Embracing ChatGPT for a festive New Year

A day before we were about to ring in the New Year, I found myself preparing a New Year's quiz. It's something we've…
The Silent AI Takeover

2023年12月25日

The Silent AI Takeover

Adapting to Tomorrow's Job Market At the end of the year, it's natural for people to reflect. At least, I find myself…

See all articles

Why ChatGPT cannot generate a Photo with a correctly written name on it?

Lolita Ndoci

Figuring it out??????

领英推荐

AI with a Human Touch

993 位关注者

Lolita Ndoci的更多文章

社区洞察

其他会员也浏览了

How ChatGPT is Trained to Understand Language: A Comprehensive Guide

AI prompts and the pros of tags

ChatGPT Refuses To Say Jensen Huang Is Wrong - Why?

A Detailed Insight into ChatGPT Prompts for Procurement

Artificial Intelligence should not write your research proposal

3 Steps to Use ChatGPT Like a Pro

ChatGPT: AI Star in Galaxy

Independent Samples t-Test in ChatGPT

How ChatGPT Can Help Your Business

The Power of Prompt Templates

领英推荐

AI with a Human Touch

993 位关注者

Lolita Ndoci的更多文章

Prompt Engineering - Making your life easier

Is AI writing the death of human creativity?

AI and E-Commerce: Questions Answered

AI Note-Takers: The End of Privacy or the Future of Efficiency in Meetings?

Navigating the Complex Landscape of AI in Mental Health

Ethical Implications of Deepfakes

Is it rape?

Navigating the Hype and Reality Generative AI

Embracing ChatGPT for a festive New Year

The Silent AI Takeover

社区洞察

其他会员也浏览了

How ChatGPT is Trained to Understand Language: A Comprehensive Guide

AI prompts and the pros of tags

ChatGPT Refuses To Say Jensen Huang Is Wrong - Why?

A Detailed Insight into ChatGPT Prompts for Procurement

Artificial Intelligence should not write your research proposal

3 Steps to Use ChatGPT Like a Pro

ChatGPT: AI Star in Galaxy

Independent Samples t-Test in ChatGPT

How ChatGPT Can Help Your Business

The Power of Prompt Templates