登录查看更多内容

Fact-Checking Headlines about ChatGPT and Coding

Barclay R. Brown, Ph.D., ESEP

Senior Fellow, AI Research, Collins Aerospace

发布日期: 2024年6月9日

A post by futurism.com features the headline, "STUDY FINDS THAT 52 PERCENT OF CHATGPT ANSWERS TO PROGRAMMING QUESTIONS ARE WRONG." Today, I'm playing fact checker. When I read the original study, I find the actual conclusions of the study: "Our analysis shows that 52% of ChatGPT answers contain incorrect information and 77% are verbose. Nonetheless, our user study participants still preferred ChatGPT answers 35% of the time due to their comprehensiveness and well-articulated language style. However, they also overlooked the misinformation in the ChatGPT answers 39% of the time." The study's conclusions are not at all captured correctly in the futurism.com headline, which leads me to wonder what percent of HEADLINES contain incorrect information. The fact that some percent of responses CONTAIN incorrect information is not at all the same thing as saying that same percent are "wrong."

If someone would like to make the case that chatGPT (GPT-4o) is always wrong about coding questions, I believe I can create prompts that will generate wrong answers nearly every time. One easy way is for the prompt to be vague--then when the LLM chooses one interpretation, we simply claim that we really meant it the other way.

I use chatGPT for coding nearly every day. Yes, sometimes the code has a bug. I tell GPT4o about the bug and it recodes it in a second, eliminating the bug. If you know enough about how LLMs work, you know why there can be a difference between it coding it one way at the start and then correcting it with specific feedback. Newsflash to no one: human coders often have bugs in first draft code--many more than ChatGPT, since human coders like me also make syntax errors, forget function and class property names,

I find using a powerful LLM to assist me in coding is a much more fun and interesting way to work, and on personal projects enables me to do things I would likely not take the time to code by hand.

领英推荐

Top 10 Budget-Friendly Courses To Learn ChatGPT Quickly

Stealth Startup 2 个月前

Will A.I. be Able to Augment Programmers? DeepMind's…

Michael Spencer 3 年前

AI-Assisted Development: Andrej Karpathy’s “Vibe…

Roberto Moreno 1 个月前

There IS a learning curve. Well-formed and clearly worded prompts are as important for coding tasks as they are for writing tasks.

Next steps: get good information, try it yourself and make your own conclusions. Also, we need some good programming COMPETITIONS between three kinds of teams (echoing Karparov's Advanced Chess tournaments): humans, AIs, and human+AI teams. If chess is any predictor, the human+AI teams are likely to emerge victorious, at least for now. Competitions would measure overall time to complete, working code, not first pass coding accuracy.

For me, I intend to get better at BOTH AI-assisted coding and also coding itself. What might change for me is that I'll focus more on gaining broad knowledge of coding techniques, capabilities of various libraries and subsystems, and architectural approaches to software systems, rather than trying to remember specific syntax, punctuation, and keywords. That way, I can better guide my AI programming assistant to do my bidding.

On Intelligent Systems

1,089 位关注者

Zane Scott

9 个月

Barclay - Right on target! I don't use AI for coding but I do use it for research and analysis in the social science areas. I find that it functions much like I did when I was a law student clerking for trial lawyers. I give ChatGPT instructions that shape its viewpoint and instruct it on my area of inquiry. The better my instructions as to its role and questions to be answered, the better its return. Your hunch that the human + AI team will produce the best quality output is spot on. That works for me now - minimizing the tendency of the AI to hallucinate and maximizing its response to exactly what I need. That's just like a good trial team with litigators supported by well-instructed paralegals. I would also recommend Ethan Mollick's new book, Co-Intelligence. He has lots of good stuff to say about how to relate to AI for the best results.

1 次回应

Noah Schwanke

Sr Mgr, Intelligent Systems | AI Product Manager at Collins Aerospace

9 个月

I like the coding competitions idea!

1 次回应

The AI Bulletin

9 个月

Great insight on AI and ChatGPT! AI is truly transforming the way we interact, and ChatGPT is leading the way in this revolution. Thank you for the share!

1 次回应

William Alex Dryden

Senior Project / Program Manager - M&A and Transformation Projects at WTS Solution

9 个月

Just wanted to say hello ??, it’s been a while - I hope all is well with you and I will review your post. Take care Alex Dryden

1 次回应

Ricardo Reis

Systems Engineering Igniter

9 个月

I wait for the follow-up... about: - approaches for education to take advantage of LMM without falling into complacency or creating professionals without the ability to steer the LLM like you're doing (e.g., my analogy is the "script kid", ample user of available script online but unable to penetrate and create something more deep than that) - your thoughts are pointed to a scenario of augmentation instead of replacement. Which should be the direction we should strive? For curiosity do you consider being a "team" with the AI system or it just being a sophisticated tool? cheers,

1 次回应

查看更多评论

要查看或添加评论，请登录

Barclay R. Brown, Ph.D., ESEP的更多文章

Robot Imagination Surpasses Human

2023年6月3日

Robot Imagination Surpasses Human

By now you've heard of the alleged AI system that decided to kill its operator, judging that the operator was a…

13 条评论
GPT is Here. What's a Teacher to Do?

2023年1月21日

GPT is Here. What's a Teacher to Do?

Teachers and professors (I used to be one) are facing a new "threat" in that large language models like GPT-3, chatGPT,…

23 条评论
Why AIs Won't Be Taking Over Anytime Soon

2022年12月28日

Why AIs Won't Be Taking Over Anytime Soon

Imagine we want to build an AI-based robot that can play baseball as well as a good human player. We start by…

8 条评论
AI Generates Design Concepts for Inspiration

2022年12月26日

AI Generates Design Concepts for Inspiration

A recent post on LinkedIn (https://www.linkedin.

2 条评论
Why AIs Won't Be Taking Over Anytime Soon

2021年11月7日

Why AIs Won't Be Taking Over Anytime Soon

Imagine we want to build an AI-based robot that can play baseball as well as a good human player. We start by…

1 条评论
People Systems: Companies Don't Hire People--People Hire People

2021年10月4日

People Systems: Companies Don't Hire People--People Hire People

Taking the systems approach to human activity can bring new insight and understanding. The more common way of analyzing…

3 条评论
Fooled by the System

2021年8月30日

Fooled by the System

Understanding systems also involves avoiding being fooled. I have an illustration of this I’ve presented to seminars…

10 条评论
We Live in a World of Systems

2021年8月24日

We Live in a World of Systems

When I taught a graduate course in systems thinking at Worcester Polytechnic, I told my students that by the end of the…

10 条评论
Why Model?

2021年7月3日

Why Model?

A model of a system is an abstraction that represents aspects of the system in a way that can be analyzed, but which is…

8 条评论

See all articles

Fact-Checking Headlines about ChatGPT and Coding

Barclay R. Brown, Ph.D., ESEP

Senior Fellow, AI Research, Collins Aerospace

领英推荐

On Intelligent Systems

1,089 位关注者

Barclay R. Brown, Ph.D., ESEP的更多文章

社区洞察

其他会员也浏览了

How AI is Enhancing the Role of Programmers, Not Replacing It

The Agony and the Ecstasy of AI Coding Agents

Code Alphas Clash: DeepSeek Coder R1 vs. ChatGPT (Base Model) for Programming

The Inner Game of CHOP

July 07, 2024

MetaGPT: Important Conceptual Advance in Multi-Agent Systems

Langchain Expression Language—Simplifying Complex Workflows

Run DeepSeek-R1 Like a Pro-No Coding PhD Needed!

Programming is Dead, Long Live the Era of Generative AI Coding

The Challenges of Programming Specific Behaviors in GPTs: A Case Study with Eva, the AI Scheduling Agent for Healthier Plate

领英推荐

On Intelligent Systems

1,089 位关注者

Barclay R. Brown, Ph.D., ESEP的更多文章

Robot Imagination Surpasses Human

GPT is Here. What's a Teacher to Do?

Why AIs Won't Be Taking Over Anytime Soon

AI Generates Design Concepts for Inspiration

Why AIs Won't Be Taking Over Anytime Soon

People Systems: Companies Don't Hire People--People Hire People

Fooled by the System

We Live in a World of Systems

Why Model?

社区洞察

其他会员也浏览了

How AI is Enhancing the Role of Programmers, Not Replacing It

The Agony and the Ecstasy of AI Coding Agents

Code Alphas Clash: DeepSeek Coder R1 vs. ChatGPT (Base Model) for Programming

The Inner Game of CHOP

July 07, 2024

MetaGPT: Important Conceptual Advance in Multi-Agent Systems

Langchain Expression Language—Simplifying Complex Workflows

Run DeepSeek-R1 Like a Pro-No Coding PhD Needed!

Programming is Dead, Long Live the Era of Generative AI Coding

The Challenges of Programming Specific Behaviors in GPTs: A Case Study with Eva, the AI Scheduling Agent for Healthier Plate