登录查看更多内容

Bugs Faster than the Speed of Thought

Maxim Khailo

Technical Trouble Maker

发布日期: 2021年6月30日

I got access to OpenAI’s GPT-3 last year and one of the first things I did was prompt it with a C++ interface struct and have it write the implementation. I was generally surprised by the results. Some of the completions were even code that was clearly from Github projects with valid Github links. My thought was “Wow, this would be an impressive auto-complete”. Today, Github just released Copilot, which is a GPT-3 powered auto-complete feature. It’s very impressive.

Anybody who has created a production AI system will know that only 20% of the work goes into creating the models, the scaffolding around it is the remaining 80%. I’m sure it took a lot of work to go from using the GPT-3 playground to something well integrated into an IDE like Copilot.

Being well integrated is key to the success of Copilot and it’s going to be used by hundreds of thousands if not a million programmers very quickly. Which is precisely what makes it so dangerous.

In Code Complete, Steve McConnell wrote extensively on defects in production systems. The industry average defect rate is about 15 – 50 bugs per 1000 lines of code. Some techniques used by NASA can get bug count to almost zero. Open source software likely has MORE bugs per 1000 lines of code because most open source projects have 1 developer and no eyeballs.

Copilot isn’t magic and will perform worse than a human coder on average. If it’s trained on the gigantic, 100 million project corpus of Github projects, it will most certainly have more than 50 bugs per 1000 lines of code. This is faster than Copy-Pasting code snippets because Copilot will auto-complete code that will likely compile and require less human correction. All programmers understand why copy-pasting code is bad. It likely introduces bugs. With Copilot, bugs will be transmitted faster than the speed of thought.

What can the consequences of buggy software being written at a breakneck pace be? The fatal Boeing 737 MAX8 crash involving Ethiopian Airlines in 2019 was the result of AI gone wrong. They took a safety system that was supposed to only engage in critical situations and expanded it to noncritical situations. Black box systems kill. Imagine this for a second, building AI systems is the future of software. You will no longer write algorithms but the scaffolding of learning systems. Now imagine your scaffolding itself is written mostly by Copilot. Bugs will propagate in new ways, via systems that build systems.

Building software is building a small world. It’s about meaning, and we know GPT-3 doesn’t understand meaning. It won’t understand your problem either. When programmers get used to auto-complete code that compiles, how deep will they go into it? Will they review it carefully? Building human-machine interaction is hard and you don’t want humans writing software asleep at the wheel.

Bugs Faster than the Speed of Thought

Maxim Khailo

Technical Trouble Maker

更多精彩文章

社区洞察

其他会员也浏览了

Introducing OpenAI o1

What is AutoGPT?

Developing My First Generative AI Application (and How I Learned to Respect Web Development)

GitHub Copilot: The Smartest AI

Why Rig? 5 Compelling Reasons to Use Rig for Your Next LLM Project

Mistral Launches Codestral Mamba and Mathstral for Enhanced AI Capabilities

Elon Musk’s AI, Grok, is now open source to the world

Building AbbrivioAI: Harnessing AI to Condense Textual Complexity into Clear, Actionable Summaries.

Introducing Gemma: New Open Source Model from Google outperformed Llama 2 and Mistral Models!

How I built a REST endpoint based Computer Vision task using Flask

All Watched over By Machines of Insanity

2024年7月25日

AI Efficiency and the Acceleration of Everything

2024年1月26日

Tools Were Made & Born Were Hands

2020年2月19日

Deep Latent Space Maps

2019年3月6日

We Must Become System-System Thinkers

2019年1月25日

The Reality Making Machines

2018年4月11日