登录查看更多内容

The New AI Iterative Development Paradigm (and Why AI == IA)

Darren Broemmer

Professor of Computer Science, Author, Software Engineer

发布日期: 2023年9月8日

There are a number of open-source AI tools that claim to create an entire application from only one prompt. I decided to take them for a test drive.

The results were mixed, although this is not surprising given these tools are still in their infancy. They will almost certainly improve over time along with the rest of the AI technology.

But one takeaway was clear ...

You will never be able to remove humans completely from the systems development loop.

Why is this the case? Read on to find out, but first, let's examine the results of our "completely automated AI engineering" experiment.

AI projects that generate apps from a single prompt

Projects including GPTEngineer, MetaGPT, and LangChain-Coder leverage the concept of prompt chaining. The goal of these projects is to perform the work of an entire software development team. Based on only a single prompt, they generate the entire code for an application. Each project has varying levels of interactivity, although each aspires to be largely automated.

MetaGPT states that it "takes a one-line requirement and outputs user stories / competitive analysis/requirements/data structures/APIs/ documents, etc." It does this by using AI prompts to simulate "product managers/architects/project managers/engineers. It provides the entire process of a software company along with carefully orchestrated SOPs."

Initial generative AI prompts are used to define requirements. Subsequent prompts in the chain are used to design and build the application.

So, what application are we going to build? The example given in the MetaGPT docs is "Write a cli snake game." Rather than a video game, I decided to write a utility that I can use in my business. In addition to technical writing, I also create puzzle books including crossword puzzles. This can be a time-consuming task, so I have been looking to make a crossword puzzle editor.

The basic capability could be simple. The app could display the crossword grid and allow the user to edit characters, words, and clues.

It could also be complicated. The editor could include the capability to generate portions of the puzzle, place theme words in primary locations, etc. There are a wide range of possibilities.

Here is the single prompt I used.

The product should be a Python Django application that allows users to create a 13 x 13 crossword puzzle. The application should only accept valid dictionary words from a specified file (dict.txt) and convert these words to uppercase for the puzzle. The application should also provide an easy-to-use interface for users to input words into the puzzle and save and load their crossword puzzles.

NOTE: I had installation issues with LangChain-Coder, and GptEngineer simply asked me what the requirements were. I thought this was odd since I attempted to specify them in the initial prompt. Perhaps it is too interactive. Thus, this article will focus on my experience with MetaGPT.

AI-Generated Requirements and Design Artifacts

MetaGPT parsed out the requirements as follows.

[("Develop a Python Django application that allows users to create a 13 x 13 crossword puzzle", "P0"),
 ("Ensure the application only accepts valid dictionary words from a specified file (dict.txt)", "P0"),
 ("Convert all words from the dict.txt file to uppercase for the puzzle", "P0"),
 ("Provide an easy-to-use interface for users to input words into the puzzle", "P1"),
 ("Allow users to save and load their crossword puzzles", "P1")]

This is an accurate restatement of the requirements from my prompt. I expected it would elaborate a bit more on the capabilities, but at least it accurately summarized what I told it.

MetaGPT further stated, "There are no unclear points.". Great, it seems confident that it knows what to do.

The AI chose some good design patterns. It said "The main challenge will be the implementation of the crossword puzzle logic. We will use a backtracking algorithm to fill the 13x13 grid with words from the dictionary. The words will be stored in a Trie data structure for efficient lookup."

There are two key design choices here, both of which I endorse.

A backtracking algorithm is a decent choice to try and fill out the crossword grid such that all words going across and down are valid. This could have performance issues as the number of permutations to attempt is enormous, but any implementation will have similar issues.
It chose to use a Trie data structure for the dictionary. This is exactly what I did prior in earlier crossword puzzle-related software. A Trie is a k-ary search tree where each node is the next letter in a possible word. Thus, at most, each node will have 26 children, one for each letter in the alphabet.

Data models and sequence diagrams were automatically created using Mermaid. The AI output also included a competitive analysis. This was a nice surprise. I verified that each of the other references was a commercial product or open-source project.

Competitive Analysis produced by MetaGPT

Given our basic requirements, it may have overstated the utility of our product. Nonetheless, it was helpful to get pointers to other similar products out there.

领英推荐

Generative AI: GenAI Tools Became Essential in 2023

Dr. RVS Praveen Ph.D 1 年前

Revolution with LLM: A New Era in Software Development

Pavel Podkorytov 1 个月前

Building AI Agents: Lessons from Recent Development…

Gurrpreet Sinngh 2 个月前

Finally, we get to the Code

MetaGPT proceeded to write what looked like decent code. It summarized basic knowledge of each component fed into the prompts. For example, Django apps have a models.py file. Here is what MetaGPT said about it.

The 'models.py' file defines the Puzzle and TrieNode models. The Puzzle model has a grid attribute for storing the crossword grid and a words attribute for storing the words used in the puzzle. The TrieNode model is used for efficient word lookup.

The code for the Trie data structure seemed quite good. It was simpler actually than what I had written in the past.

class TrieNode:
  def init(self, letter: str):
    self.letter = letter
    self.children = {}
    self.is_end_of_word = False
  
  def insert(self, word: str):
    node = self
    for letter in word:
      if letter not in node.children:
        node.children[letter] = TrieNode(letter)
      node = node.children[letter]
    node.is_end_of_word = True

  def search(self, word: str) -> bool:
    node = self
    for letter in word:
      if letter not in node.children:
        return False
      node = node.children[letter]
    return node.is_end_of_word

class Trie:
  def init(self):
    self.root = TrieNode("*")

  def insert(self, word: str):
    self.root.insert(word)

  def search(self, word: str) -> bool:
    return self.root.search(word)

It also gave updates along the way including how much money I was spending on the ChatGPT API calls.

2023-08-28 09:54:30.493 | INFO | metagpt.provider.openai_api:update_cost:81 - Total running cost: $0.417 | Max budget: $3.000 | Current cost: $0.104, prompt_tokens: 2616, completion_tokens: 419
2023-08-28 09:54:30.494 | INFO | metagpt.actions.write_code:run:77 - Writing utils.py ...

Unfortunately, it then crashed. And it was going so well up until this point.

openai.error.RateLimitError: Rate limit reached for 10KTPM-200RPM in organization org-<obfuscated> on tokens per min. Limit: 10000 / min. Please try again in 6ms. Contact us through our help center at help.openai.com if you continue to have issues.

Through no control of my own, the program hit an OpenAI rate limit. By itself, this would have been okay. However, there is no restart button. On my second attempt, the entire process started all over again.

And this was my biggest complaint. Software development, by its nature, is an iterative process. This whole exercise was a big reminder of this fact.

Further, iterations within the process will always involve humans, at least to some extent. Coding tasks may indeed shift largely to the machines over time. However, as any technology practitioner understands, that is only one piece of the puzzle.

AI as Intelligent Assistant

After that experiment, I still wanted a crossword editor. So I went to ChatGPT and used a smaller scope prompt to start by simply displaying a crossword grid. I decided to use Python and the PtQt6 library for the user interface, a wrapper around the Qt library.

This was a perfect use case for engineering with AI, as I have never developed using this GUI library before. AI can lead the way.

ChatGPT wrote some nice code and I gave it a whirl. However, instead of a 13x13 grid, it displayed all the letters/tiles in one big column. I informed it of its error, and it humbly apologized before providing me with working code. I was able to successfully build off of that and have been learning PtQt6 along the way.

AI as an Intelligent Assistant is the sweet spot for the technology at the moment.

Why humans can't be removed from the loop

In the first edition of this newsletter, I put forth the thesis that AI turns software engineering squarely into a requirements problem. If you can clearly define what you want, AI can help you build it. But you need to understand your application in extraordinary detail.

Formal programming languages leave no detail to chance. Every bit, byte, and pixel is specified by the code. Given the vast amount of permutations possible, any unspecified requirements or assumptions can cause the as-built application to veer further and further away from the desired target.

The reason iterative development took hold is that there are so many details that eventually need to be defined, it is nearly impossible to identify and specify them upfront.

The same is true for building applications with AI. Use AI to build components and services iteratively, and constantly review, edit, and refine what is generated. Feed this updated code back to the AI so it stays in sync with the software being built going forward.

Thus, AI == IA (Intelligent Assistant) in engineering use cases. You go back and forth with your AI engineering assistant during development, i.e. the new AI-assisted iterative paradigm.

Even as AI gets more efficient and effective, there will almost always be gaps in the requirements. Humans will always need to guide that iterative process. Humans still need to prioritize and decide what the business actually needs. These higher-level activities can also benefit from an intelligent assistant. Engineering with AI will certainly get easier, but humans will still remain in the loop.

Additional Resources

For a primer on how to use ChatGPT to write code for you, please see my book Rapid Software Engineering with ChatGPT. It walks you through how to successfully design and build an entire application using AI.

Engineering with AI

146 位关注者

要查看或添加评论，请登录

Darren Broemmer的更多文章

A big mistake people make with AI

2024年2月5日

A big mistake people make with AI

I had planned to cover other topics first, but this one really jumped out at me lately. In the world of Artificial…
How to thrive (not just survive) in a post-AI world

2024年1月26日

How to thrive (not just survive) in a post-AI world

AI is reshaping many aspects of our lives. While it is still early days for the technology, it is evolving at an…
How to Build a Moat for your Business in 2024

2023年12月7日

How to Build a Moat for your Business in 2024

As we stand on the brink of 2024, the business landscape is being reshaped by the relentless march of artificial…
Prompt Engineering in 30 Seconds

2023年11月30日

Prompt Engineering in 30 Seconds

At the most basic level, here is the bare minimum information you need to construct an effective prompt. Who: Inform AI…

1 条评论
AI is a Skill Leveler: What this means for you and your team

2023年10月24日

AI is a Skill Leveler: What this means for you and your team

Artificial Intelligence (AI) is no longer just a buzzword in today's technology-driven world. It is a transformative…
The Hottest New Programming Skill is ... String Manipulation?

2023年8月18日

The Hottest New Programming Skill is ... String Manipulation?

Thanks to LLMs (Large Language Models), the new programming language to learn is English. Applications that use…
Should you invest in Prompt Engineers?

2023年7月24日

Should you invest in Prompt Engineers?

In the rapidly evolving world of artificial intelligence, a new discipline known as "Prompt Engineering" has emerged as…
Engineering with AI will 10x your feature velocity ... until it doesn't

2023年6月28日

Engineering with AI will 10x your feature velocity ... until it doesn't

The trick is to know the difference between when AI can 10x your velocity vs. when it will slow you down.
AI converts engineering from an implementation problem to a specification problem

2023年6月14日

AI converts engineering from an implementation problem to a specification problem

Artificial intelligence (AI) is rapidly changing the software industry, and software engineers need to be prepared for…

1 条评论

See all articles

The New AI Iterative Development Paradigm (and Why AI == IA)

Darren Broemmer

Professor of Computer Science, Author, Software Engineer

AI projects that generate apps from a single prompt

AI-Generated Requirements and Design Artifacts

领英推荐

Finally, we get to the Code

AI as Intelligent Assistant

Why humans can't be removed from the loop

Additional Resources

Engineering with AI

146 位关注者

Darren Broemmer的更多文章

社区洞察

其他会员也浏览了

How AI is Revolutionizing IT Product Development: 15 Essential Tools

Machine learning and Artificial Intelligence's Effects on Software Development

Low-Code and No-Code AI: New AI Development - What is code anymore?!?!

Revolutionizing AI Development: Embrace the Simplicity and Power of Atomic Agents

Copilot X: AI Pair-Programmer is just out. What's next?

Top 9 Generative AI Use Cases in the Software Development

Embrace or Fear: Why Your Tech Team Needs AI Now

Leveraging BAML in AI Development: A Deep Dive into Boundary’s AI Markup Language

The AI Development Renaissance: A Leadership Guide to the New Landscape

10 ways generative AI helped me create my software solution in less time

AI projects that generate apps from a single prompt

AI-Generated Requirements and Design Artifacts

领英推荐

Finally, we get to the Code

AI as Intelligent Assistant

Why humans can't be removed from the loop

Additional Resources

Engineering with AI

146 位关注者

Darren Broemmer的更多文章

A big mistake people make with AI

How to thrive (not just survive) in a post-AI world

How to Build a Moat for your Business in 2024

Prompt Engineering in 30 Seconds

AI is a Skill Leveler: What this means for you and your team

The Hottest New Programming Skill is ... String Manipulation?

Should you invest in Prompt Engineers?

Engineering with AI will 10x your feature velocity ... until it doesn't

AI converts engineering from an implementation problem to a specification problem

社区洞察

其他会员也浏览了

How AI is Revolutionizing IT Product Development: 15 Essential Tools

Machine learning and Artificial Intelligence's Effects on Software Development

Low-Code and No-Code AI: New AI Development - What is code anymore?!?!

Revolutionizing AI Development: Embrace the Simplicity and Power of Atomic Agents

Copilot X: AI Pair-Programmer is just out. What's next?

Top 9 Generative AI Use Cases in the Software Development

Embrace or Fear: Why Your Tech Team Needs AI Now

Leveraging BAML in AI Development: A Deep Dive into Boundary’s AI Markup Language

The AI Development Renaissance: A Leadership Guide to the New Landscape

10 ways generative AI helped me create my software solution in less time