登录查看更多内容

Automated Test Case Generation - LLM-Based Software Engineering

Rick Banerjee

Senior Principal Member of Technical Staff at Oracle

发布日期: 2024年6月1日

+ 关注

First there was Dev. Just Dev.

Some years passed, and this changed to DevOps. Dev + Ops. Two worlds, but one person inhabiting them both.

Now, we've got DevSecOps. Dev + Security + Ops. Three worlds, and one person scurrying from one to the other at all times. With two-pizza teams, the Tech downturn developers are stretched to capacity. It reminds me of one of the characters from Hayao Miyazaki's film Spirited Away - Kamaji. Kamaji is the boiler man serving the bath house where most of the film is set. He is a yōkai[1], half-spider, half man.

If there's a desperate need for help somewhere, it is to come to the aid of the developer. Can GenAI help?

A paper authored by engineers from Meta hopes so. The paper is titled Automated Unit Test Improvement using Large Language Models at Meta[2]. Taking care to remind readers that the aim is augmenting human capacity, not replacing them, the authors lay out a novel use of GenAI in the software development lifecycle. Enhancing unit tests.

Yes, unit tests, that highly necessary but often involved aspect of software development. You'll hear many a developer (including me) say nowadays with mocks, spys and the like sometimes good, thorough unit tests take longer than the actual code.

This paper feeds in the class under test and the existing unit test as prompts to the LLM, requesting that the test class be enhanced. Out comes the enhanced test class. Just as distributed systems designers have trained themselves to design assuming failure, AI application developers are encouraged to design assuming hallucinations.

Here, taking direct aim at hallucinations, the authors propose a three-phase filtration process.

The generated test class must -

1. Compile correctly

领英推荐

The ROI in Testing

LambdaTest 2 个月前

Is Test Generation There Yet?

LambdaTest 1 个月前

Writing Tests That Actually Test

LambdaTest 1 年前

2. Is not flaky (i.e. if run N times, will stably pass all N times)

3. Improve test coverage

If the generated test class meets all three criteria, it is presented as a diff, to be merged to the code base.

This could pair nicely with a code-specific foundational model (such as the new IBM Granite granite-34b-code-instruct[3]).

Kamaji would be happy to have such help!

[1] Yōkai are a class of supernatural entities and spirits in Japanese folklore.

[2] https://arxiv.org/abs/2402.09171

[3] https://www.ibm.com/products/watsonx-ai/foundation-models

要查看或添加评论，请登录

Rick Banerjee的更多文章

Index me this

2025年2月16日

Index me this

Let's play a game. I'll offer a word and you reply with the first word that comes to your mind when you hear my word.

2 条评论
Information Batteries in the age of abundant renewable energy

2025年1月29日

Information Batteries in the age of abundant renewable energy

Where does the term "battery" come from? Long before there were electrical batteries, there were armies using guns and…

1 条评论
Processing enormous streams of stuff

2024年8月31日

Processing enormous streams of stuff

What do you remember from the recently concluded Paris Olympics? Manu Bhaker's medals might come to mind. The Indian…

1 条评论
Deception Environments: Beyond Chaos Testing

2024年7月20日

Deception Environments: Beyond Chaos Testing

One of Malcolm Gladwell's Revisitionist History podcast episodes is titled - "Taxonomy of the Modern Mystery Story"[1].…

1 条评论
Uncertainty Everywhere: From electrons to your search results

2024年7月13日

Uncertainty Everywhere: From electrons to your search results

The model of the atom before 1926 was neat and precise. The nucleus at the center, like the Sun in the solar system.
What does the great train robbery of the AI age look like?

2024年6月22日

What does the great train robbery of the AI age look like?

On the night of June 12, 1924, U.S.
Murphy's Law as a shield against Ransomware

2024年6月8日

Murphy's Law as a shield against Ransomware

On Wednesday February 21, 2024, Change Healthcare (a subsidiary of the UnitedHealth Group) faced massive system outages…

1 条评论

See all articles

Automated Test Case Generation - LLM-Based Software Engineering

Rick Banerjee

Senior Principal Member of Technical Staff at Oracle

领英推荐

Rick Banerjee的更多文章

社区洞察

其他会员也浏览了

Impact of AI on Software Development And Testing – Ethical and Productivity Implications of Intelligent Code Creation (ICC)

TestDevLab's Mid-January Newsletter 2025 ??

Where Does GenAI Fit into the Modern Developer Stack?

The Developer Productivity Engineer - November 2024

Level Up Your Developer Experience (DevEx) with GenAI

Revolutionizing Software Engineering with LLMs

Revolutionizing Software Engineering with LLMs

AI-Native Engineering: The Future of Software Development

AI Cookbook: Intellias Comprehensive Guide

Notes on "A Formal Analysis of Iterated TDD"

领英推荐

Rick Banerjee的更多文章

Index me this

Information Batteries in the age of abundant renewable energy

Processing enormous streams of stuff

Deception Environments: Beyond Chaos Testing

Uncertainty Everywhere: From electrons to your search results

What does the great train robbery of the AI age look like?

Murphy's Law as a shield against Ransomware

社区洞察

其他会员也浏览了

Impact of AI on Software Development And Testing – Ethical and Productivity Implications of Intelligent Code Creation (ICC)

TestDevLab's Mid-January Newsletter 2025 ??

Where Does GenAI Fit into the Modern Developer Stack?

The Developer Productivity Engineer - November 2024

Level Up Your Developer Experience (DevEx) with GenAI

Revolutionizing Software Engineering with LLMs

Revolutionizing Software Engineering with LLMs

AI-Native Engineering: The Future of Software Development

AI Cookbook: Intellias Comprehensive Guide

Notes on "A Formal Analysis of Iterated TDD"