登录查看更多内容

week 51 - why developers implement OS-specific tests, does Treatment Adherence Impact in TDD and A framework for compliance rules for TDD

Marabesi Matheus ??

MSc, MBA, Software Craftsperson at Codurance

发布日期: 2024年11月23日

Besouro: A framework for exploring compliance rules in automatic TDD behavior assessment

The improvements promoted by Test-Driven Design (TDD) have not been confirmed by quantitative assessment studies. To a great extent, the problem lies in the lack of a rigorous definition for TDD. An emerging approach has been to measure the conformance of TDD practices with the support of automated systems that embed an operational definition, which represent the specific TDD process assumed and the validation tests used to determine its presence and quantity. The empirical construction of TDD understanding and consensus building requires the ability of comparing different definitions, evaluating them with regard to practitioners’ perception, and exploring code information for improvement of automatic assessment.

See full paper via ResearchGate (last accessed 15 Dec, 2024)

AgoneTest: Automated creation and assessment of Unit tests leveraging Large Language Models

Software correctness is crucial, with unit testing playing an indispensable role in the software development lifecycle. However, creating unit tests is time-consuming and costly, underlining the need for automation. Leveraging Large Language Models (LLMs) for unit test generation is a promising solution, but existing studies focus on simple, small-scale scenarios, leaving a gap in understanding LLMs' performance in real-world applications, particularly regarding integration and assessment efficacy at scale. Here, we present AgoneTest, a system focused on automatically generating and evaluating complex class-level test suites. Our contributions include a scalable automated system, a newly developed dataset for rigorous evaluation, and a detailed methodology for test quality assessment.

See full paper via ACM (Open Access) (last accessed 14 Dec, 2024)

领英推荐

Building Robust APIs with Confidence: A Comprehensive…

Skill Quotient 1 年前

Finding Your Best Go Testing Framework!

LambdaTest 1 年前

When a failing test might be OK

LambdaTest 1 年前

How and why developers implement OS-specific tests

(1) We find that OS-specific tests are common: 56% of the analyzed Python projects have OS-specific tests and Windows is the most targeted OS. (2) We detect that OS verification happens more frequently in test decorators (65%) than in test code (35%). (3) OS-specific tests target a diversity of code, including file/directory, network, and permission/privilege. (4) Developers may perform multiple operations in OS-specific tests, including calling OS-specific APIs, mocking OS-specific objects, and suspending execution. (5) We find that OS-specific tests are implemented mostly to overcome unavailable external resources, unsupported standard libraries, and flaky tests.

See full paper via Springer (last accessed 22 Nov, 2024)
See full paper via GitHub (last accessed 22 Nov, 2024)

Does Treatment Adherence Impact Experiment Results in TDD?

Context: In software engineering (SE) experiments, the way in which a treatment is applied could affect results. Different interpretations of how to apply the treatment and decisions on treatment adherence could lead to different results when data are analysed. Objective: This paper aims to study whether treatment adherence has an impact on the results of an SE experiment.Method: The experiment used as test case for our research uses Test-Driven Development (TDD) and Incremental Test-LastDevelopment, (ITLD) as treatments. We reported elsewhere the design and results of such an experiment where 24 participants were recruited from industry. Here, we compare experiment results depending on the use of data from adherent participants or data from all the participants irrespective of their adherence to treatments. Results: Only 40% of the participants adhere to both TDD protocol and to the ITLD protocol; 27% never followed TDD; 20% used TDD even in the control group; 13% are defiers (used TDD in ITLD session but not in TDD session). Considering that both TDD and ITLD are less complex than other SE methods, we can hypothesize that more complex SE techniques could get even lower adherence to the treatment. Conclusion: Both TDD and ITLD are applied differently across participants. Training participants could not be enough to ensure a medium to large adherence of experiment participants.Adherence to treatments impacts results and should not be taken for granted in SE experiments.

See full paper via ResearchGate (last accessed 15 Dec, 2024)

Papers of the week

800 位关注者

要查看或添加评论，请登录

Marabesi Matheus ??的更多文章

week 53 - Special edition 2024

2024年12月22日

week 53 - Special edition 2024

Last year, the special edition brought insights on the content that was shared through 2023. In this edition, we will…
week 52 - Test Code Refactoring Unveiled, An Improvement to TDD Efficiency and Large Language Models in Detecting Test Smells

2024年12月14日

week 52 - Test Code Refactoring Unveiled, An Improvement to TDD Efficiency and Large Language Models in Detecting Test Smells

Test Code Refactoring Unveiled: Where and How Does It Affect Test Code Quality and Effectiveness? Refactoring has been…
week 50 - Agile vs Waterfall, Matching Production and Test Files and Test smells in LLM-Generated Unit Tests

2024年10月20日

week 50 - Agile vs Waterfall, Matching Production and Test Files and Test smells in LLM-Generated Unit Tests

Transition From Waterfall to Agile Methodology: An Action Research Study In recent years, software companies have…
week49 - Monitoring Continuous Integration Practices, Test Smells + Gamification and Unit Test Generation

2024年9月30日

week49 - Monitoring Continuous Integration Practices, Test Smells + Gamification and Unit Test Generation

On the Need to Monitor Continuous Integration Practices - An Empirical Study One of the crucial activities in software…
week48 - Technical debt can damage moral responsibility, LLM for Understandability of Generated Unit Tests and Mimicking Production Behavior with Mock

2024年9月14日

week48 - Technical debt can damage moral responsibility, LLM for Understandability of Generated Unit Tests and Mimicking Production Behavior with Mock

Technical debt, a double-edged sword that can damage moral responsibility This teaching case provides a simple yet…

1 条评论
week47 - Which Combination of Test Can Predict Success? and the relationship between unit test coverage and maintainability of production code

2024年9月1日

week47 - Which Combination of Test Can Predict Success? and the relationship between unit test coverage and maintainability of production code

Do tests really enable change? On the relationship between unit test coverage and maintainability of production code…
week46 - Evaluating Large Language Models in Detecting Test Smells and Code Reviews Patterns and Anti-patterns

2024年8月12日

week46 - Evaluating Large Language Models in Detecting Test Smells and Code Reviews Patterns and Anti-patterns

Evaluating Large Language Models in Detecting Test Smells Test smells are coding issues that typically arise from…
week45 - How Do Developers Structure Unit Test Cases and The role of slicing in test-driven development

2024年7月22日

week45 - How Do Developers Structure Unit Test Cases and The role of slicing in test-driven development

Preprint edition! How Do Developers Structure Unit Test Cases? An Empirical Study from the “AAA” Perspective The AAA…
week44 - Highlights in Evidence-Based Software Engineering and Ethnographically Study in the Context of Test Driven Development

2024年7月7日

week44 - Highlights in Evidence-Based Software Engineering and Ethnographically Study in the Context of Test Driven Development

Automatic Assessment of Architectural Anti-patterns and Code Smells in Student Software Projects When teaching…
week43 - Most Common Mistakes in TDD Practice and Reflections on the REST architectural style

2024年6月16日

week43 - Most Common Mistakes in TDD Practice and Reflections on the REST architectural style

Most Common Mistakes in Test-Driven Development Practice: Results from an Online Survey with Developers Test-driven…

See all articles

week 51 - why developers implement OS-specific tests, does Treatment Adherence Impact in TDD and A framework for compliance rules for TDD

Marabesi Matheus ??

MSc, MBA, Software Craftsperson at Codurance

Besouro: A framework for exploring compliance rules in automatic TDD behavior assessment

AgoneTest: Automated creation and assessment of Unit tests leveraging Large Language Models

领英推荐

How and why developers implement OS-specific tests

Does Treatment Adherence Impact Experiment Results in TDD?

Papers of the week

800 位关注者

Marabesi Matheus ??的更多文章

社区洞察

其他会员也浏览了

Is Manual Testing Dying?

Things To Include In Testing Strategy

Software Development & Testing in 2023: A New Era

Testing Vs Debugging: The Differences You Need to Know!

A Deeper Insight into Test Design

Rust for Quality Assurance Automation: A Powerful Tool for High-Performance Testing

Title: TDD & BDD: The Ultimate Guide to Test-Driven and Behavior-Driven Development

Code-Less Vs. Code-Based – Which One Should You Pick?

Notes on "A Formal Analysis of Iterated TDD"

Test-Driven Development (TDD) Crash Course

Besouro: A framework for exploring compliance rules in automatic TDD behavior assessment

AgoneTest: Automated creation and assessment of Unit tests leveraging Large Language Models

领英推荐

How and why developers implement OS-specific tests

Does Treatment Adherence Impact Experiment Results in TDD?

Papers of the week

800 位关注者

Marabesi Matheus ??的更多文章

week 53 - Special edition 2024

week 52 - Test Code Refactoring Unveiled, An Improvement to TDD Efficiency and Large Language Models in Detecting Test Smells

week 50 - Agile vs Waterfall, Matching Production and Test Files and Test smells in LLM-Generated Unit Tests

week49 - Monitoring Continuous Integration Practices, Test Smells + Gamification and Unit Test Generation

week48 - Technical debt can damage moral responsibility, LLM for Understandability of Generated Unit Tests and Mimicking Production Behavior with Mock

week47 - Which Combination of Test Can Predict Success? and the relationship between unit test coverage and maintainability of production code

week46 - Evaluating Large Language Models in Detecting Test Smells and Code Reviews Patterns and Anti-patterns

week45 - How Do Developers Structure Unit Test Cases and The role of slicing in test-driven development

week44 - Highlights in Evidence-Based Software Engineering and Ethnographically Study in the Context of Test Driven Development

week43 - Most Common Mistakes in TDD Practice and Reflections on the REST architectural style

社区洞察

其他会员也浏览了

Is Manual Testing Dying?

Things To Include In Testing Strategy

Software Development & Testing in 2023: A New Era

Testing Vs Debugging: The Differences You Need to Know!

A Deeper Insight into Test Design

Rust for Quality Assurance Automation: A Powerful Tool for High-Performance Testing

Title: TDD & BDD: The Ultimate Guide to Test-Driven and Behavior-Driven Development

Code-Less Vs. Code-Based – Which One Should You Pick?

Notes on "A Formal Analysis of Iterated TDD"

Test-Driven Development (TDD) Crash Course