Abstrakti
Background: Large language models (LLM) suffer from various forms of biases due to the biased datasets used to train the models. At the same time, human cognitive biases have an equal propensity to express themselves when using LLMs for software engineering tasks. Software testing is a critical phase of the software development life cycle. Confirmation bias is reported to have deteriorated software testing by designing more specification-consistent test cases compared to specificationinconsistent test cases. However, there is a lack of debiasing (mitigation) strategies in this regard. Aims: In this paper, first, we investigate whether the LLM model suffers from confirmation bias while performing software testing tasks. Second, we propose a vision of debasing confirmation bias in software testing via LLM. Method: We conducted an empirical study to detect confirmation bias by an LLM (ChatGPT4.0) in the design of functional test cases. Based on empirical findings, we used the analytical paradigm to design a multi-agent system. Results: We present a vision for debiasing confirmation bias in functional software testing by leveraging LLMs via a multi-agent approach. Conclusions: The proposed vision may improve the performance of LLMs in terms of reduced confirmation bias and serve as a debiasing technique for functional software testing.
| Alkuperäiskieli | Englanti |
|---|---|
| Otsikko | 2025 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, ESEM 2025 |
| Kustantaja | IEEE |
| Sivut | 344-350 |
| ISBN (elektroninen) | 979-8-3315-9147-2 |
| DOI - pysyväislinkit | |
| Tila | Julkaistu - 2025 |
| OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
| Tapahtuma | International symposium on empirical software engineering and measurement - Honolulu, Yhdysvallat Kesto: 2 lokak. 2025 → 3 lokak. 2025 |
Conference
| Conference | International symposium on empirical software engineering and measurement |
|---|---|
| Maa/Alue | Yhdysvallat |
| Kaupunki | Honolulu |
| Ajanjakso | 2/10/25 → 3/10/25 |
Julkaisufoorumi-taso
- Jufo-taso 2
Sormenjälki
Sukella tutkimusaiheisiin 'A Vision for Debiasing Confirmation Bias in Software Testing via LLM'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.Siteeraa tätä
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver