0

PyResBugs: A Dataset of Residual Python Bugs for Natural Language-Driven Fault Injection

PyResBugs is a dataset of residual bugs from Python frameworks, paired with fixed versions and annotated descriptions, enabling natural language-driven fault injection for AI-driven automated testing.

Year
2025
Venue
arXiv 2025
Authors
3
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2505.05777ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

This paper presents PyResBugs, a curated dataset of residual bugs, i.e., defects that persist undetected during traditional testing but later surface in production, collected from major Python frameworks. Each bug in the dataset is paired with its corresponding fault-free (fixed) version and annotated with multi-level natural language (NL) descriptions. These NL descriptions enable natural language-driven fault injection, offering a novel approach to simulating real-world faults in software systems. By bridging the gap between software fault injection techniques and real-world representativeness, PyResBugs provides researchers with a high-quality resource for advancing AI-driven automated testing in Python systems.

Authors

3