Reevaluating the small-scope testing hypothesis of answer set programs

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

As we increasingly rely on artificial intelligence systems, we must ensure that those systems are reliable and need to know how much we can rely on them. In software quality assurance, testing is a useful method to highlight and fix issues during development to avoid unexpected behavior after the system has been deployed. Artificial intelligence engineers are increasingly becoming aware of quality assurance as a requirement. Previous results in the area of answer set programming suggest that a high proportion of errors can be found when testing a program against a small scope, i.e. by inputs from a small domain. However, these results are based on assumptions that may be impractical for testing. To find out whether small scopes remain sufficient in practice, we evaluate several benchmarks against actual test oracles. Our findings suggest that small scopes can indeed find a high proportion of errors, but results depend on the observed benchmark and appropriate test oracles are required to achieve reliable scores.
Originalspracheenglisch
TitelTesting Software and Systems - 36th IFIP WG 6.1 International Conference, ICTSS 2024, Proceedings
Redakteure/-innenHéctor D. Menéndez, Gema Bello-Orgaz, Pepita Barnard, John Robert Bautista, Arya Farahi, Santanu Dash, DongGyun Han, Sophie Fortz, Victor Rodriguez-Fernandez
Herausgeber (Verlag)Springer, Cham
Seiten79–92
Seitenumfang14
ISBN (elektronisch)978-3-031-80889-0
ISBN (Print)978-3-031-80888-3
DOIs
PublikationsstatusVeröffentlicht - 2025

Publikationsreihe

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band15383 LNCS
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

ASJC Scopus subject areas

  • Theoretische Informatik
  • Allgemeine Computerwissenschaft

Fingerprint

Untersuchen Sie die Forschungsthemen von „Reevaluating the small-scope testing hypothesis of answer set programs“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren