Scenario-Based Testing of Automated Robotic Systems

STARS is a formal framework for analyzing and measuring scenario coverage in testing automated robotic systems. It enables users to formally define environment features and classify recorded data into distinct scenario classes using Tree-based Scenario Classifiers (TSCs).

By expressing scenario features in temporal logic and combining them hierarchically, STARS automatically identifies which scenarios were encountered in recorded data — and which are still missing. Through logically defined monitors, it can also validate requirements and provide insights into the simultaneous occurrence of features and requirement violations.

The framework computes metrics such as scenario coverage, feature occurrence, and scenario distributions, supporting safety assessment in regards to standards like ISO 21448 and UL 4600.

STARS is domain-agnostic, highly customizable, and implemented in Kotlin for seamless interoperability with JVM-based languages.

Repository: github.com/tudo-aqua/stars

Researchers

Till Schallau

Till Schallau

Till Schallau received the M.Sc. degree in Computer Science from TU University, Germany, in 2019. Since then, he has been a research associate (Ph.D.) at the Chair for Software Engineering in Prof. Dr. Falk Howar's group, TU Dortmund University, Germany. His current research interests include the formal specification of scenario-based testing of autonomous systems as well as utilizing domain-specific languages.

Dominik Schmid

Dominik Schmid

Dominik Schmid received the M.Sc. degree in Computer Science from TU University, Germany, in 2023. During his Master’s thesis, he worked as a research associate at the Institute for Transport Logistics in Prof. Dr. Uwe Clausen's group for mathematical optimization, TU Dortmund University, Germany. Since then, he has been a research associate (Ph.D.) at the Chair for Software Engineering in Prof. Dr. Falk Howar's group, TU Dortmund University, Germany. His current research focuses on the formal specification of traffic and the verification of autonomated driving systems using temporal logic to classify driving scenarios in recorded and live data traces from the automotive domain.

Nick Pawlinorz

Nick Pawlinorz

Nick Pawlinorz received his B.Sc. degree in Computer Science from TU Dortmund University, Germany, in 2024. His Bachelor's thesis, titled 'Extraktion von formal Analysierbaren Fahrdaten aus Computerspielen am Beispiel von GTA V', was related to the STARS project. He began working for the Automated Quality Assurance group under Prof. Dr. Falk Howar in 2023. He is currently pursuing his M.Sc. degree at TU Dortmund University and is involved in several research projects within the STARS context.

Dr. Stefan Naujokat

Dr. Stefan Naujokat

Stefan Naujokat received the Dipl.-Inf. and Dr.-Ing. degrees from the Department of Computer Science at TU Dortmund University, Germany, in 2009 and 2017, respectively. He was a Ph.D. student and postdoc at the Chair for Programming Systems until 2019 and currently is senior researcher at the Chair for Software Engineering in Prof. Dr. Falk Howar's group, TU Dortmund University, Germany. His dissertation focused on simplifying language workbenches to make the development of visual domain-specific languages more accessible to a wider audience of language engineers. His research interests furthermore include metamodeling, code generation, and bridging the gap between formal specifications and domain-specific languages.

Prof. Dr. Falk Howar

Prof. Dr. Falk Howar

Falk Howar is a professor for Software Engineering in the Department of Computer Science at TU Dortmund University and Coordinator of Software Engineering Research at Fraunhofer ISST. His research focuses on the safety and security of intelligent and autonomous software systems. He is particularly interested in the use of formal methods to analyze the behavior of such systems. After studying computer science and earning his doctorate at TU Dortmund University, Falk Howar first worked in the USA at Carnegie Mellon University (Silicon Valley) and at NASA Ames Research Center, where he developed methods for testing an autonomous air traffic control system. Subsequently, he was on the management board of the Institute for Applied Software Systems Engineering at TU Clausthal. There, he conducted research on safe autonomous driving functions together with partners from the automotive industry.

Publications

A Qualitative Analysis of the German Road Traffic Regulations (StVO) for Automated Driving

Dominik Schmid, Till Schallau, Nick Pawlinorz, Stefan Naujokat, Robin Philipp, Zhijing Zhu, Falk Howar

In: IEEE International Automated Vehicle Validation Conference (IAVVC 2025)

2025

DOI: https://doi.org/10.1109/IAVVC61942.2025.11219635

Preprint: Download

Abstract

An Automated Driving Systems (ADS) must comply with traffic regulations that are still primarily written for human drivers, such as the German Road Traffic Regulations (StVO). Automatically assessing legal compliance remains a key challenge, as existing formalization approaches typically cover only limited rule subsets. In this paper, we present a comprehensive qualitative analysis of the StVO, focusing on regulations relevant to passenger vehicles. Using inductive coding, we categorize legal concepts, quantify the relevant share of the regulation that apply to ADSs, and identify recurring sentence structures. The analysis highlights structural limitations such as vague formulations, implicit assumptions, and missing quantification that hinder formalization and verification of ADSs. Based on our results, we motivate recommendations for future legislation that is better suited for automated interpretation and monitoring.

Post-hoc Scenario-based Testing of Automated Driving Systems: Classification of Driving Scenarios and Checking of Functional Requirements in Recorded Data

Till Schallau, Dominik Schmid, Nick Pawlinorz, Harun Teper, Stefan Naujokat, Jian-Jia Chen, Falk Howar

In: Intelligent Vehicles (IV 2025)

2025

DOI: https://doi.org/10.1109/IV64158.2025.11097740

Preprint: Download

Abstract

We present a post-hoc approach for scenario-based testing of automated driving systems, enabling the analysis of safety and correctness for (cooperative) automated driving systems in many scenarios without conducting tests for individual scenarios. The system under test is operated in its physical environment’ and data is recorded during operation. Then, driving scenarios are identified in this data and functional requirements are checked, yielding pass or fail verdicts for individual scenarios. We validate the envisioned post-hoc approach in a single-case mechanism experiment by the example of a platooning controller, identifying a previously unknown bug in the tested system, as well as a functional insufficiency concerning the intended operational design domain.

Extended Abstract of Poster: STARS: Tree-Based Classification and Testing of Feature Combinations in the Automated Robotic Domain

Till Schallau, Dominik Schmid, Nick Pawlinorz, Stefan Naujokat, Falk Howar

In: ICST Workshops (IWCT 2025)

2025

DOI: https://doi.org/10.1109/ICSTW64639.2025.10962523

Preprint: Download

Abstract

Testing complex systems is crucial for ensuring safety, especially in automated driving, where diverse data sources and variable environments pose challenges. Here, robust safety validation is critical but exhaustive n-way combinatorial testing is impractical due to the vast number of test cases. The STARS framework uses tree-based scenario classifiers to limit feature combinations in a given domain.

Tree-Based Scenario Classification

Till Schallau, Stefan Naujokat, Fiona Kullmann, Falk Howar

In: NASA Formal Methods (NFM 2024)

2024

DOI: https://doi.org/10.1007/978-3-031-60698-4_15

Preprint: Download

Abstract

Scenario-based testing is envisioned as a key approach for the safety assurance of automated driving systems. In scenario-based testing, relevant (driving) scenarios are the basis of tests. Many recent works focus on specification, variation, generation, and execution of individual scenarios. In this work, we address the open challenges of classifying sets of recorded test drives into such scenarios and measuring scenario coverage in these test drives. Technically, we specify features in logic formulas over complex data streams and construct tree-based classifiers for scenarios from these feature specifications. For such specifications, we introduce CMFTBL, a new logic that extends existing linear-time temporal logics with aspects that are essential for concise specifications that work on field-recorded data. We demonstrate the expressiveness and effectiveness of our approach by defining a family of related scenario classifiers for different aspects of urban driving.

STARS: A Tool for Measuring Scenario Coverage When Testing Autonomous Robotic Systems

Till Schallau, Dominik Mäckel, Stefan Naujokat, Falk Howar

In: Dependable Computing - EDCC 2024 Workshops (EDCC 2024)

2024

DOI: https://doi.org/10.1007/978-3-031-56776-6_6

Preprint: Download

Abstract

Extensive testing and simulation in different environments has been suggested as one piece of evidence for the safety of autonomous systems, e.g., in the automotive domain. To enable statements on the absolute number or fractions of tested scenarios, methods and tools for computing their coverage are needed. In this paper, we present STARS, a tool for specifying semantic environment features and measuring scenario coverage when testing autonomous systems.

Validating Behavioral Requirements, Conditions, and Rules of Autonomous Systems with Scenario-Based Testing

Till Schallau, Stefan Naujokat

In: Electronic Communications of the EASST (ECEASST 2023)

2023

DOI: https://doi.org/10.14279/tuj.eceasst.82.1222

Preprint: Download

Abstract

Assuring the safety of autonomous vehicles is more and more approached by using scenario-based testing. Relevant driving situations are utilized here to fuel the argument that an autonomous vehicle behaves correctly. Many recent works focus on the specification, variation, generation, and execution of individual scenarios. However, it is still an open question if operational design domains, which describe the environmental conditions under which the system under test has to function, can be assessed with scenario-based testing. In this paper, we present open challenges and resulting research questions in the field of assuring the safety of autonomous vehicles. We have developed a toolchain that enables us to conduct scenario-based testing experiments based on scenario classification with temporal logic and driving data obtained from the CARLA simulator. We discuss the toolchain and present first results using analysis metrics like class coverage or distribution.

Theses

We include many students into our research work. Here is a list of current and previous supervised theses in the STARS context.

Current Theses

Scenario Generation based on missing feature combinations using SCENIC

Bachelor's Thesis

Analyse der Diversität des INTERACTION-Datensatzes

Bachelor's Thesis

Erstellung eines Autopiloten in CARLA mit einstellbarer Fahrqualität

Bachelor's Thesis

Implementation of a Traffic Data Acquisition Prototype for Scenario-Based Testing of Automated Driving Systems

Masters's Thesis

Eine Qualitative Analyse der Autonome-Fahrzeuge-Genehmigungs-und-Betriebs-Verordnung - AFGBV für Automatische Fahrsysteme

Bachelor's Thesis

Evaluation einer blockgrenzenübergreifenden Segmentierung zur Analyse von Fahrdaten im STARS-Framework

Bachelor's Thesis

Supervised Theses

Risikobewertung von Szenarien in STARS mittels unfallrelevanter Fahrmetriken

Bachelor's Thesis - 2025

A Proof-Logic-Based Explainable Model Checker for the Temporal Logic CMFTBL

Master's Thesis - 2025

Model Checking von CMFTBL-Formeln mit SMT-Solvern am Beispiel von automatisierten Fahrsystemen

Bachelor's Thesis - 2025

Strategien zur Segmentierung von Simulationsfahrten für formale Analysen mit STARS

Bachelor's Thesis - 2025

Continuous TSC evaluation and visualization for driving in CARLA

Bachelor's Thesis - 2025

Aufbau und Evaluierung einer quellenunabhängigen Schnittstelle für das STARS-Framework durch Nutzung einer ROS2-Umgebung am Beispiel des CARLA-Simulators

Bachelor's Thesis - 2024

Analyse der Szenario-Diversität des KITTI-Datensatzes mit dem STARS Framework

Bachelor's Thesis - 2024

Testfallerstellung für STARS Scenario Features durch effizientes Labeln von Fahrdaten in Videospielen

Bachelor's Thesis - 2024

Strategies for extracting static map data from video games for formal analysis with STARS

Bachelor's Thesis - 2024

Design und Implementierung einer DSL zur vereinfachten Beschreibung von Szenarien zum simulationsbasierten Testen von autonomen Fahrzeugen

Master's Thesis - 2024

Szenario-basierte Auswertung von formalisierten Verkehrsregeln für Autobahnverkehr mit dem STARS Framework

Bachelor's Thesis - 2023

Extraktion von formal Analysierbaren Fahrdaten aus Computerspielen am Beispiel von Videospielen

Bachelor's Thesis - 2023