Formal Specification and Testing for Reinforcement Learning (ICFP 2023 - ICFP Papers and Events)

Mon 4 - Sat 9 September 2023 Seattle, Washington, United States

Who

Mahsa Varshosaz, Mohsen Ghaffari, Einar Broch Johnsen, Andrzej Wąsowski

Track

ICFP 2023 ICFP Papers and Events

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 6 Sep 2023 11:30 - 12:00 at B - Fifth Avenue - Testing Chair(s): Stephen Dolan

Abstract

The development process for reinforcement learning applications is still exploratory rather than systematic. This exploratory nature reduces reuse of specifications between applications and increases the chances of introducing programming errors. This paper takes a step towards systematizing the development of reinforcement learning applications. We introduce a formal specification of reinforcement learning problems and algorithms, with a particular focus on temporal difference methods and their definitions in backup diagrams. We further develop a test harness for a large class of reinforcement learning applications based on temporal difference learning, including SARSA and Q-learning. The entire development is rooted in functional programming methods; starting with pure specifications and denotational semantics, ending with property-based testing and using compositional interpreters for a domain-specific term language as a test oracle for concrete implementations. We demonstrate the usefulness of this testing method on a number of examples, and evaluate with mutation testing. We show that our test suite is effective in killing mutants (90% mutants killed for 75% of subject agents). More importantly, almost half of all mutants are killed by generic write-once-use-everywhere tests that apply to \emph{any} reinforcement learning problem modeled using our library, without any additional effort from the programmer.

DOI

https://doi.org/10.1145/3607835

Mahsa Varshosaz

IT University of Copenhagen, Denmark

Denmark

Mohsen Ghaffari

IT University of Copenhagen

Denmark

Einar Broch Johnsen

University of Oslo

Andrzej Wąsowski

IT University of Copenhagen, Denmark

Denmark

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 6 Sep
Displayed time zone: Pacific Time (US & Canada) change

10:30 - 12:00	TestingICFP Papers and Events at B - Fifth Avenue Chair(s): Stephen Dolan Jane Street

10:30 30m Talk		Reflecting on Random GenerationDistinguished Paper ICFP Papers and Events Harrison Goldstein University of Pennsylvania, Samantha Frohlich University of Bristol, Meng Wang University of Bristol, Benjamin C. Pierce University of Pennsylvania DOI
11:00 30m Talk		Etna: An Evaluation Platform for Property-Based Testing (Experience Report)Experience Report ICFP Papers and Events Jessica Shi University of Pennsylvania, Alperen Keles University of Maryland at College Park, Harrison Goldstein University of Pennsylvania, Benjamin C. Pierce University of Pennsylvania, Leonidas Lampropoulos University of Maryland, College Park DOI
11:30 30m Talk		Formal Specification and Testing for Reinforcement LearningRemote ICFP Papers and Events Mahsa Varshosaz IT University of Copenhagen, Denmark, Mohsen Ghaffari IT University of Copenhagen, Einar Broch Johnsen University of Oslo, Andrzej Wąsowski IT University of Copenhagen, Denmark DOI