home

author: niplav, created: 2023-12-04, modified: 2023-12-04, language: english, status: notes, importance: 7, confidence: certain

.

Too Good to be True: Training an RL Agent to be Suspicious

Code based on this tutorial, trying to implement the experiment detailed in Yudkowsky 2017.