5 Comments
User's avatar
ToxSec's avatar

Incredibly in-depth article on this subject. I feel like i can re-read this a few times to fully get all the useful information here.

Cameron R. Wolfe, Ph.D.'s avatar

Thanks so much for reading! Hope it was helpful!

ToxSec's avatar

it absolutely was!

Hodman Murad's avatar

This is very cool and very needed. It's important that we design agent evaluations that don't accidentally reward cheating

Cameron R. Wolfe, Ph.D.'s avatar

Totally agree, thanks for reading!