Fewshot Corp.

We are working in reward hacking detection and mitigation. We have a concrete research agenda to study reward hacking tasks to complete our empirical study measuring how reward visibility affects hacking behavior, demonstrate whether RL training systematically amplifies reward hacking, and establish actionable guidelines for test design.