Blake Wulfe
Blake Wulfe
Home
Publications
Blog
Projects
Resume
reward learning
Dynamics-Aware Comparison of Learned Reward Functions
We propose a method for quantifying the similarity of learned reward functions without performing policy learning and evaluation.
Cite
×