![]() |
RL via regressing relative values
UT RL Reading group November, 2024; slides RL via regressing relative values |
![]() |
A tutorial on RLHF
SCALAR/MIDI Lab March, 2024; slides TLDR: A tutorial on DPO/cDPO/IPO/RSO. |
![]() |
RLRG presentation on Multi-Agent Diagnostics for Robustness via Illuminated Diversity
RLRG Feb 16, 2024; slides TLDR: How to use diversity search methods to obtain adversarial jailbreaks to policy? |
![]() |
RLRG presentation on Bridging Reinforcement Learning Theory
and Practice with the Effective Horizon
RLRG September 1, 2023; slides TLDR: Bridging RL Theory and Practice. |
![]() |
RLRG presentation on Extreme Q-Learning
RLRG Feb 22, 2023; slides TLDR: A novel framework for Q-learning that models the maximal soft-values without needing to sample from a policy. |
![]() |
Research Preparation Exam at UT Austin
Research Preparation Exam April 25, 2023; slides rank-game: A unifying theory for learning from preferences and demonstrations. |
![]() |
RLRG presentation on The Information Geometry of Unsupervised Reinforcement Learning
RLRG September 21, 2022; slides TLDR: A explanation for the mutual information representation learning in RL. |
![]() |
Discussion on utility of parallel and differentiable simulators
SCALAR Lab; slides TLDR: How can parallel and differentiable simulators speed up reinforcement learning? |