Harshit Sikchi

	RL via regressing relative values UT RL Reading group November, 2024; slides RL via regressing relative values
	A tutorial on RLHF SCALAR/MIDI Lab March, 2024; slides TLDR: A tutorial on DPO/cDPO/IPO/RSO.
	RLRG presentation on Multi-Agent Diagnostics for Robustness via Illuminated Diversity RLRG Feb 16, 2024; slides TLDR: How to use diversity search methods to obtain adversarial jailbreaks to policy?
	RLRG presentation on Bridging Reinforcement Learning Theory and Practice with the Effective Horizon RLRG September 1, 2023; slides TLDR: Bridging RL Theory and Practice.
	RLRG presentation on Extreme Q-Learning RLRG Feb 22, 2023; slides TLDR: A novel framework for Q-learning that models the maximal soft-values without needing to sample from a policy.
	Research Preparation Exam at UT Austin Research Preparation Exam April 25, 2023; slides rank-game: A unifying theory for learning from preferences and demonstrations.
	RLRG presentation on The Information Geometry of Unsupervised Reinforcement Learning RLRG September 21, 2022; slides TLDR: A explanation for the mutual information representation learning in RL.
	Discussion on utility of parallel and differentiable simulators SCALAR Lab; slides TLDR: How can parallel and differentiable simulators speed up reinforcement learning?