Improved Regret Bounds for Undiscounted Continuous Reinforcement Learning

Lakshmanan Kailasam, Ronald Ortner, Daniil Ryabko

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Filter
Participation in conference

Search results