Improved Regret Bounds for Undiscounted Continuous Reinforcement Learning

Lakshmanan Kailasam, Ronald Ortner, Daniil Ryabko

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Original languageEnglish
Title of host publicationProceedings of The 32nd International Conference on Machine Learning
Publication statusPublished - 2015

Cite this