Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.
Multiagent Systems: A survey from a machine learning perspective. Peter Stone and Manuela Veloso. Autonomous Robots, 8(3):345–383, July 2000. @Article(MASsurvey ...
TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots. Todd Hester and Peter Stone. Machine Learning, 90(3):385–429, 2013.
Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism. Kurt Dresner and Peter Stone. In The Third International Joint Conference on Autonomous Agents and Multiagent Systems ...
Grounded Action Transformation for Robot Learning in Simulation. Josiah Hanna and Peter Stone. @InProceedings{AAAI17-Hanna, author = {Josiah Hanna and Peter Stone}, title = {Grounded Action ...
Gaussian processes for sample efficient reinforcement learning with RMAX-like exploration. Tobias Jung and Peter Stone. @InProceedings{ECML10-jung, author = "Tobias Jung and Peter Stone", title = ...
Though computers have surpassed humans at many tasks, especially computationally intensive ones, there are many tasks for which human expertise remains necessary and/or useful. For such tasks, it is ...
General Game Learning using Knowledge Transfer. Bikramjit Banerjee and Peter Stone.
UT Austin Villa is a robot soccer team that has competed in the annual RoboCup soccer competitions since 2003. The team has won several championships and has inspired research contributions spanning ...
Mobile Robot Planning using Action Language BC with an Abstraction Hierarchy. Shiqi Zhang, Fangkai Yang, Piyush Khandelwal, and Peter Stone. In Proceedings of the 13th International Conference on ...
In reinforcement learning (RL), a reward function that aligns exactly with a task's true performance metric is often sparse. For example, a true task metric might encode a reward of 1 upon success and ...
Imitation from observation (IfO) is the problem of learning directly from state-only demonstrations without having access to the demonstrator's actions.The lack of action information both ...