site:www.cs.utexas.edu

Transfer Learning for Reinforcement Learning Domains: A Survey

Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.

www.cs.utexas.edu6 天

Multiagent Systems: A survey from a machine learning perspective

Multiagent Systems: A survey from a machine learning perspective. Peter Stone and Manuela Veloso. Autonomous Robots, 8(3):345–383, July 2000. @Article(MASsurvey ...

www.cs.utexas.edu6 天

TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots

TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots. Todd Hester and Peter Stone. Machine Learning, 90(3):385–429, 2013.

www.cs.utexas.edu6 天

Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism

Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism. Kurt Dresner and Peter Stone. In The Third International Joint Conference on Autonomous Agents and Multiagent Systems ...

www.cs.utexas.edu6 天

Grounded Action Transformation for Robot Learning in Simulation

Grounded Action Transformation for Robot Learning in Simulation. Josiah Hanna and Peter Stone. @InProceedings{AAAI17-Hanna, author = {Josiah Hanna and Peter Stone}, title = {Grounded Action ...

www.cs.utexas.edu6 天

Gaussian processes for sample efficient reinforcement learning with RMAX-like exploration

Gaussian processes for sample efficient reinforcement learning with RMAX-like exploration. Tobias Jung and Peter Stone. @InProceedings{ECML10-jung, author = "Tobias Jung and Peter Stone", title = ...

www.cs.utexas.edu6 天

TAMER: Training an Agent Manually via Evaluative Reinforcement

Though computers have surpassed humans at many tasks, especially computationally intensive ones, there are many tasks for which human expertise remains necessary and/or useful. For such tasks, it is ...

www.cs.utexas.edu6 天

General Game Learning using Knowledge Transfer

General Game Learning using Knowledge Transfer. Bikramjit Banerjee and Peter Stone.

www.cs.utexas.edu5 天

UT Austin Villa: Project-Driven Research in AI and Robotics

UT Austin Villa is a robot soccer team that has competed in the annual RoboCup soccer competitions since 2003. The team has won several championships and has inspired research contributions spanning ...

www.cs.utexas.edu6 天

Mobile Robot Planning using Action Language BC with an Abstraction Hierarchy

Mobile Robot Planning using Action Language BC with an Abstraction Hierarchy. Shiqi Zhang, Fangkai Yang, Piyush Khandelwal, and Peter Stone. In Proceedings of the 13th International Conference on ...

www.cs.utexas.edu6 天

The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task ...

In reinforcement learning (RL), a reward function that aligns exactly with a task's true performance metric is often sparse. For example, a true task metric might encode a reward of 1 upon success and ...

www.cs.utexas.edu6 天

Generative Adversarial Imitation from Observation

Imitation from observation (IfO) is the problem of learning directly from state-only demonstrations without having access to the demonstrator's actions.The lack of action information both ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果