Classically, imitation learning algorithms have been developed for idealized situations, e.g., the demonstrations are often required to be collected in the exact same environment and usually include ...
As president in my sophomore year, CSB tried to do a couple of things. One was social bonding—bringing everyone together through socials and having fun. The other part was sourcing career networking ...
Recent work has shown that deep neural networks are capable ofapproximating both value functions and policies in reinforcementlearning domains featuring continuous state and actionspaces. However, to ...
Transfer learning is a method where an agent reuses knowledge learned in a source task to improve learning on a target task. Recent work has shown that transfer learning can be extended to the idea of ...
Though computers have surpassed humans at many tasks, especially computationally intensive ones, there are many tasks for which human expertise remains necessary and/or useful. For such tasks, it is ...
Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.
Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism. Kurt Dresner and Peter Stone. In The Third International Joint Conference on Autonomous Agents and Multiagent Systems ...
My research interests are in the area of machine learning for speech, language, and sound processing. I am particularly interested in multimodality and unsupervised ...
Multiagent Systems: A survey from a machine learning perspective. Peter Stone and Manuela Veloso. Autonomous Robots, 8(3):345–383, July 2000. @Article(MASsurvey ...
Gaussian processes for sample efficient reinforcement learning with RMAX-like exploration. Tobias Jung and Peter Stone. @InProceedings{ECML10-jung, author = "Tobias Jung and Peter Stone", title = ...
The goal of this study is to explore which aspects of people’s analytical decision making are affected when ex- posed to music. To this end, we apply a stochastic sequen- tial model of simple ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果