site:www.cs.utexas.edu

www.cs.utexas.edu1d

CS388C: Combinatorics and Graph Theory (Spring 2023)

Prerequisites: Graduate standing or consent of instructor. Many students find this course difficult, so a first-rate math background is highly recommended. See the Review Sheet for material you're ...

Department of Computer Science - University of Texas at Austin2d

E. Allen Emerson Memorial Seminar

E. Allen Emerson, a beloved and long-serving member of the faculty in Computer Science at UT Austin and a renowned Computer ...

www.cs.utexas.edu3d

Artificial Intelligence and Life in 2030

Artificial Intelligence and Life in 2030. Peter Stone, Rodney Brooks, Erik Brynjolfsson, Ryan Calo, Oren Etzioni, Greg Hager, Julia Hirschberg, Shivaram ...

www.cs.utexas.edu3d

The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications

In reinforcement learning (RL), a reward function that aligns exactly with a task's true performance metric is often sparse. For example, a true task metric might encode a reward of 1 upon success and ...

www.cs.utexas.edu3d

Policy Gradient Reinforcement Learning for Fast Quadrupedal Locomotion

Policy Gradient Reinforcement Learning for Fast Quadrupedal Locomotion. Nate Kohl and Peter Stone.

www.cs.utexas.edu3d

Impact of Music on Decision Making in Quantitative Tasks

The goal of this study is to explore which aspects of people’s analytical decision making are affected when ex- posed to music. To this end, we apply a stochastic sequen- tial model of simple ...

www.cs.utexas.edu3d

Multiagent Systems: A survey from a machine learning perspective

Multiagent Systems: A survey from a machine learning perspective. Peter Stone and Manuela Veloso. Autonomous Robots, 8(3):345–383, July 2000. @Article(MASsurvey, Author="Peter Stone and Manuela Veloso ...

www.cs.utexas.edu3d

Dynamic Behaviors on the NAO Robot With Closed-Loop Whole Body Operational Space Control

Exploiting full-body dynamics in feedback control can enhance the balancing capability of a legged system using various techniques such as Whole-Body Control (WBC) or Centroidal Momentum control.

www.cs.utexas.edu4d

Imitation Learning from Video by Leveraging Proprioception

Classically, imitation learning algorithms have been developed for idealized situations, e.g., the demonstrations are often required to be collected in the exact same environment and usually include ...

www.cs.utexas.edu4d

Boosting for Regression Transfer

David Pardoe and Peter Stone.

www.cs.utexas.edu4d

Deep Reinforcement Learning in Parameterized Action Space

Recent work has shown that deep neural networks are capable ofapproximating both value functions and policies in reinforcementlearning domains featuring continuous state and actionspaces. However, to ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results