Researchers at the U.S. Army Research Laboratory and the University of Texas at Austin have developed new techniques for robots or computer programs to learn how to perform tasks by interacting with a human instructor.
The findings of the study will be presented and published at the Association for the Advancement of Artificial Intelligence Conference in New Orleans, Louisiana, Feb. 2-7.
ARL and UT researchers considered a specific case where a human provides real-time feedback in the form of critique. First introduced by collaborator Dr. Peter Stone, a professor at the University of Texas at Austin, along with his former doctoral student, Brad Knox, as TAMER, or Training an Agent Manually via Evaluative Reinforcement, the ARL/UT team developed a new algorithm called Deep TAMER.
It is an extension of TAMER that uses deep learning – a class of machine learning algorithms that are loosely inspired by the brain to provide a robot the ability to learn how to perform tasks by viewing video streams in a short amount of time with a human trainer.
According to Army researcher Dr. Garrett Warnell, the team considered situations where a human teaches an agent how to behave by observing it and providing critique, for example, “good job” or “bad job” -similar to the way a person might train a dog to do a trick. Warnell said the researchers extended earlier work in this field to enable this type of training for robots or computer programs that currently see the world through images, which is an important first step in designing learning agents that can operate in the real world.
Many current techniques in artificial intelligence require robots to interact with their environment for extended periods of time to learn how to optimally perform a task. During this process, the agent might perform actions that may not only be wrong, like a robot running into a wall for example, but catastrophic like a robot running off the side of a cliff. Warnell said help from humans will speed things up for the agents, and help them avoid potential pitfalls.
As a first step, the researchers demonstrated Deep TAMER’s success by using it with 15 minutes of human-provided feedback to train an agent to perform better than humans on the Atari game of bowling – a task that has proven difficult for even state-of-the-art methods in artificial intelligence. Deep-TAMER-trained agents exhibited superhuman performance, besting both their amateur trainers and, on average, an expert human Atari player.
Within the next one to two years, researchers are interested in exploring the applicability of their newest technique in a wider variety of environments: for example, video games other than Atari Bowling and additional simulation environments to better represent the types of agents and environments found when fielding robots in the real world.
Their work will be published in the AAAI 2018 conference proceedings.
“The Army of the future will consist of Soldiers and autonomous teammates working side-by-side,” Warnell said. “While both humans and autonomous agents can be trained in advance, the team will inevitably be asked to perform tasks, for example, search and rescue or surveillance, in new environments they have not seen before. In these situations, humans are remarkably good at generalizing their training, but current artificially-intelligent agents are not.”
Deep TAMER is the first step in a line of research its researchers envision will enable more successful human-autonomy teams in the Army. Ultimately, they want autonomous agents that can quickly and safely learn from their human teammates in a wide variety of styles such as demonstration, natural language instruction and critique.
The Latest on: Deep Learning
- Fuelling automotive GPUs with data to power the next generation of deep learning on September 19, 2018 at 2:08 am
If you can’t attend the session live please register anyway and we’ll send you a link to the slides and a video of the session when it’s finished. The key battleground for automotive stakeholders over ... […]
- Deep breaths, parents, but maybe video games and screen-time are not as bad as you think on September 18, 2018 at 11:07 pm
Deep breaths, parents. You are skeptical. That’s understandable. After all, video games have historically been the enemy of parenting since time began, causing more problems than solving them. But gam... […]
- Art? Music? The five senses? How deep learning is making AI more human on September 18, 2018 at 9:18 am
How do you ethically programme an autonomous car? What is the best way to train a robot surgeon? Can machines be taught to exhibit aesthetic sensibilities? These are just some of the questions that th... […]
- DarwinAI Emerges from Stealth with Powerful Design, Optimization and Explainability Platform for Deep Learning on September 18, 2018 at 8:11 am
WATERLOO, Ontario, Sept. 18, 2018 (GLOBE NEWSWIRE) -- DarwinAI, a Waterloo, Canada startup creating next generation technologies for Artificial Intelligence development, announced today it is ... […]
- MathWorks Expands Deep Learning Capabilities in Release 2018b of the MATLAB and Simulink Product Families on September 18, 2018 at 6:00 am
NATICK, Mass.--(BUSINESS WIRE)--MathWorks today introduced Release 2018b of MATLAB and Simulink. The release contains significant enhancements for deep learning, along with new capabilities and bug fi... […]
- Deep learning courses for nlp market insights shared in detailed report on September 18, 2018 at 4:48 am
Deep learning is part of a broader family of machine learning methods based on learning data representations, as opposed to task-specific algorithms. Research analysis on the global deep learning cour... […]
- Exploring Deep Learning Models for Compression and Acceleration on September 17, 2018 at 9:04 am
EdgeVerve’s Business Applications built on AI platform Infosys Nia™ enables your enterprise to manage specific business areas and make the move from a deterministic to cognitive approach. […]
- Microsoft (MSFT) Adds Deep Learning AI Tools with Lobe Buyout on September 17, 2018 at 7:13 am
Microsoft (MSFT - Free Report) recently acquired Lobe, a startup based in San Francisco, CA. However, the terms of the deal have been kept under wraps. Notably, this buyout adds to Microsoft’s growing ... […]
- Review: Keras sails through deep learning on September 17, 2018 at 3:06 am
As I discussed in my review of PyTorch, the foundational deep neural network (DNN) frameworks such as TensorFlow (Google) and CNTK (Microsoft) tend to be hard to use for model building. However, Tenso... […]
- Microsoft acquires deep learning startup Lobe on September 13, 2018 at 7:38 pm
Microsoft Corp. said Thursday its acquired a small San Francisco-based startup called Lobe Artificial Intelligence Inc. as part of its continuing push to help people create deep learning models ... […]
via Google News and Bing News