Researchers at the U.S. Army Research Laboratory and the University of Texas at Austin have developed new techniques for robots or computer programs to learn how to perform tasks by interacting with a human instructor.
The findings of the study will be presented and published at the Association for the Advancement of Artificial Intelligence Conference in New Orleans, Louisiana, Feb. 2-7.
ARL and UT researchers considered a specific case where a human provides real-time feedback in the form of critique. First introduced by collaborator Dr. Peter Stone, a professor at the University of Texas at Austin, along with his former doctoral student, Brad Knox, as TAMER, or Training an Agent Manually via Evaluative Reinforcement, the ARL/UT team developed a new algorithm called Deep TAMER.
It is an extension of TAMER that uses deep learning – a class of machine learning algorithms that are loosely inspired by the brain to provide a robot the ability to learn how to perform tasks by viewing video streams in a short amount of time with a human trainer.
According to Army researcher Dr. Garrett Warnell, the team considered situations where a human teaches an agent how to behave by observing it and providing critique, for example, “good job” or “bad job” -similar to the way a person might train a dog to do a trick. Warnell said the researchers extended earlier work in this field to enable this type of training for robots or computer programs that currently see the world through images, which is an important first step in designing learning agents that can operate in the real world.
Many current techniques in artificial intelligence require robots to interact with their environment for extended periods of time to learn how to optimally perform a task. During this process, the agent might perform actions that may not only be wrong, like a robot running into a wall for example, but catastrophic like a robot running off the side of a cliff. Warnell said help from humans will speed things up for the agents, and help them avoid potential pitfalls.
As a first step, the researchers demonstrated Deep TAMER’s success by using it with 15 minutes of human-provided feedback to train an agent to perform better than humans on the Atari game of bowling – a task that has proven difficult for even state-of-the-art methods in artificial intelligence. Deep-TAMER-trained agents exhibited superhuman performance, besting both their amateur trainers and, on average, an expert human Atari player.
Within the next one to two years, researchers are interested in exploring the applicability of their newest technique in a wider variety of environments: for example, video games other than Atari Bowling and additional simulation environments to better represent the types of agents and environments found when fielding robots in the real world.
Their work will be published in the AAAI 2018 conference proceedings.
“The Army of the future will consist of Soldiers and autonomous teammates working side-by-side,” Warnell said. “While both humans and autonomous agents can be trained in advance, the team will inevitably be asked to perform tasks, for example, search and rescue or surveillance, in new environments they have not seen before. In these situations, humans are remarkably good at generalizing their training, but current artificially-intelligent agents are not.”
Deep TAMER is the first step in a line of research its researchers envision will enable more successful human-autonomy teams in the Army. Ultimately, they want autonomous agents that can quickly and safely learn from their human teammates in a wide variety of styles such as demonstration, natural language instruction and critique.
The Latest on: Deep Learning
Difference between AI, Machine Learning and Deep Learning
on May 23, 2018 at 10:44 am
The concept of artificial intelligence (AI) is definitely not a new one. For most of us, our first encounter was through the science fiction (Sci-fi) movies. We have been gripped by The Terminator series, The Matrix, I. Robot, Ex Machina, all depicting the ... […]
Pointer Telocation Announces Collaboration with Microsoft Israel in the Development of a Deep Learning AI System for Driver Behavior
on May 22, 2018 at 3:09 am
ROSH HAAYIN, Israel, May 22, 2018 /PRNewswire/ -- Pointer Telocation Ltd. (NasdaqCM: PNTR) (TASE: PNTR) – a leading developer, manufacturer and operator of Mobile Resource Management (MRM) solutions, today announced a new technological collaboration with ... […]
SD Times news digest: Apple developers form new union, ElectricFlow 8.3 and NVIDIA’s deep learning solution
on May 21, 2018 at 7:58 am
A group of Apple developers are banding together to form the Developers Union. The union is designed to advocate for sustainability in the App Store, according to the developers. This is a community-driven “non-union union” designed to help improve the ... […]
Unified Deep Learning Configurations and Emerging Applications
on May 21, 2018 at 6:32 am
Previous sections have described the complementary strengths of CPUs, GPUs, and FPGAs for different types of deep learning operations. With the emergence of new use cases, there will be a growing benefit of unified deep learning configurations combining ... […]
Nvidia researchers create AI, deep-learning system to enable robots to learn from human demonstration
on May 20, 2018 at 5:48 pm
Nvidia researchers have created a deep-learning system that can teach a robot simply by observing a human's actions. According to Nvidia, the deep learning and artificial intelligence method is designed to improve robot-human communication and allow them ... […]
How Deep Learning Can Take You to the Next Level of Understanding the Voice of Your Customer
on May 19, 2018 at 12:35 pm
Understanding the voice of your customer is key in today’s competitive business landscape, as is developing a customer-centric management style which focuses on understanding and maintaining compelling, positive high-quality experiences for your customers. […]
Deep Learning On Your Desktop
on May 18, 2018 at 3:27 pm
The AI revolution is here. However, GPU-accelerated AI and HPC deployments can be complex and time consuming to build, test and maintain. And keeping pace with the community’s fast-moving advances in software development requires a high level of ... […]
Deep Learning at the Edge Drives New Models for Security
on May 14, 2018 at 8:30 pm
In this special guest feature, George Brostoff, Founder and CEO of SensibleVision, shows his excitement about AI’s potential to dramatically transform and improve the security space. When most people think of AI and security, they conjure how it might be ... […]
via Google News and Bing News