Join us in returning to NYC on June 5th to collaborate with executive leaders in exploring comprehensive methods for auditing AI models regarding bias, performance, and ethical compliance across diverse organizations. Find out how you can attend here.
Large language models (LLMs) can accelerate the training of robotics systems in super-human ways, according to a new study by scientists at Nvidia, the University of Pennsylvania and the University of Texas, Austin.
The study introduces DrEureka, a technique that can automatically create reward functions and randomization distributions for robotics systems. DrEureka stands for Domain Randomization Eureka. DrEureka only requires a high-level description of the target task and is faster and more efficient than human-designed rewards in transferring learned policies from simulated environments to the real world.
The implications can be great for the fast-moving world of robotics, which has recently gotten a renewed boost from the advances in language and vision models.
When designing robotics models for new tasks, a policy is usually trained in a simulated environment and deployed to the real world. The difference between simulation and real-world environments, referred to as the sim-to-real gap, is one of the big challenges of any robotics system. Configuring and fine-tuning the policy for optimal performance usually requires a bit of back and forth between simulation and real-world environments.
The AI Impact Tour: The AI Audit
Join us as we return to NYC on June 5th to engage with top executive leaders, delving into strategies for auditing AI models to ensure fairness, optimal performance, and ethical compliance across diverse organizations. Secure your attendance for this exclusive invite-only event.
Recent works have shown that LLMs can combine their vast world knowledge and reasoning capabilities with the physics engines of virtual simulators to learn complex low-level skills. For example, LLMs can be used to design reward functions, the components that steer the robotics reinforcement learning (RL) system to find the correct sequences of actions for the desired task.
However, once a policy is learned in simulation, transferring it to the real world requires a lot of manual tweaking of the reward functions and simulation parameters.
The goal of DrEureka is to use LLMs to automate the intensive human efforts required in the sim-to-real transfer process.
DrEureka builds on Eureka, a technique that was introduced in October 2023. Eureka takes a robotic task description and uses an LLM to generate software implementations for a reward function that measures success in that task. These reward functions are then run in simulation and the results are returned to the LLM, which reflects on the outcome and modifies it to the reward function. The advantage of this technique is that it can be run in parallel with hundreds of reward functions, all generated by the LLM. It can then pick the best functions and continue to improve them.
While the reward functions of Eureka are great for training RL policies in simulation, it does not account for the messiness of the real world and therefore requires manual sim-to-real transfer. DrEureka addresses this shortcoming by automatically configuring domain randomization (DR) parameters.
DR techniques randomize the physical parameters of the simulation environment so that the RL policy can generalize to the unpredictable perturbances it meets in the real world. One of the important challenges of DR is choosing the right parameters and range of perturbations. Adjusting parameters requires commonsense physical reasoning and knowledge of the target robot.
These characteristics of designing DR parameters make it an ideal problem for LLMs to tackle because of their strong grasp of physical knowledge and effectiveness in generating hypotheses, providing good initializations to complex search and black-box optimization problems in a zero-shot manner, the researchers wrote.
DrEureka uses a multi-step process to break down the complexity of optimizing reward functions and domain randomization parameters at the same time. First, an LLM generates reward functions based on a task description and safety instructions about the robot and the environment. DrEureka uses these instructions to create an initial reward function and learn a policy as in the original Eureka. The model then runs tests with the policy and reward function to determine the suitable range of physics parameters, such as friction and gravity.
The LLM then uses this information to select the optimal domain randomization configurations. Finally, the policy is retrained with the DR configurations to become robust against the noisiness of the real world.
The researchers described DrEureka as a language-model driven pipeline for sim-to-real transfer with minimal human intervention.
The researchers evaluated DrEureka on quadruped and dexterous manipulator platforms, although the method is general and applicable to diverse robots and tasks. Their findings show that in quadruped locomotion, policies trained with DrEureka outperform the classic human-designed systems by 34% in forward velocity and 20% in distance traveled across various real-world evaluation terrains. They also tested DrEureka on dexterous manipulation with robotic hands. Given a fixed amount of time, the best policy trained by DrEureka performed 300% more cube rotations than human-developed policies.
But the most interesting finding was the application of DrEureka on the novel task of having a robo-dog balancing and walking on a yoga ball. The LLM was able to design a reward function and DR configurations that allowed the trained policy to be transferred to the real world with no extra configurations and perform well enough on diverse indoor and outdoor terrains with minimal safety support.
Interestingly the study found that the safety instruction included in the task description plays an important role in ensuring that the LLM generates logical instructions that transfer to the real world.
We believe that DrEureka demonstrates the potential of accelerating robot learning research by using foundation models to automate the difficult design aspects of low-level skill learning, the researchers wrote.
VB Daily
Stay in the know! Get the latest news in your inbox daily
By subscribing, you agree to VentureBeat's Terms of Service.
Thanks for subscribing. Check out more VB newsletters here.
An error occured.
Read the rest here:
Nvidia's DrEureka outperforms humans in training robotics systems - VentureBeat
- Robotics - Gizmag [Last Updated On: June 12th, 2016] [Originally Added On: June 12th, 2016]
- Download Microsoft Robotics Developer Studio 4 from ... [Last Updated On: June 16th, 2016] [Originally Added On: June 16th, 2016]
- Robotics News & Articles - IEEE Spectrum [Last Updated On: June 19th, 2016] [Originally Added On: June 19th, 2016]
- Robotics - Wikipedia, the free encyclopedia [Last Updated On: June 19th, 2016] [Originally Added On: June 19th, 2016]
- Robotics News & Articles - IEEE Spectrum [Last Updated On: June 19th, 2016] [Originally Added On: June 19th, 2016]
- RIA - Robotics Online - Industrial Robot Automation [Last Updated On: June 19th, 2016] [Originally Added On: June 19th, 2016]
- Three Laws of Robotics - Wikipedia, the free encyclopedia [Last Updated On: June 25th, 2016] [Originally Added On: June 25th, 2016]
- Robotics [Last Updated On: July 18th, 2016] [Originally Added On: July 18th, 2016]
- Houston Robotics [Last Updated On: August 12th, 2016] [Originally Added On: August 12th, 2016]
- Home - Illinois Tech Robotics [Last Updated On: August 27th, 2016] [Originally Added On: August 27th, 2016]
- Robotics - Wikibooks, open books for an open world [Last Updated On: September 8th, 2016] [Originally Added On: September 8th, 2016]
- JPL Robotics: Home Page [Last Updated On: September 8th, 2016] [Originally Added On: September 8th, 2016]
- ABB Robotics [Last Updated On: September 10th, 2016] [Originally Added On: September 10th, 2016]
- Softbank Robotics | Humanoid robotics & programmable robots [Last Updated On: September 29th, 2016] [Originally Added On: September 29th, 2016]
- Robotics - Wikipedia [Last Updated On: October 20th, 2016] [Originally Added On: October 20th, 2016]
- STEM Robotics 101 NXT | STEMRobotics [Last Updated On: October 25th, 2016] [Originally Added On: October 25th, 2016]
- Industrial Electronics and Robotics Technician [Last Updated On: October 25th, 2016] [Originally Added On: October 25th, 2016]
- Future of Robotics|Robots of the Future|Robot Links [Last Updated On: October 25th, 2016] [Originally Added On: October 25th, 2016]
- Industrial Robotics Training | School of Technology [Last Updated On: October 25th, 2016] [Originally Added On: October 25th, 2016]
- Robotics | An Open Access Journal from MDPI [Last Updated On: October 25th, 2016] [Originally Added On: October 25th, 2016]
- Robotics Technician & Automation Training - Online Course in ... [Last Updated On: October 25th, 2016] [Originally Added On: October 25th, 2016]
- Rich Hooper, PhD, PE: Robotics Engineer [Last Updated On: October 25th, 2016] [Originally Added On: October 25th, 2016]
- Robotics Toolbox - PeterCorke.com [Last Updated On: October 25th, 2016] [Originally Added On: October 25th, 2016]
- ROBOTC | Robotics Academy [Last Updated On: October 25th, 2016] [Originally Added On: October 25th, 2016]
- How have robots changed manufacturing? | HowStuffWorks [Last Updated On: October 25th, 2016] [Originally Added On: October 25th, 2016]
- Professional Development | Robotics Academy [Last Updated On: October 25th, 2016] [Originally Added On: October 25th, 2016]
- Robotics Academy [Last Updated On: December 14th, 2016] [Originally Added On: December 14th, 2016]
- KUKA Robotics | KUKA AG [Last Updated On: December 14th, 2016] [Originally Added On: December 14th, 2016]
- Robotics Engineer - Science Buddies [Last Updated On: December 24th, 2016] [Originally Added On: December 24th, 2016]
- Robotics - reddit [Last Updated On: January 10th, 2017] [Originally Added On: January 10th, 2017]
- Preorder Snap Flying Camera | Vantage Robotics [Last Updated On: January 25th, 2017] [Originally Added On: January 25th, 2017]
- Robotics - The Great Courses [Last Updated On: January 25th, 2017] [Originally Added On: January 25th, 2017]
- Belding hosts largest robotics competition in history of event - Greenville Daily News [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- Robohub Digest 01/17: New year, new perspectives and forecasts for the year ahead in AI, machine learning and ... - Robohub [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- Robotics-focused ETFs see big gains, Trump could hasten trend - MarketWatch [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- Robotics competition events help teens express creativity - Monroe Evening News [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- First Tech Challenge: High schools battle in robotics competition - The Journal News | LoHud.com [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- Gemini school robotics team wins in 'FIRST' competition - Chicago Tribune [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- TMRIES to impart robotics education - The Hindu - The Hindu [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- Meridian robotics team gears up for Idaho FRC Regional - KBOI-TV [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- South High wins Volvo's robotics tournament - Herald-Mail Media [Last Updated On: February 6th, 2017] [Originally Added On: February 6th, 2017]
- Local robotics students headed to World Championship tournament - KOLO [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Oxford Hills robotics club to compete in North American championship - Lewiston Sun Journal [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Panther engineering and robotics team paves pathways to innovation - Clay County Free Press.com (subscription) [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Shopping robots on the march in Ocado - BBC News [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- OHS robotics club dazzles McKinley students - Southernminn.com [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Toyota Industries Acquires Warehouse Robotics Developer - WSJ - Wall Street Journal [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Rehab robotics field promises to return control, mobility to aging population - Arizona State University [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Mount Paran Christian School Robotics Team has engineered their way to State Competition - MDJOnline.com [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- 33C3: Edible Soft Robotics - Hackaday [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Keller students Starstruck on robotics - Fort Worth Star Telegram [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- WPI Robotics Team Has a Mission with Mars | News | WPI - WPI News [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Rethink's Robots Get Massive Software Upgrade, Rodney Brooks So Excited - IEEE Spectrum [Last Updated On: February 7th, 2017] [Originally Added On: February 7th, 2017]
- Front Porch: Vision Charter robotics teams perform well at state - Idaho Press-Tribune [Last Updated On: February 9th, 2017] [Originally Added On: February 9th, 2017]
- Raspberry Pi-powered arm: This kit aims to make robotics simple ... - TechRepublic [Last Updated On: February 9th, 2017] [Originally Added On: February 9th, 2017]
- Roam Robotics' lightweight, inexpensive exoskeleton for the masses - New Atlas [Last Updated On: February 9th, 2017] [Originally Added On: February 9th, 2017]
- Rehab robotics field promises to return control, mobility to aging population - ScienceBlog.com (blog) [Last Updated On: February 9th, 2017] [Originally Added On: February 9th, 2017]
- Rethink Robotics rethinks its software | ZDNet - ZDNet [Last Updated On: February 9th, 2017] [Originally Added On: February 9th, 2017]
- Home-schooled students studying robotics - Valencia County News Bulletin [Last Updated On: February 9th, 2017] [Originally Added On: February 9th, 2017]
- Botetourt 4-H Phoenix Robotics team sets sights on success - Roanoke Times [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- Hartland robotics teams take on the FIRST Tech challenge at state competition - Lake Country Now [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- iFuture Robotics cuts manpower needs of warehouses with its mobile robots - YourStory.com [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- Crescent Valley robotics focused on developing freshmen - Corvallis Gazette Times [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- CyPhy, Mitsubishi, and Soft Robotics Join Robo Madness on March 28 - Xconomy [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- Agility Robotics Introduces Cassie, a Dynamic and Talented Robot ... - IEEE Spectrum [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- FIRST Robotics Competition - Wikipedia [Last Updated On: February 10th, 2017] [Originally Added On: February 10th, 2017]
- Raptor legs & human hips: Giant leap for walking robots - RT [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- Patriot Robotics Alliance upsets Brentwood Academy Alliances - Clarksville Now [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- South Beach robotics squad advances to super regionals - The Daily World [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- How drones and robotics may shape the future of conflict under President Trump - PRI [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- Personal robotics the next technological revolution: Dr Vivian Balakrishnan - Channel NewsAsia [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- Ford Bets $1B on Startup Founded by Waymo, Uber Vets - ABC News [Last Updated On: February 11th, 2017] [Originally Added On: February 11th, 2017]
- Woodall robotics team competes in Broken Arrow - Muskogee Daily Phoenix [Last Updated On: February 12th, 2017] [Originally Added On: February 12th, 2017]
- Students Compete in LEGO Robotics Design Contest in Brooklyn - NY1 [Last Updated On: February 12th, 2017] [Originally Added On: February 12th, 2017]
- Essex County Freeholders Honor Livingston Robotics Club Teams - TAPinto.net [Last Updated On: February 12th, 2017] [Originally Added On: February 12th, 2017]
- Robotics event offers lesson in teamwork - Lewiston Morning Tribune (subscription) [Last Updated On: February 12th, 2017] [Originally Added On: February 12th, 2017]
- Face off: Marion students compete in robotics competition - The Exponent Telegram (press release) (registration) [Last Updated On: February 12th, 2017] [Originally Added On: February 12th, 2017]
- Freshman Caldwell team takes first place in Regional Robotics Tournament - Idaho Press-Tribune [Last Updated On: February 12th, 2017] [Originally Added On: February 12th, 2017]
- Ford Motor bets $1 billion on robotics startup - Toledo Blade [Last Updated On: February 12th, 2017] [Originally Added On: February 12th, 2017]
- Alice Shaw robotics team building invention to save wolves | Local ... - Lompoc Record [Last Updated On: February 12th, 2017] [Originally Added On: February 12th, 2017]