Walking the 'dog' on the yoga ball! Eureka, selected as one of NVIDIA's top ten projects, has made a new breakthrough-AI-php.cn

Home

Technology peripherals

Walking the 'dog' on the yoga ball! Eureka, selected as one of NVIDIA's top ten projects, has made a new breakthrough

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

May 05, 2024 pm 01:01 PM

git project dreureka

The robot dog walks steadily on the yoga ball, and its balance is quite good:

Walking the dog on the yoga ball! Eureka, selected as one of NVIDIAs top ten projects, has made a new breakthrough

It can handle various scenes, whether it is a flat sidewalk , or even challenging lawns can hold:

Walking the dog on the yoga ball! Eureka, selected as one of NVIDIAs top ten projects, has made a new breakthrough

Even if a researcher kicks a yoga ball, the robot dog will not tip over:

Walking the dog on the yoga ball! Eureka, selected as one of NVIDIAs top ten projects, has made a new breakthrough

The robot dog that deflates the balloon can also maintain balance:

Walking the dog on the yoga ball! Eureka, selected as one of NVIDIAs top ten projects, has made a new breakthrough

The above demonstrations are all at 1x speed and have not been accelerated.

Paper address: https://eureka-research.github.io/dr-eureka/assets/dreureka-paper.pdf
Project homepage: https://github.com/eureka-research/DrEureka
Paper title: DrEureka: Language Model Guided Sim-To-Real Transfer

This research was jointly created by researchers from the University of Pennsylvania, NVIDIA, and the University of Texas at Austin, and is completely open source. They proposed DrEureka (Domain Randomized Eureka), a new algorithm that utilizes LLM to implement reward design and domain randomized parameter configuration, which can simultaneously achieve simulation-to-reality transfer. The study demonstrates the DrEureka algorithm's ability to solve novel robotic tasks, such as quadruped robot balancing and walking on a yoga ball, without the need for iterative manual design.

DrEureka is based on Eureka, which was also named one of the top ten NVIDIA projects in 2023. To learn more about Eureka, please refer to "With GPT-4, the robot has learned how to spin pens and plate walnuts".

In the abstract section of the paper, the researchers stated that transferring strategies learned in simulations to the real world is a promising strategy for large-scale acquisition of robot skills. However, simulation-to-reality approaches often rely on the manual design and tuning of task reward functions and simulation physical parameters, which makes the process slow and labor-intensive. This paper examines the use of large language models (LLMs) to automate and accelerate simulation-to-realistic design.

Jim Fan, one of the authors of the paper and a senior scientist at NVIDIA, also participated in this research. Previously, Nvidia established an AI laboratory, led by Jim Fan, specializing in embodied intelligence. Jim Fan said:

"We trained a robot dog to balance and walk on a yoga ball. This was completely done in simulation, and then zero-sample migration Go to the real world and run directly without fine-tuning

##The yoga ball walking task is particularly difficult for the robot dog because we cannot accurately simulate the bouncy ball surface. Easily search for a large number of simulated real configurations and allow the robot dog to control the ball on various terrains and even walk sideways!

## Generally speaking, the transfer from simulation to reality is This is achieved through domain randomization, a tedious process that requires roboticists to eye every parameter and manually adjust it. Cutting-edge LLMs like GPT-4 have a lot of built-in physical intuition, including friction, damping, stiffness, gravity. etc., with GPT-4, DrEureka can skillfully adjust these parameters and explain its reasoning well."

Paper Introduction

The DrEureka process is as follows, which accepts task and safety instructions and environment source code, and runs Eureka to generate regularization reward functions and strategies. It then tests the strategy under different simulation conditions to construct a reward-aware physical prior, which is fed to an LLM to generate a set of domain randomization (DR) parameters. Finally, the policy is trained using the synthesized reward and DR parameters for actual deployment.

Eureka Reward Design. The reward design component is based on Eureka because of its simplicity and expressiveness, but this paper introduces some improvements to enhance its applicability from simulation to real-world environments. The pseudocode is as follows:

#Reward aware physics prior (RAPP, reward aware physics prior). Security reward functions can regulate policy behavior to fix environmental choices, but are not sufficient by themselves to achieve simulation-to-reality transfer. Therefore, this paper introduces a simple RAPP mechanism to limit the basic scope of LLM.

LLM is used for domain randomization. Given the RAPP range for each DR parameter, the final step of DrEureka instructs LLM to generate domain randomization configurations within the limits of the RAPP range. See Figure 3 for the specific process:

This research uses Unitree Go1 for experiments. Go1 is a small quadruped robot with 12 degrees of freedom in its four legs. In the quadrupedal locomotion task, this paper also systematically evaluates the performance of DrEureka policies on several real-world terrains and finds that they remain robust and outperform policies trained using human-designed reward and DR configurations.

For more information, please refer to the original paper.

The above is the detailed content of Walking the 'dog' on the yoga ball! Eureka, selected as one of NVIDIA's top ten projects, has made a new breakthrough. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hello Kitty Island Adventure: How To Get Giant Seeds

1 months ago By 尊渡假赌尊渡假赌尊渡假赌

How Long Does It Take To Beat Split Fiction?

4 weeks ago By DDD

R.E.P.O. Save File Location: Where Is It & How to Protect It?

4 weeks ago By DDD

Two Point Museum: All Exhibits And Where To Find Them

1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7372

Java Tutorial

1628

CakePHP Tutorial

1355

Laravel Tutorial

1267

PHP Tutorial

1215

Related knowledge

How to install deepseek Feb 19, 2025 pm 05:48 PM

There are many ways to install DeepSeek, including: compile from source (for experienced developers) using precompiled packages (for Windows users) using Docker containers (for most convenient, no need to worry about compatibility) No matter which method you choose, Please read the official documents carefully and prepare them fully to avoid unnecessary trouble.

Summary of FAQs for DeepSeek usage Feb 19, 2025 pm 03:45 PM

DeepSeekAI Tool User Guide and FAQ DeepSeek is a powerful AI intelligent tool. This article will answer some common usage questions to help you get started quickly. FAQ: The difference between different access methods: There is no difference in function between web version, App version and API calls, and App is just a wrapper for web version. The local deployment uses a distillation model, which is slightly inferior to the full version of DeepSeek-R1, but the 32-bit model theoretically has 90% full version capability. What is a tavern? SillyTavern is a front-end interface that requires calling the AI model through API or Ollama. What is breaking limit

What are the AI tools? Nov 29, 2024 am 11:11 AM

AI tools include: Doubao, ChatGPT, Gemini, BlenderBot, etc.

What are the Grayscale Encryption Trust Funds? Common Grayscale Encryption Trust Funds Inventory Mar 05, 2025 pm 12:33 PM

Grayscale Investment: The channel for institutional investors to enter the cryptocurrency market. Grayscale Investment Company provides digital currency investment services to institutions and investors. It allows investors to indirectly participate in cryptocurrency investment through the form of trust funds. The company has launched several crypto trusts, which has attracted widespread market attention, but the impact of these funds on token prices varies significantly. This article will introduce in detail some of Grayscale's major crypto trust funds. Grayscale Major Crypto Trust Funds Available at a glance Grayscale Investment (founded by DigitalCurrencyGroup in 2013) manages a variety of crypto asset trust funds, providing institutional investors and high-net-worth individuals with compliant investment channels. Its main funds include: Zcash (ZEC), SOL,

Delphi Digital: How to change the new AI economy by parsing the new ElizaOS v2 architecture? Mar 04, 2025 pm 07:00 PM

ElizaOSv2: Empowering AI and leading the new economy of Web3. AI is evolving from auxiliary tools to independent entities. ElizaOSv2 plays a key role in it, which gives AI the ability to manage funds and operate Web3 businesses. This article will dive into the key innovations of ElizaOSv2 and how it shapes an AI-driven future economy. AI Automation: Going to independently operate ElizaOS was originally an AI framework focusing on Web3 automation. v1 version allows AI to interact with smart contracts and blockchain data, while v2 version achieves significant performance improvements. Instead of just executing simple instructions, AI can independently manage workflows, operate business and develop financial strategies. Architecture upgrade: Enhanced A

As top market makers enter the crypto market, what impact will Castle Securities have on the industry? Mar 04, 2025 pm 08:03 PM

The entry of top market maker Castle Securities into Bitcoin market maker is a symbol of the maturity of the Bitcoin market and a key step for traditional financial forces to compete for future asset pricing power. At the same time, for retail investors, it may mean the gradual weakening of their voice. On February 25, according to Bloomberg, Citadel Securities is seeking to become a liquidity provider for cryptocurrencies. The company aims to join the list of market makers on various exchanges, including exchanges operated by CoinbaseGlobal, BinanceHoldings and Crypto.com, people familiar with the matter said. Once approved by the exchange, the company initially planned to set up a market maker team outside the United States. This move is not only a sign

Significantly surpassing SFT, the secret behind o1/DeepSeek-R1 can also be used in multimodal large models Mar 12, 2025 pm 01:03 PM

Researchers from Shanghai Jiaotong University, Shanghai AILab and the Chinese University of Hong Kong have launched the Visual-RFT (Visual Enhancement Fine Tuning) open source project, which requires only a small amount of data to significantly improve the performance of visual language big model (LVLM). Visual-RFT cleverly combines DeepSeek-R1's rule-based reinforcement learning approach with OpenAI's reinforcement fine-tuning (RFT) paradigm, successfully extending this approach from the text field to the visual field. By designing corresponding rule rewards for tasks such as visual subcategorization and object detection, Visual-RFT overcomes the limitations of the DeepSeek-R1 method being limited to text, mathematical reasoning and other fields, providing a new way for LVLM training. Vis

Bitwise: Businesses Buy Bitcoin A Neglected Big Trend Mar 05, 2025 pm 02:42 PM

Weekly Observation: Businesses Hoarding Bitcoin – A Brewing Change I often point out some overlooked market trends in weekly memos. MicroStrategy's move is a stark example. Many people may say, "MicroStrategy and MichaelSaylor are already well-known, what are you going to pay attention to?" This is true, but many investors regard it as a special case and ignore the deeper market forces behind it. This view is one-sided. In-depth research on the adoption of Bitcoin as a reserve asset in recent months shows that this is not an isolated case, but a major trend that is emerging. I predict that in the next 12-18 months, hundreds of companies will follow suit and buy large quantities of Bitcoin

See all articles