Home Technology peripherals AI DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

Aug 09, 2024 pm 04:01 PM
deepmind industry table tennis robot

But maybe you can’t beat the old man in the park?

The Paris Olympic Games are in full swing, and table tennis has attracted much attention. At the same time, robots have also made new breakthroughs in playing table tennis.

Just now, DeepMind proposed the first learning robot agent that can reach the level of human amateur players in competitive table tennis.

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

Paper address: https://arxiv.org/pdf/2408.03906

How good is this DeepMind robot at playing table tennis? Probably on par with human amateur players:

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

Both forehand and backhand:

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

The opponent uses a variety of playing styles, and the robot can also withstand it:

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

Receives serves with different spins:

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

However, the competition seems not as intense as the battle between the old men in the park.

For robots, table tennis requires mastering complex low-level skills and strategic gameplay, and requires long-term training. DeepMind believes that strategies that are suboptimal but can perform low-level skills proficiently may be a better choice. This distinguishes table tennis from purely strategic games such as chess and Go.

Thus, table tennis is a valuable benchmark for improving robot capabilities, including high-speed locomotion, real-time precise and strategic decision-making, system design, and direct competition with human opponents.

For this, the chief scientist of Google DeepMind praised: "The table tennis robot will help us solve high-speed control and perception problems."

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

The study conducted 29 table tennis games between robots and humans, including Bots won 45% of the time (13/29). All human players were players the robot had never seen before, ranging in ability from beginners to tournament players.

While the bot lost all its matches against the highest-level players, it won 100% of its matches against beginners and 55% of its matches against intermediate players, demonstrating the performance of a human amateur. .

Overall, the contributions of this research include:

  1. Proposing a hierarchical and modular policy architecture that includes:

  2. low-level controllers and their detailed skill descriptors that are useful for The capabilities of the agent are modeled and help bridge the gap between simulation and reality;

  3. Choose high-level controllers with low-level skills.

  4. Technology to implement zero-sample simulation to reality, including defining iterative methods based on real-world task distribution and defining automatic curriculum.

  5. Adapt to unseen opponents in real time.

Method introduction

The agent consists of a low-level skill library and a high-level controller. The low-level skill pool focuses on a specific aspect of table tennis, such as forehand topspin, backhand aim, or forehand serve. In addition to incorporating training strategies, the study also collects and stores information offline and online about the strengths, weaknesses, and limitations of each low-level skill. The high-level controller responsible for coordinating low-level skills will select the best skills based on current game statistics and skill descriptions.

In addition, the study also collected a small amount of human and human sparring game data as a seed for the initial task conditions. The data set includes position, speed and rotation information. The agent is then trained in a simulated environment using reinforcement learning and employs some existing techniques to seamlessly deploy the policy to real hardware.

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

The agent plays with humans to generate more training data. As the robot continues to learn, the game standards become more and more complex, allowing the agent to learn more and more complex actions. This hybrid “simulation-reality” loop creates an automated teaching that allows the robot’s skills to improve over time.

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

レイヤードコントロール

レイヤードコントロールには主に次の部分が含まれます:

  • 卓球のプレースタイル: 高レベルコントローラー (HLC、ハイレベルコントローラー) はまず、どのプレースタイル (フォアハンドまたはフォアハンドまたはハイレベルコントローラー) を使用するかを決定します。バックハンド);

  • 調整: 対戦相手との試合の統計に基づいて各 HLC の好み (H 値) をオンラインで維持します。

  • 最も効果的なスキルを選択します: LLC によるサンプリングに基づいて、HLC の候補者をペアにします。

  • 更新: H値と対戦相手の統計はゲームが終了するまで更新されます。

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

結果

研究者らは、初心者、中級者、上級者、上級 + スキルを含む、さまざまなレベルの 29 人の卓球選手とエージェントを比較しました。人間のプレーヤーは標準的な卓球ルールに従ってロボットと 3 試合を行いましたが、ロボットがサーブを打つことができなかったため、ルールが若干変更されました。

すべての対戦相手と対戦して、ロボットは試合の 45%、試合の 46% で勝利しました。スキル レベル別に見ると、ボットは初心者に対してはすべての試合に勝ち、上級および上級+ プレーヤーに対してはすべての試合に負け、中級者に対しては試合の 55% に勝ちました。これは、エージェントが卓球のラウンドにおいて人間の中級プレーヤーのレベルに達していることを示しています。

ロボットが上級プレイヤーに勝てない理由は、反応速度、カメラのセンシング能力、回転処理などを含む物理的および技術的な制限により、シミュレーション環境で正確にモデル化することが困難です。

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

ロボットとのスパーリングもとても魅力的です

研究参加者は、ロボットと遊ぶのがとても楽しかったと述べ、ロボットに対して「面白い」「魅力的」という点で高い評価を与えました。彼らはまた、再びロボットと戦うことに「非常に意欲がある」と満場一致で表明した。自由時間には、5分間で平均4分06秒ロボットと遊んだ。

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners
DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

このロボットはバックスピンが苦手です

最もスキルの高い参加者は、ロボットはバックスピンの処理が苦手だと述べました。この観察を検証するために、研究者らはボールのスピンに対するロボットの着地率をプロットしたところ、バックスピンが多いボールに直面するとロボットの着地率が大幅に低下することが結果で示された。この欠陥は、ロボットが低いボールを扱うときにテーブルとの衝突を避けようとすることが部分的に原因であり、第二に、ボールのスピンをリアルタイムで判断することが非常に難しいという事実によって引き起こされます。

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

参考リンク:

https://sites.google.com/view/competitive-robot-table-tennis/home?utm_source&utm_medium&utm_campaign&utm_content&pli=1

The above is the detailed content of DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Article Tags

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners Aug 09, 2024 pm 04:01 PM

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners

Claude has become lazy too! Netizen: Learn to give yourself a holiday Claude has become lazy too! Netizen: Learn to give yourself a holiday Sep 02, 2024 pm 01:56 PM

Claude has become lazy too! Netizen: Learn to give yourself a holiday

The first mechanical claw! Yuanluobao appeared at the 2024 World Robot Conference and released the first chess robot that can enter the home The first mechanical claw! Yuanluobao appeared at the 2024 World Robot Conference and released the first chess robot that can enter the home Aug 21, 2024 pm 07:33 PM

The first mechanical claw! Yuanluobao appeared at the 2024 World Robot Conference and released the first chess robot that can enter the home

At the World Robot Conference, this domestic robot carrying 'the hope of future elderly care' was surrounded At the World Robot Conference, this domestic robot carrying 'the hope of future elderly care' was surrounded Aug 22, 2024 pm 10:35 PM

At the World Robot Conference, this domestic robot carrying 'the hope of future elderly care' was surrounded

Li Feifei's team proposed ReKep to give robots spatial intelligence and integrate GPT-4o Li Feifei's team proposed ReKep to give robots spatial intelligence and integrate GPT-4o Sep 03, 2024 pm 05:18 PM

Li Feifei's team proposed ReKep to give robots spatial intelligence and integrate GPT-4o

ACL 2024 Awards Announced: One of the Best Papers on Oracle Deciphering by HuaTech, GloVe Time Test Award ACL 2024 Awards Announced: One of the Best Papers on Oracle Deciphering by HuaTech, GloVe Time Test Award Aug 15, 2024 pm 04:37 PM

ACL 2024 Awards Announced: One of the Best Papers on Oracle Deciphering by HuaTech, GloVe Time Test Award

Hongmeng Smart Travel S9 and full-scenario new product launch conference, a number of blockbuster new products were released together Hongmeng Smart Travel S9 and full-scenario new product launch conference, a number of blockbuster new products were released together Aug 08, 2024 am 07:02 AM

Hongmeng Smart Travel S9 and full-scenario new product launch conference, a number of blockbuster new products were released together

The first large UI model in China is released! Motiff's large model creates the best assistant for designers and optimizes UI design workflow The first large UI model in China is released! Motiff's large model creates the best assistant for designers and optimizes UI design workflow Aug 19, 2024 pm 04:48 PM

The first large UI model in China is released! Motiff's large model creates the best assistant for designers and optimizes UI design workflow

See all articles