Table of Contents
" >"Evolutionary History" of Tesla Humanoid Robot
Home Technology peripherals AI Tesla's Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

Tesla's Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

May 06, 2024 pm 02:52 PM
AI humanoid robot optimus

Tesla humanoid robot has unlocked new skills!

Yesterday, Tesla Optimus officially released a new demo video, showing the latest progress of the second-generation Optimus humanoid robot.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

#This time, Optimus started working in the factory, learned how to assemble batteries at the Tesla battery factory, and walked faster than before Further and more stable.

Let’s take a first look at Optimus’ latest skills and training details.

Optimus’ end-to-end neural network is now trained to accurately package battery cells at Tesla factories.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

Running in real time on the robot's FSD computer, relying solely on 2D cameras, hand touch and force sensors . Optimus uses its legs to maintain balance while the network drives its entire upper body.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

The insertion process requires very precise movements, and the fault tolerance rate is very low. The neural network will automatically locate the next free slot. Optimus is also able to autonomously recover from failures.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

Optimus’ training data is collected via human teleoperation and scaled for a variety of tasks.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

Optimus is being tested at the factory and human intervention rates continue to decrease.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

Of course, Optimus can now regularly walk around the office, not only faster but also farther and farther.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

In this regard, Tesla Optimus (Tesla Bot) engineer Milan Kovac explained more details of Optimus’ training.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

Over the past few months, Tesla’s amazing manufacturing team has built more robots to conduct research and collect labor Smart data.

The team trained and deployed a neural network that allowed Optimus to begin performing useful tasks, such as picking up battery cells from a conveyor belt and inserting them into pallets with precision.

This neural network operates completely end-to-end, meaning that Optimus only consumes video from the robot's 2D camera and data from the onboard proprioceptors and directly produces joint control sequences.

Optimus runs entirely on the robot’s embedded FSD computer, powered by an onboard battery. This design allows a single neural network to perform multiple tasks by adding more diverse data to the training process.

Although Optimus is not perfect yet and is still a bit slow, we are seeing higher and higher success rates and fewer and fewer mistakes. Tesla also trained Optimus to recover from failure cases and saw spontaneous corrections occur.

Tesla has deployed several Optimus humanoid robots in a factory, where they are tested daily on real workstations and continuously improved.

The team is conducting further work to make Optimus move faster and handle rougher terrain without sacrificing its human nature. The team will also focus on repeatability, training neural networks to handle dynamic calibration and small differences between robots. There will be more updates soon.

In addition, Jim Fan, senior research scientist and head of AI Agents at Nvidia, gave high praise to the skills demonstrated in Optimus’s new video.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

# He believes that the video gives us a glimpse into the real situation of human data collection, which is Optimus’s biggest advantage. What does it take to build such a pipeline? Optimus does the following:

Optimus's hand is one of the world's top five-finger dexterous robot hands. It has tactile perception and has 11 degrees of freedom (DOF). Many competing products have only 6-7 DOF, and are highly robust and can withstand a large number of object interactions without the need for continuous maintenance.

Remote operation software: We can see human operators wearing VR glasses and gloves. Setting up the software to enable real-time streaming of first-person video and precise control of the output while maintaining extremely low latency is pretty remarkable. Humans are very sensitive to the smallest delay between their own movements and that of a robot. Optimus has a fluid full-body controller that can perform human poses in real time.

Large-scale fleet: You need more than one robot to collect data in parallel, a trained human working multiple shifts per day (preferably around the clock), and an on-call Maintenance team to ensure the robot is always busy. That’s a lot of operational complexity that academic research labs don’t even consider.

Task and Environment: It is equally important to figure out what you want to operate remotely. Currently, most of this work is demo-driven: collect data on tasks you want to put into social media videos. But to solve the problem of general-purpose robots, we need to carefully consider the distribution of tasks and environments. From the 43- to 51-second portion of the video, we can see factory and home settings, such as moving batteries, processing laundry, and sorting everyday items into shelves.

This is an open research question: If you only had the budget to collect training data for 1000 tasks, what would you choose to maximize skill transfer and generalization?

Conclusion: Teleoperation is necessary, but not sufficient, to solve the problem of humanoid robots. It's fundamentally unscalable.

In the comment area, Musk previewed the latest goal for the Optimus humanoid robot hand, which will have 22 degrees of freedom (DoF) later this year. This is even more exciting.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

Previously, according to Musk’s revelation on the earnings call, Tesla’s Optimus humanoid robot may have a “useful factory” by the end of this year. "Task" capabilities, and plans to take the lead in using it in its own factory before the end of this year. At the same time, Tesla is likely to sell Optimus before the end of 2025.

"Evolutionary History" of Tesla Humanoid Robot

The Tesla humanoid robot can be traced back to "Tesla" in August 2021. At the "AI Day" event, Musk first revealed his idea of ​​building a humanoid robot (Tesla Bot).

This humanoid robot is about 1.72 meters tall and weighs about 112 pounds. It has a well-proportioned body and a facial screen. It can move at a speed of about 8 kilometers/hour.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

In October 2022, at this year’s “Tesla AI Day”, the Optimus humanoid robot made its debut. At that time, it could not perform dance or other tricks. The robot consumes 500W of power when working, weighs 73 kilograms, and has 27 degrees of freedom for hand movement.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

However, in the Demo demonstration, Optimus showed pictures of it moving boxes, watering flowers, and working in the Tesla Gigafactory.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

#In May 2023, after more than half a year of dormancy, Tesla Optimus finally gained the ability to walk smoothly and grab objects.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

# In September 2023, Tesla Optimus evolved again and can classify objects autonomously. The highlight is the complete end-to-end training of the neural network : Input video, output control. In his spare time, he can also do yoga for fun.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

In December 2023, the second generation of Optimus arrived, making people really watch the movie "Public Enemy" What it looks like inside.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

# With the release of a new video, the speed of Tesla’s humanoid robot progress is once again stunning.

Teslas Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further

The above is the detailed content of Tesla's Optimus humanoid robot works in the factory, is skilled in disassembling batteries, self-correcting, and can go even further. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Bytedance Cutting launches SVIP super membership: 499 yuan for continuous annual subscription, providing a variety of AI functions Bytedance Cutting launches SVIP super membership: 499 yuan for continuous annual subscription, providing a variety of AI functions Jun 28, 2024 am 03:51 AM

This site reported on June 27 that Jianying is a video editing software developed by FaceMeng Technology, a subsidiary of ByteDance. It relies on the Douyin platform and basically produces short video content for users of the platform. It is compatible with iOS, Android, and Windows. , MacOS and other operating systems. Jianying officially announced the upgrade of its membership system and launched a new SVIP, which includes a variety of AI black technologies, such as intelligent translation, intelligent highlighting, intelligent packaging, digital human synthesis, etc. In terms of price, the monthly fee for clipping SVIP is 79 yuan, the annual fee is 599 yuan (note on this site: equivalent to 49.9 yuan per month), the continuous monthly subscription is 59 yuan per month, and the continuous annual subscription is 499 yuan per year (equivalent to 41.6 yuan per month) . In addition, the cut official also stated that in order to improve the user experience, those who have subscribed to the original VIP

Context-augmented AI coding assistant using Rag and Sem-Rag Context-augmented AI coding assistant using Rag and Sem-Rag Jun 10, 2024 am 11:08 AM

Improve developer productivity, efficiency, and accuracy by incorporating retrieval-enhanced generation and semantic memory into AI coding assistants. Translated from EnhancingAICodingAssistantswithContextUsingRAGandSEM-RAG, author JanakiramMSV. While basic AI programming assistants are naturally helpful, they often fail to provide the most relevant and correct code suggestions because they rely on a general understanding of the software language and the most common patterns of writing software. The code generated by these coding assistants is suitable for solving the problems they are responsible for solving, but often does not conform to the coding standards, conventions and styles of the individual teams. This often results in suggestions that need to be modified or refined in order for the code to be accepted into the application

Seven Cool GenAI & LLM Technical Interview Questions Seven Cool GenAI & LLM Technical Interview Questions Jun 07, 2024 am 10:06 AM

To learn more about AIGC, please visit: 51CTOAI.x Community https://www.51cto.com/aigc/Translator|Jingyan Reviewer|Chonglou is different from the traditional question bank that can be seen everywhere on the Internet. These questions It requires thinking outside the box. Large Language Models (LLMs) are increasingly important in the fields of data science, generative artificial intelligence (GenAI), and artificial intelligence. These complex algorithms enhance human skills and drive efficiency and innovation in many industries, becoming the key for companies to remain competitive. LLM has a wide range of applications. It can be used in fields such as natural language processing, text generation, speech recognition and recommendation systems. By learning from large amounts of data, LLM is able to generate text

Can fine-tuning really allow LLM to learn new things: introducing new knowledge may make the model produce more hallucinations Can fine-tuning really allow LLM to learn new things: introducing new knowledge may make the model produce more hallucinations Jun 11, 2024 pm 03:57 PM

Large Language Models (LLMs) are trained on huge text databases, where they acquire large amounts of real-world knowledge. This knowledge is embedded into their parameters and can then be used when needed. The knowledge of these models is "reified" at the end of training. At the end of pre-training, the model actually stops learning. Align or fine-tune the model to learn how to leverage this knowledge and respond more naturally to user questions. But sometimes model knowledge is not enough, and although the model can access external content through RAG, it is considered beneficial to adapt the model to new domains through fine-tuning. This fine-tuning is performed using input from human annotators or other LLM creations, where the model encounters additional real-world knowledge and integrates it

To provide a new scientific and complex question answering benchmark and evaluation system for large models, UNSW, Argonne, University of Chicago and other institutions jointly launched the SciQAG framework To provide a new scientific and complex question answering benchmark and evaluation system for large models, UNSW, Argonne, University of Chicago and other institutions jointly launched the SciQAG framework Jul 25, 2024 am 06:42 AM

Editor |ScienceAI Question Answering (QA) data set plays a vital role in promoting natural language processing (NLP) research. High-quality QA data sets can not only be used to fine-tune models, but also effectively evaluate the capabilities of large language models (LLM), especially the ability to understand and reason about scientific knowledge. Although there are currently many scientific QA data sets covering medicine, chemistry, biology and other fields, these data sets still have some shortcomings. First, the data form is relatively simple, most of which are multiple-choice questions. They are easy to evaluate, but limit the model's answer selection range and cannot fully test the model's ability to answer scientific questions. In contrast, open-ended Q&A

The full-size universal humanoid robot Qinglong debuts at WAIC, accelerating the entry into the era of embodied intelligence The full-size universal humanoid robot Qinglong debuts at WAIC, accelerating the entry into the era of embodied intelligence Jul 11, 2024 pm 02:03 PM

On the afternoon of July 4, under the guidance of the World Artificial Intelligence Conference Organizing Committee Office, hosted by the World Artificial Intelligence Conference Organizing Committee Office, and hosted by the National and Local Humanoid Robot Innovation Center and the China Institute of Electronics, this site, "Robotics Technology and Applications" The 2024 WAIC World Artificial Intelligence Conference Humanoid Robot and Embodied Intelligence Development Forum co-organized by the magazine will be held at the Shanghai World Expo Exhibition and Convention Center on the afternoon of July 4. This forum invited 12 domestic and foreign scholars, business representatives and developer representatives in the field of humanoid robots and embodied intelligence to give keynote reports, technology sharing and roundtable discussions, and released innovative results of humanoid robots. The forum attracted more than 200 professional audiences in the field of humanoid robots and embodied intelligence, and they were watched online through multiple live broadcast platforms.

SOTA performance, Xiamen multi-modal protein-ligand affinity prediction AI method, combines molecular surface information for the first time SOTA performance, Xiamen multi-modal protein-ligand affinity prediction AI method, combines molecular surface information for the first time Jul 17, 2024 pm 06:37 PM

Editor | KX In the field of drug research and development, accurately and effectively predicting the binding affinity of proteins and ligands is crucial for drug screening and optimization. However, current studies do not take into account the important role of molecular surface information in protein-ligand interactions. Based on this, researchers from Xiamen University proposed a novel multi-modal feature extraction (MFE) framework, which for the first time combines information on protein surface, 3D structure and sequence, and uses a cross-attention mechanism to compare different modalities. feature alignment. Experimental results demonstrate that this method achieves state-of-the-art performance in predicting protein-ligand binding affinities. Furthermore, ablation studies demonstrate the effectiveness and necessity of protein surface information and multimodal feature alignment within this framework. Related research begins with "S

Five schools of machine learning you don't know about Five schools of machine learning you don't know about Jun 05, 2024 pm 08:51 PM

Machine learning is an important branch of artificial intelligence that gives computers the ability to learn from data and improve their capabilities without being explicitly programmed. Machine learning has a wide range of applications in various fields, from image recognition and natural language processing to recommendation systems and fraud detection, and it is changing the way we live. There are many different methods and theories in the field of machine learning, among which the five most influential methods are called the "Five Schools of Machine Learning". The five major schools are the symbolic school, the connectionist school, the evolutionary school, the Bayesian school and the analogy school. 1. Symbolism, also known as symbolism, emphasizes the use of symbols for logical reasoning and expression of knowledge. This school of thought believes that learning is a process of reverse deduction, through existing

See all articles