Table of Contents
No need for radar: it’s expensive but hard to use!
Centimeter level map: No need!
Will you come back after leaving your job? This is love
Home Technology peripherals AI Tesla's former AI director Karpathy reveals his departure and pure vision solution

Tesla's former AI director Karpathy reveals his departure and pure vision solution

May 16, 2023 am 11:28 AM
intelligent radar

Tesla’s former Artificial Intelligence Director and new AI Internet celebrity teacher Andrej Karpathy recently participated in the podcast of MIT artificial intelligence expert Lex Fridman. For artificial intelligence enthusiasts, this interview can be described as "double chef ecstasy".

In the nearly three-and-a-half-hour interview, the two discussed grand topics such as artificial intelligence, the universe, and human society. They also discussed in detail many of Tesla's technologies, such as autonomous driving. Driving, Optimus humanoid robots, and Tesla vision solutions. In addition, the two also talked about Andrej's resignation, which the audience is most concerned about, and the reason why Tesla canceled the ultrasonic radar.

No need for radar: it’s expensive but hard to use!

Tesla removed millimeter-wave radar from the sensor suite last year, and has just announced that it will remove all ultrasonic radars, retain only cameras, and adopt a purely visual solution. Lex asked: "Does this make it harder or easier for the vehicle to detect the road?" Karpathy said: "People generally think of these sensors as an integral asset of the car. , but if the integrity of the product is fully considered, these sensors are actually a potential burden."

"These sensors are not free and cannot appear in the car out of thin air. Not only do they need A complete supply chain also needs someone to be responsible for procurement,” and these all cost real money.

At the same time, the sensor may fail and need to be replaced. "As part of car manufacturing, the production of sensors can also drag down the overall schedule. So, not only do you need procurement and maintenance, but you also have to have a team to write firmware."

Not only that, use Radar sensors also complicate detection systems. "Incorporating them into a car system will lead to overbloat of the overall system," Karpathy said. Installing so many sensors also puts pressure on the data engine. As developments continue to occur over time, the functions of sensors become increasingly refined. "There are too many radars now, each with different functions. This has caused overexpansion of the detection system. In addition, too many radars will interfere with each other and affect the effect."

He highly praised his former boss Musk's ability to simplify the complex, "I think Elon is very good at simplifying. He once said: 'The best parts are no parts.' He will always try to get rid of things that are not important." , has been doing subtraction, because he understands the entropy increase phenomenon of the organization."

The cost is high, there are many problems, and people need to constantly repair it. It will also bring about the complexity of the detection system. In this case, the cost of installing radar is high and there is not much development potential.

"As a computer vision engineer, if you want to improve the vehicle detection network, you will consider whether adding sensors is useful and how useful it is. We conduct comparative experiments to truly determine whether radar can provide car owners with Provides very useful traffic information. But the results show that the difference is not big, which shows that radar is not useful."

Karpathy not only explained why Tesla abandoned this technology, but also asserted that Other car companies will make the same choice. "Similar to lidar, I don't think ultrasonic radar can provide a lot of additional information. I think other companies that are still using lidar will abandon this technology."

Purely visual solution: better One Chip

Karpathy has high hopes for purely visual solutions. "If we choose a pure vision solution, we can concentrate all resources and build a powerful data engine."

"The bandwidth of this sensor is very high, and we have made substantial progress in this regard. By investing heavily in the technology, you can achieve extraordinary results."

Karpathy said a purely visual approach is both necessary and sufficient. In a sense, the world is designed for human visual consumption, and people have visual needs.

At the same time, this solution can provide all driving information needed by all drivers. "So we have to focus our resources on developing this technology and keep asking ourselves: 'Do I really want to introduce other sensors?' I think the answer in this case is no."

Although the pure vision solution has received strong support from Karpathy, when Lex asked how he viewed the difference between lidar and pure vision solutions, as well as point clouds and voxels, Karpathy said frankly: The two are not autonomous driving. the key of.

He said: "I have never understood this debate. Because it is not the core of the problem. I think everyone should pay attention to whether there is a road test fleet as support when discussing automation. This is It is the key to whether the artificial intelligence system can provide better services."

Therefore, when considering the detection capability of the sensor, it must be comprehensive. Including whether it can provide a road test fleet to collect large amounts of data, whether it can integrate sensors and data, and integrate sensors into the data engine to achieve rapid search of different parts of the data, and then continuously improve the models used.

Centimeter level map: No need!

When asked what he thought about other companies producing high-definition maps of self-driving cars in their operating areas, Karpathy said: "It's crazy!"

" We have been talking about how autonomous driving will change the world and how this technology can be applied to transportation on a global scale. If you need to continuously provide a centimeter-level accurate map of the world or a city and keep it updated frequently, the cost is too high."

When Lex asked whether this approach would be extended to all regions of the United States, Karpathy used the example of Tesla to explain: "People don't need such a high-precision map. A low-precision map The map is enough to show key information such as road conditions and road sections ahead. Drivers can understand their environment through this key information just like looking at Google Maps."

"Tesla's driving system uses information with a similar resolution to Google Maps. But it does not pre-draw maps with centimeter-level accuracy. This approach is superfluous, thankless, and dilutes the team's capabilities, preventing technical staff from focusing on what is really necessary, which is Computer vision problem."

Will you come back after leaving your job? This is love

When talking about why he left Tesla, Karpathy said it was a difficult decision. Although Tesla has not yet fully implemented autonomous driving, the R&D team has been able to develop on its own. This resignation also gave him an opportunity to re-examine his love for artificial intelligence, open source and education.

Previously, he had worked for Tesla for 5 years and reported directly to the big boss Musk. Among Tesla executives, he is definitely considered a veteran. . According to reports, Li Feifei's disciple had been on vacation for several months. He had previously said that he would return to Tesla after his vacation in the near future, but he immediately announced his resignation.

Teslas former AI director Karpathy reveals his departure and pure vision solution

Karpathy said: "I am very happy to help Tesla achieve many goals in the past five years. The decision to leave is actually a difficult choice. . In the past 5 years, autonomous driving has completed its "graduation". It started from staggering to find a way and drove onto the streets of the city. I look forward to the future of a more powerful autonomous driving team to continue to be glorious."

Teslas former AI director Karpathy reveals his departure and pure vision solution

Regarding his future plans after leaving his job, he said: "There are no specific plans for the future. I may return to areas where I have long-term passion, such as AI technology work, open source and education, etc. ."

Of course, he also mentioned the possibility of returning to Tesla in the interview: "Maybe at some point I will come back and work on Optimus or AGI at Tesla. (General Artificial Intelligence) work. Tesla will be an amazing company that can create extraordinary things. At this massive robotics company, talented designers are creating new things that have never been done before."

From a Tesla executive to an internet celebrity teacher, Karpathy can leave Tesla for artificial intelligence, or he can return to work one day for humanoid robots and AGI. What he pursues is not material and status, but the continuous advancement of technology. This is similar to the behavior of his mentor Li Feifei who refused to change careers after graduation and stuck to computer image recognition research. Perhaps, this is "like a teacher, like a disciple"!

The above is the detailed content of Tesla's former AI director Karpathy reveals his departure and pure vision solution. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
Two Point Museum: All Exhibits And Where To Find Them
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Smart App Control on Windows 11: How to turn it on or off Smart App Control on Windows 11: How to turn it on or off Jun 06, 2023 pm 11:10 PM

Intelligent App Control is a very useful tool in Windows 11 that helps protect your PC from unauthorized apps that can damage your data, such as ransomware or spyware. This article explains what Smart App Control is, how it works, and how to turn it on or off in Windows 11. What is Smart App Control in Windows 11? Smart App Control (SAC) is a new security feature introduced in the Windows 1122H2 update. It works with Microsoft Defender or third-party antivirus software to block potentially unnecessary apps that can slow down your device, display unexpected ads, or perform other unexpected actions. Smart application

The facial features are flying around, opening the mouth, staring, and raising eyebrows, AI can imitate them perfectly, making it impossible to prevent video scams The facial features are flying around, opening the mouth, staring, and raising eyebrows, AI can imitate them perfectly, making it impossible to prevent video scams Dec 14, 2023 pm 11:30 PM

With such a powerful AI imitation ability, it is really impossible to prevent it. It is completely impossible to prevent it. Has the development of AI reached this level now? Your front foot makes your facial features fly, and on your back foot, the exact same expression is reproduced. Staring, raising eyebrows, pouting, no matter how exaggerated the expression is, it is all imitated perfectly. Increase the difficulty, raise the eyebrows higher, open the eyes wider, and even the mouth shape is crooked, and the virtual character avatar can perfectly reproduce the expression. When you adjust the parameters on the left, the virtual avatar on the right will also change its movements accordingly to give a close-up of the mouth and eyes. The imitation cannot be said to be exactly the same, but the expression is exactly the same (far right). The research comes from institutions such as the Technical University of Munich, which proposes GaussianAvatars, which

Easily understand 4K HD images! This large multi-modal model automatically analyzes the content of web posters, making it very convenient for workers. Easily understand 4K HD images! This large multi-modal model automatically analyzes the content of web posters, making it very convenient for workers. Apr 23, 2024 am 08:04 AM

A large model that can automatically analyze the content of PDFs, web pages, posters, and Excel charts is not too convenient for workers. The InternLM-XComposer2-4KHD (abbreviated as IXC2-4KHD) model proposed by Shanghai AILab, the Chinese University of Hong Kong and other research institutions makes this a reality. Compared with other multi-modal large models that have a resolution limit of no more than 1500x1500, this work increases the maximum input image of multi-modal large models to more than 4K (3840x1600) resolution, and supports any aspect ratio and 336 pixels to 4K Dynamic resolution changes. Three days after its release, the model topped the HuggingFace visual question answering model popularity list. Easy to handle

MotionLM: Language modeling technology for multi-agent motion prediction MotionLM: Language modeling technology for multi-agent motion prediction Oct 13, 2023 pm 12:09 PM

This article is reprinted with permission from the Autonomous Driving Heart public account. Please contact the source for reprinting. Original title: MotionLM: Multi-Agent Motion Forecasting as Language Modeling Paper link: https://arxiv.org/pdf/2309.16534.pdf Author affiliation: Waymo Conference: ICCV2023 Paper idea: For autonomous vehicle safety planning, reliably predict the future behavior of road agents is crucial. This study represents continuous trajectories as sequences of discrete motion tokens and treats multi-agent motion prediction as a language modeling task. The model we propose, MotionLM, has the following advantages: First

Do you know that programmers will be in decline in a few years? Do you know that programmers will be in decline in a few years? Nov 08, 2023 am 11:17 AM

"ComputerWorld" magazine once wrote an article saying that "programming will disappear by 1960" because IBM developed a new language FORTRAN, which allows engineers to write the mathematical formulas they need and then submit them. Give the computer a run, so programming ends. A few years later, we heard a new saying: any business person can use business terms to describe their problems and tell the computer what to do. Using this programming language called COBOL, companies no longer need programmers. . Later, it is said that IBM developed a new programming language called RPG that allows employees to fill in forms and generate reports, so most of the company's programming needs can be completed through it.

GR-1 Fourier Intelligent Universal Humanoid Robot is about to start pre-sale! GR-1 Fourier Intelligent Universal Humanoid Robot is about to start pre-sale! Sep 27, 2023 pm 08:41 PM

The humanoid robot is 1.65 meters tall, weighs 55 kilograms, and has 44 degrees of freedom in its body. It can walk quickly, avoid obstacles quickly, climb steadily up and down slopes, and resist impact interference. You can now take it home! Fourier Intelligence's universal humanoid robot GR-1 has started pre-sale. Robot Lecture Hall Fourier Intelligence's Fourier GR-1 universal humanoid robot has now opened for pre-sale. GR-1 has a highly bionic trunk configuration and anthropomorphic motion control. The whole body has 44 degrees of freedom. It has the ability to walk, avoid obstacles, cross obstacles, go up and down slopes, resist interference, and adapt to different road surfaces. It is a general artificial intelligence system. Ideal carrier. Official website pre-sale page: www.fftai.cn/order#FourierGR-1# Fourier Intelligence needs to be rewritten.

Read the smart car skateboard chassis in one article Read the smart car skateboard chassis in one article May 24, 2023 pm 12:01 PM

01 What is a skateboard chassis? The so-called skateboard chassis integrates the battery, electric transmission system, suspension, brakes and other components on the chassis in advance to achieve separation of the body and chassis and decoupling the design. Based on this type of platform, car companies can significantly reduce early R&D and testing costs, while quickly responding to market demand to create different models. Especially in the era of driverless driving, the layout of the car is no longer centered on driving, but will focus on space attributes. The skateboard-type chassis can provide more possibilities for the development of the upper cabin. As shown in the picture above, of course when we look at the skateboard chassis, we should not be framed by the first impression of "Oh, it is a non-load-bearing body" when we come up. There were no electric cars back then, so there were no battery packs worth hundreds of kilograms, no steering-by-wire system that could eliminate the steering column, and no brake-by-wire system.

Huawei will launch the Xuanji sensing system in the field of smart wearables, which can assess the user's emotional state based on heart rate Huawei will launch the Xuanji sensing system in the field of smart wearables, which can assess the user's emotional state based on heart rate Aug 29, 2024 pm 03:30 PM

Recently, Huawei announced that it will launch a new smart wearable product equipped with Xuanji sensing system in September, which is expected to be Huawei's latest smart watch. This new product will integrate advanced emotional health monitoring functions. The Xuanji Perception System provides users with a comprehensive health assessment with its six characteristics - accuracy, comprehensiveness, speed, flexibility, openness and scalability. The system uses a super-sensing module and optimizes the multi-channel optical path architecture technology, which greatly improves the monitoring accuracy of basic indicators such as heart rate, blood oxygen and respiration rate. In addition, the Xuanji Sensing System has also expanded the research on emotional states based on heart rate data. It is not limited to physiological indicators, but can also evaluate the user's emotional state and stress level. It supports the monitoring of more than 60 sports health indicators, covering cardiovascular, respiratory, neurological, endocrine,

See all articles