


Horizon releases open-source Sparse4D algorithm, pushing one step closer to end-to-end autonomous driving
On January 22, Horizon will open source the Sparse4D series of pure visual autonomous driving algorithms to encourage more developers in the industry to participate in the exploration of cutting-edge technology directions such as end-to-end autonomous driving and sparse perception. Currently, the Sparse4D algorithm has been launched on the GitHub platform. Developers can follow the Horizon GitHub official account "Horizon Robotics" to obtain the source code.
Sparse4D is a series of algorithms towards long-term sparse 3D target detection, which belongs to the category of temporal multi-view fusion sensing technology. Facing the industry development trend of sparse perception, Sparse4D has built a pure sparse fusion perception framework to make the perception algorithm more efficient and precise, and to make the perception system simpler. Compared with the dense BEV algorithm, Sparse4D reduces the computational complexity, breaks the limitation of computing power on the perception range, and surpasses the dense BEV algorithm in terms of perception effect and reasoning speed. In both the nuScenes pure visual 3D detection and 3D tracking lists, Sparse4D ranked first, becoming SOTA, ahead of the latest methods including SOLOFusion, BEVFormer v2 and StreamPETR.
Sparse4D algorithm architecture
##After three versions of iterations, the Horizon Sparse4D team It has successfully overcome technical problems such as improving the performance of sparse algorithms, reducing the computational complexity of timing fusion, and achieving end-to-end target tracking. Recently, they published a paper titled "Sparse4D v3: Advancing End-to-End 3D Detection and Tracking", detailing their research results. By using Horizon business data for performance verification, the Sparse4D team has successfully deployed on the Horizon Journey 5 computing solution. In the future, according to plans, Sparse4D technology will be used in Horizon’s next generation products. The achievement of this result will further promote the development of Horizon.
Dr. Yu Yinan, Vice President of Horizon and President of the Software Platform Product Line, pointed out that the current industry has entered the era of end-to-end sensing, which can be completed with only one network the entire perception task. He believes that the Sparse4D series of algorithms have improved the performance of sparse algorithms to a new level and successfully achieved end-to-end multi-target tracking. This is of great significance for both sparse perception and end-to-end autonomous driving, and can be said to be a milestone breakthrough. Horizon chose to open source Sparse4D to the entire industry, hoping to make progress together with outstanding developers in the industry.
Comparison between traditional perception system and end-to-end perception system
Another example of Horizon actively participating in the ecological construction of open source software for intelligent driving is the open source Sparse4D series of algorithms. This algorithm has huge application potential in the implementation of pure visual, end-to-end autonomous driving. In addition, Horizon has also open sourced leading technologies such as VAD algorithm and MapTR algorithm, which will further promote the development of the industry. It is expected that the Sparse4D algorithm will receive widespread attention and use by industry developers. Horizon's continued efforts will accelerate the industry's development process.
Horizon adheres to the concept of transforming independently innovative technologies, breakthrough products and solutions into the commercial value of ecological partners in the smart car industry, and contributes to the development of the industry. Energize. Through close collaboration, open integration, and cooperative innovation with all parties in the industry, Horizon is committed to becoming the source of living water for the smart automobile industry ecology and providing it with sustainable development momentum. Horizon fully understands that the mass production of autonomous driving is an important breakthrough, so we will continue to embrace open source and accelerate the implementation and mass production of cutting-edge technologies. We firmly believe that the future of win-win cooperation with the industry will be broader, and Horizon will continue to work hard to contribute to the prosperity of the smart car industry.
The above is the detailed content of Horizon releases open-source Sparse4D algorithm, pushing one step closer to end-to-end autonomous driving. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



Written above & the author’s personal understanding Three-dimensional Gaussiansplatting (3DGS) is a transformative technology that has emerged in the fields of explicit radiation fields and computer graphics in recent years. This innovative method is characterized by the use of millions of 3D Gaussians, which is very different from the neural radiation field (NeRF) method, which mainly uses an implicit coordinate-based model to map spatial coordinates to pixel values. With its explicit scene representation and differentiable rendering algorithms, 3DGS not only guarantees real-time rendering capabilities, but also introduces an unprecedented level of control and scene editing. This positions 3DGS as a potential game-changer for next-generation 3D reconstruction and representation. To this end, we provide a systematic overview of the latest developments and concerns in the field of 3DGS for the first time.

Yesterday during the interview, I was asked whether I had done any long-tail related questions, so I thought I would give a brief summary. The long-tail problem of autonomous driving refers to edge cases in autonomous vehicles, that is, possible scenarios with a low probability of occurrence. The perceived long-tail problem is one of the main reasons currently limiting the operational design domain of single-vehicle intelligent autonomous vehicles. The underlying architecture and most technical issues of autonomous driving have been solved, and the remaining 5% of long-tail problems have gradually become the key to restricting the development of autonomous driving. These problems include a variety of fragmented scenarios, extreme situations, and unpredictable human behavior. The "long tail" of edge scenarios in autonomous driving refers to edge cases in autonomous vehicles (AVs). Edge cases are possible scenarios with a low probability of occurrence. these rare events

0.Written in front&& Personal understanding that autonomous driving systems rely on advanced perception, decision-making and control technologies, by using various sensors (such as cameras, lidar, radar, etc.) to perceive the surrounding environment, and using algorithms and models for real-time analysis and decision-making. This enables vehicles to recognize road signs, detect and track other vehicles, predict pedestrian behavior, etc., thereby safely operating and adapting to complex traffic environments. This technology is currently attracting widespread attention and is considered an important development area in the future of transportation. one. But what makes autonomous driving difficult is figuring out how to make the car understand what's going on around it. This requires that the three-dimensional object detection algorithm in the autonomous driving system can accurately perceive and describe objects in the surrounding environment, including their locations,

The first pilot and key article mainly introduces several commonly used coordinate systems in autonomous driving technology, and how to complete the correlation and conversion between them, and finally build a unified environment model. The focus here is to understand the conversion from vehicle to camera rigid body (external parameters), camera to image conversion (internal parameters), and image to pixel unit conversion. The conversion from 3D to 2D will have corresponding distortion, translation, etc. Key points: The vehicle coordinate system and the camera body coordinate system need to be rewritten: the plane coordinate system and the pixel coordinate system. Difficulty: image distortion must be considered. Both de-distortion and distortion addition are compensated on the image plane. 2. Introduction There are four vision systems in total. Coordinate system: pixel plane coordinate system (u, v), image coordinate system (x, y), camera coordinate system () and world coordinate system (). There is a relationship between each coordinate system,

Trajectory prediction plays an important role in autonomous driving. Autonomous driving trajectory prediction refers to predicting the future driving trajectory of the vehicle by analyzing various data during the vehicle's driving process. As the core module of autonomous driving, the quality of trajectory prediction is crucial to downstream planning control. The trajectory prediction task has a rich technology stack and requires familiarity with autonomous driving dynamic/static perception, high-precision maps, lane lines, neural network architecture (CNN&GNN&Transformer) skills, etc. It is very difficult to get started! Many fans hope to get started with trajectory prediction as soon as possible and avoid pitfalls. Today I will take stock of some common problems and introductory learning methods for trajectory prediction! Introductory related knowledge 1. Are the preview papers in order? A: Look at the survey first, p

Original title: SIMPL: ASimpleandEfficientMulti-agentMotionPredictionBaselineforAutonomousDriving Paper link: https://arxiv.org/pdf/2402.02519.pdf Code link: https://github.com/HKUST-Aerial-Robotics/SIMPL Author unit: Hong Kong University of Science and Technology DJI Paper idea: This paper proposes a simple and efficient motion prediction baseline (SIMPL) for autonomous vehicles. Compared with traditional agent-cent

Written in front & starting point The end-to-end paradigm uses a unified framework to achieve multi-tasking in autonomous driving systems. Despite the simplicity and clarity of this paradigm, the performance of end-to-end autonomous driving methods on subtasks still lags far behind single-task methods. At the same time, the dense bird's-eye view (BEV) features widely used in previous end-to-end methods make it difficult to scale to more modalities or tasks. A sparse search-centric end-to-end autonomous driving paradigm (SparseAD) is proposed here, in which sparse search fully represents the entire driving scenario, including space, time, and tasks, without any dense BEV representation. Specifically, a unified sparse architecture is designed for task awareness including detection, tracking, and online mapping. In addition, heavy

In the past month, due to some well-known reasons, I have had very intensive exchanges with various teachers and classmates in the industry. An inevitable topic in the exchange is naturally end-to-end and the popular Tesla FSDV12. I would like to take this opportunity to sort out some of my thoughts and opinions at this moment for your reference and discussion. How to define an end-to-end autonomous driving system, and what problems should be expected to be solved end-to-end? According to the most traditional definition, an end-to-end system refers to a system that inputs raw information from sensors and directly outputs variables of concern to the task. For example, in image recognition, CNN can be called end-to-end compared to the traditional feature extractor + classifier method. In autonomous driving tasks, input data from various sensors (camera/LiDAR
