Students who have read the BEV obstacle story should know that our group started doing BEV obstacles around October 21 material. At that time, I didn’t dare to think about doing BEV lane markings because there was no manpower. But I remember that around December, we met with a candidate. During the interview, we heard that they had been working on BEV lane markings for more than half a year. The entire technical route was used as a BEV lane marking network through high-precision maps. Train the true value and say that the effect is not bad. Unfortunately, that candidate did not come to us in the end. Combined with the content of lane markings taught at Telsa AI day in 2021, the seed of making BEV lane markings was planted in the group.
Throughout 2022, the manpower in our team was very tight. I remember that in June and July, We just have the manpower to explore the BEV lane lines. But at that time, there was only one classmate in our group (let’s call him Xiaoxuan for now) who had 2 months to do this. Then the seeds of 2021 began to sprout. We were going to start with the data. Student Xiaoxuan was still very good (very imaginative, and Xiaoxuan also made more things that surprised everyone in the future), and it was almost used. In February, we can extract lane line data around the corresponding car through high-speed high-precision maps. When it was made, I remember everyone was still very excited.
Figure 1: The effect of high-precision map lane lines projected onto the image system
As you can see from Figure 1, there are still some fitting problems problem, so Xiaoxuan made a series of optimizations. Two months later, Xiaoxuan went to do other tasks. Looking back now, we have taken the right step in exploring BEV lane lines. Because in 2021 and 22, many excellent BEV lane line papers and codes have been gradually open sourced. Seeing this, you may think that there must be a perfect story about the implementation of BEV lane lines in 2023. However, ideals are often very fulfilling, but the reality is very cruel.
Because our BEV obstacles have proven that BEV can go down this road, and it has also shown good results in road tests. The group began to have more resources to consider lane lines. Note that this is not BEV. why? Because at this time, we were facing a lot of pressure to go online, and we did not have enough experience in BEV lane lines. In other words, there were almost no people in the entire group who had done mass production of 2D lane lines. In the first half of 2023, it can really be described as stumbling. We had many heated discussions internally, and finally decided to form two lines, one of which is the 2D lane line: most of the manpower is on the 2D lane line, and the focus is on the 2D lane line. Post-processing, light model, and accumulation of lane line post-processing mass production experience through 2D lane lines. One line is the BEV lane line: there is only a small number of manpower (actually only 1-2 manpower), focusing on the model design of the BEV lane line and accumulating model experience. There are already many BEV lane marking networks. I will post two papers that have a great impact on us here for your reference. "HDMapNet: An Online HD Map Construction and Evaluation Framework" and "MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction"
##Figure 2: HDMapNet Figure 3 MapTR Fortunately, in April and May, we accumulated a lot of experience in mass production of lane line post-processing in the 2D lane line. , our BEV lane line network was also designed, and at the end of May, BEV lane lines were successfully put on the bus soon. I have to say here that our classmate Dahai who is responsible for lane line post-processing is still very capable. However, just when you think things are going well, the nightmare is often about to begin. After the BEV lane lines were deployed, the vehicle control effect was not ideal. At this time, everyone fell into a stage of self-doubt. Is it because of the cubic spline fitting problem of the BEV lane lines or the problem of poor adaptation of downstream parameters. Fortunately, we have the supplier's results on our car. We saved the supplier's lane line results during the road test, and then compared them with our results in the visualization tool. When the vehicle control effect is not good, we must first prove that there is no problem with the quality of our own lane lines, so that the downstream drivers can adapt to our BEV lane lines. It took a month, a whole month, for us to have stable control of the car. I remember very clearly that we also ran from Shanghai to Suzhou. It was still a Saturday. Everyone in the group was very excited to see the high-speed car control effect.However, a story often has twists and turns. We can only use high-speed high-precision maps to produce lane line data. What to do about the city? There are still so many bad cases that need to be solved. At this time, the important person will finally appear. Let's call him Classmate Xiaotang (the big steward of our data group). Classmate Xiaotang and the others used point cloud reconstruction to reconstruct the clip for us ( This process was quite painful. I remember those two months were the most stressful time for them, haha. Of course, classmate Xiaotang and we often fell in love and killed each other. , after all, I often say that there is no data again during meetings. ). Then how to label after reconstruction? Looking at the suppliers at the time, none of them had such labeling tools, let alone labeling experience. Together with Xiaotang and others, after a long month, the annotation tool was finally polished with the supplier. (We often joke that we are empowering the entire self-driving annotation industry. This process is really painful, and rebuilding clips is really slow to load). However, the whole labeling is still relatively slow or expensive. At this time, Xiaoxuan made his debut with his large model of lane line pre-labeling (the effect of the large model of lane line pre-labeling is still outstanding), and everyone looked at him with amazement. In sparkling. After this set of combinations, our lane line data production is finally almost ready. In August, our BEV lane line control lane line has been iterated well, which is suitable for simple high-speed piloting functions. Now Xiaoxuan is still bringing us more surprises in the pre-marked direction of the large model. We and Xiaotang are still in love with each other.
However, a story does not end so easily. In September, we started working on multi-modal (Lidar, camera, Radar) multi-task (lane lines, obstacles, Occ) pre-fusion models. It will also subsequently support City Navigation (NCP), a so-called solution that emphasizes perception and ignores maps. Based on the experience of BEV obstacles and BEV lane lines, we will soon deploy the converged network on vehicles, probably by the end of September. Many subtasks have also been added to lane lines, such as road sign recognition, intersection topology, etc. In this process, we upgraded the post-processing of BEV lane lines, abandoned the lane line cubic spline fitting, and adopted a point tracking scheme. The output of the point tracking scheme and our lane line model can be easily Good combination. This process was also painful. We held a special meeting once a week for 2 consecutive months. After all, we have done well based on the fitting plan, but in order to reach a higher limit, we can only suffer from pain and happiness. Finally, we have already put the basic functions into road testing.
Let me briefly explain Figure 4. The left side is the effect of lane line point tracking. Currently, the perception range of our model is only the first 80 meters. You can see that there are some points behind the car, which are left by tracking. On the right is the real-time perceptual map we have established. Of course, it is still in a rapid iteration process, and there are still many problems being solved.
At this moment, standing in 24 years and looking back at our growth and accumulation from 21 years to now, I am very fortunate to be at that point in 21 years , I have the opportunity to do BEV, and I am very fortunate to have a group of like-minded friends who can help each other along the way. In 24 years, there are many things for us to pursue, including the mass production of pre-fusion models, efforts in data direction, exploration of timing models, end-to-end imagination, etc.
The above is the detailed content of A little bit about the implementation of BEV lane lines. For more information, please follow other related articles on the PHP Chinese website!