Home Technology peripherals AI 01Wang's self-developed full navigation map vector database swept 6 first places on the authoritative list

01Wang's self-developed full navigation map vector database swept 6 first places on the authoritative list

Mar 11, 2024 pm 04:01 PM
industry vector database 010,000 things

On March 11, 011Wang announced the launch of a new vector database "Descartes" based on full navigation graphs, which has won the first place in 6 data set evaluations of the authoritative list ANN-Benchmarks.

Vector database, also known as the information retrieval technology in the AI ​​era, is one of the core technologies of Retrieval-Augmented Generation (RAG). For large model application developers, the vector database is a very important infrastructure, which affects the performance of large models to a certain extent.

In the offline test of the international authoritative evaluation platform ANN-Benchmarks, Zero One Everything Descartes vector database ranked first among the 6 data set evaluations It has a significant performance improvement compared to the first place in the industry on the previous list, and the performance improvement on some data sets is even more than 2 times.

Zero One Everything said that the Cartesian vector database will be used in AI products that will be officially launched in the near future, and will also be provided to developers in combination with tools in the future.

Vector database becomes AI 2.0 infrastructure
Won the favor of the capital market

With the advent of the AI ​​2.0 era represented by large models, the amount of multi-modal unstructured data such as pictures, videos, and natural languages ​​has increased sharply, which is different from the traditional methods used to process structured data. database. Vector database is specially used to store, manage, query and retrieve vectorized unstructured data; it is like an external memory disk that can be called by large models at any time to form "long-term memory", also nicknamed large model memory The "hippocampus".

# Large models naturally have four flaws. Vector databases are like tailor-made "special medicine" that can accurately solve every pain point.

  • #Real-time information: Large models take a long time to train, update slowly, and cannot reflect the latest information. There is a "deadline" challenge for their knowledge. The vector database adopts a lightweight update mechanism that can quickly supplement the latest information.
  • Privacy protection: Users’ security and privacy data should not be directly provided to large model training, otherwise there will be a risk of leakage. Vector data cracks privacy by acting as an intermediate carrier for information transmission in the inference stage. Difficulties in protection.
  • Illusion correction: Large models often exhibit inference distortion or hallucination phenomena. Such problems can be effectively corrected and alleviated through the rich knowledge reference provided by the vector database.
  • Inference efficiency: The cost of inference for large models is high. The vector database can be used as a caching mechanism to avoid the need to re-execute complex inference calculations for each query request, greatly saving computing resources.

The technological changes and platform changes initiated by AI 2.0 have further strengthened the role of vector databases. Related products from major manufacturers such as Google, Microsoft, and Meta have come out one after another, and startups such as Zilliz, Pinecone, Weaviate, and Qdrant have also emerged. In 2023, OpenAI's vector database partner Pinecone completed a Series B financing of US$138 million, and domestic start-up Fabarta ArcNeural also completed a Pre-A round of financing of hundreds of millions of yuan.
Challenging the authoritative list
Winning the first place in six evaluations

ANN-Benchmarks is the most authoritative vector database performance testing tool in the industry. It can show the performance of different algorithms under different real data sets.

The following 6 evaluation data sets cover glove-25-angular, glove-100-angular, sift-128-euclidean, nytimes-256-angular, fashion-mnist -784-euclidean, gist-960-euclidean six major data sets, the abscissa represents recall, the ordinate represents QPS (number of requests processed per second), the closer the curve position is to the upper right corner, the better the algorithm performance, zero ten thousand The Cartesian vector database ranks highest in all 6 dataset evaluations.

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list

#

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list


"Throughput QPS" is an important indicator to measure the query processing capabilities of an information retrieval system (such as a search engine or database). Based on the TOP1 of the original list, the Zero-One Thousand Things Cartesian Vector Database has achieved significant performance improvements. The performance improvement on some data sets is more than 2 times. In the gist-960-euclidean data set dimension, it is significantly ahead of the original TOP1 of the list. 286%.

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list

#: Poly Herigid Cartesial Pign Database and the original list TOP1 QPS Performance Comparison

##Technical Reveal

What is curious is how Descartes achieves the above excellent performance?

As we all know, RAG is a technology that combines retrieval and generation. It enhances the generation capabilities of language models by retrieving queried information from massive data. Similar to traditional retrieval methods, in essence, RAG vector retrieval mainly solves two major problems:

1. Reduce the candidate set for retrieval inspection by establishing a certain index structure ;
2. Reduce the complexity of single vector calculation.

Zero One Thousand Cartesian Vector Database has significant comparative advantages over the industry in processing complex queries, improving retrieval efficiency, and optimizing data storage. In response to the first question, the Zero One World team has two major killers:

  • Leading full navigation map technology. The current industry status quo is mainly through hashing, KD-Tree, VP-Tree and other methods. The navigation effect is not accurate enough and the cropping strength is not enough. The global multi-layer thumbnail navigation technology developed by Zero One Wish, and the coordinate system navigation on the map, It can not only ensure accuracy, but also clip a large number of irrelevant vectors.
  • The first adaptive neighbor selection strategy to fill the gap in the industry. The self-developed adaptive neighbor selection strategy of 01Wuxing breaks through the limitations of relying only on real topk or fixed edge selection strategies in the past. The new strategy allows each node to dynamically select the best one based on the distribution characteristics of itself and its neighbors. Neighbor edges converge closer to the target vector faster, thereby improving RAG vector retrieval performance by 15%-30%.

In response to the second question, Zero One Wish adopts a two-level quantization scheme to enhance RAG. Zero One Thousand uses two-level quantization to reduce computational complexity. At the same time, columnar storage fully utilizes the concurrency capabilities of SIMD to further leverage hardware capabilities. Compared with traditional PQ table lookup, the performance is greatly improved to 2-3 times.

In addition, Zero-One Everything also has full-stack vector technology solutions such as index structure optimization and connectivity guarantee to improve the performance of Cartesian vector databases.

Full stack vector technology: higher accuracy and stronger performance

Through the above With the support of full-stack vector technology, Zero-One Cartesian Vector Database not only topped the authoritative list of ANN-Benchmarks in 6 evaluations. It also has core advantages such as higher accuracy and stronger performance in practical application scenarios.

#Zero One Everything Cartesian Vector Database currently focuses on high-performance vector databases. High-performance vector databases usually refer to vector data sets with scales of tens of millions or less (such as 20 million 128-dimensional floating-point vectors). Generally speaking, high-performance vector databases can easily handle 80 to 90 percent of daily scenarios. For example, it helps corporate customers build private domain knowledge bases and intelligent customer service systems; in the field of autonomous driving, the use of high-performance vector databases can accelerate autonomous driving model training, etc.

Zero One All High Performance Vector Database has the following advantages:

  • Ultra-high precision: Based on multi-layer thumbnails and coordinate systems, inter-layer navigation and on-map orientation navigation are realized, as well as graph connectivity guarantee, achieving an accuracy greater than 99%. Under the same performance, the accuracy is significantly ahead of the industry level.
  • Super high performance: Efficient edge selection and cropping technology, tens of millions of database ms response.

Take the e-commerce recommendation scenario as an example. The number of products on the shelves may be tens of millions, and each product can be expressed by a vector. Even if the number of vectors in the library is not very large, if the e-commerce user base is very large and the number of user requests per second at peak times is very large, it may reach hundreds of thousands or even millions of QPS. The use of high-performance vector databases can effectively improve the recommendation effect of search and advertising services in e-commerce scenarios, making everyone unable to help but keep buying.

#Zero Yiwu said that the Cartesian vector database is the team’s initial attempt based on RAG and will be effectively used in AI productivity products released in the near future. In the future, after each major model is optimized to a certain extent, the capabilities of the vector database may determine the ceiling of each major model. Zero One Wish will continue to focus on R&D and sharing in the future to bring better technology and experience to users.

The above is the detailed content of 01Wang's self-developed full navigation map vector database swept 6 first places on the authoritative list. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners DeepMind robot plays table tennis, and its forehand and backhand slip into the air, completely defeating human beginners Aug 09, 2024 pm 04:01 PM

But maybe he can’t defeat the old man in the park? The Paris Olympic Games are in full swing, and table tennis has attracted much attention. At the same time, robots have also made new breakthroughs in playing table tennis. Just now, DeepMind proposed the first learning robot agent that can reach the level of human amateur players in competitive table tennis. Paper address: https://arxiv.org/pdf/2408.03906 How good is the DeepMind robot at playing table tennis? Probably on par with human amateur players: both forehand and backhand: the opponent uses a variety of playing styles, and the robot can also withstand: receiving serves with different spins: However, the intensity of the game does not seem to be as intense as the old man in the park. For robots, table tennis

The first mechanical claw! Yuanluobao appeared at the 2024 World Robot Conference and released the first chess robot that can enter the home The first mechanical claw! Yuanluobao appeared at the 2024 World Robot Conference and released the first chess robot that can enter the home Aug 21, 2024 pm 07:33 PM

On August 21, the 2024 World Robot Conference was grandly held in Beijing. SenseTime's home robot brand "Yuanluobot SenseRobot" has unveiled its entire family of products, and recently released the Yuanluobot AI chess-playing robot - Chess Professional Edition (hereinafter referred to as "Yuanluobot SenseRobot"), becoming the world's first A chess robot for the home. As the third chess-playing robot product of Yuanluobo, the new Guoxiang robot has undergone a large number of special technical upgrades and innovations in AI and engineering machinery. For the first time, it has realized the ability to pick up three-dimensional chess pieces through mechanical claws on a home robot, and perform human-machine Functions such as chess playing, everyone playing chess, notation review, etc.

Claude has become lazy too! Netizen: Learn to give yourself a holiday Claude has become lazy too! Netizen: Learn to give yourself a holiday Sep 02, 2024 pm 01:56 PM

The start of school is about to begin, and it’s not just the students who are about to start the new semester who should take care of themselves, but also the large AI models. Some time ago, Reddit was filled with netizens complaining that Claude was getting lazy. "Its level has dropped a lot, it often pauses, and even the output becomes very short. In the first week of release, it could translate a full 4-page document at once, but now it can't even output half a page!" https:// www.reddit.com/r/ClaudeAI/comments/1by8rw8/something_just_feels_wrong_with_claude_in_the/ in a post titled "Totally disappointed with Claude", full of

At the World Robot Conference, this domestic robot carrying 'the hope of future elderly care' was surrounded At the World Robot Conference, this domestic robot carrying 'the hope of future elderly care' was surrounded Aug 22, 2024 pm 10:35 PM

At the World Robot Conference being held in Beijing, the display of humanoid robots has become the absolute focus of the scene. At the Stardust Intelligent booth, the AI ​​robot assistant S1 performed three major performances of dulcimer, martial arts, and calligraphy in one exhibition area, capable of both literary and martial arts. , attracted a large number of professional audiences and media. The elegant playing on the elastic strings allows the S1 to demonstrate fine operation and absolute control with speed, strength and precision. CCTV News conducted a special report on the imitation learning and intelligent control behind "Calligraphy". Company founder Lai Jie explained that behind the silky movements, the hardware side pursues the best force control and the most human-like body indicators (speed, load) etc.), but on the AI ​​side, the real movement data of people is collected, allowing the robot to become stronger when it encounters a strong situation and learn to evolve quickly. And agile

ACL 2024 Awards Announced: One of the Best Papers on Oracle Deciphering by HuaTech, GloVe Time Test Award ACL 2024 Awards Announced: One of the Best Papers on Oracle Deciphering by HuaTech, GloVe Time Test Award Aug 15, 2024 pm 04:37 PM

At this ACL conference, contributors have gained a lot. The six-day ACL2024 is being held in Bangkok, Thailand. ACL is the top international conference in the field of computational linguistics and natural language processing. It is organized by the International Association for Computational Linguistics and is held annually. ACL has always ranked first in academic influence in the field of NLP, and it is also a CCF-A recommended conference. This year's ACL conference is the 62nd and has received more than 400 cutting-edge works in the field of NLP. Yesterday afternoon, the conference announced the best paper and other awards. This time, there are 7 Best Paper Awards (two unpublished), 1 Best Theme Paper Award, and 35 Outstanding Paper Awards. The conference also awarded 3 Resource Paper Awards (ResourceAward) and Social Impact Award (

Hongmeng Smart Travel S9 and full-scenario new product launch conference, a number of blockbuster new products were released together Hongmeng Smart Travel S9 and full-scenario new product launch conference, a number of blockbuster new products were released together Aug 08, 2024 am 07:02 AM

This afternoon, Hongmeng Zhixing officially welcomed new brands and new cars. On August 6, Huawei held the Hongmeng Smart Xingxing S9 and Huawei full-scenario new product launch conference, bringing the panoramic smart flagship sedan Xiangjie S9, the new M7Pro and Huawei novaFlip, MatePad Pro 12.2 inches, the new MatePad Air, Huawei Bisheng With many new all-scenario smart products including the laser printer X1 series, FreeBuds6i, WATCHFIT3 and smart screen S5Pro, from smart travel, smart office to smart wear, Huawei continues to build a full-scenario smart ecosystem to bring consumers a smart experience of the Internet of Everything. Hongmeng Zhixing: In-depth empowerment to promote the upgrading of the smart car industry Huawei joins hands with Chinese automotive industry partners to provide

Distributed Artificial Intelligence Conference DAI 2024 Call for Papers: Agent Day, Richard Sutton, the father of reinforcement learning, will attend! Yan Shuicheng, Sergey Levine and DeepMind scientists will give keynote speeches Distributed Artificial Intelligence Conference DAI 2024 Call for Papers: Agent Day, Richard Sutton, the father of reinforcement learning, will attend! Yan Shuicheng, Sergey Levine and DeepMind scientists will give keynote speeches Aug 22, 2024 pm 08:02 PM

Conference Introduction With the rapid development of science and technology, artificial intelligence has become an important force in promoting social progress. In this era, we are fortunate to witness and participate in the innovation and application of Distributed Artificial Intelligence (DAI). Distributed artificial intelligence is an important branch of the field of artificial intelligence, which has attracted more and more attention in recent years. Agents based on large language models (LLM) have suddenly emerged. By combining the powerful language understanding and generation capabilities of large models, they have shown great potential in natural language interaction, knowledge reasoning, task planning, etc. AIAgent is taking over the big language model and has become a hot topic in the current AI circle. Au

Li Feifei's team proposed ReKep to give robots spatial intelligence and integrate GPT-4o Li Feifei's team proposed ReKep to give robots spatial intelligence and integrate GPT-4o Sep 03, 2024 pm 05:18 PM

Deep integration of vision and robot learning. When two robot hands work together smoothly to fold clothes, pour tea, and pack shoes, coupled with the 1X humanoid robot NEO that has been making headlines recently, you may have a feeling: we seem to be entering the age of robots. In fact, these silky movements are the product of advanced robotic technology + exquisite frame design + multi-modal large models. We know that useful robots often require complex and exquisite interactions with the environment, and the environment can be represented as constraints in the spatial and temporal domains. For example, if you want a robot to pour tea, the robot first needs to grasp the handle of the teapot and keep it upright without spilling the tea, then move it smoothly until the mouth of the pot is aligned with the mouth of the cup, and then tilt the teapot at a certain angle. . this

See all articles