Table of Contents
A billion-dollar bet
How Ray becomes the tool of choice for machine learning
Home Technology peripherals AI Ray, the open source AI framework behind ChatGPT, is now worth $1 billion

Ray, the open source AI framework behind ChatGPT, is now worth $1 billion

Apr 12, 2023 pm 03:07 PM
frame ai

Text-generating artificial intelligence has taken the internet by storm recently: ChatGPT is popular for its ability to provide highly detailed, near-lifelike answers to almost any question one can think of. The emergence of large model applications has made people full of confidence in AI technology breakthroughs, but few people know that behind it, a distributed machine learning framework is powering this generative AI revolution.

Distributed computing framework Ray from A16z-backed startup Anyscale is key to enabling OpenAI to power up its training models like ChatGPT. Ray is behind all of OpenAI's recent large-scale language models — and it may also be the framework behind OpenAI's much-anticipated GPT-4. With the continuous implementation of large-scale model technology, industry insiders believe that an industry worth billions of dollars is being formed by generating content that is close to humans.

In this field, Ray is the most influential framework. Before its advent, OpenAI used a custom collection of tools to develop large models. But OpenAI president Greg Brockman said at the Ray Summit earlier this year that the company had turned to Ray as the challenges it faced increased.

Ray, the open source AI framework behind ChatGPT, is now worth $1 billion

Lukas Biewald, CEO of software company Weights & Biases, believes that Ray is already a hot rising star in the AI ​​world. "Because of new tools, you can run the same code on a laptop and on a large distributed server. That's a huge change, and it's going to increase in importance as the models get bigger," Biewald said.

A billion-dollar bet

As the technology matures, Ray has attracted the attention of the capital market. Anyscale's equity has become a scarce commodity, with Business Insider reporting that its latest funding round, an extension of its Series C round that valued it at more than $1 billion, closed within days, according to people familiar with the matter.

Some investors have described Anyscale as Horowitz’s hopeful “next Databricks” — a description that seems reasonable, given that the startup’s co-founder, Ion Stoica He is the co-founder of Databricks, a data giant with a market capitalization of $31 billion.

“Artificial intelligence is developing at an incredible pace and people are trying new approaches all the time,” said Robert Nishihara, CEO of Anyscale. "ChatGPT combines a lot of previous work on large language models. On top of this, you need to have an infrastructure that enables flexibility, rapid innovation, and expansion of different algorithms and methods."

With ever-larger models behind hot new tools like ChatGPT, tech companies are having to rethink the way they develop AI from the ground up. Ray was born to make it easier to train these massive models and can contain hundreds of billions of data points, giving each response a quasi-lifelike feel.

How Ray becomes the tool of choice for machine learning

Ray is a distributed computing framework based on memory sharing, suitable for fine-grained parallel computing and heterogeneous computing. It provides an underlying infrastructure for managing the complex task of distributing the work of training machine learning models.

In 2017, UC Berkeley researchers submitted Ray's paper "Ray: A Distributed Framework for Emerging AI Applications" for the first time:

Ray, the open source AI framework behind ChatGPT, is now worth $1 billion

  • Paper link: https://arxiv.org/abs/1712.05889
  • GitHub: https:// github.com/ray-project/ray

#In this work, the researchers predict what the next generation of AI applications will look like: one with continuous interactions with the environment , and learn from interactive actions. These applications must increasingly complete tasks in dynamic environments, react to changes in the environment, and perform a series of actions to achieve long-term goals. These characteristics have put forward new and demanding system requirements for the performance and flexibility of the operating environment, so researchers have proposed a distributed-based Ray framework.

Ray implements a unified interface that can express task parallelism and actor-based computation, supported by a single dynamic execution engine. To meet performance requirements, Ray uses a distributed scheduler and distributed fault-tolerant storage to manage the system's control state. It is the first distributed computing framework that unifies training, simulation and services. It unifies role parallel (actor) and task parallel (task) calculations based on a dynamic task execution engine, and ensures the high scalability and high performance of the framework. Fault tolerance.

Ray's architecture.

Based on this work, in December 2019, Robert Nishihara, Philipp Moritz and Ion Stoica of UC Berkeley and Berkeley Professor Michael I. Jordan founded Anyscale. The company has raised $260 million so far.

Ray, the open source AI framework behind ChatGPT, is now worth $1 billion

Machine learning practitioners can often run small models using limited data sets on their laptops, such as simple models that predict what products users will buy. . However, laptops are not feasible for very large models like ChatGPT, which require massive servers to train.

Training a model using a large number of devices faces an important challenge - coordinating training on different hardware. Ray just solves this problem. It provides practitioners with a mechanism to manage different hardware as a unit to determine what data goes where, handle failures, etc. The hardware types span Google Cloud, AWS and other A portfolio of products that address the same problem. In addition, Ray also extended "actor", a key programming concept in other languages, to Python, which is known to be the language of choice for machine learning programs.

Ray, the open source AI framework behind ChatGPT, is now worth $1 billion

As a distributed computing framework, Ray has two key advantages, namely location-aware (Locality-aware) and task placement (task placement) ). As shown in the figure below, Ray is able to scale out the system to support high-throughput fine-grained tasks while maintaining fault tolerance and low-latency task scheduling.

Ray, the open source AI framework behind ChatGPT, is now worth $1 billion

Ray removes significant complexity from training large models for OpenAI, freeing up the company to focus on the model’s critical capabilities .

The next generation of AI requires new development tools, and Ray is just one of a rapidly emerging set of next-generation machine learning tools that are rapidly disrupting the way AI is developed. For example, Google's JAX framework has also received huge attention. JAX is expected to become the backbone of Google's core machine learning tools and has been widely adopted in DeepMind and Google Brain.

Similarly, Coiled, a startup backed by FirstMark Capital and Bessemer Venture Partners, has developed a parallel computing framework called Dask.

Large-scale language models are unlocking more potential recently, and these new machine learning tools will build more powerful language models for technology giants and startups in the industry.

The above is the detailed content of Ray, the open source AI framework behind ChatGPT, is now worth $1 billion. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

How to get logged in user information in WordPress for personalized results How to get logged in user information in WordPress for personalized results Apr 19, 2025 pm 11:57 PM

Recently, we showed you how to create a personalized experience for users by allowing users to save their favorite posts in a personalized library. You can take personalized results to another level by using their names in some places (i.e., welcome screens). Fortunately, WordPress makes it very easy to get information about logged in users. In this article, we will show you how to retrieve information related to the currently logged in user. We will use the get_currentuserinfo();  function. This can be used anywhere in the theme (header, footer, sidebar, page template, etc.). In order for it to work, the user must be logged in. So we need to use

How to elegantly obtain entity class variable names to build database query conditions? How to elegantly obtain entity class variable names to build database query conditions? Apr 19, 2025 pm 11:42 PM

When using MyBatis-Plus or other ORM frameworks for database operations, it is often necessary to construct query conditions based on the attribute name of the entity class. If you manually every time...

How to solve the problem of username and password authentication failure when connecting to local EMQX using Eclipse Paho? How to solve the problem of username and password authentication failure when connecting to local EMQX using Eclipse Paho? Apr 19, 2025 pm 04:54 PM

How to solve the problem of username and password authentication failure when connecting to local EMQX using EclipsePaho's MqttAsyncClient? Using Java and Eclipse...

How to properly configure apple-app-site-association file in pagoda nginx to avoid 404 errors? How to properly configure apple-app-site-association file in pagoda nginx to avoid 404 errors? Apr 19, 2025 pm 07:03 PM

How to correctly configure apple-app-site-association file in Baota nginx? Recently, the company's iOS department sent an apple-app-site-association file and...

How to efficiently query large amounts of personnel data through natural language processing? How to efficiently query large amounts of personnel data through natural language processing? Apr 19, 2025 pm 09:45 PM

Effective method of querying personnel data through natural language processing How to efficiently use natural language processing (NLP) technology when processing large amounts of personnel data...

How to process and display percentage numbers in Java? How to process and display percentage numbers in Java? Apr 19, 2025 pm 10:48 PM

Display and processing of percentage numbers in Java In Java programming, the need to process and display percentage numbers is very common, for example, when processing Excel tables...

See all articles