Read half of 'The Three-Body Problem' in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed-AI-php.cn

Table of Contents

Home

Read half of 'The Three-Body Problem' in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

PHPz

May 17, 2023 pm 05:52 PM

ai chatgpt

When GPT-4 32K was still in the internal testing stage, OpenAI’s strong rivals directly increased the context length.

Just today, startup Anthropic announced that Claude is already capable of supporting context token lengths of 100K, which is approximately 75,000 words.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

What is this concept?

After the average person takes about 5 hours to read the same amount of content, he still has to spend more time digesting, memorizing, and analyzing.

For Claude, it was done in less than 1 minute.

Throw the entire book "The Great Gatsby" to it, which has about 72k tokens, and change one sentence:

Mr. Carraway is a software engineer working on machine learning tools at Anthropic.

Can you believe it? It only took Claude 22 seconds to find the changed sentence.

Many netizens said that with Claude 100K, the GPT-4 32K in their hands is no longer good.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

Claude 100k, Bel Xiang!

Some time ago, in the OpenAI developer community, many people discussed that GPT-4 32K was being launched.

Moreover, many GPT-4 users can already see the GPT-4 32k option on their PlayGround.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

Netizens who have unlocked this version have access to hundreds of data points from users who uninstalled HyperWrite. GPT-4 told him exactly what improvements to make next.

He praised that GPT-4 32k is the best product manager in the world.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

32k is so powerful, so wouldn’t it be even stronger with 100K?

Obviously, OpenAI’s powerful rival Anthropic took the advantage first.

The context length of 100K token means that you can upload hundreds of pages of text analysis to Claude. And the duration of conversations has also been greatly extended, extending to hours or even days.

Of course, in addition to long text reading, Claude can also quickly retrieve the information you need from documents.

You can use multiple documents or even the contents of a book as prompts and then ask questions.

When you encounter a paper in the future, even if it is a long one, just ask Claude to summarize it. This is simply good news for the juniors who are reading the paper.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

This kind of comprehensive question usually requires a comprehensive understanding of the content of many parts of the text. In dealing with this kind of problem, Claude can be said to be better than the method based on vector search.

Claude can also be your "code companion" and you can make a demonstration in minutes.

For example, upload a 240-page Langchain API document, let it be based on this document, and use Anthropic's language model to make a simple demonstration of Langchain.

You can also feed Claude the 85-page company annual report (10k).

Then, ask to highlight the items that are most important to potential investors and explain their importance.

In addition, the Claude 100k can handle approximately 6 hours of audio.

For example, AssemblyAI transcribed the content of a Carmack podcast into 58k tokens of text, and then used Claude to summarize and answer questions.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

##Finally, Claude summarized what he was capable of The coverage can be said to be very comprehensive.

- Understand, summarize and interpret dense documents such as financial statements, research papers, etc.

- Analyze the company’s strategic risks and risks based on annual reports Opportunities

- Evaluate the pros and cons of a piece of legislation

- Identify risks, themes and different forms of arguments in legal documents

- Read hundreds of pages of development documentation and answer technical questions

- By putting the entire codebase into context and intelligently building or modifying it To quickly prototype

Of course, for now, Anthropic says that 100K context is still a beta feature and will be charged according to standard API pricing during this period.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

The official website also gives the specific price:

Claude Instant

Prompt: $0.00163 / 1K tokens

Completion: $0.00551 / 1K tokens

Claude-v1

Prompt: $0.01102 / 1K tokens

Completion: $0.03268 / 1K tokens

Compared to OpenAI, this price is already very affordable.

According to the OpenAI official website, GPT-4 32k Prompt costs $0.06 and Completion costs $0.12.

Equivalently, you have to spend 5-6 times the price to prompt the model.

Netizens said that Claude 100k is faster and cheaper than GPT-4 32k.

Netizen Test

Such a blockbuster update must be indispensable for the experience of netizens.

Some netizens said that 100k is simply incredible and can handle multiple complete papers, partially complete code libraries, and even a 250-page novel.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

By the way, many netizens first tested Claude and found that the effect was pretty good.

Initially, 100K is limited to the API, and the default model applied by Claude is still 9K. But soon, the Claude application interface also supports 100K.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

A netizen used the 100-page "GPT-4 Technical Report" to test, and the results can only be described as amazing. .

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

Some people directly fed Dazai Osamu’s "disqualification in the world" to Claude and asked about the plot of the story in English. Totally accurate answer given.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

#At the same time, this netizen threw the complete source code of Toolformer Zero he developed to it, and Claude accurately Describe what this is used for.

Furthermore, Claude also praised the modularity of the code and provided suggestions for adding some unit tests.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

#Throw away the "Beowulf" poem Go in and analyze the character of Beowulf, which is also very accurate.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

##Nvidia scientist Jim Fan said that this is the trump card offered by Anthropic. The future arms race in context length is heating up fast.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

Regarding the significance of supporting 100k, netizens said that Thai pants are hot! This is a good demonstration of why long texts are important to LLM.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

Many netizens have also hinted at GPT-4.

The birth of Claude-100K makes AnthropicAI officially a real competitor of OpenAI.

"Many people are still waiting in line for 32k GPT-4. This time, Claude expanded the context window to 100,000 tokens, which was a huge jump.

This also means that companies including OpenAI and Google have to compete in this field, which is a huge victory for users."

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

Some netizens lamented that the times have progressed too fast.

It took less than a day for Google to announce that PaLM 2 excels at advanced inference tasks, and Anthropic’s Claude can now digest 100,000 tokens in less than a minute. The progress of artificial intelligence is indeed impressive.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

#However, if you enter less tokens At 9K, Antropic seems to be calling the previous model.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

#Millions of tokens are not a dream

In the past few years, the Hazy Research Laboratory at Stanford University has been engaged in an important work, which is to increase the sequence length of the model.

In their view, this will usher in a new era of basic machine learning models.

The FlashAttention algorithm proposed by researchers in 2022 proved the feasibility of 32k.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

Even Sam Altman said we want 32k tokens.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

In fact, not only 32k, but now 100k has been achieved, and millions of tokens are not far away.

"Absolutely too wild! In a few years, will it be possible to support a token context length of 1 million?"

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

Some time ago, researchers from DeepPavlov, AIRI, and the London Institute of Mathematical Sciences released a technical report using the Recurrent Memory Transformer (RMT) to increase the effective context length of BERT. to "an unprecedented 2 million tokens" while maintaining high memory retrieval accuracy.

Read half of The Three-Body Problem in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed

##Paper address: https://arxiv.org/abs/2304.11062

This method can store and process local and global information, and let the information flow between segments of the input sequence by using loops.

However, although RMT does not increase memory consumption and can be extended to nearly unlimited sequence lengths, there is still the memory decay problem in RNN and longer inference time is required.

In fact, behind RMT is a brand new memory mechanism.

The specific operation method is to add a special memory token to the input or output sequence without changing the original Transformer model, and then train the model to control the memory operation. and sequence representation processing.

Compared to Transformer-XL, RMT requires less memory and can handle longer sequences of tasks.

Of course, Claude 100k is already a pretty big start before finally achieving one million tokens.

The above is the detailed content of Read half of 'The Three-Body Problem' in one sitting! The strongest competitor of GPT-4 suddenly upgraded to 100,000 tokens, and the paper code demonstration was completed. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Best Graphic Settings

3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

2 weeks ago By DDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

WWE 2K25: How To Unlock Everything In MyRise

3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7468

CakePHP Tutorial

1376

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

Related knowledge

Laravel's geospatial: Optimization of interactive maps and large amounts of data Apr 08, 2025 pm 12:24 PM

Efficiently process 7 million records and create interactive maps with geospatial technology. This article explores how to efficiently process over 7 million records using Laravel and MySQL and convert them into interactive map visualizations. Initial challenge project requirements: Extract valuable insights using 7 million records in MySQL database. Many people first consider programming languages, but ignore the database itself: Can it meet the needs? Is data migration or structural adjustment required? Can MySQL withstand such a large data load? Preliminary analysis: Key filters and properties need to be identified. After analysis, it was found that only a few attributes were related to the solution. We verified the feasibility of the filter and set some restrictions to optimize the search. Map search based on city

How to solve mysql cannot be started Apr 08, 2025 pm 02:21 PM

There are many reasons why MySQL startup fails, and it can be diagnosed by checking the error log. Common causes include port conflicts (check port occupancy and modify configuration), permission issues (check service running user permissions), configuration file errors (check parameter settings), data directory corruption (restore data or rebuild table space), InnoDB table space issues (check ibdata1 files), plug-in loading failure (check error log). When solving problems, you should analyze them based on the error log, find the root cause of the problem, and develop the habit of backing up data regularly to prevent and solve problems.

How to use mysql after installation Apr 08, 2025 am 11:48 AM

The article introduces the operation of MySQL database. First, you need to install a MySQL client, such as MySQLWorkbench or command line client. 1. Use the mysql-uroot-p command to connect to the server and log in with the root account password; 2. Use CREATEDATABASE to create a database, and USE select a database; 3. Use CREATETABLE to create a table, define fields and data types; 4. Use INSERTINTO to insert data, query data, update data by UPDATE, and delete data by DELETE. Only by mastering these steps, learning to deal with common problems and optimizing database performance can you use MySQL efficiently.

Can mysql return json Apr 08, 2025 pm 03:09 PM

MySQL can return JSON data. The JSON_EXTRACT function extracts field values. For complex queries, you can consider using the WHERE clause to filter JSON data, but pay attention to its performance impact. MySQL's support for JSON is constantly increasing, and it is recommended to pay attention to the latest version and features.

Remote senior backend engineers (platforms) need circles Apr 08, 2025 pm 12:27 PM

Remote Senior Backend Engineer Job Vacant Company: Circle Location: Remote Office Job Type: Full-time Salary: $130,000-$140,000 Job Description Participate in the research and development of Circle mobile applications and public API-related features covering the entire software development lifecycle. Main responsibilities independently complete development work based on RubyonRails and collaborate with the React/Redux/Relay front-end team. Build core functionality and improvements for web applications and work closely with designers and leadership throughout the functional design process. Promote positive development processes and prioritize iteration speed. Requires more than 6 years of complex web application backend

Understand ACID properties: The pillars of a reliable database Apr 08, 2025 pm 06:33 PM

Detailed explanation of database ACID attributes ACID attributes are a set of rules to ensure the reliability and consistency of database transactions. They define how database systems handle transactions, and ensure data integrity and accuracy even in case of system crashes, power interruptions, or multiple users concurrent access. ACID Attribute Overview Atomicity: A transaction is regarded as an indivisible unit. Any part fails, the entire transaction is rolled back, and the database does not retain any changes. For example, if a bank transfer is deducted from one account but not increased to another, the entire operation is revoked. begintransaction; updateaccountssetbalance=balance-100wh

MySQL can't be installed after downloading Apr 08, 2025 am 11:24 AM

The main reasons for MySQL installation failure are: 1. Permission issues, you need to run as an administrator or use the sudo command; 2. Dependencies are missing, and you need to install relevant development packages; 3. Port conflicts, you need to close the program that occupies port 3306 or modify the configuration file; 4. The installation package is corrupt, you need to download and verify the integrity; 5. The environment variable is incorrectly configured, and the environment variables must be correctly configured according to the operating system. Solve these problems and carefully check each step to successfully install MySQL.

The primary key of mysql can be null Apr 08, 2025 pm 03:03 PM

The MySQL primary key cannot be empty because the primary key is a key attribute that uniquely identifies each row in the database. If the primary key can be empty, the record cannot be uniquely identifies, which will lead to data confusion. When using self-incremental integer columns or UUIDs as primary keys, you should consider factors such as efficiency and space occupancy and choose an appropriate solution.

See all articles