Home Technology peripherals AI Brief analysis: the underlying principles of ChatGPT application

Brief analysis: the underlying principles of ChatGPT application

Apr 13, 2023 am 08:37 AM
chatgpt application Ground floor

ChatGPT is undoubtedly the most handsome guy on the Internet recently. Through this period of use and the review of some information, Xiao Wang has learned some of the principles behind it and tried to explain the underlying principles of the ChatGPT application. If there are any inaccuracies, please correct me.

Reading this article may answer the following questions for you:

Why do some ChatGPTs charge and some do not?

Why does ChatGPT answer word by word?

Why do the answers to Chinese questions sometimes make people laugh?

Why do you ask it what day it is today and its answer is a time in the past?

Why do you refuse to answer some questions?

"ChatGPT Domestic Version" Operating Principle

With the popularity of ChatGPT, many domestic versions have appeared. This version is free to use The number of times and subsequent charging methods are different. Brother Xiao Wang drew a sketch and tried to help understand.

Brief analysis: the underlying principles of ChatGPT application

#[For Method 1]: After registering an account, you can use it scientifically online. There is currently no limit on the number of times. For registration costs, please refer to my previous article.

[For Method 2]: It is understood that there is no need to access the Internet scientifically. The cost of use is to purchase the service of the "domestic version of ChatGPT" operator, so the cost of use is also different.

ChatGPT, how does it work internally?

First, OpenAI launched a new conversational assistant on November 30, 2022. The chatbot is based on the language model (LLM for Large Language Models) GPT-3, or more precisely, on its version 3.5. ChatGPT is actually an adaptation of InstructGPT, which was launched in January 2022 but didn’t make the same impression at the time.

Compared with its predecessors, what is so great about ChatGPT?

Thanks to its ability to automatically generate human-like text, as well as its ability to take conversational context into account while avoiding the shortcomings of its predecessors, such as Tay from Microsoft or Galactica from Meta . Tay became racist and xenophobic within 24 hours. Galactica is creating nonsense and misinformation and can speak out about racism in a very eloquent way. Tay was shut down within 24 hours, Galactica three days later. OpenAI appears to have learned from the mistakes of Microsoft and Meta. In a short period of time, the system has been pushed to unprecedented levels.

What is GPT-3?

The GPT (Generative Pre-trained Transformer) series of models is composed of language models based on Transformer technology. It was developed by San Francisco-based company OpenAI. OpenAI was founded in December 2015 by Elon Musk (the boss of Tesla electric cars) and American businessman Sam Altman, the predecessor of the incubator Y Combinator (Scribd, Reddit, Airbnb, Dropbox, GitLab, Women Who Code, etc.) President. ), and has served as Chairman of the OpenAI Board of Directors since 2020.

In 2020, GPT-3 is the largest language model ever built, with 175 billion parameters. It is so large that 800 GB of memory is required to train it.

LLMs are typically generated from a large number of example texts in different languages ​​and domains. GPT-3 has been trained on hundreds of billions of English words from Common Crawl, WebText2, Books1/2 and Wikipedia (Xiao Wangge thinks this is why we ask questions in Chinese, and it sometimes answers Reasons that make us laugh and cry). It is also trained with programming examples coded in CSS, JSX, Python, and more. It accepts 2048 tokens as input, which allows it to handle very large sentences of about 1,500 words (OpenAI considers a token to be a part of a word of about four characters, and uses the example of 1,000 tokens representing about 750 words).

GPT-3 is classified as a generative model, which means it is primarily trained to predict the next token at the end of an input sentence, i.e. the next word (This is also why it is a appears on the screen word by word). An autocomplete mechanism now found in search engines or Outlook.

GPT-3 has been cited many times for its ability to generate text that is extremely close to the capabilities of a journalist or author. Just give it the beginning of a sentence and it will complete the rest of the paragraph or article word for word. By extension, the model has demonstrated its ability to handle a wide range of language processing tasks, such as translating, answering questions, and filling in missing words in text.

GPT-3.5 is a variant of the GPT-3 model. It has been trained using a mixture of selected text and code until Q4 2021. This explains why ChatGPT is unable to evoke facts after that date. (This explains why when you ask it what day it is today, its answer is a time in the past).

Do we refuse to answer some questions?

If we ask some unethical questions, it will refuse to answer: as follows:

Brief analysis: the underlying principles of ChatGPT application

It will politely refuse to answer. Unlike Tay and Galactica, ChatGPT's training is moderated at the source using the moderation API, which allows inappropriate requests to be deferred during training. Nonetheless, false positives and false negatives can still occur and lead to over-moderation. The Moderation API is a classification model performed by the GPT model based on the following categories: Violence, Self-Harm, Hate, Harassment, and Sexuality. To do this, OpenAI uses anonymized data and synthetic data (zero samples), especially when there is not enough data.

Finally

ChatGPT’s ability to simulate real conversations is extraordinary. Even if we know it is a machine, an algorithm, we can only get caught up in the game of asking it so many questions that the machine seems sacred by its outsized knowledge.

But when you look at it carefully, it is still a sentence generator without human-like understanding and self-criticism. I'm even more curious about what will happen next and how successful they will be with this type of architecture.

Reference:

Model Index: https://beta.openai.com/docs/model-index-for-researchers

InstructGPT: https://openai. com/blog/instruction-following/

ChatGPT: https://openai.com/blog/chatgpt/

BLOOM: https://bigscience.huggingface.co/blog/bloom

Y Combinator: https://fr.wikipedia.org/wiki/Y_Combinator

The above is the detailed content of Brief analysis: the underlying principles of ChatGPT application. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

ChatGPT now allows free users to generate images by using DALL-E 3 with a daily limit ChatGPT now allows free users to generate images by using DALL-E 3 with a daily limit Aug 09, 2024 pm 09:37 PM

DALL-E 3 was officially introduced in September of 2023 as a vastly improved model than its predecessor. It is considered one of the best AI image generators to date, capable of creating images with intricate detail. However, at launch, it was exclus

How to Undo Delete from Home Screen in iPhone How to Undo Delete from Home Screen in iPhone Apr 17, 2024 pm 07:37 PM

Deleted something important from your home screen and trying to get it back? You can put app icons back on the screen in a variety of ways. We have discussed all the methods you can follow and put the app icon back on the home screen. How to Undo Remove from Home Screen in iPhone As we mentioned before, there are several ways to restore this change on iPhone. Method 1 – Replace App Icon in App Library You can place an app icon on your home screen directly from the App Library. Step 1 – Swipe sideways to find all apps in the app library. Step 2 – Find the app icon you deleted earlier. Step 3 – Simply drag the app icon from the main library to the correct location on the home screen. This is the application diagram

The role and practical application of arrow symbols in PHP The role and practical application of arrow symbols in PHP Mar 22, 2024 am 11:30 AM

The role and practical application of arrow symbols in PHP In PHP, the arrow symbol (->) is usually used to access the properties and methods of objects. Objects are one of the basic concepts of object-oriented programming (OOP) in PHP. In actual development, arrow symbols play an important role in operating objects. This article will introduce the role and practical application of arrow symbols, and provide specific code examples to help readers better understand. 1. The role of the arrow symbol to access the properties of an object. The arrow symbol can be used to access the properties of an object. When we instantiate a pair

From beginner to proficient: Explore various application scenarios of Linux tee command From beginner to proficient: Explore various application scenarios of Linux tee command Mar 20, 2024 am 10:00 AM

The Linuxtee command is a very useful command line tool that can write output to a file or send output to another command without affecting existing output. In this article, we will explore in depth the various application scenarios of the Linuxtee command, from entry to proficiency. 1. Basic usage First, let’s take a look at the basic usage of the tee command. The syntax of tee command is as follows: tee[OPTION]...[FILE]...This command will read data from standard input and save the data to

ChatGPT is now available for macOS with the release of a dedicated app ChatGPT is now available for macOS with the release of a dedicated app Jun 27, 2024 am 10:05 AM

Open AI’s ChatGPT Mac application is now available to everyone, having been limited to only those with a ChatGPT Plus subscription for the last few months. The app installs just like any other native Mac app, as long as you have an up to date Apple S

SearchGPT: Open AI takes on Google with its own AI search engine SearchGPT: Open AI takes on Google with its own AI search engine Jul 30, 2024 am 09:58 AM

Open AI is finally making its foray into search. The San Francisco company has recently announced a new AI tool with search capabilities. First reported by The Information in February this year, the new tool is aptly called SearchGPT and features a c

Explore the advantages and application scenarios of Go language Explore the advantages and application scenarios of Go language Mar 27, 2024 pm 03:48 PM

The Go language is an open source programming language developed by Google and first released in 2007. It is designed to be a simple, easy-to-learn, efficient, and highly concurrency language, and is favored by more and more developers. This article will explore the advantages of Go language, introduce some application scenarios suitable for Go language, and give specific code examples. Advantages: Strong concurrency: Go language has built-in support for lightweight threads-goroutine, which can easily implement concurrent programming. Goroutin can be started by using the go keyword

The wide application of Linux in the field of cloud computing The wide application of Linux in the field of cloud computing Mar 20, 2024 pm 04:51 PM

The wide application of Linux in the field of cloud computing With the continuous development and popularization of cloud computing technology, Linux, as an open source operating system, plays an important role in the field of cloud computing. Due to its stability, security and flexibility, Linux systems are widely used in various cloud computing platforms and services, providing a solid foundation for the development of cloud computing technology. This article will introduce the wide range of applications of Linux in the field of cloud computing and give specific code examples. 1. Application virtualization technology of Linux in cloud computing platform Virtualization technology

See all articles