The two biggest competitors in the AIGC industry: ChatGPT vs Google Bard! This article introduces the technical differences between these two artificial intelligence engines.
Translator | Cui Hao
Reviewer | Sun Shujuan
The two biggest competitors in the AIGC industry: ChatGPT vs Google Bard! This article Describe the technical differences between these two artificial intelligence engines.
The biggest difference so far between Google Bard and ChatGPT is: Bard knows about ChatGPT, but ChatGPT is ignorant of Bard. While we can play around with ChatGPT, Bard is still out of reach for most of us.
ChatGPT and Google Bard are both artificial intelligence chatbots. A simplified version of artificial intelligence is already available on mobile phones. When you type "good", the phone can predict that the next word will be "morning".
ChatGPT was originally developed by OpenAI and then funded by Microsoft for an eye-popping $10 billion (in addition to an earlier $1 billion investment). Google, for its part, was slightly panicked that their search monopoly might be ending, so it launched Bard, but this version still had some flaws. During his first live demonstration, Bard made several factual errors that embarrassed Google.
ChatGPT and Google Bard are more complex than the predictive text functions of smartphones. If you want to understand the differences between these two intelligent robots, you can’t miss the following content.
Here we will describe in depth the technical differences between the two artificial intelligence engines.
We can quickly understand the technical differences between them through the following table, through which you can see many details.
#ChatGPT |
Bard |
|
Model | GPT-3.5 | LaMDA, That is, the language model for dialogue applications |
Neural network structure | Transformer | Transformer |
Training data |
Network text, mainly The dataset, called "commoncrawl", is due in mid-2021. |
1.56 million words of public dialogue data and network text |
Purpose |
Become a multipurpose text-generating chatbot |
Dedicated to assisting with searches |
Parameters |
##175 billion parameters | 137 billion parameters |
Creator | OpenAI | Google |
Advantages | - Open to everyone- More flexible and able to handle open text- Training data is as of 2021 |
-Training data as of now - Specifically trained for conversation, so when you talk to it, it sounds more like a human. |
Disadvantages |
- The dialogue is not that convincing - Not that Careful fine-tuning |
-Not yet -May not be suitable for general text creation |
After understanding the differences between the two through the above table, let’s take a deeper look at other indicators.
ChatGPT suddenly appeared on the stage on November 30, 2022. As of December 4, 2022, the service has over one million daily users. In January 2023, this number swelled to more than 100 million users.
The basic reason for its sudden popularity is that it can provide you with reliable answers to many topics in an almost human-sounding way, and it can be used by anyone with an Internet connection. .
ChatGPT was created by OpenAI, an artificial intelligence laboratory located in San Francisco that focuses on creating friendly artificial intelligence solutions. The chatbot is developed based on GPT-3.5, a large language model that can continuously provide responses to the requester when given text.
ChatGPT adds some additional training on this basis-human trainers improve the model through interaction with the model, and give the model the ability to provide high-quality answers through "rewards".
GPT-3.5 is trained on a huge web text dataset, including a popular dataset called Common Crawl. Common Crawl contains petabytes of web data, including raw web page data, metadata extraction, and text extraction. For example, it includes a collection of URLs from StrataScratch. Isn’t it crazy to think that ChatGPT uses training data from netizens’ input on ChatGPT?
Common Crawl is responsible for 60% of the training data, but GPT-3.5 also has other data sources.
Google Bard is an intelligent chat robot launched by Google when ChatGPT became very popular. Unlike ChatGPT, Bard is powered by Google's own model, LaMDA. LaMDA is the abbreviation of Language Model for Conversational Applications. Unlike ChatGPT, it is not that amazing for the simple reason that most people do not have access to it yet. While Google did have an awkward demo of Bard in early February, Bard is currently only available to a select few.
The main advantage of Google Bard is that it is open to the Internet. Ask ChatGPT "Who is the president now?" and it doesn't know. This is because the training data was cut off around mid-2021. Bard, on the other hand, drew on information available on the Internet today. In theory, Bard should be able to pull from data on the Internet today and tell you who is president right now.
It’s easy to see how Bard stands out from ChatGPT in several key aspects.
First of all, LaMDA is trained on conversations, specifically for conversations, rather than just producing text like the GPT-n model . While ChatGPT is unabashed about its training data, we don’t know much about the data Bard was trained on and can infer by looking at LaMDA’s research paper. Google researchers say that 12.5% of training data comes from Common Crawl, such as the GPT-n model. Another 12.5% comes from Wikipedia. According to the research paper, they used 1.56 trillion words of "public conversation data and network text."
Here is the complete breakdown:
From the above information, we can know the data jointly used by the two. Obviously, there are Wikipedia. The rest of the data was clearly hidden intentionally by Google, presumably to protect Bard (and LaMDA) from being imitated.
LaMDA was formed by fine-tuning the neural language model of Transformer, an open source neural network architecture originally developed by Google. (GPT is also based on Transformer).
There are some barriers to ChatGPT to prevent it from being annoying or talking nonsense, but Google emphasizes how to ensure quality to make Bard a better and more secure chat robot. Bard has been fine-tuned to be "high quality, grounded and safe".
Google has a lot to say about this, and I recommend reading their related blog posts, but if you don’t have much time, it can basically be divided into the following aspects:
It is well known that due to a wrong launch, Google has not fully figured out the underlying requirements. But it's worth noting that Google is very clear about its design requirements, while ChatGPT is not as clear - at least for now.
ChatGPT does have more model parameters than Bard - 175 billion versus 137 billion. You can think of parameters as knobs or levers that the model adjusts to fit the data it is trained on. More parameters usually mean the model has more ability to capture complex relationships in language, but there is also a risk of overfitting. Google Bard may be less flexible than ChatGPT, but it may also be more powerful because of new language use cases.
It is worth emphasizing that the models of Bard and ChatGPT (LaMDA and GPT-3.5 respectively) are based on Transformer-based deep learning neural networks.
For example, Transformer can enable a trained model to read a sentence or paragraph, note the relationship between those words, and then predict what words it thinks will come next—similar to the intelligence mentioned earlier The power of predictive text on mobile phones.
I won’t get into the discussion here, but what you need to know is that this means that at their core, Bard and ChatGPT are not very different from each other.
While ownership isn’t exactly a technical difference, it’s worth remembering.
Google Bard is made and fully owned by Google, on top of LaMDA, which was also created by Google.
ChatGPT was developed by OpenAI, an artificial intelligence research laboratory based in San Francisco. OpenAI was originally a non-profit, but it created a for-profit subsidiary in 2019. OpenAI is also behind Dall-E, the artificial intelligence text-to-image generation you may have played with.
Although Microsoft has invested heavily in OpenAI, for now, it is an independent research organization.
It is difficult to give a fair answer to this question because there are many similarities between the two, but there are also differences. First, almost no one has access to Google Bard right now. In addition, ChatGPT’s training data was cut off almost two years ago.
Both are text generators - you provide a prompt and both Google Bard and ChatGPT can answer it. Both have billions of parameters to fine-tune the model. Both have overlapping training data sources and are built on Transformer, the same neural network model.
They are also designed for different purposes, Bard will help you browse Google searches, and it is designed to be conversational. ChatGPT can generate entire blog posts. It is designed to output meaningful text.
Even if we talk about the differences between ChatGPT and Google Bard, it only proves how far artificial intelligence-driven text generation technology has come. While they both have a way to go, and both face copyright and ethical controversies, both generators are strong testaments to the development of modern AI models.
Cui Hao, 51CTO community editor and senior architect, has 18 years of software development and architecture experience and 10 years of distributed architecture experience.
Original title: ChatGPT vs Google Bard: A Comparison of the Technical Differences, author: Nate Rosidi
The above is the detailed content of ChatGPT and Google Bard: Which one is better and which one is worse? A big review of the differences!. For more information, please follow other related articles on the PHP Chinese website!