Table of Contents
LLaMA suffered from "open source" ?
Explaining Jokes" >Explaining Jokes
Zero sample classification" >Zero sample classification
Home Technology peripherals AI The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be 'open source'! GitHub gains 8k stars and a large number of reviews are released

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be 'open source'! GitHub gains 8k stars and a large number of reviews are released

Apr 12, 2023 pm 05:49 PM
Shin Jigen

The battle for ChatGPT is intensifying.

# A few weeks ago, Meta released its own large-scale language model LLaMA, with parameters ranging from 7 billion to 65 billion.

In the paper, LLaMA (13 billion) with only 1/10 parameters surpasses GPT-3 in most benchmark tests.

For LLaMA with 65 billion parameters, it is comparable to DeepMind’s Chinchilla (70 billion parameters) and Google’s PaLM (540 billion parameters).

#Although Meta claims that LLaMA is open source, researchers still need to apply and review it.

#However, what I never expected was that just a few days after its release, the model files of LLaMA were leaked in advance.

#So, the question is, is this intentional or accidental?

LLaMA suffered from "open source" ?

Recently, LLaMA’s finished product library was leaked on the foreign forum 4chan.

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be open source! GitHub gains 8k stars and a large number of reviews are released

Last Thursday, user llamanon posted on 4chan’s tech board via a torrent (torrent) Release of LLaMA models of 7B and 65B.

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be open source! GitHub gains 8k stars and a large number of reviews are released

#This seed link is currently blocked Merged into LLaMA's GitHub page.

He also submitted a second pull request to the project, which provided a seed link to another set of weights for the model.

#Currently, the project has received 8k stars on GitHub.

#However, one of the biggest mistakes leakers make is including their unique identifier code in the leaked model.

#This code is specifically designed to track down leakers, putting user llamanon’s personal information at risk.

# As the saying goes, LLaMA is not open source and it is not decent, but netizens help it to be decent.

Additionally, users on 4chan have created a handy resource for those looking to deploy the model on their own workstations.

# and provides a guide to a distribution tutorial on how to obtain a model and add modified weights to it for more efficient inference.

#What’s more, this resource even provides a way to integrate LLaMA into the online writing platform KoboldAI.

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be open source! GitHub gains 8k stars and a large number of reviews are released

Whether Meta did this intentionally or accidentally leaked it. Netizens expressed their opinions one after another.

A netizen’s analysis was very clear, “Maybe Meta leaked it deliberately to fight against OpenAI.”

Some customers think this is a better model and it hits right at the heart of their business plan to sell access for $250,000 a year. A month of access to their service buys a machine capable of running this leaked model. Meta undercuts a potential upstart competitor to keep the current big tech cartel stable. Maybe this is a bit of a conspiracy theory, but we live in the age of big technology and big conspiracies.

# On Monday, Meta said it would continue to release its artificial intelligence tools to accredited researchers even though LLaMA had been leaked to unauthorized users.

Some netizens directly said that they downloaded LLaMA with 7 billion parameters. Although they don’t know how to run it, they can get it just in case they need it in the future.

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be open source! GitHub gains 8k stars and a large number of reviews are released

##The leak and open source of LLaMA is a big event:

Stable Diffusion is open source. Eight months later, we can now read other people's minds and decode everything they see.

#With the opening of LLMs, we're going to get some really crazy stuff.

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be open source! GitHub gains 8k stars and a large number of reviews are released

Preliminary model evaluation

Not long after LLaMA was released, netizens discovered this The smallest parameter model also requires nearly 30GB of GPU to run.

# However, with floating point optimization via Bits and Bytes libraries, they were able to get the model running on a single NVIDIA RTX 3060.

Additionally, a researcher on GitHub was even able to infer a few words per second running the 7B version of LLM on a Ryzen 7900X CPU.

#So what exactly is the LLaMA model? Foreign guys reviewed it.

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be open source! GitHub gains 8k stars and a large number of reviews are released


##LLaMA performed well in many tests.

In terms of large-scale multi-task language understanding, even the relatively small 13B model is on par with GPT-3, which is the size of its 13 times.

The 33B version is far superior to GPT-3, and the 65B version can compete with the most powerful existing LLM model-Google's 540B parameter PaLM.

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be open source! GitHub gains 8k stars and a large number of reviews are released

For text that needs to be processed using logic or calculations, LLaMA performs well and can compete with PaLM in quantitative reasoning. Compared to, or even better than the latter's code generation capabilities.

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be open source! GitHub gains 8k stars and a large number of reviews are released

Given these results, LLaMA appears to be one of the most advanced models currently available, and, It's small enough that it doesn't require many resources to run. This makes LLaMA very tempting for people to want to play with it and see what it can do.

Explaining Jokes

PaLM’s original paper shows a very cool use case: given a joke, let the model explain why it is funny. This task requires a combination of experimentation and logic, which all previous models of PaLM were unable to achieve.

Let some of the jokes be explained by LLaMA and ChatGPT. Some joke language models can get them, such as Schimidhuber's long and boring speech.

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be open source! GitHub gains 8k stars and a large number of reviews are released

But overall, neither LLaMA nor ChatGPT have a sense of humor.

However, the two have different strategies for dealing with jokes that they don’t understand. ChatGPT will generate “a wall of text”, hoping that at least some of the sentences are correct answers. , this behavior is like students who don’t know the answer, hoping that the teacher can find the answer from their random talk.

Zero sample classification

This is a very practical function that allows people to use LLM instead of scoring to generate training sets and then train smaller serviceable models on those training sets.

A more challenging task is to classify clicked ads. Since even humans cannot agree on what a clicked ad is, the model is provided in the prompt. Some examples, so in fact this is a few-sample rather than zero-sample classification. Here are tips from LLaMA.

In the test, only LLaMA-33B managed to follow the required format and give answers, and its predictions were reasonable. ChatGPT performed second, and could give a comparison A reasonable answer, but often not in the prescribed format, and the smaller 7B and 13B models are not well suited to the task.

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be open source! GitHub gains 8k stars and a large number of reviews are released

##Code generation

Although the method is LLM Excellent in humanities but not in STEM subjects, so how does LLaMA perform in this area?

#In the prompt, give the form of the search table and the purpose you hope to achieve, and ask the model to provide SQL query statements.

ChatGPT performs better in this task, but the results given by the language model are generally unreliable.

The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be open source! GitHub gains 8k stars and a large number of reviews are released

In various tests compared with ChatGPT, LLaMA did not perform as expected. Just as successful. Of course, if the gap is only caused by RLHF (reinforcement learning with human feedback), then the future of small models may be brighter.

The above is the detailed content of The LLaMA model was leaked, and the Meta version of ChatGPT was forced to be 'open source'! GitHub gains 8k stars and a large number of reviews are released. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

I Tried Vibe Coding with Cursor AI and It's Amazing! I Tried Vibe Coding with Cursor AI and It's Amazing! Mar 20, 2025 pm 03:34 PM

Vibe coding is reshaping the world of software development by letting us create applications using natural language instead of endless lines of code. Inspired by visionaries like Andrej Karpathy, this innovative approach lets dev

Top 5 GenAI Launches of February 2025: GPT-4.5, Grok-3 & More! Top 5 GenAI Launches of February 2025: GPT-4.5, Grok-3 & More! Mar 22, 2025 am 10:58 AM

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

How to Use YOLO v12 for Object Detection? How to Use YOLO v12 for Object Detection? Mar 22, 2025 am 11:07 AM

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

Is ChatGPT 4 O available? Is ChatGPT 4 O available? Mar 28, 2025 pm 05:29 PM

ChatGPT 4 is currently available and widely used, demonstrating significant improvements in understanding context and generating coherent responses compared to its predecessors like ChatGPT 3.5. Future developments may include more personalized interactions and real-time data processing capabilities, further enhancing its potential for various applications.

Google's GenCast: Weather Forecasting With GenCast Mini Demo Google's GenCast: Weather Forecasting With GenCast Mini Demo Mar 16, 2025 pm 01:46 PM

Google DeepMind's GenCast: A Revolutionary AI for Weather Forecasting Weather forecasting has undergone a dramatic transformation, moving from rudimentary observations to sophisticated AI-powered predictions. Google DeepMind's GenCast, a groundbreak

Which AI is better than ChatGPT? Which AI is better than ChatGPT? Mar 18, 2025 pm 06:05 PM

The article discusses AI models surpassing ChatGPT, like LaMDA, LLaMA, and Grok, highlighting their advantages in accuracy, understanding, and industry impact.(159 characters)

o1 vs GPT-4o: Is OpenAI's New Model Better Than GPT-4o? o1 vs GPT-4o: Is OpenAI's New Model Better Than GPT-4o? Mar 16, 2025 am 11:47 AM

OpenAI's o1: A 12-Day Gift Spree Begins with Their Most Powerful Model Yet December's arrival brings a global slowdown, snowflakes in some parts of the world, but OpenAI is just getting started. Sam Altman and his team are launching a 12-day gift ex

Best AI Art Generators (Free & Paid) for Creative Projects Best AI Art Generators (Free & Paid) for Creative Projects Apr 02, 2025 pm 06:10 PM

The article reviews top AI art generators, discussing their features, suitability for creative projects, and value. It highlights Midjourney as the best value for professionals and recommends DALL-E 2 for high-quality, customizable art.

See all articles