Home Technology peripherals AI Stanford University releases AI basic model transparency index, Llama 2 ranks first but 'fails'

Stanford University releases AI basic model transparency index, Llama 2 ranks first but 'fails'

Oct 21, 2023 am 08:17 AM

IT House News on October 20th, Stanford University recently released the "Transparency Index" of the AI ​​basic model. The highest display index is Meta's Lama 2, but the related "transparency" is only 54%, so the researchers believe , almost all AI models on the market "lack transparency."

It is reported that this research was led by Rishi Bommasani, head of the HAI Center for Basic Model Research (CRFM), and investigated the 10 most popular basic models overseas:

  • Meta’s Llama 2,
  • BloomZ by BigScience,
  • OpenAI’s GPT-4,
  • Stability AI’s Stable Diffusion,
  • Claude,
  • of Anthropic PBC
  • Google’s PaLM 2,
  • Cohere's Command,
  • Jurassic-2,
  • by AI21 Labs
  • Inflection AI’s Inflection、
  • Amazon’s Titan.

Rishi Bommasani believes that "lack of transparency" has always been a problem faced by the AI ​​industry. In terms of specific model "transparency indicators", IT House found that the relevant evaluation content mainly revolves around "model training data set copyright", "training model "Computing resources used", "Credibility of the content generated by the model", "Model's own capabilities", "Risk of the model being induced to generate harmful content", "User privacy of using the model", etc., totaling 100 items.

The final survey showed that Meta’s Lama 2 topped the list with 54% transparency, while OpenAI’s GPT-4 had only 48% transparency, and Google’s PaLM 2 ranked fifth with 40%.

斯坦福大学发布AI基础模型透明度指标,Llama 2居首但“不及格”

▲ Picture source Stanford University

Among the specific indicators, the top ten models with the "best" score performance are "Model Basics". This evaluation content mainly includes "whether the model, scale, and model of the model are accurately introduced during model training." Architecture" with an average transparency of 63%. The worst performer is Impact, which mainly evaluates whether the basic model will "retrieve user information for evaluation", with an average transparency of only 11%.

CRFM Director Percy Liang said that the "transparency" of the business base model is very important for promoting AI legislation, as well as related industries and academia.

Rishi Bommasani said that lower model transparency makes it more difficult for companies to know whether they can safely rely on relevant models, and for researchers to rely on these models to do research.

Rishi Bommasani ultimately believes that the above ten basic models all "fail" in terms of transparency. Although Meta's Llama 2 has the highest score, it cannot meet the needs of the outside world. "The model transparency must reach at least 82% to be recognized by the outside world. ".

The above is the detailed content of Stanford University releases AI basic model transparency index, Llama 2 ranks first but 'fails'. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
Two Point Museum: All Exhibits And Where To Find Them
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

I Tried Vibe Coding with Cursor AI and It's Amazing! I Tried Vibe Coding with Cursor AI and It's Amazing! Mar 20, 2025 pm 03:34 PM

Vibe coding is reshaping the world of software development by letting us create applications using natural language instead of endless lines of code. Inspired by visionaries like Andrej Karpathy, this innovative approach lets dev

How to Use DALL-E 3: Tips, Examples, and Features How to Use DALL-E 3: Tips, Examples, and Features Mar 09, 2025 pm 01:00 PM

DALL-E 3: A Generative AI Image Creation Tool Generative AI is revolutionizing content creation, and DALL-E 3, OpenAI's latest image generation model, is at the forefront. Released in October 2023, it builds upon its predecessors, DALL-E and DALL-E 2

Top 5 GenAI Launches of February 2025: GPT-4.5, Grok-3 & More! Top 5 GenAI Launches of February 2025: GPT-4.5, Grok-3 & More! Mar 22, 2025 am 10:58 AM

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

How to Use YOLO v12 for Object Detection? How to Use YOLO v12 for Object Detection? Mar 22, 2025 am 11:07 AM

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

Elon Musk & Sam Altman Clash over $500 Billion Stargate Project Elon Musk & Sam Altman Clash over $500 Billion Stargate Project Mar 08, 2025 am 11:15 AM

The $500 billion Stargate AI project, backed by tech giants like OpenAI, SoftBank, Oracle, and Nvidia, and supported by the U.S. government, aims to solidify American AI leadership. This ambitious undertaking promises a future shaped by AI advanceme

Google's GenCast: Weather Forecasting With GenCast Mini Demo Google's GenCast: Weather Forecasting With GenCast Mini Demo Mar 16, 2025 pm 01:46 PM

Google DeepMind's GenCast: A Revolutionary AI for Weather Forecasting Weather forecasting has undergone a dramatic transformation, moving from rudimentary observations to sophisticated AI-powered predictions. Google DeepMind's GenCast, a groundbreak

Sora vs Veo 2: Which One Creates More Realistic Videos? Sora vs Veo 2: Which One Creates More Realistic Videos? Mar 10, 2025 pm 12:22 PM

Google's Veo 2 and OpenAI's Sora: Which AI video generator reigns supreme? Both platforms generate impressive AI videos, but their strengths lie in different areas. This comparison, using various prompts, reveals which tool best suits your needs. T

Which AI is better than ChatGPT? Which AI is better than ChatGPT? Mar 18, 2025 pm 06:05 PM

The article discusses AI models surpassing ChatGPT, like LaMDA, LLaMA, and Grok, highlighting their advantages in accuracy, understanding, and industry impact.(159 characters)

See all articles