


Stanford University releases AI basic model transparency index, Llama 2 ranks first but 'fails'
IT House News on October 20th, Stanford University recently released the "Transparency Index" of the AI basic model. The highest display index is Meta's Lama 2, but the related "transparency" is only 54%, so the researchers believe , almost all AI models on the market "lack transparency."
It is reported that this research was led by Rishi Bommasani, head of the HAI Center for Basic Model Research (CRFM), and investigated the 10 most popular basic models overseas:
- Meta’s Llama 2,
- BloomZ by BigScience,
- OpenAI’s GPT-4,
- Stability AI’s Stable Diffusion,
- Claude,
of Anthropic PBC- Google’s PaLM 2,
- Cohere's Command,
- Jurassic-2,
by AI21 Labs- Inflection AI’s Inflection、
- Amazon’s Titan.
Rishi Bommasani believes that "lack of transparency" has always been a problem faced by the AI industry. In terms of specific model "transparency indicators", IT House found that the relevant evaluation content mainly revolves around "model training data set copyright", "training model "Computing resources used", "Credibility of the content generated by the model", "Model's own capabilities", "Risk of the model being induced to generate harmful content", "User privacy of using the model", etc., totaling 100 items.
The final survey showed that Meta’s Lama 2 topped the list with 54% transparency, while OpenAI’s GPT-4 had only 48% transparency, and Google’s PaLM 2 ranked fifth with 40%.
▲ Picture source Stanford University
Among the specific indicators, the top ten models with the "best" score performance are "Model Basics". This evaluation content mainly includes "whether the model, scale, and model of the model are accurately introduced during model training." Architecture" with an average transparency of 63%. The worst performer is Impact, which mainly evaluates whether the basic model will "retrieve user information for evaluation", with an average transparency of only 11%.
CRFM Director Percy Liang said that the "transparency" of the business base model is very important for promoting AI legislation, as well as related industries and academia.
Rishi Bommasani said that lower model transparency makes it more difficult for companies to know whether they can safely rely on relevant models, and for researchers to rely on these models to do research.
Rishi Bommasani ultimately believes that the above ten basic models all "fail" in terms of transparency. Although Meta's Llama 2 has the highest score, it cannot meet the needs of the outside world. "The model transparency must reach at least 82% to be recognized by the outside world. ".
The above is the detailed content of Stanford University releases AI basic model transparency index, Llama 2 ranks first but 'fails'. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

Vibe coding is reshaping the world of software development by letting us create applications using natural language instead of endless lines of code. Inspired by visionaries like Andrej Karpathy, this innovative approach lets dev

DALL-E 3: A Generative AI Image Creation Tool Generative AI is revolutionizing content creation, and DALL-E 3, OpenAI's latest image generation model, is at the forefront. Released in October 2023, it builds upon its predecessors, DALL-E and DALL-E 2

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

The $500 billion Stargate AI project, backed by tech giants like OpenAI, SoftBank, Oracle, and Nvidia, and supported by the U.S. government, aims to solidify American AI leadership. This ambitious undertaking promises a future shaped by AI advanceme

Google DeepMind's GenCast: A Revolutionary AI for Weather Forecasting Weather forecasting has undergone a dramatic transformation, moving from rudimentary observations to sophisticated AI-powered predictions. Google DeepMind's GenCast, a groundbreak

Google's Veo 2 and OpenAI's Sora: Which AI video generator reigns supreme? Both platforms generate impressive AI videos, but their strengths lie in different areas. This comparison, using various prompts, reveals which tool best suits your needs. T

The article discusses AI models surpassing ChatGPT, like LaMDA, LLaMA, and Grok, highlighting their advantages in accuracy, understanding, and industry impact.(159 characters)
