Table of Contents
‍Stable Diffusion: A Milestone for Open Source Models
Home Technology peripherals AI This model obsessed former Tesla AI Director Andrej Karpathy for three days!

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

Apr 09, 2023 pm 04:51 PM
ai Model

Recently, an AI researcher and digital artist named Xander Steenbrugge uploaded a very shocking video "Journey Across Time".

The great evolution of life on earth, starting from the primitive ocean, to ancient lizards, dinosaurs, mammals, to monkeys, orangutans, apes, homo sapiens... and finally science fiction appeared in the future world.

And Steenbrugge also commented excitedly: We are crossing a threshold where generative artificial intelligence is no longer just about novel aesthetics, but has evolved into an amazing tool that can build Powerful, human-centered narrative. Steenbrugge said the video used 36 consecutive phrases. To find the best possible order, he tried over a thousand different hints and seeds, and applied many "hint engineering" techniques to the code to figure out what worked and what didn't.

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

Former Tesla AI Senior Director Andrej Karpathy was so shocked after reading it that he couldn't help but give it a try.

"Surreal steampunk neural network machine, in the shape of a brain, placed on a base covered with neurons made of gears", while typing this text Later, his masterpiece was produced.

2 minute video (~1 hour rendered on A100), generated by smooth interpolation between models with random noise input.

This model, called Stable Diffusion, uses a method of "interpolation" between the meanings of two sentences. The insertion is a semantic gap rather than a visual one, so it dramatically changes the way the story is told.

And this is just the beginning of the digital content creation revolution driven by generative artificial intelligence.

‍Stable Diffusion: A Milestone for Open Source Models

Since the beginning of 2021, artificial intelligence that can generate images from text descriptions has been developing rapidly. At that time, OpenAI demonstrated impressive results with DALL-E 1 and CLIP.

In 2022, OpenAI released the impressive DALL-E 2, Google showed off Imagen and Parti, Midjourney launched a public beta, and Craiyon created AI images everywhere kind of social media.

And recently, Stability AI has released a brand new model-Stable Diffusion.

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

Unlike DALL-E 2, however, Stable Diffusion can generate a variety of famous figures banned by OpenAI.

Although systems such as Midjourney, Pixelz.ai, etc. can also do it, the quality of their generation is far from comparable to Stable Diffusion. , and none of them are open source.

Now we invite our old friend Musk to perform a second-second transformation into "Black Widow" (Scarlett Johansson).

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

Apparently, Stability AI, as the lead developer, hopes that more than one company or team will be able to train variants of Stable Diffusion.

For example, you are a researcher who does not have massive GPU computing power. Don't worry, Stable Diffusion will soon be able to run on a graphics card with only 5.1GB VRAM.

For another example, if you are a fan of MacBook equipped with Apple M1 chip, Stable Diffusion can also run. However, the time for image generation will change from seconds to minutes.

It seems that multimodal models are heading down the path that large language models have taken before: moving away from a single vendor and making numerous alternatives widely available through open source.

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

In addition, Runway is already studying how to implement text-to-video editing based on Stable Diffusion.

In a word, let the tennis court become a beach;

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

No matter the lightning and thunder, or the wind and sunshine;

Whether it’s on the moon or on Mars;

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

##can’t stop me from playing tennis...

Using Compute to Accelerate Basic AI Open Source

After the testing phase, Stable Diffusion will be free and the code and trained models will be released as open source. There will also be a hosted version with a web interface for users to test the system.

Stable Diffusion is the result of a collaboration between researchers at Stability AI, RunwayML, LMU Munich, EleutherAI and LAION. EleutherAI is famous for its open source language models GPT-J-6B and GPT-NeoX-20B, etc.

The non-profit organization LAION (Open Network for Large-Scale Artificial Intelligence) provided the open-source LAION 5B dataset for training data, which the team filtered based on human feedback during the initial testing phase to Create the final LAION-Aesthetics training dataset.

Patrick Esser of Runway and Robin Rombach of LMU Munich led the project, whose research in the CompVis group at Heidelberg University laid the foundation for the project. At Heidelberg University they created the widely used VQGAN and Latent Diffusion. These two models, coupled with the results of OpenAI and Google Brain, enable Stable Diffusion.

Stability AI was founded in 2020, and the investor behind it is mathematician and computer scientist Emad Mostaque. He has worked as an analyst at various hedge funds.

With Stability AI and his private wealth, Mostaque hopes to cultivate an open source AI research community. His startup has previously supported the creation of the “LAION 5B” data set. To train Stable Diffusion's models, Stability AI provided servers with 4,000 Nvidia A100 GPUs.

"No one else has decision-making power except our 75 employees - not billionaires, large funds, or governments. We are completely independent." Mostaque said. "We plan to use our computing to accelerate the open source of basic artificial intelligence."

Netizens are going crazy

In addition to the two videos at the beginning, some netizens also use Stable Diffusion generates an aging animation.

He said changing a word in a long prompt during production had a more subtle effect than in a short prompt. Additionally, using descriptions such as old, middle-aged, or young often works better than stating specific ages such as 10, 40, or 80.

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

In addition, there are various dreamy still pictures.

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

「Lover」

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

「Lion」

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

"Dark Soul" written by Leyendecker and Maxence

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

"Cyber ​​Peking Opera"

Easter Egg

Feel the tortured mental journey of Andrej Karpathy, Tesla’s former artificial intelligence and autonomous driving vision director.

This model obsessed former Tesla AI Director Andrej Karpathy for three days!

After watching the video made by the master, Karpathy was very excited.

#The results achieved through a wave of operations will not feel out of place even in a small kindergarten class. Desperate Karpathy said that he wanted to hire a "prompt engineer" to create for him. FinallyKarpathy finally came to his senses and created new works that he was very satisfied with - steampunk artificial neural network machine and blueberry pasta.

The above is the detailed content of This model obsessed former Tesla AI Director Andrej Karpathy for three days!. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1664
14
PHP Tutorial
1268
29
C# Tutorial
1242
24
How to understand DMA operations in C? How to understand DMA operations in C? Apr 28, 2025 pm 10:09 PM

DMA in C refers to DirectMemoryAccess, a direct memory access technology, allowing hardware devices to directly transmit data to memory without CPU intervention. 1) DMA operation is highly dependent on hardware devices and drivers, and the implementation method varies from system to system. 2) Direct access to memory may bring security risks, and the correctness and security of the code must be ensured. 3) DMA can improve performance, but improper use may lead to degradation of system performance. Through practice and learning, we can master the skills of using DMA and maximize its effectiveness in scenarios such as high-speed data transmission and real-time signal processing.

How to use the chrono library in C? How to use the chrono library in C? Apr 28, 2025 pm 10:18 PM

Using the chrono library in C can allow you to control time and time intervals more accurately. Let's explore the charm of this library. C's chrono library is part of the standard library, which provides a modern way to deal with time and time intervals. For programmers who have suffered from time.h and ctime, chrono is undoubtedly a boon. It not only improves the readability and maintainability of the code, but also provides higher accuracy and flexibility. Let's start with the basics. The chrono library mainly includes the following key components: std::chrono::system_clock: represents the system clock, used to obtain the current time. std::chron

Quantitative Exchange Ranking 2025 Top 10 Recommendations for Digital Currency Quantitative Trading APPs Quantitative Exchange Ranking 2025 Top 10 Recommendations for Digital Currency Quantitative Trading APPs Apr 30, 2025 pm 07:24 PM

The built-in quantization tools on the exchange include: 1. Binance: Provides Binance Futures quantitative module, low handling fees, and supports AI-assisted transactions. 2. OKX (Ouyi): Supports multi-account management and intelligent order routing, and provides institutional-level risk control. The independent quantitative strategy platforms include: 3. 3Commas: drag-and-drop strategy generator, suitable for multi-platform hedging arbitrage. 4. Quadency: Professional-level algorithm strategy library, supporting customized risk thresholds. 5. Pionex: Built-in 16 preset strategy, low transaction fee. Vertical domain tools include: 6. Cryptohopper: cloud-based quantitative platform, supporting 150 technical indicators. 7. Bitsgap:

How to handle high DPI display in C? How to handle high DPI display in C? Apr 28, 2025 pm 09:57 PM

Handling high DPI display in C can be achieved through the following steps: 1) Understand DPI and scaling, use the operating system API to obtain DPI information and adjust the graphics output; 2) Handle cross-platform compatibility, use cross-platform graphics libraries such as SDL or Qt; 3) Perform performance optimization, improve performance through cache, hardware acceleration, and dynamic adjustment of the details level; 4) Solve common problems, such as blurred text and interface elements are too small, and solve by correctly applying DPI scaling.

What is real-time operating system programming in C? What is real-time operating system programming in C? Apr 28, 2025 pm 10:15 PM

C performs well in real-time operating system (RTOS) programming, providing efficient execution efficiency and precise time management. 1) C Meet the needs of RTOS through direct operation of hardware resources and efficient memory management. 2) Using object-oriented features, C can design a flexible task scheduling system. 3) C supports efficient interrupt processing, but dynamic memory allocation and exception processing must be avoided to ensure real-time. 4) Template programming and inline functions help in performance optimization. 5) In practical applications, C can be used to implement an efficient logging system.

How to use string streams in C? How to use string streams in C? Apr 28, 2025 pm 09:12 PM

The main steps and precautions for using string streams in C are as follows: 1. Create an output string stream and convert data, such as converting integers into strings. 2. Apply to serialization of complex data structures, such as converting vector into strings. 3. Pay attention to performance issues and avoid frequent use of string streams when processing large amounts of data. You can consider using the append method of std::string. 4. Pay attention to memory management and avoid frequent creation and destruction of string stream objects. You can reuse or use std::stringstream.

How to measure thread performance in C? How to measure thread performance in C? Apr 28, 2025 pm 10:21 PM

Measuring thread performance in C can use the timing tools, performance analysis tools, and custom timers in the standard library. 1. Use the library to measure execution time. 2. Use gprof for performance analysis. The steps include adding the -pg option during compilation, running the program to generate a gmon.out file, and generating a performance report. 3. Use Valgrind's Callgrind module to perform more detailed analysis. The steps include running the program to generate the callgrind.out file and viewing the results using kcachegrind. 4. Custom timers can flexibly measure the execution time of a specific code segment. These methods help to fully understand thread performance and optimize code.

An efficient way to batch insert data in MySQL An efficient way to batch insert data in MySQL Apr 29, 2025 pm 04:18 PM

Efficient methods for batch inserting data in MySQL include: 1. Using INSERTINTO...VALUES syntax, 2. Using LOADDATAINFILE command, 3. Using transaction processing, 4. Adjust batch size, 5. Disable indexing, 6. Using INSERTIGNORE or INSERT...ONDUPLICATEKEYUPDATE, these methods can significantly improve database operation efficiency.

See all articles