Learning like a baby, DeepMind's new model learns the rules of the physical world in 28 hours-AI-php.cn

Table of Contents

Using knowledge from developmental psychology

Method Introduction

PLATO model architecture

Experimental results

Home

Technology peripherals

Learning like a baby, DeepMind's new model learns the rules of the physical world in 28 hours

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Apr 09, 2023 am 11:41 AM

ai deepmind alphafold

Deepmind aims to build a model that can learn intuitive physics and analyze why the model achieves this ability.

From AlphaFold to mathematical reasoning, DeepMind has been trying to combine AI and basic science. Now, DeepMind has created a new model that can learn simple physical rules.

Developmental psychologists tested and analyzed how babies follow the movement of objects through their gaze. For example, children expressed surprise when a video was played in which a ball suddenly disappeared.

Computer scientist Luis Piloto of DeepMind and colleagues hope to develop similar tests for artificial intelligence (AI). The team trained a neural network using videos of animations of simple objects like cubes and balls, and the model learned by discovering patterns in large amounts of data. The research paper was published July 11 in Nature Human Behavior.

Learning like a baby, DeepMinds new model learns the rules of the physical world in 28 hours

Paper address: https://www.nature.com/articles/s41562-022-01394 -8
Dataset address: https://github.com/deepmind/physical_concepts

This model performs physical learning by automatically encoding and tracking objects, Hence the name PLATO (Physics Learning through Auto-encoding and Tracking Objects). PLATO receives the original image from the video and a version of the image that highlights the targets of each object in the scene. PLATO aims to develop internal representations of the physical properties of objects, such as their position and velocity.

The system was trained on approximately 30 hours of videos showing simple motion mechanisms (such as a ball rolling down a slope) and developed the ability to predict how these objects would behave in different situations. . In particular, PLATO learns continuity and robustness to ensure that the trajectory of the target is uninterrupted and the shape of the object is persistent. As the video plays, the model's predictions become more accurate.

When playing videos with "impossible" events, such as an object suddenly disappearing, PLATO can measure the difference between the video and its own predictions, thus providing a measure of "surprise."

Piloto said: "PLATO was not designed as a model of infant behavior, but it can test hypotheses about how human infants learn. We hope that cognitive scientists will eventually use it to simulate infant behavior."

Jeff Clune, a computer scientist at the University of British Columbia, said, "Comparing AI with the learning methods of human infants is an important research direction. PLATO researchers hand-designed much of the prior knowledge that gives the artificial intelligence model advantages." Researchers like Clune are trying to let programs develop their own algorithms to understand the physical world.

Using knowledge from developmental psychology

In order to pursue richer physical intuition in AI systems, DeepMind’s research team draws inspiration from developmental psychology. The research team built a deep learning system that incorporates a core insight from developmental psychology, namely that physics is understood at the level of discrete objects and their interactions.

The core of intuitive physics relies on a discrete set of concepts (e.g., object persistence, solidity, continuity, etc.) that can be distinguished, manipulated, and individually detected. Traditional, standard approaches to AI learning intuitive physics learn about the physical world through video or state predictors, binary outcome predictions, question-answer performance, or reinforcement learning tasks. These approaches appear to require understanding some aspects of intuitive physics but do not explicitly operationalize or strategically explore a clear set of concepts.

Developmental psychology, on the other hand, holds that a physical concept corresponds to a set of expectations about how the future will unfold. For example, people expect that objects will not magically teleport from one place to another suddenly, but will trace a continuous path through time and space, which leads to the concept of continuity. Therefore, there is a way to measure knowledge of specific physical concepts: the Violation of Expectations (VoE) paradigm.

When exploring a specific concept using the VoE paradigm, researchers show infants visually similar arrays (called probes) that are either consistent (physically possible) or inconsistent (physically unlikely) with the physical concept. possible). In this paradigm, “surprise” is measured by gaze duration.

Learning like a baby, DeepMinds new model learns the rules of the physical world in 28 hours

Method Introduction

First of all, DeepMind proposed a very rich video corpus-Physical Concepts data set. This dataset contains VoE probe videos targeting five important physical concepts considered core elements in developmental psychology, including continuity, goal persistence, and robustness. The fourth is immutability, which captures the concept that certain target properties (such as shape) do not change; the fifth concept is directional inertia, which involves the expectation that a moving object will change in a direction consistent with the principle of inertia.

The most important thing is that the Physical Concepts dataset also includes a separate video corpus as training data. These videos demonstrate various procedurally generated physics events.

Learning like a baby, DeepMinds new model learns the rules of the physical world in 28 hours

Figure 2: Example of video dataset used to train the model

PLATO model architecture

Deepmind aims to build an intuitive learning model of physics, and analyze why the model achieves this capability. Some advanced systems in the field of AI are instantiated in the PLATO model.

The first is the target personalization process. The target personalization process cuts the visual continuous sensory input into a set of discrete entities, where each entity has a corresponding set of attributes. In PLATO, each segmented video frame is decomposed into a set of target codes (Fig. 3a-c) by the perceptual module, enabling mapping from visual input to individual targets. PLATO does not learn to segment the scene, but given a segmentation target, it learns a compressed representation.

Secondly, target tracking (or target index) assigns an index to each target, thereby achieving the correspondence between target perception and dynamic attribute calculation across time (Figure 3b, c) . In PLATO, target code is accumulated and tracked over frames in the target buffer (Figure 3d).

The last component is the relationship processing of these tracked targets. This process is inspired by the "physical reasoning system" proposed in developmental psychology, which can dynamically process the relationship between objects. Representations, generating new representations that are affected by relationships and interactions between objects and other objects.

PLATO learns the interaction between target memory and target perception history (Figure 3d) to generate predicted video frames for the next target and update target-based memory.

Learning like a baby, DeepMinds new model learns the rules of the physical world in 28 hours

Figure 3: PLATO includes two components: perception module (left) and dynamic prediction (right)

Experimental results

In When tested, PLATO showed strong VoE effects in all five detection categories when trained with five different random seeds.

Learning like a baby, DeepMinds new model learns the rules of the physical world in 28 hours

Figure 5: PLATO shows robust performance in probing the Physical Concepts dataset.

The training corpus in the Physical Concepts dataset contains a total of 300,000 videos. Using conservative calculations, that's approximately 52 days of continuous visual experience. From an AI and development perspective, there's the question of how much training data is actually needed to produce a VoE effect in testing. To evaluate this, Deepmind trained random seeds of three PLATO dynamic predictors on datasets of decreasing size (Figure 6), calculating a grand average of the VoE effects across all five detection classes.

Results show robust VoE effects in Deepmind’s models after training with as few as 50,000 examples (equivalent to 28 hours of visual experience) .

Learning like a baby, DeepMinds new model learns the rules of the physical world in 28 hours

Figure 6: PLATO shows powerful results in just 28 hours of visual experience.

Generalization testing: Deepmind uses the ADEPT dataset, which is designed to explore intuitive physical knowledge. As shown in Figure 7, PLATO shows clear VoE effects for all three detection categories.

Learning like a baby, DeepMinds new model learns the rules of the physical world in 28 hours

Figure 7: PLATO demonstrates robust effects on unseen targets and dynamics without any retraining.

For more information, please view the original paper.

The above is the detailed content of Learning like a baby, DeepMind's new model learns the rules of the physical world in 28 hours. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

How Long Does It Take To Beat Split Fiction?

1 months ago By DDD

R.E.P.O. Save File Location: Where Is It & How to Protect It?

1 months ago By DDD

R.E.P.O. Best Graphic Settings

2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

1 weeks ago By DDD

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7391

Java Tutorial

1630

CakePHP Tutorial

1357

Laravel Tutorial

1268

PHP Tutorial

1216

Related knowledge

How to achieve the effect of high input elements but high text at the bottom? Apr 04, 2025 pm 10:27 PM

How to achieve the height of the input element is very high but the text is located at the bottom. In front-end development, you often encounter some style adjustment requirements, such as setting a height...

How to correctly display the locally installed 'Jingnan Mai Round Body' on the web page? Apr 05, 2025 pm 10:33 PM

Using locally installed font files in web pages Recently, I downloaded a free font from the internet and successfully installed it into my system. Now...

Where to get the material for H5 page production Apr 05, 2025 pm 11:33 PM

The main sources of H5 page materials are: 1. Professional material website (paid, high quality, clear copyright); 2. Homemade material (high uniqueness, but time-consuming); 3. Open source material library (free, need to be carefully screened); 4. Picture/video website (copyright verified is required). In addition, unified material style, size adaptation, compression processing, and copyright protection are key points that need to be paid attention to.

How to select a child element with the first class name item through CSS? Apr 05, 2025 pm 11:24 PM

When the number of elements is not fixed, how to select the first child element of the specified class name through CSS. When processing HTML structure, you often encounter different elements...

How to quickly build a foreground page using AI programming tools? Apr 04, 2025 pm 08:24 PM

Quickly build the front-end page: Shortcuts for back-end developers As a back-end developer with three to four years of experience, you may be interested in basic JavaScript, CSS...

Does H5 page production require continuous maintenance? Apr 05, 2025 pm 11:27 PM

The H5 page needs to be maintained continuously, because of factors such as code vulnerabilities, browser compatibility, performance optimization, security updates and user experience improvements. Effective maintenance methods include establishing a complete testing system, using version control tools, regularly monitoring page performance, collecting user feedback and formulating maintenance plans.

Setting flex: 1 1 0 What is the difference between setting flex-basis and not setting flex-basis? Apr 05, 2025 am 09:39 AM

The difference between flex:110 in Flex layout and flex-basis not set In Flex layout, how to set flex...

How to efficiently remove specific conditional expressions in script tags in HTML strings? Apr 05, 2025 pm 12:45 PM

Efficiently modifying HTML string content This article will explore how to modify an HTML string, with the goal of removing...

See all articles