How to improve storage and transmission efficiency? Parameter-intensive mask network has significant effect-AI-php.cn

Table of Contents

4. A new network compression idea

5. Conclusion

Home

Technology peripherals

How to improve storage and transmission efficiency? Parameter-intensive mask network has significant effect

王林

Apr 12, 2023 pm 05:10 PM

storage transmission

In order to handle more complex tasks, the scale of neural networks has been increasing in recent years, and how to efficiently store and transmit neural networks has become very important. On the other hand, with the proposal of the Lottery Ticket Hypothesis (LTH), random sparse neural networks have recently shown strong potential. How to use this potential to improve the storage and transmission efficiency of the network is also worth exploring.

Researchers from Northeastern University and Rochester Institute of Technology proposed Parameter-Efficient Masking Networks (PEMN). The authors first explore the representational capabilities of random networks generated by a limited number of random numbers. Experiments show that even if the network is generated from a limited number of random numbers, it still has good representation capabilities by choosing different sub-network structures.

Through this exploratory experiment, the author naturally proposed to use a limited number of random numbers as a prototype, combined with a set of masks to express a neural network. Because a limited number of random numbers and binary masks occupy very little storage space, the author uses this to propose a new idea for network compression. The article has been accepted for NeurIPS 2022. The code has been open sourced.

How to improve storage and transmission efficiency? Parameter-intensive mask network has significant effect

##Paper address: https://arxiv.org/abs/2210.06699
Paper code: https://github.com/yueb17/PEMN

1. Related research

MIT researchers proposed the Lottery Ticket Hypothesis (ICLR'19): In a randomly initialized network, there is a lottery sub-network (winning ticket) that achieves good results when trained alone. The lottery ticket hypothesis explores the trainability of stochastic sparse networks. Uber researchers proposed Supermask (NeurIPS’19): In a randomly initialized network, there is a sub-network that can be directly used for inference without training. Supermask explores the usability of stochastic sparse networks. Researchers at the University of Washington proposed Edge-Popup (CVPR’20): learning the mask of the subnetwork through backpropagation, which greatly improves the usability of random sparse networks.

2. Research motivation/process

The above related research has explored the potential of random sparse networks from different angles, such as trainability and usability, of which usability can also be understood for representational ability. In this work, the authors are interested in how well a neural network generated from random numbers can represent without training weights. Following the exploration of this problem, the authors proposed Parameter-Efficient Masking Networks (PEMN). Naturally, the author uses PEMN to provide a new idea for network compression and serves as an example to explore potential application scenarios of PEMN.

3. Explore the representation ability of neural networks composed of random numbers

Given a random network, the author chooses the Edge-Popup algorithm to select subnetworks to explore its Representation ability. The difference is that instead of randomly initializing the entire network, the author proposes three parameter-intensive network generation strategies to use a prototype to build a random network.

One-layer: Select the weight of the repeated structure in the network as prototype to fill in other network layers with the same structure.
Max-layer padding (MP): Select the network layer with the largest number of parameters as the prototype and truncate the corresponding parameter amounts to fill other network layers.
Random vector padding (RP): Select a random vector of a certain length as the prototype and copy it to fill the entire network.

Three different random network generation strategies gradually reduce the number of unique values in the network. We choose based on the random networks obtained by different strategies. subnetworks, thereby exploring the representational potential of random networks generated by a limited number of random numbers.

How to improve storage and transmission efficiency? Parameter-intensive mask network has significant effect

The above figure shows the experimental results of CIFAR10 image classification using ConvMixer and ViT network. The Y-axis is the accuracy, and the X-axis is the random network obtained using different strategies. As the

According to the experimental results, we observed that even if the random network only has a very limited number of non-repeating random numbers (such as PR_1e-3), it can still maintain the representation of the selected subnetwork well. ability. So far, the author has explored the representation ability of a neural network composed of a limited number of random numbers through different random network generation strategies and observed that even if the non-repeating random numbers are very limited, the corresponding random network can still represent the data well.

At the same time, based on these random network generation strategies and combined with the obtained subnetwork mask, the author proposed a new neural network type called Parameter-Efficient Masking Networks (PEMN).

4. A new network compression idea

This article chooses neural network compression as an example to expand the potential applications of PEMN. Specifically, the different random network generation strategies proposed in this article can efficiently use prototypes to represent complete random networks, especially the most fine-grained random vector padding (RP) strategy.

The author uses the random vector prototype in the RP strategy and a corresponding set of subnet masks to represent a random network. The prototype needs to be saved in floating point format, while the mask only needs to be saved in binary format. Because the prototype length in RP can be very short (because a limited number of non-repeating random numbers still have strong representation ability), the overhead of representing a neural network will become very small, that is, storing a floating point number format with a limited length. A random vector and a set of masks in binary format. Compared with traditional sparse networks that store floating-point values of subnetworks, this paper proposes a new network compression idea to efficiently store and transmit neural networks.

How to improve storage and transmission efficiency? Parameter-intensive mask network has significant effect

In the above figure, the author uses PEMN to compress the network and compare it with the traditional network pruning method. The experiment uses the ResNet network to perform image classification tasks on the CIFAR data set. We observe that the new compression scheme generally performs better than traditional network pruning. Especially at very high compression rates, PEMN can still maintain good accuracy.

5. Conclusion

Inspired by the potential demonstrated by random networks recently, this paper proposes different parameter intensive strategies to construct random neural networks, and then explores the problem of only The representation potential of random neural networks generated without repeated random numbers is limited, and Parameter-Efficient Masking Networks (PEMN) are proposed. The author applies PEMN to the network compression scenario to explore its potential in practical applications and provides a new idea for network compression. The authors provide extensive experiments showing that even if there are only a very limited number of non-repeating random numbers in a random network, it still has good representation capabilities through the selection of subnetworks. In addition, compared with traditional pruning algorithms, experiments show that the newly proposed method can achieve better network compression effects, verifying the application potential of PEMN in this scenario.

The above is the detailed content of How to improve storage and transmission efficiency? Parameter-intensive mask network has significant effect. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hello Kitty Island Adventure: How To Get Giant Seeds

1 months ago By 尊渡假赌尊渡假赌尊渡假赌

How Long Does It Take To Beat Split Fiction?

4 weeks ago By DDD

R.E.P.O. Save File Location: Where Is It & How to Protect It?

4 weeks ago By DDD

Two Point Museum: All Exhibits And Where To Find Them

1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7384

Java Tutorial

1628

CakePHP Tutorial

1357

Laravel Tutorial

1267

PHP Tutorial

1216

Related knowledge

Huawei will launch innovative MED storage products next year: rack capacity exceeds 10 PB and power consumption is less than 2 kW Mar 07, 2024 pm 10:43 PM

This website reported on March 7 that Dr. Zhou Yuefeng, President of Huawei's Data Storage Product Line, recently attended the MWC2024 conference and specifically demonstrated the new generation OceanStorArctic magnetoelectric storage solution designed for warm data (WarmData) and cold data (ColdData). Zhou Yuefeng, President of Huawei's data storage product line, released a series of innovative solutions. Image source: Huawei's official press release attached to this site is as follows: The cost of this solution is 20% lower than that of magnetic tape, and its power consumption is 90% lower than that of hard disks. According to foreign technology media blocksandfiles, a Huawei spokesperson also revealed information about the magnetoelectric storage solution: Huawei's magnetoelectronic disk (MED) is a major innovation in magnetic storage media. First generation ME

Vue3+TS+Vite development skills: how to encrypt and store data Sep 10, 2023 pm 04:51 PM

Vue3+TS+Vite development tips: How to encrypt and store data. With the rapid development of Internet technology, data security and privacy protection are becoming more and more important. In the Vue3+TS+Vite development environment, how to encrypt and store data is a problem that every developer needs to face. This article will introduce some common data encryption and storage techniques to help developers improve application security and user experience. 1. Data Encryption Front-end Data Encryption Front-end encryption is an important part of protecting data security. Commonly used

How to clear cache on Windows 11: Detailed tutorial with pictures Apr 24, 2023 pm 09:37 PM

What is cache? A cache (pronounced ka·shay) is a specialized, high-speed hardware or software component used to store frequently requested data and instructions, which in turn can be used to load websites, applications, services, and other aspects of the system faster part. Caching makes the most frequently accessed data readily available. Cache files are not the same as cache memory. Cache files refer to frequently needed files such as PNGs, icons, logos, shaders, etc., which may be required by multiple programs. These files are stored in your physical drive space and are usually hidden. Cache memory, on the other hand, is a type of memory that is faster than main memory and/or RAM. It greatly reduces data access time since it is closer to the CPU and faster compared to RAM

Git installation process on Ubuntu Mar 20, 2024 pm 04:51 PM

Git is a fast, reliable, and adaptable distributed version control system. It is designed to support distributed, non-linear workflows, making it ideal for software development teams of all sizes. Each Git working directory is an independent repository with a complete history of all changes and the ability to track versions even without network access or a central server. GitHub is a Git repository hosted on the cloud that provides all the features of distributed revision control. GitHub is a Git repository hosted on the cloud. Unlike Git which is a CLI tool, GitHub has a web-based graphical user interface. It is used for version control, which involves collaborating with other developers and tracking changes to scripts and

How to transfer screen recording files to a mobile phone losslessly_Tutorial on transferring screen recording videos from a computer to a mobile phone without compression Mar 01, 2024 pm 09:53 PM

In our daily life and work, we often need to use computer devices and mobile devices to transfer video files to each other. However, the video files at the receiving end are prone to blurring. This is caused by the compression of the transmission software during the transmission process. How to avoid this situation and how to transfer files to another device losslessly? You can use the EV screen recording software to transfer the video files recorded on the mobile phone to the computer in a lossless and uncompressed form for playback. Original video, below will bring you relevant tutorials, I hope it can help you. The first step on how to losslessly transfer screen recording files to your mobile phone is to download the EV screen recording software on your mobile phone, then click "Toolkit" on the homepage and find the "WiFi Transfer" function. The second step is to give up

How to correctly use sessionStorage to protect sensitive data Jan 13, 2024 am 11:54 AM

How to correctly use sessionStorage to store sensitive information requires specific code examples. Whether in web development or mobile application development, we often need to store and process sensitive information, such as user login credentials, ID numbers, etc. In front-end development, using sessionStorage is a common storage solution. However, since sessionStorage is browser-based storage, some security issues need to be paid attention to to ensure that the stored sensitive information is not maliciously accessed and used.

How do PHP and swoole achieve efficient data caching and storage? Jul 23, 2023 pm 04:03 PM

How do PHP and swoole achieve efficient data caching and storage? Overview: In web application development, data caching and storage are a very important part. PHP and swoole provide an efficient method to cache and store data. This article will introduce how to use PHP and swoole to achieve efficient data caching and storage, and give corresponding code examples. 1. Introduction to swoole: swoole is a high-performance asynchronous network communication engine developed for PHP language. It can

Understanding artificial intelligence tables in one article: starting with MindsDB Apr 12, 2023 pm 12:04 PM

This article is reprinted from the WeChat public account "Living in the Information Age". The author lives in the information age. To reprint this article, please contact the Living in the Information Age public account. For students who are familiar with database operations, writing beautiful SQL statements and finding ways to find the data they need from the database is a routine operation. For students who are familiar with machine learning, it is also a routine operation to obtain data, preprocess the data, build a model, determine the training set and test set, and use the trained model to make a series of predictions about the future. So, can we combine the two technologies? We see that data is stored in the database, and predictions need to be based on past data. If we query future data through the existing data in the database, then it is

See all articles