How to implement data compression and decompression algorithms in C++?
How to implement data compression and decompression algorithms in C?
Abstract: Data compression and decompression is one of the most important technologies in the computer field. This article will introduce how to use C to implement data compression and decompression algorithms, and provide code examples for readers' reference.
1. Data compression algorithm
The data compression algorithm can encode a large amount of data to reduce the occupation of storage space and transmission bandwidth. In C, we can use Huffman coding and LZ77 algorithm to achieve data compression.
1.1 Huffman coding
Huffman coding is a frequency-based data compression algorithm. It assigns a shorter code to each character based on the frequency of data occurrence to achieve the purpose of compressing data.
The sample code is as follows:
#include<iostream> #include<queue> #include<string> #include<unordered_map> using namespace std; // Huffman树的节点 struct Node { char ch; int freq; Node* left; Node* right; }; // 用于比较树节点的优先队列 class Compare { public: bool operator() (Node* a, Node* b) { return a->freq > b->freq; } }; // 生成Huffman树 Node* generateHuffmanTree(string text) { // 统计每个字符出现的频率 unordered_map<char, int> freqTable; for (char ch : text) { freqTable[ch]++; } // 将频率和字符转换为Huffman树节点 priority_queue<Node*, vector<Node*>, Compare> pq; for (auto it = freqTable.begin(); it != freqTable.end(); it++) { Node* node = new Node(); node->ch = it->first; node->freq = it->second; node->left = nullptr; node->right = nullptr; pq.push(node); } // 构建Huffman树 while (pq.size() > 1) { Node* left = pq.top(); pq.pop(); Node* right = pq.top(); pq.pop(); Node* parent = new Node(); parent->ch = ''; parent->freq = left->freq + right->freq; parent->left = left; parent->right = right; pq.push(parent); } return pq.top(); } // 生成Huffman编码表 void generateHuffmanCodeTable(Node* root, string code, unordered_map<char, string>& codeTable) { if (root == nullptr) { return; } if (root->ch != '') { codeTable[root->ch] = code; } generateHuffmanCodeTable(root->left, code + "0", codeTable); generateHuffmanCodeTable(root->right, code + "1", codeTable); } // 压缩数据 string compressData(string text, unordered_map<char, string>& codeTable) { string compressedData; for (char ch : text) { compressedData += codeTable[ch]; } return compressedData; } int main() { string text = "Hello, World!"; Node* root = generateHuffmanTree(text); unordered_map<char, string> codeTable; generateHuffmanCodeTable(root, "", codeTable); string compressedData = compressData(text, codeTable); cout << "Compressed Data: " << compressedData << endl; return 0; }
1.2 LZ77 algorithm
The LZ77 algorithm is a dictionary-based data compression algorithm. It replaces recurring data fragments with pointers to old data to reduce the storage space of the data.
The sample code is as follows:
#include<iostream> #include<string> #include<vector> using namespace std; // 压缩数据 string compressData(string text) { string compressedData; int i = 0; while (i < text.length()) { int len = 0; int offset = 0; for (int j = 0; j < i; j++) { int k = 0; while (i + k < text.length() && text[j + k] == text[i + k]) { k++; } if (k > len) { len = k; offset = i - j; } } if (len > 0) { compressedData += "(" + to_string(offset) + "," + to_string(len) + ")"; i += len; } else { compressedData += text[i]; i++; } } return compressedData; } int main() { string text = "ababaabababbbb"; string compressedData = compressData(text); cout << "Compressed Data: " << compressedData << endl; return 0; }
2. Data decompression algorithm
The data decompression algorithm is used to restore compressed data. In C, we can use the corresponding decompression algorithm to restore the data.
2.1 Huffman decompression
The sample code is as follows:
#include<iostream> #include<string> #include<unordered_map> using namespace std; // 解压缩数据 string decompressData(string compressedData, unordered_map<string, char>& codeTable) { string decompressedData; string code; for (char ch : compressedData) { code += ch; if (codeTable.count(code) > 0) { decompressedData += codeTable[code]; code = ""; } } return decompressedData; } int main() { string compressedData = "010101001111011001"; unordered_map<string, char> codeTable = { {"0", 'a'}, {"10", 'b'}, {"110", 'c'}, {"1110", 'd'}, {"1111", 'e'} }; string decompressedData = decompressData(compressedData, codeTable); cout << "Decompressed Data: " << decompressedData << endl; return 0; }
2.2 LZ77 decompression
The sample code is as follows:
#include<iostream> #include<string> #include<vector> using namespace std; // 解压缩数据 string decompressData(string compressedData) { string decompressedData; int i = 0; while (i < compressedData.length()) { if (compressedData[i] == '(') { int j = i + 1; while (compressedData[j] != ',') { j++; } int offset = stoi(compressedData.substr(i + 1, j - i - 1)); int k = j + 1; while (compressedData[k] != ')') { k++; } int len = stoi(compressedData.substr(j + 1, k - j - 1)); for (int l = 0; l < len; l++) { decompressedData += decompressedData[decompressedData.length() - offset]; } i = k + 1; } else { decompressedData += compressedData[i]; i++; } } return decompressedData; } int main() { string compressedData = "a(1,1)ab(3,3)b(9,2)"; string decompressedData = decompressData(compressedData); cout << "Decompressed Data: " << decompressedData << endl; return 0; }
Conclusion:
This article introduces how to use C to implement data compression and decompression algorithms. With Huffman coding and LZ77 algorithm, we are able to compress and decompress data efficiently. Readers can choose the algorithm that suits them according to their needs, and practice and optimize based on the sample code.
The above is the detailed content of How to implement data compression and decompression algorithms in C++?. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



How to implement robot control and robot navigation in C++? Robot control and navigation are very important parts of robotics technology. In the C++ programming language, we can use various libraries and frameworks to implement robot control and navigation. This article will introduce how to use C++ to write code examples for controlling robots and implementing navigation functions. 1. Robot control In C++, we can use serial communication or network communication to realize robot control. The following is a sample code that uses serial communication to control robot movement: inclu

In C++ development, null pointer exception is a common error, which often occurs when the pointer is not initialized or is continued to be used after being released. Null pointer exceptions not only cause program crashes, but may also cause security vulnerabilities, so special attention is required. This article will explain how to avoid null pointer exceptions in C++ code. Initializing pointer variables Pointers in C++ must be initialized before use. If not initialized, the pointer will point to a random memory address, which may cause a Null Pointer Exception. To initialize a pointer, point it to an

How to write a simple file encryption program in C++? Introduction: With the development of the Internet and the popularity of smart devices, the importance of protecting personal data and sensitive information has become increasingly important. In order to ensure the security of files, it is often necessary to encrypt them. This article will introduce how to use C++ to write a simple file encryption program to protect your files from unauthorized access. Requirements analysis: Before starting to write a file encryption program, we need to clarify the basic functions and requirements of the program. In this simple program we will use symmetry

How to write a simple music recommendation system in C++? Introduction: Music recommendation system is a research hotspot in modern information technology. It can recommend songs to users based on their music preferences and behavioral habits. This article will introduce how to use C++ to write a simple music recommendation system. 1. Collect user data First, we need to collect user music preference data. Users' preferences for different types of music can be obtained through online surveys, questionnaires, etc. Save data in a text file or database

How to use the Fibonacci sequence algorithm in C++ The Fibonacci sequence is a very classic sequence, and its definition is that each number is the sum of the previous two numbers. In computer science, using the C++ programming language to implement the Fibonacci sequence algorithm is a basic and important skill. This article will introduce how to use C++ to write the Fibonacci sequence algorithm and provide specific code examples. 1. Recursive method Recursion is a common method of Fibonacci sequence algorithm. In C++, the Fibonacci sequence algorithm can be implemented concisely using recursion. under

Research summary of methods to solve data compression problems encountered in MongoDB technology development: As the amount of data continues to grow and application scenarios continue to expand, the efficiency of data storage and transmission becomes increasingly important. Especially for non-relational databases such as MongoDB, how to effectively compress data to reduce storage and transmission costs has become a challenging task. This article aims to study methods to solve data compression problems encountered in MongoDB technology development and provide specific code examples. Introduction With data storage and processing

Efficiently utilize C++ programming skills to build robust embedded system functions. With the continuous development of technology, embedded systems play an increasingly important role in our lives. As a high-level programming language, C++ is flexible and scalable and is widely used in embedded system development. In this article, we will introduce some C++ programming techniques to help developers efficiently use C++ to build robust embedded system functions. 1. Use object-oriented design Object-oriented design is one of the core features of the C++ language. In the embedded system

How to write a simple image recognition program using C++? In the development of modern science and technology, image recognition technology plays an increasingly important role. Whether it is face recognition, object detection or autonomous driving, image recognition plays a key role. This article will introduce how to use C++ to write a simple image recognition program to help readers understand the basic principles and implementation process of image recognition. First, we need to install and configure OpenCV (open source computer vision library). OpenCV is a widely used computer vision library for
