How to implement Huffman coding algorithm using Python?-Python Tutorial-php.cn

Home

Backend Development

Python Tutorial

How to implement Huffman coding algorithm using Python?

王林

Sep 20, 2023 am 10:49 AM

python implementation Huffman coding algorithm huffman tree

How to implement Huffman coding algorithm using Python?

How to use Python to implement the Huffman coding algorithm?

Abstract:
Huffman coding is a classic data compression algorithm that achieves efficient compression and storage of data by generating a unique code based on the frequency of character occurrences. This article will introduce how to use Python to implement the Huffman coding algorithm and provide specific code examples.

Understanding the idea of Huffman coding
The core idea of Huffman coding is to use slightly shorter codes for characters that appear more frequently, and to use slightly longer codes for characters that appear less frequently. encoding, thereby achieving a higher compression rate of the encoded data. Specifically, Huffman coding maps the frequency of characters and the corresponding character information one by one, and constructs a Huffman tree to represent the encoding of 0 and 1 according to the left and right branches of the tree node.
Building a Huffman tree
Before we start coding, we need to build a Huffman tree first. First, count the frequency of each character in the string and store the character and frequency information in a frequency dictionary. Then, build a Huffman tree based on the frequency dictionary. The specific steps are as follows:
Initialize a priority queue (minimum heap) for storing Huffman tree nodes
Convert each node in the frequency dictionary characters and frequency information are added to the priority queue as leaf nodes
Loop the following operations until there is only one node left in the queue:
- Select two frequencies from the queue The smallest node serves as the left and right child nodes, and generates a new node, the frequency is the sum of the frequencies of the left and right child nodes
- Add the new node to the queue
The remaining nodes in the queue The node below is the root node of the Huffman tree

The following is a code example:

import heapq
from collections import defaultdict


class Node:
    def __init__(self, frequency, value=None):
        self.frequency = frequency
        self.value = value
        self.left_child = None
        self.right_child = None

    def __lt__(self, other):
        return self.frequency < other.frequency


def build_huffman_tree(freq_dict):
    priority_queue = []

    for char, freq in freq_dict.items():
        heapq.heappush(priority_queue, Node(freq, char))

    while len(priority_queue) > 1:
        left_child = heapq.heappop(priority_queue)
        right_child = heapq.heappop(priority_queue)
        new_node = Node(left_child.frequency + right_child.frequency)
        new_node.left_child = left_child
        new_node.right_child = right_child
        heapq.heappush(priority_queue, new_node)

    return heapq.heappop(priority_queue)

Copy after login

Generate Huffman coding table
After constructing the Huffman After the tree, we can generate the corresponding Huffman coding table based on the Huffman tree. The Huffman coding table maps each character to its corresponding code. The specific steps are as follows:
Traverse the Huffman tree, starting from the root node, the left branch on the path is marked as 0, the right branch is marked as 1, record the path and encoding of each leaf node
Store the path and encoding information in the encoding dictionary

The following is a code example:

def generate_huffman_codes(huffman_tree):
    code_dict = {}

    def traverse(node, current_code=''):
        if node.value:
            code_dict[node.value] = current_code
        else:
            traverse(node.left_child, current_code + '0')
            traverse(node.right_child, current_code + '1')

    traverse(huffman_tree)
    return code_dict

Copy after login

Compress and decompress data
After having the Huffman coding table , we can compress the original data, replace each character of the original data with the corresponding Huffman code, and store the encoded binary data in the file. When decompressing the data, we need to restore the encoded binary data to the original data according to the Huffman coding table.

The following is a code example for compressing and decompressing data:

def compress_data(data, code_dict):
    compressed_data = ''
    for char in data:
        compressed_data += code_dict[char]
    return compressed_data


def decompress_data(compressed_data, huffman_tree):
    decompressed_data = ''
    current_node = huffman_tree
    for bit in compressed_data:
        if bit == '0':
            current_node = current_node.left_child
        else:
            current_node = current_node.right_child

        if current_node.value:
            decompressed_data += current_node.value
            current_node = huffman_tree

    return decompressed_data

Copy after login

Summary:
This article introduces how to use Python to implement the Huffman coding algorithm. The main steps include building Huffman trees, generating Huffman coding tables, and compressing and decompressing data. We hope that the introduction and code examples in this article can help readers better understand and apply the Huffman coding algorithm.

The above is the detailed content of How to implement Huffman coding algorithm using Python?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Best Graphic Settings

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

2 weeks ago By DDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

WWE 2K25: How To Unlock Everything In MyRise

1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7514

CakePHP Tutorial

1378

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

Related knowledge

How to implement Huffman coding algorithm using Python? Sep 20, 2023 am 10:49 AM

How to implement Huffman coding algorithm using Python? Abstract: Huffman coding is a classic data compression algorithm that generates unique codes based on the frequency of character occurrences, thereby achieving efficient compression and storage of data. This article will introduce how to use Python to implement the Huffman coding algorithm and provide specific code examples. Understand the idea of Huffman coding. The core idea of Huffman coding is to use slightly shorter codes for characters that appear more frequently, and to use slightly longer codes for characters that appear less frequently, so as to achieve coding.

How to implement the offline map download function in Baidu Map API in Python Jul 29, 2023 pm 02:34 PM

Python method to implement the offline map download function in Baidu Map API With the rapid development of mobile Internet, the demand for offline map download function is becoming more and more urgent. The offline map download function allows users to still use map navigation and other functions without an Internet connection, giving users a better user experience. This article will introduce how to use Python to implement the offline map download function in Baidu Map API. Baidu Map API provides a complete set of open interfaces, including offline map download functions. In use

Use Python to implement Baidu AI interface docking to make your program smarter and more powerful Aug 13, 2023 am 09:29 AM

Use Python to implement Baidu AI interface docking to make your program smarter and more powerful. With the continuous development of artificial intelligence technology, more and more developers are beginning to implement intelligent functions to improve the intelligence of their programs. The Baidu AI interface is a powerful tool that can help us implement multiple intelligent functions such as speech recognition, image recognition, and natural language processing. This article will show you how to use Python to connect to Baidu AI interface to make your program smarter and more powerful. First, we need to go to Baidu AI Open Platform (h

Python implements methods and case sharing for automated testing of web pages using headless browser acquisition applications Aug 08, 2023 am 08:29 AM

Python implements methods and case sharing for automated testing of web pages using headless browser collection applications Overview: In today's Internet era, automated web page testing has become one of the important means to improve software quality and efficiency. As a high-level programming language, Python has a wealth of third-party libraries and tools, making it easy and fast to use Python for automated testing of web pages. This article will introduce how to use a headless browser to collect applications and implement automated testing of web pages, and provide relevant code examples. 1. What is headless browsing?

Python implements page simulation click and scroll function analysis for headless browser collection applications Aug 09, 2023 pm 05:13 PM

Python implements page simulation click and scroll function analysis for headless browser collection applications. When collecting network data, it is often necessary to simulate user operations, such as clicking buttons, drop-down scrolling, etc. A common way to achieve these operations is to use a headless browser. A headless browser is actually a browser without a user interface that simulates user operations through programming. The Python language provides many libraries to implement headless browser operations, the most commonly used of which is the selenium library. selen

Use Python to write a program to draw the appearance of ice cubes Jan 13, 2024 am 08:49 AM

Using Python to realize the drawing effect of Bingdundun Bingdundun, as the mascot of the 2022 Beijing Winter Olympics, is not only active in the competition venues, but also has won the love of many netizens on the Internet. If you want to use code to achieve the drawing effect of ice in Python, let’s take a look at the specific code examples below! First, we need to introduce the turtle library in Python to implement the drawing function. If this library is not installed on your computer, you can install it through pip. The command is as follows: pipin

Optimization strategies for Python implementation of Linux script operations Oct 05, 2023 am 11:57 AM

Summary of optimization strategies for Python implementation of Linux script operations: With the widespread use of Linux operating systems, using scripts to automate operations has become a common way. In this article, we will discuss how to use Python to optimize Linux script operations to improve efficiency and maintainability. Specifically, we will focus on the following aspects: using appropriate modules and libraries, using multi-threading and multi-processing, using databases for data storage and management, etc. 1. Use appropriate modules and libraries Py

How to implement Kruskal's algorithm using Python? Sep 19, 2023 pm 03:30 PM

How to implement Kruskal's algorithm using Python? Introduction: Kruskal's algorithm is a classic algorithm for solving the minimum spanning tree, which can find the spanning tree with the minimum total weight in a given weighted connected graph. This article will introduce how to implement Kruskal's algorithm using Python and provide detailed code examples. Algorithm introduction: The basic idea of Kruskal's algorithm is to sort all the edges in the connected graph according to their weights, and then select the edges from small to large. If the current edge selected does not form a cycle, add it to the minimum spanning tree.

See all articles