Table of Contents
Cause of the problem
Solution
Method 1: GB18030 encoding conversion
Method 2: Use reflect
Method 3: Use the name attribute in fileheader and FileInfo
Home Backend Development Golang golang zip Chinese garbled code

golang zip Chinese garbled code

May 14, 2023 pm 10:35 PM

With the continuous popularity and application of Golang in the field of web development, Zip compressed files have become an essential functional module. However, some developers encountered a common problem when using Golang's Zip package for file compression - Chinese file names appear garbled.

This is a very troublesome problem, because it not only makes the originally beautiful file name look nondescript, but may also cause a series of other errors. Below we will explore the causes and solutions to this problem.

Cause of the problem

Zip format is a binary format that contains the file name, file directory, compression method and compressed data. Among this information, the file name is a very critical part, because it determines the name and storage path of the file after the user decompresses it.

However, different file systems and encoding formats may result in different parsing results for the same file name. For example, on Windows systems, the default encoding used for file names is GBK, while on UNIX/Linux systems, the file names use UTF-8. If we do not handle these different encoding methods when using the Zip package in Golang for compression, the file name may be treated as garbled characters.

Solution

We have a variety of solutions for the problem of garbled Chinese file names. Below we will introduce some of the more feasible methods.

Method 1: GB18030 encoding conversion

Because the Zip format uses CP437 encoding to represent the file name, we can convert the file name from GB18030 encoding to CP437 encoding, so as to ensure that File names can be correctly parsed in any environment. In Go, you can use the golang.org/x/text/encoding/simplifiedchinese package to convert between GB18030 encoding and CP437 encoding.

import (
    "golang.org/x/text/encoding/simplifiedchinese"
    "golang.org/x/text/transform"
)

func GbkToUtf8(data []byte) ([]byte, error) {
    return transform.NewReader(bytes.NewReader(data), simplifiedchinese.GB18030.NewDecoder()).ReadAll()
}

func Utf8ToGbk(data []byte) ([]byte, error) {
    return transform.NewReader(bytes.NewReader(data), simplifiedchinese.GB18030.NewEncoder()).ReadAll()
}
Copy after login

Method 2: Use reflect

We can also use the StructTag option in the reflect package to force the use of the specified encoding method. Specifically, you can add a zip tag to the structure and a chinese-utf8 tag to the Tag. The sample code is as follows:

type File struct {
    Name string `zip:"filename=测试文件,chinese-utf8"`
}

func main() {
    zhName := "测试文件"
    utf8Name, _ := GbkToUtf8([]byte(zhName))
    f := &File{Name: string(utf8Name)}
    // 压缩文件...
}
Copy after login

Method 3: Use the name attribute in fileheader and FileInfo

In the Golang Zip package, we can also call fileheader and ##name attribute of #FileInfo to manually specify the correct encoding for each file.

import "archive/zip"

func zipFiles(filePaths []string, dest string) error {
    // 创建文件
    newZipFile, err := os.Create(dest)
    if err != nil {
        return err
    }
    defer newZipFile.Close()

    // 创建 ZIP writer 对象
    zipWriter := zip.NewWriter(newZipFile)
    defer zipWriter.Close()

    // 遍历 filePaths,为每个文件设置正确的编码方式
    for _, filePath := range filePaths {
        zipFile, err := os.Open(filePath)
        if err != nil {
            return err
        }
        defer zipFile.Close()

        // 解析文件名,并转换编码
        zipFileInfo, _ := zip.FileInfoHeader(zipFile.Stat())
        zipFileInfo.Name, _ = GbkToUtf8([]byte(zipFileInfo.Name))

        // 创建 Zip 文件写入器
        zipWriterNewFile, err := zipWriter.CreateHeader(zipFileInfo)
        if err != nil {
            return err
        }

        // 读取文件并写入 Zip 文件中
        _, err = io.Copy(zipWriterNewFile, zipFile)
        if err != nil {
            return err
        }
    }

    return nil
}
Copy after login
Conclusion

Golang’s Zip package provides a variety of solutions to the problem of garbled Chinese file names. If the encoding format of the file name is not uniform, it is easy for the file name to be garbled. We can easily avoid this problem by simply choosing the right solution according to our needs.

The above is the detailed content of golang zip Chinese garbled code. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Repo: How To Revive Teammates
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

How do I write mock objects and stubs for testing in Go? How do I write mock objects and stubs for testing in Go? Mar 10, 2025 pm 05:38 PM

This article demonstrates creating mocks and stubs in Go for unit testing. It emphasizes using interfaces, provides examples of mock implementations, and discusses best practices like keeping mocks focused and using assertion libraries. The articl

How do you write unit tests in Go? How do you write unit tests in Go? Mar 21, 2025 pm 06:34 PM

The article discusses writing unit tests in Go, covering best practices, mocking techniques, and tools for efficient test management.

How do you use the pprof tool to analyze Go performance? How do you use the pprof tool to analyze Go performance? Mar 21, 2025 pm 06:37 PM

The article explains how to use the pprof tool for analyzing Go performance, including enabling profiling, collecting data, and identifying common bottlenecks like CPU and memory issues.Character count: 159

How can I define custom type constraints for generics in Go? How can I define custom type constraints for generics in Go? Mar 10, 2025 pm 03:20 PM

This article explores Go's custom type constraints for generics. It details how interfaces define minimum type requirements for generic functions, improving type safety and code reusability. The article also discusses limitations and best practices

How can I use tracing tools to understand the execution flow of my Go applications? How can I use tracing tools to understand the execution flow of my Go applications? Mar 10, 2025 pm 05:36 PM

This article explores using tracing tools to analyze Go application execution flow. It discusses manual and automatic instrumentation techniques, comparing tools like Jaeger, Zipkin, and OpenTelemetry, and highlighting effective data visualization

Explain the purpose of Go's reflect package. When would you use reflection? What are the performance implications? Explain the purpose of Go's reflect package. When would you use reflection? What are the performance implications? Mar 25, 2025 am 11:17 AM

The article discusses Go's reflect package, used for runtime manipulation of code, beneficial for serialization, generic programming, and more. It warns of performance costs like slower execution and higher memory use, advising judicious use and best

How do you specify dependencies in your go.mod file? How do you specify dependencies in your go.mod file? Mar 27, 2025 pm 07:14 PM

The article discusses managing Go module dependencies via go.mod, covering specification, updates, and conflict resolution. It emphasizes best practices like semantic versioning and regular updates.

How do you use table-driven tests in Go? How do you use table-driven tests in Go? Mar 21, 2025 pm 06:35 PM

The article discusses using table-driven tests in Go, a method that uses a table of test cases to test functions with multiple inputs and outcomes. It highlights benefits like improved readability, reduced duplication, scalability, consistency, and a

See all articles