How to Remove Invalid UTF-8 Characters from Strings in Go?
Dec 16, 2024 pm 09:02 PMRemoving Invalid UTF-8 Characters from Strings in Go
When attempting to marshal a list of strings using json.Marshal, it's possible to encounter an error indicating the presence of invalid UTF-8 characters. This article addresses this issue and provides solutions for removing or replacing such characters in Go.
In Python, the unicode module offers methods like unicode.replace and unicode.strict to handle invalid characters. However, Go does not have direct equivalents. Instead, it relies on a different approach:
Using strings.ToValidUTF8 in Go 1.13
To remove invalid UTF-8 characters from a string, you can use the strings.ToValidUTF8 function introduced in Go 1.13. It takes two parameters: the input string and a replacement character to use for invalid bytes. If the replacement character is an empty string, invalid bytes will be silently removed:
invalidString := "a\xc5z" validString := strings.ToValidUTF8(invalidString, "") // validString will now be "az"
Using strings.Map and utf8.RuneError in Go 1.11
An alternative solution is to use strings.Map along with utf8.RuneError. strings.Map applies a function to each rune in a string, while utf8.RuneError represents an invalid UTF-8 character. Here's an example:
invalidString := "a\xc5z" fixUtf := func(r rune) rune { if r == utf8.RuneError { return -1 // Replace invalid characters with -1 } return r } validString := strings.Map(fixUtf, invalidString) fmt.Println(validString) // Output: "az"
The above is the detailed content of How to Remove Invalid UTF-8 Characters from Strings in Go?. For more information, please follow other related articles on the PHP Chinese website!

Hot Article

Hot tools Tags

Hot Article

Hot Article Tags

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

Go language pack import: What is the difference between underscore and without underscore?

How to implement short-term information transfer between pages in the Beego framework?

How do I write mock objects and stubs for testing in Go?

How to convert MySQL query result List into a custom structure slice in Go language?

How can I define custom type constraints for generics in Go?

How can I use tracing tools to understand the execution flow of my Go applications?

How to write files in Go language conveniently?
