Home Web Front-end JS Tutorial Why Does `atob()` Fail to Decode UTF-8 Strings in JavaScript?

Why Does `atob()` Fail to Decode UTF-8 Strings in JavaScript?

Nov 02, 2024 am 09:35 AM

Why Does `atob()` Fail to Decode UTF-8 Strings in JavaScript?

Using Javascript's atob to decode base64 doesn't properly decode utf-8 strings

The window.atob() function in JavaScript doesn't correctly decode UTF-8 strings when dealing with characters that occupy more than one byte, resulting in ASCII-encoded characters instead.

Unicode Problem

JavaScript strings are encoded in 16-bit units, and btoa() expects binary data as input. Characters that occupy more than one byte, such as special characters or foreign characters, are not considered binary data and will trigger an error when passed to btoa(). This issue is known as "The Unicode Problem."

Solution with Binary Interoperability

The recommended solution by MDN involves encoding to and decoding from a binary string representation. This preserves the binary nature of the data and eliminates the Unicode Problem. The encoding process involves converting the UTF-8 string into a binary string with Uint16Array and Uint8Array. Decoding involves converting the binary string back to a UTF-8 string.

Solution with ASCII Base64 Interoperability

Another solution is to convert the UTF-16 DOMString to an 8-bit integer array of characters using Uint8Array and then encode it using btoa(). This method maintains the UTF-8 functionality and produces plain text base64 strings that can be decoded on platforms that support UTF-8. Decoding involves converting the base64 string back to a UTF-8 string using atob() and decodeURIComponent().

Deprecated Solution

A previously used solution involved using escape() and unescape() functions, which have now been deprecated. While this method still works in modern browsers, it's not recommended for use.

Additionally, it's worth noting that when working with the GitHub API, you may need to strip whitespace from the base64 source before decoding to work correctly on Mobile Safari.

The above is the detailed content of Why Does `atob()` Fail to Decode UTF-8 Strings in JavaScript?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot Article Tags

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Replace String Characters in JavaScript Replace String Characters in JavaScript Mar 11, 2025 am 12:07 AM

Replace String Characters in JavaScript

Custom Google Search API Setup Tutorial Custom Google Search API Setup Tutorial Mar 04, 2025 am 01:06 AM

Custom Google Search API Setup Tutorial

Example Colors JSON File Example Colors JSON File Mar 03, 2025 am 12:35 AM

Example Colors JSON File

8 Stunning jQuery Page Layout Plugins 8 Stunning jQuery Page Layout Plugins Mar 06, 2025 am 12:48 AM

8 Stunning jQuery Page Layout Plugins

10 jQuery Syntax Highlighters 10 jQuery Syntax Highlighters Mar 02, 2025 am 12:32 AM

10 jQuery Syntax Highlighters

Build Your Own AJAX Web Applications Build Your Own AJAX Web Applications Mar 09, 2025 am 12:11 AM

Build Your Own AJAX Web Applications

What is 'this' in JavaScript? What is 'this' in JavaScript? Mar 04, 2025 am 01:15 AM

What is 'this' in JavaScript?

10  JavaScript & jQuery MVC Tutorials 10 JavaScript & jQuery MVC Tutorials Mar 02, 2025 am 01:16 AM

10 JavaScript & jQuery MVC Tutorials

See all articles