Home Backend Development PHP Tutorial PHP中实现中文字符进制转换原理分析_php技巧

PHP中实现中文字符进制转换原理分析_php技巧

May 17, 2016 am 09:14 AM
Chinese characters Base conversion

一,中文字符转十进制原理分析

GBK编码中一个汉字由二个字符组成,获取汉字字符串的方法如下
复制代码 代码如下:

$string = "不要迷恋哥";
$length = strlen($string);
for($i=0;$iif(ord($string[$i])>127){
$result[] = ord($string[$i]).' '.ord($string[++$i]);
}
}
var_dump($result);

php-ord-dec-number
由于一个汉字为两个字符组成,通过ord()函数获取字符的ASCII值如果大于127时,就可以确定当前字符为一个汉字的前半部分,还需要获取汉字的后半部分。当然,这种判断的方法要结合具体的开发环境,如果存在ASCII值大于127的单个字符,这种方法判断显然就不正确。

PHP实现中文字符转十进制的原理就是通过for循环的方法获取一个汉字的二个字符,然后使用ord()函数把各字符转换为十进制。如上分别是:不 [178 187] 要 [210 170] 迷 [195 212] 恋 [193 181] 哥 [184 231]

二,中文字符转十六进制原理分析

使用UltraEdit开发工具可以直接查看中文字符的十六进制,如下图

如,查看“不要迷恋哥”这五个字的十六进制
ultraedit-16-hex
从上面的图可以知道各个汉字对应该的十六进制字符分别是:不 B2BB 要 D2AA 迷 C3D4 恋 C1B5 哥 B8E7

PHP 实现中文字符转十六进制的原理就是首先使用ord()函数取出各个中文字符的十进制,具体可查看[PHP函数篇掌握ord()与chr()函数应用],然后使用dechex()函数把各个中文字符转化为十六进制

实例源代码
复制代码 代码如下:

$string = "不要迷恋哥";
$length = strlen($string);
echo $string;
$result = array();
//十进制
for($i=0;$iif(ord($string[$i])>127){
$result[] = ord($string[$i]).' '.ord($string[++$i]);
}
}
var_dump($result);
//十六进制
$strings = array();
foreach($result as $v){
$dec = explode(" ",$v);
$strings[] = dechex($dec[0])." ".dechex($dec[1]);
}
var_dump($strings);

结果如下图

php-dec-hex-string

通过上面的方法实现把中文字符转换为十六进制,输出结果可对比使用UltraEdit开发工具获取的十六进制。

三,中文字符转二进制和八进制原理分析

实现中文字符转二进制和八进制与上面的十六进制转换原理一样,只是转换的函数不同,结合上面的实例代码,实现如下

中文字符转二进制,方法如下
复制代码 代码如下:

$strings = array();
foreach($result as $v){
$dec = explode(" ",$v);
$strings[] = decbin($dec[0])." ".decbin($dec[1]);
}
var_dump($strings);

结果如下:
php-dec-bin-string
中文字符转八进制,方法如下
复制代码 代码如下:

$strings = array();
foreach($result as $v){
$dec = explode(" ",$v);
$strings[] = decoct($dec[0])." ".decoct($dec[1]);
}

结果如下:
php-string-hex
了解PHP实现中文字符进制转换原理,再通过PHP内置函数urldecode()就可以把十六进制的字符串通过组合转换为正常的中文汉字,请关注下一期中文字符编码研究系列之urldecode()与urlencode()函数字符编码原理。
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Repo: How To Revive Teammates
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

How to use the CONV function in MySQL to convert a numerical value to a different base How to use the CONV function in MySQL to convert a numerical value to a different base Jul 12, 2023 pm 02:33 PM

How to use the CONV function in MySQL to convert a value to a different base Introduction: In the database, it is often necessary to convert a value between different bases. MySQL provides a very convenient function CONV, which can quickly realize hexadecimal conversion of numerical values. This article details how to use the CONV function and provides some code examples. 1. Overview of CONV function The CONV function is a mathematical function provided by MySQL, which is used to convert a value from one base to another. its

How to verify whether the input is all Chinese characters in golang How to verify whether the input is all Chinese characters in golang Jun 24, 2023 am 09:16 AM

With the development of the times, we pay more and more attention to the verification of data, especially the verification of user input. For language verification, how to accurately determine whether the input is all Chinese characters has become an important issue. In golang, we can use the unicode package and regexp package to achieve this requirement. 1. Unicode package The unicode package provides a series of core support for Unicode. We can use the functions in this package to accurately determine whether a character is a Chinese character.

How to sort Chinese characters in C language environment? How to sort Chinese characters in C language environment? Feb 18, 2024 pm 02:10 PM

How to implement Chinese character sorting function in C language programming software? In modern society, the Chinese character sorting function is one of the essential functions in many software. Whether in word processing software, search engines or database systems, Chinese characters need to be sorted to better display and process Chinese text data. In C language programming, how to implement the Chinese character sorting function? One method is briefly introduced below. First of all, in order to implement the Chinese character sorting function in C language, we need to use the string comparison function. Ran

How to deal with the pinyin sorting problem of Chinese characters in PHP? How to deal with the pinyin sorting problem of Chinese characters in PHP? Sep 05, 2023 pm 05:00 PM

How to deal with the pinyin sorting problem of Chinese characters in PHP? When developing Chinese websites or applications, we often face the need to sort Chinese strings according to pinyin. However, due to the complexity of Chinese characters, directly using conventional sorting algorithms will lead to errors in sorting results. Therefore, we need to use a special method to deal with the pinyin sorting problem of Chinese characters. In PHP, a common solution is to use a pinyin library such as "Overtrue/Pinyin". This is a pinyin based on PHP

How to use regular expressions to match Chinese characters in PHP How to use regular expressions to match Chinese characters in PHP Jun 22, 2023 am 09:16 AM

In PHP, regular expression is a commonly used string matching tool. It can be used to determine whether a string conforms to a specific format, thereby verifying the validity of the input value. When processing Chinese characters, since Chinese characters and English characters are encoded differently, the matching rules of the regular expression need to be adjusted accordingly. This article will introduce how to use regular expressions to match Chinese characters in PHP. 1. Understand Chinese character encoding. Commonly used character encodings in PHP are UTF-8 and G.

How to get the first letter of Chinese characters in PHP? How to get the first letter of Chinese characters in PHP? Sep 06, 2023 am 11:18 AM

How to get the first letter of Chinese characters in PHP? When processing Chinese characters, sometimes we need to get the first letter of the Chinese character. PHP provides some built-in functions and extension packages to achieve this functionality. A common way is to use the mb_substr() function in combination with the ord() function. The mb_substr() function is used to obtain the substring of a string, and the ord() function is used to obtain the ASCII code value of a character. We can get the first letter of the Chinese character by getting the first character and its ASCII code value.

PHP regular expression practice: matching Chinese characters PHP regular expression practice: matching Chinese characters Jun 22, 2023 pm 08:34 PM

In the process of developing projects using PHP, we often encounter the need to process Chinese characters. Regular expressions are a powerful text processing tool that can help us match and process Chinese characters quickly and accurately. In this article, I will introduce related techniques and examples on how to use PHP regular expressions to match Chinese characters. Matching Chinese Characters First, we need to understand how Chinese characters are represented in the computer. Normally, Chinese characters are represented using Unicode encoding. In Unicod

How to convert decimal to binary in c language How to convert decimal to binary in c language Nov 24, 2022 am 10:27 AM

How to convert decimal to binary: Keep dividing the decimal number by 2 until the quotient is zero, and then write the remainder from bottom to top; the conversion code "int main(void){int n,len;int a[20] ;scanf("%d",&n);while(n/2){a[len++]=n%2;n=n/2;}a[len++]=n%2;for(i=len-1 ;i>=0;i--){printf("%d",a[i]);}}".

See all articles