PHP中实现中文字符进制转换原理分析_php技巧
GBK编码中一个汉字由二个字符组成,获取汉字字符串的方法如下
$string = "不要迷恋哥";
$length = strlen($string);
for($i=0;$iif(ord($string[$i])>127){
$result[] = ord($string[$i]).' '.ord($string[++$i]);
}
}
var_dump($result);

由于一个汉字为两个字符组成,通过ord()函数获取字符的ASCII值如果大于127时,就可以确定当前字符为一个汉字的前半部分,还需要获取汉字的后半部分。当然,这种判断的方法要结合具体的开发环境,如果存在ASCII值大于127的单个字符,这种方法判断显然就不正确。
PHP实现中文字符转十进制的原理就是通过for循环的方法获取一个汉字的二个字符,然后使用ord()函数把各字符转换为十进制。如上分别是:不 [178 187] 要 [210 170] 迷 [195 212] 恋 [193 181] 哥 [184 231]
二,中文字符转十六进制原理分析
使用UltraEdit开发工具可以直接查看中文字符的十六进制,如下图
如,查看“不要迷恋哥”这五个字的十六进制

从上面的图可以知道各个汉字对应该的十六进制字符分别是:不 B2BB 要 D2AA 迷 C3D4 恋 C1B5 哥 B8E7
PHP 实现中文字符转十六进制的原理就是首先使用ord()函数取出各个中文字符的十进制,具体可查看[PHP函数篇掌握ord()与chr()函数应用],然后使用dechex()函数把各个中文字符转化为十六进制
实例源代码
$string = "不要迷恋哥";
$length = strlen($string);
echo $string;
$result = array();
//十进制
for($i=0;$iif(ord($string[$i])>127){
$result[] = ord($string[$i]).' '.ord($string[++$i]);
}
}
var_dump($result);
//十六进制
$strings = array();
foreach($result as $v){
$dec = explode(" ",$v);
$strings[] = dechex($dec[0])." ".dechex($dec[1]);
}
var_dump($strings);
结果如下图
三,中文字符转二进制和八进制原理分析
实现中文字符转二进制和八进制与上面的十六进制转换原理一样,只是转换的函数不同,结合上面的实例代码,实现如下
中文字符转二进制,方法如下
$strings = array();
foreach($result as $v){
$dec = explode(" ",$v);
$strings[] = decbin($dec[0])." ".decbin($dec[1]);
}
var_dump($strings);
结果如下:

中文字符转八进制,方法如下
$strings = array();
foreach($result as $v){
$dec = explode(" ",$v);
$strings[] = decoct($dec[0])." ".decoct($dec[1]);
}
结果如下:

了解PHP实现中文字符进制转换原理,再通过PHP内置函数urldecode()就可以把十六进制的字符串通过组合转换为正常的中文汉字,请关注下一期中文字符编码研究系列之urldecode()与urlencode()函数字符编码原理。

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

How to use the CONV function in MySQL to convert a value to a different base Introduction: In the database, it is often necessary to convert a value between different bases. MySQL provides a very convenient function CONV, which can quickly realize hexadecimal conversion of numerical values. This article details how to use the CONV function and provides some code examples. 1. Overview of CONV function The CONV function is a mathematical function provided by MySQL, which is used to convert a value from one base to another. its

With the development of the times, we pay more and more attention to the verification of data, especially the verification of user input. For language verification, how to accurately determine whether the input is all Chinese characters has become an important issue. In golang, we can use the unicode package and regexp package to achieve this requirement. 1. Unicode package The unicode package provides a series of core support for Unicode. We can use the functions in this package to accurately determine whether a character is a Chinese character.

How to implement Chinese character sorting function in C language programming software? In modern society, the Chinese character sorting function is one of the essential functions in many software. Whether in word processing software, search engines or database systems, Chinese characters need to be sorted to better display and process Chinese text data. In C language programming, how to implement the Chinese character sorting function? One method is briefly introduced below. First of all, in order to implement the Chinese character sorting function in C language, we need to use the string comparison function. Ran

How to deal with the pinyin sorting problem of Chinese characters in PHP? When developing Chinese websites or applications, we often face the need to sort Chinese strings according to pinyin. However, due to the complexity of Chinese characters, directly using conventional sorting algorithms will lead to errors in sorting results. Therefore, we need to use a special method to deal with the pinyin sorting problem of Chinese characters. In PHP, a common solution is to use a pinyin library such as "Overtrue/Pinyin". This is a pinyin based on PHP

In PHP, regular expression is a commonly used string matching tool. It can be used to determine whether a string conforms to a specific format, thereby verifying the validity of the input value. When processing Chinese characters, since Chinese characters and English characters are encoded differently, the matching rules of the regular expression need to be adjusted accordingly. This article will introduce how to use regular expressions to match Chinese characters in PHP. 1. Understand Chinese character encoding. Commonly used character encodings in PHP are UTF-8 and G.

How to get the first letter of Chinese characters in PHP? When processing Chinese characters, sometimes we need to get the first letter of the Chinese character. PHP provides some built-in functions and extension packages to achieve this functionality. A common way is to use the mb_substr() function in combination with the ord() function. The mb_substr() function is used to obtain the substring of a string, and the ord() function is used to obtain the ASCII code value of a character. We can get the first letter of the Chinese character by getting the first character and its ASCII code value.

In the process of developing projects using PHP, we often encounter the need to process Chinese characters. Regular expressions are a powerful text processing tool that can help us match and process Chinese characters quickly and accurately. In this article, I will introduce related techniques and examples on how to use PHP regular expressions to match Chinese characters. Matching Chinese Characters First, we need to understand how Chinese characters are represented in the computer. Normally, Chinese characters are represented using Unicode encoding. In Unicod

How to convert decimal to binary: Keep dividing the decimal number by 2 until the quotient is zero, and then write the remainder from bottom to top; the conversion code "int main(void){int n,len;int a[20] ;scanf("%d",&n);while(n/2){a[len++]=n%2;n=n/2;}a[len++]=n%2;for(i=len-1 ;i>=0;i--){printf("%d",a[i]);}}".
