


How to convert Chinese characters to Pinyin in PHP program_PHP tutorial
In order to do a pinyin keyword search in the database, I thought of generating pinyin of the product name as a keyword and storing it in the database, so I collected information online and came up with something like this
/**
* Chinese Pinyin conversion class
* @author julian.song
* Some code sources are from the Internet, and the copyright belongs to the original author
*/
class Pinyin{
Public function __construct()
{
}
/**
* Convert string to Pinyin
* @param $string ASCII code
* @param $charset encoding
*/
Public function getPinyin($string, $charset='gb2312')
{
$dataKey = "a|ai|an|ang|ao|ba|bai|ban|bang|bao|bei|ben|beng|bi|bian|biao|bie|bin|bing|bo|bu|ca|cai| can|cang|cao|ce|ceng|cha".
"|chai|chan|chang|chao|che|chen|cheng|chi|chong|chou|chu|chuai|chuan|chuang|chui|chun|chuo|ci|cong|cou|cu|".
"cuan|cui|cun|cuo|da|dai|dan|dang|dao|de|deng|di|dian|diao|die|ding|diu|dong|dou|du|duan|dui|dun|duo|e |en|er".
"|fa|fan|fang|fei|fen|feng|fo|fou|fu|ga|gai|gan|gang|gao|ge|gei|gen|geng|gong|gou|gu|gua|guai|guan| guang|gui".
"|gun|guo|ha|hai|han|hang|hao|he|hei|hen|heng|hong|hou|hu|hua|huai|huan|huang|hui|hun|huo|ji|jia|jian| jiang".
"|jiao|jie|jin|jing|jiong|jiu|ju|juan|jue|jun|ka|kai|kan|kang|kao|ke|ken|keng|kong|kou|ku|kua|kuai|kuan| kuang".
"|kui|kun|kuo|la|lai|lan|lang|lao|le|lei|leng|li|lia|lian|liang|liao|lie|lin|ling|liu|long|lou|lu|lv| luan|lue".
"|lun|luo|ma|mai|man|mang|mao|me|mei|men|meng|mi|mian|miao|mie|min|ming|miu|mo|mou|mu|na|nai|nan| nang|nao|ne".
"|nei|nen|neng|ni|nian|niang|niao|nie|nin|ning|niu|nong|nu|nv|nuan|nue|nuo|o|ou|pa|pai|pan|pang|pao| pei|pen".
"|peng|pi|pian|piao|pie|pin|ping|po|pu|qi|qia|qian|qiang|qiao|qie|qin|qing|qiong|qiu|qu|quan|que|qun|ran| rang".
"|rao|re|ren|reng|ri|rong|rou|ru|ruan|rui|run|ruo|sa|sai|san|sang|sao|se|sen|seng|sha|shai|shan|shang| shao|".
"she|shen|sheng|shi|shou|shu|shua|shuai|shuan|shuang|shui|shun|shuo|si|song|sou|su|suan|sui|sun|suo|ta|tai|".
"tan|tang|tao|te|teng|ti|tian|tiao|tie|ting|tong|tou|tu|tuan|tui|tun|tuo|wa|wai|wan|wang|wei|wen|weng|wo |wu".
"|xi|xia|xian|xiang|xiao|xie|xin|xing|xiong|xiu|xu|xuan|xue|xun|ya|yan|yang|yao|ye|yi|yin|ying|yo|yong| you".
"|yu|yuan|yue|yun|za|zai|zan|zang|zao|ze|zei|zen|zeng|zha|zhai|zhan|zhang|zhao|zhe|zhen|zheng|zhi|zhong|".
"zhou|zhu|zhua|zhuai|zhuan|zhuang|zhui|zhun|zhuo|zi|zong|zou|zu|zuan|zui|zun|zuo";
$dataValue = "-20319|-20317|-20304|-20295|-20292|-20283|-20265|-20257|-20242|-20230|-20051|-20036|-20032|-20026|-20002|- 19990".
"|-19986|-19982|-19976|-19805|-19784|-19775|-19774|-19763|-19756|-19751|-19746|-19741|-19739|-19728|-19725".
"|-19715|-19540|-19531|-19525|-19515|-19500|-19484|-19479|-19467|-19289|-19288|-19281|-19275|-19270|-19263".
"|-19261|-19249|-19243|-19242|-19238|-19235|-19227|-19224|-19218|-19212|-19038|-19023|-19018|-19006|-19003".
"|-18996|-18977|-18961|-18952|-18783|-18774|-18773|-18763|-18756|-18741|-18735|-18731|-18722|-18710|-18697".
"|-18696|-18526|-18518|-18501|-18490|-18478|-18463|-18448|-18447|-18446|-18239|-18237|-18231|-18220|-18211".
"|-18201|-18184|-18183|-18181|-18012|-17997|-17988|-17970|-17964|-17961|-17950|-17947|-17931|-17928|-17922".
"|-17759|-17752|-17733|-17730|-17721|-17703|-17701|-17697|-17692|-17683|-17676|-17496|-17487|-17482|-17468".
"|-17454|-17433|-17427|-17417|-17202|-17185|-16983|-16970|-16942|-16915|-16733|-16708|-16706|-16689|-16664".
"|-16657|-16647|-16474|-16470|-16465|-16459|-16452|-16448|-16433|-16429|-16427|-16423|-16419|-16412|-16407".
"|-16403|-16401|-16393|-16220|-16216|-16212|-16205|-16202|-16187|-16180|-16171|-16169|-16158|-16155|-15959".
"|-15958|-15944|-15933|-15920|-15915|-15903|-15889|-15878|-15707|-15701|-15681|-15667|-15661|-15659|-15652".
"|-15640|-15631|-15625|-15454|-15448|-15436|-15435|-15419|-15416|-15408|-15394|-15385|-15377|-15375|-15369".
"|-15363|-15362|-15183|-15180|-15165|-15158|-15153|-15150|-15149|-15144|-15143|-15141|-15140|-15139|-15128".
"|-15121|-15119|-15117|-15110|-15109|-14941|-14937|-14933|-14930|-14929|-14928|-14926|-14922|-14921|-14914".
"|-14908|-14902|-14894|-14889|-14882|-14873|-14871|-14857|-14678|-14674|-14670|-14668|-14663|-14654|-14645".
"|-14630|-14594|-14429|-14407|-14399|-14384|-14379|-14368|-14355|-14353|-14345|-14170|-14159|-14151|-14149".
"|-14145|-14140|-14137|-14135|-14125|-14123|-14122|-14112|-14109|-14099|-14097|-14094|-14092|-14090|-14087".
"|-14083|-13917|-13914|-13910|-13907|-13906|-13905|-13896|-13894|-13878|-13870|-13859|-13847|-13831|-13658".
"|-13611|-13601|-13406|-13404|-13400|-13398|-13395|-13391|-13387|-13383|-13367|-13359|-13356|-13343|-13340".
"|-13329|-13326|-13318|-13147|-13138|-13120|-13107|-13096|-13095|-13091|-13076|-13068|-13063|-13060|-12888".
"|-12875|-12871|-12860|-12858|-12852|-12849|-12838|-12831|-12829|-12812|-12802|-12607|-12597|-12594|-12585".
"|-12556|-12359|-12346|-12320|-12300|-12120|-12099|-12089|-12074|-12067|-12058|-12039|-11867|-11861|-11847".
"|-11831|-11798|-11781|-11604|-11589|-11536|-11358|-11340|-11339|-11324|-11303|-11097|-11077|-11067|-11055".
"|-11052|-11045|-11041|-11038|-11024|-11020|-11019|-11018|-11014|-10838|-10832|-10815|-10800|-10790|-10780".
"|-10764|-10587|-10544|-10533|-10519|-10331|-10329|-10328|-10322|-10315|-10309|-10307|-10296|-10281|-10274".
"|-10270|-10262|-10260|-10256|-10254";
$tDataKey = explode('|', $dataKey);
$tDataValue = explode('|', $dataValue);
$data = (PHP_VERSION>='5.0') ? array_combine($tDataKey, $tDataValue) : $this->arrayCombine($tDataKey, $tDataValue);
arsort($data);
reset($data);
if($charset != 'gb2312') $string = $this->utf82GB2312($string);
$res = '';
for($i=0; $i
{
$asc = ord(substr($string, $i, 1));
if($asc>160) { $q = ord(substr($string, ++$i, 1)); $asc = $asc*256 + $q - 65536; }
$res .= $this->asc2Pinyin($asc, $data);
}
return preg_replace("/[^a-z0-9]*/", '', $res);
}
/**
* Convert ASCII code to corresponding pinyin
* @param $asc ASCII code
* @param $data ASCII pinyin character comparison table
*/
public function asc2Pinyin($asc, $data)
{
if ($asc>0 && $asc<160 ) return chr($asc);
elseif($asc<-20319 || $asc>-10247) return '';
else {
foreach($data as $k=>$v){ if($v<=$asc) break; }
return $k;
}
}
/**
* Convert UFT8 character encoding to GB2312
* @param $char
*/
public function utf82GB2312($char)
{
$string = '';
if($char < 0x80) $string .= $char;
elseif($char < 0x800)
{
$string .= chr(0xC0 | $char>>6);
$string .= chr(0x80 | $char & 0x3F);
}elseif($char < 0x10000){
$string .= chr(0xE0 | $char>>12);
$string .= chr(0x80 | $char>>6 & 0x3F);
$string .= chr(0x80 | $char & 0x3F);
} elseif($char < 0x200000) {
$string .= chr(0xF0 | $char>>18);
$string .= chr(0x80 | $char>>12 & 0x3F);
$string .= chr(0x80 | $char>>6 & 0x3F);
$string .= chr(0x80 | $char & 0x3F);
}
return iconv('UTF-8', 'GB2312//IGNORE', $string);
}
/**
* Comparison array
* @param $arr1 array 1
* @param $arr2 array 2
*/
public function arrayCombine($arr1, $arr2)
{
for($i=0; $i
return $res;
}
/**
*Get the first letter of the pinyin of Chinese characters
* @param $str string
*/
public function getInitial($str,$charset) {
if($charset=="UTF-8")$str=$this->utf82GB2312($str);
$asc = ord(substr($str, 0, 1));
if ($asc < 160) {//非中文
if ($asc >= 48 && $asc <= 57) {
return chr($asc); //数字
} elseif ($asc >= 65 && $asc <= 90) {
return chr($asc); // A--Z
} elseif ($asc >= 97 && $asc <= 122) {
return chr($asc - 32); // a--z
} else {
return '-'; //其他
}
} else {//中文
$asc = $asc * 1000 + ord(substr($str, 1, 1));
//获取拼音首字母A--Z
if ($asc >= 176161 && $asc < 176197) {
return 'A';
} elseif ($asc >= 176197 && $asc < 178193) {
return 'B';
} elseif ($asc >= 178193 && $asc < 180238) {
return 'C';
} elseif ($asc >= 180238 && $asc < 182234) {
return 'D';
} elseif ($asc >= 182234 && $asc < 183162) {
return 'E';
} elseif ($asc >= 183162 && $asc < 184193) {
return 'F';
} elseif ($asc >= 184193 && $asc < 185254) {
return 'G';
} elseif ($asc >= 185254 && $asc < 187247) {
return 'H';
} elseif ($asc >= 187247 && $asc < 191166) {
return 'J';
} elseif ($asc >= 191166 && $asc < 192172) {
return 'K';
} elseif ($asc >= 192172 && $asc < 194232) {
return 'L';
} elseif ($asc >= 194232 && $asc < 196195) {
return 'M';
} elseif ($asc >= 196195 && $asc < 197182) {
return 'N';
} elseif ($asc >= 197182 && $asc < 197190) {
return 'O';
} elseif ($asc >= 197190 && $asc < 198218) {
return 'P';
} elseif ($asc >= 198218 && $asc < 200187) {
return 'Q';
} elseif ($asc >= 200187 && $asc < 200246) {
return 'R';
} elseif ($asc >= 200246 && $asc < 203250) {
return 'S';
} elseif ($asc >= 203250 && $asc < 205218) {
return 'T';
} elseif ($asc >= 205218 && $asc < 206244) {
return 'W';
} elseif ($asc >= 206244 && $asc < 209185) {
return 'X';
} elseif ($asc >= 209185 && $asc < 212209) {
return 'Y';
} elseif ($asc >= 212209) {
return 'Z';
} else {
return '-';
}
}
}
/**
* Split string into array
* @param $str string
* @param $charset encoding
*/
public function mbStringtoArray($str,$charset) {
$strlen=mb_strlen($str);
while($strlen){
$array[]=mb_substr($str,0,1,$charset);
$str=mb_substr($str,1,$strlen,$charset);
$strlen=mb_strlen($str);
}
return $array;
}
public function getInitials($str,$charset){
$initials="";
foreach($this->mbStringtoArray($str,$charset)as $char){
$initials.=$this->getInitial($char,$charset);
}
return $initials;
}
}
本文出自 “大白良” 博客

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

In this chapter, we will understand the Environment Variables, General Configuration, Database Configuration and Email Configuration in CakePHP.

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

To work with date and time in cakephp4, we are going to make use of the available FrozenTime class.

To work on file upload we are going to use the form helper. Here, is an example for file upload.

In this chapter, we are going to learn the following topics related to routing ?

CakePHP is an open-source framework for PHP. It is intended to make developing, deploying and maintaining applications much easier. CakePHP is based on a MVC-like architecture that is both powerful and easy to grasp. Models, Views, and Controllers gu

Visual Studio Code, also known as VS Code, is a free source code editor — or integrated development environment (IDE) — available for all major operating systems. With a large collection of extensions for many programming languages, VS Code can be c

Validator can be created by adding the following two lines in the controller.
