Character is a general term for various characters and symbols, including characters of various countries, punctuation marks, graphic symbols, numbers, etc. Character set(Character set) is a collection of multiple characters. There are many types of character sets. Each character set contains a different number of characters. Common character set names: ASCII character set, GB2312 character set, BIG5 Character set, GB18030 character set, Unicode character set, etc. In order for a computer to accurately process text in various character sets, character encoding is required so that the computer can recognize and store various text. There are a large number of Chinese characters, and they are divided into two kinds of characters, Simplified Chinese and Traditional Chinese, with different writing rules. Computers were originally designed based on English single-byte characters. Therefore, encoding Chinese characters is the technical basis for Chinese information exchange. .
This article mainly introduces PHP to convert string from GBK to UTF8 character set through iconv. It has a very good reference value. Let’s take a look at it with the editor.
PHP converts strings from GBK to UTF8 character set through iconv.
1. Introduction to iconv()
iconvFunction can convert a known character set file into another known character set Character set file. For example: Convert from GB2312 to UTF-8.
iconv function is built in php5, and the GB character set is turned on by default.
2. iconv() error
iconv will make an error when converting the character "-" to gb2312. The solution is to add "/" after the encoding that needs to be converted. /IGNORE", that is, after the second parameter of the iconv function. As follows:
iconv("UTF-8", "GB2312//IGNORE", $data)
ignore means to ignore errors during conversion. Without the ignore parameter, all strings following this character cannot be saved.
3. iconv() example
<?php echo $str= ‘你好,这里是卖咖啡!'; echo '<br />'; echo iconv('GB2312', 'UTF-8', $str); //将字符串的编码从GB2312转到UTF-8 echo '<br />'; echo iconv_substr($str, 1, 1, 'UTF-8'); //按字符个数截取而非字节 print_r(iconv_get_encoding()); //得到当前页面编码信息 echo iconv_strlen($str, 'UTF-8'); //得到设定编码的字符串长度 ?>
The above is the detailed content of How to convert character set in php. For more information, please follow other related articles on the PHP Chinese website!