1. Download the libiconv function library http://ftp.gnu.org/pub/gnu/libiconv/libiconv-1.9.2.tar.gz;
2. Unzip tar -zxvf libiconv-1.9.2.tar. gz;
3. Install libiconv
Copy code The code is as follows:
#configure --prefix=/usr/ local/iconv
#make
#make install
4. Recompile php and add compilation parameters --with-iconv=/usr/local/iconv
under windows
I am working on a thief program recently, and I need to use the iconv function to convert the captured utf-8 encoded pages into gb2312. I found that only by using the iconv function to transcode the captured data, the data will be reduced for no reason. Some. I was depressed for a while. After checking the information on the Internet, I found out that this was a bug in the iconv function. iconv will make an error when converting the character "—" to gb2312
The solution is very simple, that is, add "//IGNORE" after the encoding that needs to be converted, that is, after the second parameter of the iconv function. As follows:
The following For the quoted content:
iconv("UTF-8","GB2312//IGNORE",$data)
ignore means to ignore errors during conversion. If there is no ignore parameter, all characters after this character None of the strings can be saved.
iconv is not the default function of php, and it is also a module installed by default. It needs to be installed before it can be used.
If it is windows2000+php, you can modify the php.ini file and remove the ";" before extension=php_iconv.dll. At the same time, you need to copy the iconv.dll in your original php installation file to your winnt/ Under system32 (if your dll points to this directory)
In the Linux environment, use static installation and add an additional item --with-iconv when configure. phpinfo can see the iconv item. (Linux7.3+Apache4.06+php4.3.2),
Download: ftp://ftp.gnu.org/pub/gnu/libiconv/libiconv-1.8.tar.gz
Installation:
Copy code The code is as follows:
#cp libiconv-1.8.tar.gz /usr/local/src
#tar zxvf lib*
#./configure --prefix=/usr/local/libiconv
#make
#make install
Compile php
#./configure --prefix=/ usr/local/php4.3.2 --with-iconv=/usr/local/libiconv/
Simple example of usage:
echo iconv( "gb2312","ISO-8859-1","we");
Introduction to the mb_convert_encoding and iconv functions in PHP
The mb_convert_encoding function is used to convert encodings. I used to not understand the concept of program coding, but now I seem to understand a little bit.
However, English generally does not have encoding problems, only Chinese data will have this problem. For example, when you use Zend Studio or Editplus to write a program, you use gbk encoding. If the data needs to be entered into the database, and the database encoding is utf8, then the data must be encoded and converted, otherwise it will become garbled when entering the database. .
See the official usage of mb_convert_encoding:
http://cn.php.net/manual/zh/function.mb-convert-encoding.php
Make a GBK To UTF- 8
Copy code The code is as follows:
< ?php
header("content-Type: text/html ; charset=Utf-8");
echo mb_convert_encoding("You are my friend", "UTF-8", "GBK");
?>
Another GB2312 To Big5
Copy code The code is as follows:
< ?php
header("content-Type : text/html; charset=big5");
echo mb_convert_encoding("You are my friend", "big5", "GB2312");
?>
However, to use the above function, you need to install and enable the mbstring extension library first.
Another function iconv in PHP is also used to convert string encoding, and its function is similar to the function above.
There are some detailed examples below:
iconv — Convert string to requested character encoding
(PHP 4 >= 4.0.5, PHP 5)
mb_convert_encoding — Convert character encoding
(PHP 4 >= 4.0.6, PHP 5)
Usage:
string mb_convert_encoding ( string str, string to_encoding [, mixed from_encoding] )
You need to enable the mbstring extension library first, In php.ini, remove the ; in front of; extension=php_mbstring.dll.
mb_convert_encoding can specify multiple input encodings. It will automatically identify according to the content, but the execution efficiency is much worse than iconv;
string iconv (string in_charset, string out_charset, string str)
Note: In addition to specifying the encoding to be converted to, the second parameter can also add two suffixes: //TRANSLIT and //IGNORE, where //TRANSLIT will automatically convert characters that cannot be converted directly into one or more approximate characters. //IGNORE will ignore characters that cannot be converted, and the default effect is to truncate from the first illegal character.
Returns the converted string or FALSE on failure.
Use:
It is found that iconv will make an error when converting the character "-" to gb2312. If there is no ignore parameter, all this The string following the character cannot be saved. No matter what, this "-" cannot be converted successfully and cannot be output. In addition, mb_convert_encoding does not have this bug.
In general, iconv is used. The mb_convert_encoding function is only used when the original encoding cannot be determined, or the iconv conversion cannot be displayed normally.
from_encoding is specified by character code name before conversion. it can be array or string - comma separated enumerated list. If it is not specified, the internal encoding will be used.
/* Auto detect encoding from JIS, eucjp-win, sjis-win, then convert str to UCS-2LE */
$str = mb_convert_encoding($str, "UCS-2LE", "JIS, eucjp-win, sjis-win");
/* "auto ” is expanded to "ASCII,JIS,UTF-8,EUC-JP,SJIS" */
$str = mb_convert_encoding($str, "EUC-JP", "auto");
Example :
Copy code The code is as follows:
$content = iconv("GBK", "UTF-8", $content );
$content = mb_convert_encoding($content, "UTF-8", "GBK");
http://www.bkjia.com/PHPjc/324396.htmlwww.bkjia.comtruehttp: //www.bkjia.com/PHPjc/324396.htmlTechArticle1. Download the libiconv function library http://ftp.gnu.org/pub/gnu/libiconv/libiconv- 1.9.2.tar.gz; 2. Decompress tar -zxvf libiconv-1.9.2.tar.gz; 3. Install libiconv. Copy the code as follows: #...