PHP中GBK和UTF8编码处理方法
PHP中GBK和UTF8编码处理方法
一、编码范围
1. GBK (GB2312/GB18030)
\x00-\xff GBK双字节编码范围
\x20-\x7f ASCII
\xa1-\xff 中文
\x80-\xff 中文
2. UTF-8 (Unicode)
\u4e00-\u9fa5 (中文)
\x3130-\x318F (韩文
\xAC00-\xD7A3 (韩文)
\u0800-\u4e00 (日文)
ps: 韩文是大于[\u9fa5]的字符
正则例子:
PHP:
preg_replace(""/([\x80-\xff])/"","""",$str); preg_replace(""/([u4e00-u9fa5])/"","""",$str);
二、其他语言代码例子
PHP:
//判断内容里有没有中文-GBK (PHP) function check_is_chinese($s){ return preg_match('/[\x80-\xff]./', $s); } //获取字符串长度-GBK (PHP) function gb_strlen($str){ $count = 0; for($i=0; $i<strlen($str); $i++){ $s = substr($str, $i, 1); if (preg_match(""/[\x80-\xff]/"", $s)) ++$i; ++$count; } return $count; } //截取字符串字串-GBK (PHP) function gb_substr($str, $len){ $count = 0; for($i=0; $i<strlen($str); $i++){ if($count == $len) break; if(preg_match(""/[\x80-\xff]/"", substr($str, $i, 1))) ++$i; ++$count; } return substr($str, 0, $i); } //统计字符串长度-UTF8 (PHP) function utf8_strlen($str) { $count = 0; for($i = 0; $i <strlen($str); $i++){ $value = ord($str[$i]); if($value> 127) { $count++; if($value>= 192 && $value <= 223) $i++; elseif($value>= 224 && $value <= 239) $i = $i + 2; elseif($value>= 240 && $value <= 247) $i = $i + 3; else die('Not a UTF-8 compatible string'); } $count++; } return $count; } //截取字符串-UTF8(PHP) function utf8_substr($str,$position,$length){ $start_position = strlen($str); $start_byte = 0; $end_position = strlen($str); $count = 0; for($i = 0; $i <strlen($str); $i++){ if($count>= $position && $start_position> $i){ $start_position = $i; $start_byte = $count; } if(($count-$start_byte)>=$length) { $end_position = $i; break; } $value = ord($str[$i]); if($value> 127){ $count++; if($value>= 192 && $value <= 223) $i++; elseif($value>= 224 && $value <= 239) $i = $i + 2; elseif($value>= 240 && $value <= 247) $i = $i + 3; else die('Not a UTF-8 compatible string'); } $count++; } return(substr($str,$start_position,$end_position-$start_position)); }
//字符串长度统计-UTF8 [中文3个字节,俄文、韩文占2个字节,字母占1个字节]
(Ruby)
def utf8_string_length(str) temp = CGI::unescape(str) i = 0; j = 0; temp.length.times{|t| if temp[t] <127 i += 1 elseif temp[t]>= 127 and temp[t] <224 j += 1 if 0 == (j % 2) i += 2 j = 0 end else j += 1 if 0 == (j % 3) i +=2 j = 0 end end } return i }
//判断是否是有韩文-UTF-8
(javascript)
function checkKoreaChar(str) { for(i=0; i<str.length; i++) { if(((str.charCodeAt(i)> 0x3130 && str.charCodeAt(i) <0x318F) || (str.charCodeAt(i)>= 0xAC00 && str.charCodeAt(i) <= 0xD7A3))) { return true; } } return false; } <table style="width:97%;" class="t_table" cellspacing="0"> <tbody> <tr> <td> //判断是否有中文字符-GBK (javascript) function check_chinese_char(s){ return (s.length != s.replace(/[^\x00-\xff]/g,""**"").length); } </td> </tr> </tbody> </table>

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

In this chapter, we will understand the Environment Variables, General Configuration, Database Configuration and Email Configuration in CakePHP.

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

To work with date and time in cakephp4, we are going to make use of the available FrozenTime class.

To work on file upload we are going to use the form helper. Here, is an example for file upload.

In this chapter, we are going to learn the following topics related to routing ?

CakePHP is an open-source framework for PHP. It is intended to make developing, deploying and maintaining applications much easier. CakePHP is based on a MVC-like architecture that is both powerful and easy to grasp. Models, Views, and Controllers gu

Visual Studio Code, also known as VS Code, is a free source code editor — or integrated development environment (IDE) — available for all major operating systems. With a large collection of extensions for many programming languages, VS Code can be c

Validator can be created by adding the following two lines in the controller.
