Home Backend Development PHP Tutorial Example of getting the character length of a utf8 string in php_PHP tutorial

Example of getting the character length of a utf8 string in php_PHP tutorial

Jul 13, 2016 am 10:40 AM
php utf8 Write judgment exist character string Example frame of Obtain form length need verify

When I was writing the form validation class of the framework tonight, I needed to determine whether the length of a certain string was within a specified range. Naturally, I thought of the strlen function in PHP.

The code is as follows


$str = 'Hello world!中';
echo strlen($str); // Output 12

 代码如下  


$str = 'Hello world!中';
echo strlen($str); // 输出12

Test Chinese

The code is as follows

$str = 'Hello, world! ';
echo strlen($str); // Output 12 under GBK or GB2312, output 18 under UTF-8

 代码如下  

$str = '你好,世界!';
echo strlen($str); // GBK或GB2312下输出12,UTF-8下输出18 

PHP’s built-in string length function strlen cannot correctly handle Chinese strings. All it gets is the number of bytes occupied by the string. For the Chinese encoding of GB2312, the value obtained by strlen is twice the number of Chinese characters, while for UTF-8 encoded Chinese, the difference is three times (under UTF-8 encoding, one Chinese character occupies 3 bytes).

The following example is taken from the famous WordPress. It is very accurate. It should also be noted that this function only applies to strings encoded in utf-8.

The code is as follows


function utf8_strlen($string=null){
// Decompose the string into units
Preg_match_all("/./us", $string, $match);
// Return the number of units
Return count($match[0]);
}

 代码如下  


function utf8_strlen($string=null){
    // 将字符串分解为单元
    preg_match_all("/./us", $string, $match);
    // 返回单元个数   
    return count($match[0]);
}

But the above code cannot handle GBK/GB2312 Chinese strings under UTF-8 encoding, because the Chinese characters of GBK/GB2312 will be recognized as two characters and the calculated number of Chinese characters will double, so I I came up with this idea:

The code is as follows

$tmp = @iconv('gbk', 'utf-8', $str);
If(!empty($tmp)){
$str = $tmp;
}
Preg_match_all('/./us', $str, $match);
echo count($match[0]);

 代码如下  

    $tmp = @iconv('gbk', 'utf-8', $str);
    if(!empty($tmp)){
    $str = $tmp;
    }
    preg_match_all('/./us', $str, $match);
    echo count($match[0]);

Compatible with GBK/GB2312 and UTF-8 encoding, passed the test with a small amount of data, but it is not yet confirmed whether it is completely correct

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/727579.htmlTechArticleWhen writing the form validation class of the framework tonight, I need to determine whether the length of a certain string is within the specified range. Naturally, the strlen function in PHP comes to mind. The code is as follows $str = 'Hello wo...
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

CakePHP Project Configuration CakePHP Project Configuration Sep 10, 2024 pm 05:25 PM

In this chapter, we will understand the Environment Variables, General Configuration, Database Configuration and Email Configuration in CakePHP.

PHP 8.4 Installation and Upgrade guide for Ubuntu and Debian PHP 8.4 Installation and Upgrade guide for Ubuntu and Debian Dec 24, 2024 pm 04:42 PM

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

CakePHP Date and Time CakePHP Date and Time Sep 10, 2024 pm 05:27 PM

To work with date and time in cakephp4, we are going to make use of the available FrozenTime class.

CakePHP Working with Database CakePHP Working with Database Sep 10, 2024 pm 05:25 PM

Working with database in CakePHP is very easy. We will understand the CRUD (Create, Read, Update, Delete) operations in this chapter.

CakePHP File upload CakePHP File upload Sep 10, 2024 pm 05:27 PM

To work on file upload we are going to use the form helper. Here, is an example for file upload.

CakePHP Routing CakePHP Routing Sep 10, 2024 pm 05:25 PM

In this chapter, we are going to learn the following topics related to routing ?

Discuss CakePHP Discuss CakePHP Sep 10, 2024 pm 05:28 PM

CakePHP is an open-source framework for PHP. It is intended to make developing, deploying and maintaining applications much easier. CakePHP is based on a MVC-like architecture that is both powerful and easy to grasp. Models, Views, and Controllers gu

CakePHP Creating Validators CakePHP Creating Validators Sep 10, 2024 pm 05:26 PM

Validator can be created by adding the following two lines in the controller.

See all articles