编写PHP程序检查字符串中的中文字符个数的实例分享_php实例
有时候我们需要计算一个字符串中包含的字数,对于纯英文字符串,字数等于字符串长度,用
strlen函数即可获得,但如果字符串中包含中文怎办?mb_strlen可以实现,但不幸没装扩展,那就自己实现一下吧。
php有一个扩展一般是必装的,我们可以使用mb_strlen来获取字符串中的字数,用法一般如下:
$len = mb_strlen("你是我的小苹果","utf-8");
如愿获得字符串长度:7.
如果没装mb扩展呢?自己实现一下吧。
我们要先明白一个事实:字符串是由字符组成的,而字符是由字节表示的,每个英文字符是一个字节,对应一个ascii码,英文字符的ascii码是小于128的,也就是十六进制的 0x80 .当一个字节的ascii码超过了127,那就说明当前字节不是一个完整的字符。
比如
$str = "你是我的小苹果";
中的$str{0}可以取到第一个字节,我们来看一下它是啥:
php > $str = "你是我的小苹果"; php > echo $str{0}; �
是个乱码,它只是
字的字节之一,也就是说,
这个字符是由超过一个字节组成的,我们这样试试:
php > echo $str{0}.$str{1}.$str{2};
你
可以看到,将三个字节连在一起输出,就成了一个完整的
。
至于这里为什么是三个字节,而不是两个或4个?这个取决于字符串的编码,我这里控制台默认是utf8编码的,在PHP中,一个utf8字符是用三个字节表达的,如果是gbk编码,则会是两个字节。至于编码和字节的关系,这个话题比较大,一篇说不完,请参考这篇文章:字符编码笔记:ascii,unicode和utf8 。
知道了这些,我们就可以自己编写一个字数检查的函数了,大致流程如下:
1.for循环遍历字节 2.判断字节编码是否 >= 0x80,是的话跳过N个字节
我写了个简单的函数,可以判断gbk或utf8字符串的长度,仅供参考:
<?php function mbstrlen($str,$encoding="utf8") { if (($len = strlen($str)) == 0) { return 0; } $encoding = strtolower($encoding); if ($encoding == "utf8" or $encoding == "utf-8") { $step = 3; } elseif ($encoding == "gbk" or $encoding == "gb2312") { $step = 2; } else { return false; } $count = 0; for ($i=0; $i<$len; $i++) { $count++; //如果字节码大于127,则根据编码跳几个字节 if (ord($str{$i}) >= 0x80) { $i = $i + $step - 1;//之所以减去1,因为for循环本身还要$i++ } } return $count; } echo mbstrlen(iconv("utf-8","gbk","你是我的小苹果"),"gbk"); echo mbstrlen("你是我的小苹果");

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



In this chapter, we will understand the Environment Variables, General Configuration, Database Configuration and Email Configuration in CakePHP.

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

To work with date and time in cakephp4, we are going to make use of the available FrozenTime class.

Working with database in CakePHP is very easy. We will understand the CRUD (Create, Read, Update, Delete) operations in this chapter.

To work on file upload we are going to use the form helper. Here, is an example for file upload.

In this chapter, we are going to learn the following topics related to routing ?

CakePHP is an open-source framework for PHP. It is intended to make developing, deploying and maintaining applications much easier. CakePHP is based on a MVC-like architecture that is both powerful and easy to grasp. Models, Views, and Controllers gu

Validator can be created by adding the following two lines in the controller.
