Home Backend Development PHP Tutorial The difference between strlen, mb_strlen, substr(), mb_substr() and mb_strcut in php_PHP tutorial

The difference between strlen, mb_strlen, substr(), mb_substr() and mb_strcut in php_PHP tutorial

Jul 13, 2016 pm 05:00 PM
php strlen substr about the difference article of

The article introduces in detail the differences and usage of strlen, mb_strlen, substr(), mb_substr() and mb_strcut. Students who need to learn can refer to it.

About the use of the string splitting function of mb_*:
Configuration under win
Need to install php_mbstring.dll extension
You need to open php_mbstring.dll in php.ini
The configuration under Linux can be easily searched online

The code is as follows Copy code
 代码如下 复制代码

//测试时文件的编码方式要是UTF8
$str='中文a字1符';
echo strlen($str).'
';//14
    echo mb_strlen($str,'utf8').'
';//6
    echo mb_strlen($str,'gbk').'
';//8
    echo mb_strlen($str,'gb2312').'
';//10
?>

//The encoding method of the file during testing must be UTF8
$str='Chinese character a';

echo strlen($str).'
';//14

echo mb_strlen($str,'utf8').'
';//6

echo mb_strlen($str,'gbk').'
';//8

echo mb_strlen($str,'gb2312').'
';//10

?>

Result analysis: When calculating strlen, a UTF8 Chinese character is treated as 3 lengths, so the length of "Chinese a character 1 character" is 3*4+2=14. When calculating mb_strlen, the internal code is selected as UTF8, then A Chinese character will be calculated as a length of 1, so the length of "Chinese a character 1 character" is 6

mb_strlen default encoding can be passed

Obtained by mb_internal_encoding().

Using these two functions, you can jointly calculate the occupancy of a mixed Chinese and English string (the occupancy of a Chinese character is 2, and the occupancy of an English character is

1 echo (strlen($str) + mb_strlen($str,'UTF8')) / 2;


PHP’s built-in string length function strlen cannot correctly handle Chinese strings. It only gets the number of bytes occupied by the string. For GB2312 Chinese encoding, the value obtained by strlen is twice the number of Chinese characters, while for UTF-8 encoded Chinese, the difference is three times (under UTF-8 encoding, one Chinese character occupies 3 bytes).

String splitting

The substr() function can split text, but if the text to be split includes Chinese characters, you will often encounter problems. In this case, you can use the mb_substr()/mb_strcut function
 代码如下 复制代码
echo mb_substr('这样一来我的字符串就不会有乱码^_^', 0, 7, 'utf-8');
?>

mb_substr splits characters by words, while mb_strcut splits characters by bytes, but neither will produce half a character.

 代码如下 复制代码
echo mb_strcut('这样一来我的字符串就不会有乱码^_^', 0, 7, 'utf-8');
?>
The substr() function can split text, but if the text to be split includes Chinese characters, you will often encounter problems. In this case, you can use the mb_substr()/mb_strcut function. The usage of mb_substr()/mb_strcut is similar to substr(), except that mb_substr()/mb_strcut needs to add one more parameter at the end to set the encoding of the string. However, most servers do not open php_mbstring.dll. You need to open php_mbstring.dll in php.ini. For example:
The code is as follows Copy code
echo mb_substr('This way my string will not be garbled^_^', 0, 7, 'utf-8');<🎜> ?>
Output: This way my words
The code is as follows Copy code
echo mb_strcut('This way my string will not be garbled^_^', 0, 7, 'utf-8');<🎜> ?>


Output: like this
As can be seen from the above example, mb_substr splits characters by words, while mb_strcut splits characters by bytes, but neither of them will produce half a character...

Description of mbstring function:


PHP's mbstring extension module provides multi-byte character processing capabilities. The most commonly used method is to use mbstring to split multi-byte Chinese characters. This can avoid the occurrence of half characters. Since it is an extension of PHP, its The performance is also better than some custom multi-byte segmentation functions.

The mbstring extension provides several functions with similar functions, mb_substr and mb_strcut. See their explanation in the manual.

mb_substr
mb_substr() returns the portion of str specified by the start and length parameters.

mb_substr() performs multi-byte safe substr() operation based on number of characters. Position is sqlserver/42852.htm target=_blank >counted from the beginning of str. First character's position is 0. Second character position is 1 , and so on.

mb_strcut
mb_strcut() returns the portion of str specified by the start and length parameters.

mb_strcut() performs equivalent operation as mb_substr() with different method. If start position is multi-byte character's second byte or larger, it starts from first byte of multi-byte character.

It subtracts string from str that is shorter than length AND character that is not part of multi-byte string or not being middle of shift sequence.

For another example, there is a piece of text that is segmented using mb_substr and mb_strcut respectively:

PLAIN TEXT
CODE:

The code is as follows
 代码如下 复制代码

$str = '我是一串比较长的中文-www.webjx.com';

echo "mb_substr:" . mb_substr($str, 0, 6, 'utf-8');

echo "
";

echo "mb_strcut:" . mb_strcut($str, 0, 6, 'utf-8');
?>

Copy code

$str = 'I am a relatively long string of Chinese-www.webjx.com';

echo "mb_substr:" . mb_substr($str, 0, 6, 'utf-8');

代码如下 复制代码

/**
* 字符串分割 按字分割
* @param $content string
* @param $length int
* @param $etc string
* @return string
*/
function Truncate($content, $length, $etc = '...') {

if ($length == 0) {
return '';
} elseif (mb_strlen($content,'utf-8') > $length) {
            $length -= min($length, mb_strlen($etc));
            $charset = 'utf-8';
            $content = mb_substr($content, 0, $length, $charset) . $etc;
        }
        return $content;
    }

    $str ='伏尔泰(1694~1778)法国资产阶级启蒙思想家,哲学家,史学家,文学家。伏尔泰原名F.M.阿鲁埃。';

    echo strlen($str);//字符串长度
 echo '


';
    echo mb_strlen($str,'utf-8');//字符串长度
 echo '
';
 echo mb_strcut($str,0,35,'utf-8');//按字节分割
 echo '
';
 echo mb_substr($str,0,35,'utf-8');//按字 分割
 echo '
';
    echo Truncate($str,35);//字符串截取方法

echo "
"; echo "mb_strcut:" . mb_strcut($str, 0, 6, 'utf-8'); ?> The output results are as follows: mb_substr: I am a string of comparisons mb_strcut:I am Test code:
The code is as follows Copy code
/** * String splitting by word * @param $content string * @param $length int * @param $etc string * @return string ​*/ function Truncate($content, $length, $etc = '...') { if ($length == 0) {               return '';              } elseif (mb_strlen($content,'utf-8') > $length) {                $length -= min($length, mb_strlen($etc));                $charset = 'utf-8'; $content = mb_substr($content, 0, $length, $charset) . $etc; }          return $content; } $str ='Voltaire (1694~1778) French bourgeois Enlightenment thinker, philosopher, historian, and writer. Voltaire's original name was F.M. Arrouet. '; echo strlen($str);//String length echo '
'; Echo mb_strlen($str,'utf-8');//String length echo '
'; echo mb_strcut($str,0,35,'utf-8');//Split by bytes echo '
'; echo mb_substr($str,0,35,'utf-8');//Split by word echo '
'; echo Truncate($str,35);//String interception method

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/631281.htmlTechArticleThe article introduces in detail the differences and usage of strlen, mb_strlen, substr(), mb_substr() and mb_strcut. Students who need to learn can refer to it. Regarding the use of the string splitting function of mb_*:...
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
Two Point Museum: All Exhibits And Where To Find Them
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

What is the difference between pre-market and after-market trading? Detailed explanation of the differences between pre-market and after-market trading What is the difference between pre-market and after-market trading? Detailed explanation of the differences between pre-market and after-market trading Mar 03, 2025 pm 11:54 PM

In traditional financial markets, pre-market and after-market trading refers to trading activities outside the regular trading period. Although the cryptocurrency market is trading around the clock, trading platforms like Bitget also offer similar features, especially some comprehensive platforms that trade stocks and cryptocurrencies at the same time. This article will clarify the differences in pre-market and after-market trading and explore its impact on currency price. Four major differences between pre-market and after-market trading: The main differences between pre-market and after-market trading and regular trading periods are in four aspects: trading time, liquidity, price fluctuations and trading volume: Trading time: Pre-market trading occurs before the official trading starts, and after-market trading is carried out after the regular trading ends. Liquidity: The liquidity of pre- and after-hours trading is low, there are few traders, and the bid and offer price difference is large; while the liquidity is high during the regular trading period, the price is

Why is Bittensor said to be the 'bitcoin' in the AI ​​track? Why is Bittensor said to be the 'bitcoin' in the AI ​​track? Mar 04, 2025 pm 04:06 PM

Original title: Bittensor=AIBitcoin? Original author: S4mmyEth, Decentralized AI Research Original translation: zhouzhou, BlockBeats Editor's note: This article discusses Bittensor, a decentralized AI platform, hoping to break the monopoly of centralized AI companies through blockchain technology and promote an open and collaborative AI ecosystem. Bittensor adopts a subnet model that allows the emergence of different AI solutions and inspires innovation through TAO tokens. Although the AI ​​market is mature, Bittensor faces competitive risks and may be subject to other open source

Is there any difference between South Korean Bitcoin and domestic Bitcoin? Is there any difference between South Korean Bitcoin and domestic Bitcoin? Mar 05, 2025 pm 06:51 PM

The Bitcoin investment boom continues to heat up. As the world's first decentralized digital asset, Bitcoin has attracted much attention on its decentralization and global liquidity. Although China was once the largest market for Bitcoin, policy impacts have led to transaction restrictions. Today, South Korea has become one of the major Bitcoin markets in the world, causing investors to question the differences between it and its domestic Bitcoin. This article will conduct in-depth analysis of the differences between the Bitcoin markets of the two countries. Analysis of the differences between South Korea and China Bitcoin markets. The main differences between South Korea and China’s Bitcoin markets are reflected in prices, market supply and demand, exchange rates, regulatory supervision, market liquidity and trading platforms. Price difference: South Korea’s Bitcoin price is usually higher than China, and this phenomenon is called “Kimchi Premium.” For example, in late October 2024, the price of Bitcoin in South Korea was once

Vertical proxy: Application scenarios and interpretation of disruptive potential of encryption native proxy Vertical proxy: Application scenarios and interpretation of disruptive potential of encryption native proxy Mar 04, 2025 am 10:21 AM

Artificial intelligence agents (AIAgents) are rapidly integrating into daily operations of enterprises, from large companies to small businesses, almost all areas have begun to be used, including sales, marketing, finance, law, IT, project management, logistics, customer service and workflow automation. We are moving from an era of manual processing of data, performing repetitive tasks, and using Excel tables to an era of autonomous operation by AI agents around the clock, which not only improves efficiency but also significantly reduces costs. Application case of AI agents in Web2: YCombinator's Perspective Apten: A sales and marketing optimization tool combining AI and SMS technology. BildAI: A model that can read architectural blueprints,

What exchange is Nexo? Is Nexo exchange safe? What exchange is Nexo? Is Nexo exchange safe? Mar 05, 2025 pm 07:39 PM

Nexo: Not only is it a cryptocurrency exchange, but also your digital financial manager. Nexo is not a traditional cryptocurrency exchange, but a financial platform that focuses more on cryptocurrency lending. It allows users to obtain loans in cryptocurrency as collateral and provides services to earn interest. While Nexo also offers cryptocurrency buying, selling and redemption capabilities, its core business is crypto lending. This article will explore the operating model and security of Nexo in depth to provide investors with a more comprehensive understanding. Nexo's operating model was founded in 2018 and is headquartered in Zug, Switzerland, and is a pioneer in the field of digital finance. It is different from other centralized exchanges and focuses more on providing comprehensive financial services. Users can buy, sell, trade cryptocurrencies without selling assets and

The difference between Ether and Bitcoin What is the difference between Ether and Bitcoin The difference between Ether and Bitcoin What is the difference between Ether and Bitcoin Mar 19, 2025 pm 04:54 PM

The difference between Ethereum and Bitcoin is significant. Technically, Bitcoin uses PoW, and Ether has shifted from PoW to PoS. Trading speed is slow for Bitcoin and Ethereum is fast. In application scenarios, Bitcoin focuses on payment storage, while Ether supports smart contracts and DApps. In terms of issuance, the total amount of Bitcoin is 21 million, and there is no fixed total amount of Ether coins. Each security challenge is available. In terms of market value, Bitcoin ranks first, and the price fluctuations of both are large, but due to different characteristics, the price trend of Ethereum is unique.

What does closing a virtual currency position mean? Same as selling? How to avoid forced closing of positions? What does closing a virtual currency position mean? Same as selling? How to avoid forced closing of positions? Mar 04, 2025 am 06:51 AM

Detailed explanation of closing positions in virtual currency trading: Strategies to avoid the risk of liquidation. This article will deeply explore the concept of "closing positions" in the virtual currency market, and clarify the difference between it and "sell", and how to effectively avoid the risk of forced liquidation (filtering positions). What is virtual currency closing? Close positions refers to investors ending existing positions through reverse trading, thereby locking in profits and losses. For example, investors holding long positions (buy) can close their positions by selling equal amounts of virtual currency; investors holding short positions (sell) need to buy equal amounts of virtual currency to close their positions. A closing operation is essentially closing or releasing an established investment position. Is closing a position equal to selling? Although long closing does involve selling operations, closing and selling are not exactly the same. Close position definition: End opened

How many times is the U standard 2 times equivalent to the U standard? What is the difference between U standard and currency standard? How many times is the U standard 2 times equivalent to the U standard? What is the difference between U standard and currency standard? Mar 04, 2025 am 07:48 AM

Coin Standard and U-Material Perpetual Contract: Conversion and risk analysis of leverage multiples. The pricing methods of perpetual contracts are mainly divided into two types: coin Standard and U-Material. The currency standard contract is settled in the transaction cryptocurrency (such as BTC, ETH), with the goal of obtaining more of the cryptocurrency; the U standard contract is settled in the stablecoin (such as USDT), with the goal of earning more stablecoins, similar to the traditional gold standard system. Many investors are curious: How many times the leverage at the currency standard is equivalent to the U standard? To put it simply, the conversion between the 2x leverage of the currency standard and the leverage of the U standard is roughly equivalent to the 2x leverage of the U standard. However, this equivalence relationship is not absolute, as currency price fluctuations significantly affect the actual leverage effect. The risk of currency standard leverage will fluctuate with the currency price

See all articles