Community

Learn

Tools Library

AI Tools

Leisure

English

Home > Backend Development > PHP Tutorial > GB2312 php smarty intercepts garbled Chinese characters problem? gb2312/utf-8

GB2312 php smarty intercepts garbled Chinese characters problem? gb2312/utf-8

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Release： 2016-07-29 08:47:14

Original

1008 people have browsed it

The display of general website pages will inevitably involve the interception of substrings. At this time, truncate comes in handy, but it is only suitable for English users. For Chinese users, using truncate will cause garbled characters, and for Chinese and English For mixed strings, if the same number of strings are intercepted, the actual display lengths will be different, which will appear visually uneven and the image will be beautiful. This is because the length of one Chinese character is roughly equivalent to the length of two English characters. In addition, truncate is not compatible with GB2312, UTF-8 and other encodings at the same time.
Improved smartTruncate: File name: modifier.smartTruncate.php

Copy code The code is as follows:

function smartDetectUTF8($string)
{
static $result = array();
if(! array_key_exists($key = md5($string), $result))
{
$utf8 = "
/^(?:
[x09x0Ax0Dx20-x7E] # ASCII
| [xC2-xDF][x80-xBF ] # non-overlong 2-byte
| x9F][x80-xBF] # excluding surrogates
| 15
| }
return $result[$key];
}
function smartStrlen($string)
{
$result = 0;
$number = smartDetectUTF8($string) ? 3 : 2;
for($i = 0; $ i < strlen($string); $i += $bytes)
{
$bytes = ord(substr($string, $i, 1)) > 127 ? $number : 1;
$result += $ bytes > 1 ? 1.0 : 0.5;
}
return $result;
}
function smartSubstr($string, $start, $length = null)
{
$result = '';
$number = smartDetectUTF8($string ) ? 3 : 2;
if($start < 0)
{
$start = max(smartStrlen($string) + $start, 0);
}
for($i = 0; $i < strlen ($string); $i += $bytes)
{
if($start <= 0)
{
break;
}
$bytes = ord(substr($string, $i, 1)) > 127 ? $number : 1;
$start -= $bytes > 1 ? 1.0 : 0.5;
}
if(is_null($length))
{
$result = substr($string, $i);
}
else
{
for($j = $i; $j < strlen($string); $j += $bytes)
{
if($length <= 0)
{
break;
}
if(($bytes = ord(substr($string, $j, 1)) > 127 ? $number : 1) > 1)
{
if($length < 1.0)
{
break;
}
$result .= substr($string, $j, $bytes);
$length -= 1.0;
}
else
{
$result .= substr($string, $j, 1);
$length - = 0.5;
}
}
}
return $result;
}
function smarty_modifier_smartTruncate($string, $length = 80, $etc = '...',
$break_words = false, $middle = false)
{
if ($length == 0)
return '';
if (smartStrlen($string) > $length) {
$length -= smartStrlen($etc);
if (!$break_words && !$middle) {
$string = preg_replace('/s+?(S+)?$/', '', smartSubstr($string, 0, $length+1));
}
if(!$middle) {
return smartSubstr( $string, 0, $length).$etc;
} else {
return smartSubstr($string, 0, $length/2) . $etc . smartSubstr($string, -$length/2);
}
} else {
return $string;
}
}
?>

The above code fully realizes the original function of truncate, and is compatible with both GB2312 and UTF-8 encoding. When judging the character length, a Chinese character It counts as 1.0, and one English character counts as 0.5, so there will be no unevenness when intercepting substrings.
There is nothing special about how to use the plug-in. Here is a simple test:
{$content|smartTruncate:5: ".."} ($content is equal to "A China B China C People D People E Communist Party F and G Country H")
Display: A China B China C.. (The length of Chinese symbols is counted as 1.0, and the length of English symbols is counted as 0.5, And consider the length of the omitted symbols)
Whether you use GB2312 encoding or UTF-8 encoding, you will find that the results are correct, which is one of the reasons why I added the word smart in the plug-in name.
The above introduces the problem of garbled Chinese characters intercepted by GB2312 php smarty? gb2312/utf-8, including the content of GB2312. I hope it will be helpful to friends who are interested in PHP tutorials.

Related labels：

GB2312

Previous article：The usage of recaptcha when captcha php space does not support socket but supports curl Next article：intentfilter php array_filter removes empty character elements in the array

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Latest Articles by Author

What is a NullPointerException, and how do I fix it?

2024-10-22 09:46:29
From Novice to Coder: Your Journey Begins with C Fundamentals

2024-10-13 13:53:41
Unlocking Web Development with PHP: A Beginner's Guide

2024-10-12 12:15:51
Demystifying C: A Clear and Simple Path for New Programmers

2024-10-11 22:47:31
Unlock Your Coding Potential: C Programming for Absolute Beginners

2024-10-11 19:36:51
Unleash Your Inner Programmer: C for Absolute Beginners

2024-10-11 15:50:41
Automate Your Life with C: Scripts and Tools for Beginners

2024-10-11 15:07:41
PHP Made Easy: Your First Steps in Web Development

2024-10-11 14:21:21
Build Anything with Python: A Beginner's Guide to Unleashing Your Creativity

2024-10-11 12:59:11
The Key to Coding: Unlocking the Power of Python for Beginners

2024-10-11 12:17:31

Latest Issues

java - eclipse two projects with different encodings, how to coexist

From 1970-01-01 08:00:00

0

0

0

javascript - dede background column appears garbled problem

From 1970-01-01 08:00:00

0

0

0

Please tell me about the problem of garbled characters when using PDO to connect to MSSQL database and insert it?

From 1970-01-01 08:00:00

0

0

0

PHP failed to connect to database

From 1970-01-01 08:00:00

0

0

0

git - sourcetree comparison view garbled

From 1970-01-01 08:00:00

0

0

0

Related Topics

More>

Popular Recommendations

Popular Tutorials

More>

Related Tutorials

Popular Recommendations

Latest courses

Latest Downloads

More>

Web Effects

Website Source Code

Website Materials

Front End Template