The bytes occupied by characters in different encoding formats in php are different. In ANSI encoding format, one Chinese character occupies 2 bytes and one English character occupies 1 byte; in UTF-8 encoding format, one Chinese character occupies 3 bytes and one English character occupies 1 byte.
The bytes occupied by characters in different encoding formats are different:
(Recommended tutorial: php graphic tutorial)
ANSI Chinese characters occupy 2 bytes and English characters occupy 1 byte;
UTF-8 Chinese characters occupy 3 bytes and English characters occupy 1 byte;
Unicode Chinese characters occupy 2 bytes and English characters occupy 2 bytes;
Talk about the difference between characters and bytes:
(1) "Bytes" Definition
Byte (Byte) is a unit of measurement that indicates the amount of data. It is a unit of measurement used by computer information technology to measure storage capacity.
(Video tutorial recommendation: php video tutorial)
(2) Definition of "Character"
Characters refer to text and characters used in computers Symbols, such as 1, 2, 3, A, B, C, ~! ·#¥%……—*()——, etc.
Numbers, letters and other symbols are all characters. Characters only occupy one byte, and Chinese characters occupy two (UTF-8).
The above is the detailed content of How many bytes does one character occupy in php?. For more information, please follow other related articles on the PHP Chinese website!