Home Backend Development PHP Tutorial Wang Shuai: In-depth PHP kernel (1) - Exploring the principle of weakly typed variables

Wang Shuai: In-depth PHP kernel (1) - Exploring the principle of weakly typed variables

Aug 08, 2016 am 09:21 AM
nbsp php type zval

PHP is a simple and powerful language that provides many language features suitable for the Web, including weak typing of variables. Under the weak typing mechanism, you can assign any type of value to a variable.
PHP is executed through Zend Engine (hereinafter referred to as ZE). ZE is written in C and implements a set of weak type mechanisms at the bottom. ZE's memory management uses optimization strategies such as copy-on-write and reference counting to reduce memory copies when reassigning variables.

The following not only takes you to explore the principles of PHP weak typing, but also writes about PHP extensions and introduces how to operate PHP variables.

1. PHP variable types

There are 8 variable types in PHP:

  • Standard types: boolean, integer, floating point float, string
  • Complex types: array array, object
  • Special type: resource

PHP does not strictly check the variable type. Variables can declare their type without displaying it, and assign values ​​directly during runtime. Variables can also be converted freely. As in the following example, without implementation declaration, $i can be assigned any type of value.

[php] view plaincopy

  1. $i = 1; //int $i = 'show me the money'; //string $i = 0.02; //float $i = array( 1, 2, 3); // array $i = new Exception('test', 123); // object $i = fopen('/tmp/aaa.txt', 'a') // resource ?>

If you don’t have a deep understanding of the principle of weak typing, you will have “exceeding expectations” surprises when comparing variables.

[php] view plaincopy

  1. $str1 = null; $str2 = false; echo$str1 ==$str2 ? 'Equal' : 'Not equal'; $str3 = ''; $str4 = 0; echo $str3= =$str4 ? 'equal' : 'not equal'; $str5 = 0; $str6 = '0'; echo $str5==$str6 ? 'equal' : 'not equal'; ?>

All the above three results They are equal because PHP performs variable conversion internally when comparing variables. If you want the value and type to be determined at the same time, please use three = (for example, $a===0) to determine. Maybe you will find it commonplace, maybe you will find it amazing, then please join me to delve into the PHP kernel and explore the principle of PHP variables.

2. Introduction to variable storage and standard types

All variables in PHP are implemented with the structure zval. In Zend/zend.h we can see the definition of zval:

[php] view plaincopy

  1. typedef union _zvalue_value { long lval; /* long value */ double dval; /* dou ble value */ struct { char *val; int len; /* this will always be set for strings */ } str; /* string (always has length) */ HashTable *ht; /* an array */ zend_object_value obj; /* stores an object store handle, and handlers */}zvalue_value; means reference count
1
is_ref__gc valueThe value of the stored variableThe specific type of the variable
Indicates whether it is a reference 0
type

Among them, refcount__gc and is_ref__gc indicate whether the variable is a reference. The type field identifies the type of the variable. The value of type can be: IS_NULL, IS_BOOL, IS_LONG, IS_FLOAT, IS_STRING, IS_ARRAY, IS_OBJECT, IS_RESOURCE. PHP chooses how to store zvalue_value based on the type.
zvalue_value can realize the core of variable weak type, defined as follows:

[php] view plaincopy

  1. typedef union _zvalue_value { long lval; /* long value */ double dval; /* double value */ struct { char *val; int len; /* this will always be set for strings */ } str; /* string (always has length) */ HashTable *ht; /* an array */ zend_object_value obj; /* stores an object store handle, and handlers */ } zvalue_value;
Boolean type, zval.type=IS_BOOL, will read the zval.value.lval field, the value is 1/0. If it is a string, zval.type=IS_STRING will read zval.value.str, which is a structure that stores the string pointer and length.

In C language, use " plaincopy

typedefstruct_zend_rsrc_list_entry {    void *ptr;   int type;   int refcount; }zend_rsrc_list_entry; The final pointer to the implementation, such as a file handle or a database connection structure . type is a type tag used to distinguish different resource types. refcount is used for reference counting of resources.

In the kernel, the resource type is obtained through the function ZEND_FETCH_RESOURCE.

[php] view plaincopy

  1. ZEND_FETCH_RESOURCE(con, type, zval *, default, resource_name, resource_type);

5. Conversion of variable types

Follow Now what we know about the PHP language, variables The type depends on the zval.type field indication, and the content of the variable is stored in zval.value according to zval.type. When variables are needed in PHP, only two steps are required: change the value or pointer of zval.value, and then change the type of zval.type. However, for some of PHP's advanced variables Array/Object/Resource, variable conversion requires more operations.

Variable conversion principles are divided into 3 types:

5.1 Standard type mutual conversion

is relatively simple, just follow the above steps for conversion.

5.2 Standard type and resource type conversion

The resource type can be understood as int, which is more convenient for converting standard types. After conversion, the resource will be closed or recycled.

[php] view plaincopy

  1. $var = fopen('/tmp/aaa.txt', 'a' );// Resources #1 $var = (int) $var; var_dump($var); // Output 1 ?>

5.3 Standard type and complex type conversion

Array conversion int/floating point type Float will return the number of elements; conversion to bool will return whether there are elements in Array; conversion to string will return 'Array' and throw a warning.
Details depend on experience, please read the PHP manual: http://php.net/manual/en/language.types.type-juggling.php

5.4 Complex type mutual conversion

array and object can be converted to each other. If any other type of value is converted to an object, an instance of the built-in class stdClass will be created.

When we write PHP extensions, the PHP kernel provides a set of functions for type conversion:

void convert_to_long(zval* pzval)
void convert_to_double(zval* pzval)
void convert_to_long_base(zval* pzval, int base)
void convert_to_null(zval* pzval)
void convert_to_boolean(zval* pzval)
void convert_to_array(zval* pzval)
void convert_to_object( zval* pzval)
void convert_object_to_type(zval* pzval, convert_func_t converter)

A set of macros provided by the PHP kernel to conveniently access zval and obtain the value of zval in a more fine-grained manner:

Kernel access to zval container API
macro Access variables
Z_LVAL(zval) (zval).value.lval
Z_DVAL(zval) (zval).value.dval
Z_STRVAL(zval) (zval).value.str.val
Z_STRLEN(zval) (zval).value.str. len
Z_ARRVAL(zval) (zval).value.ht
Z_TYPE(zval) (zval).type
Z_ LVAL_P(zval) ( *zval).value.lval
Z_DVAL_P(zval) (*zval).value.dval
Z_STRVAL_P(zval_p) (*zval).value.str.val
Z_STRLEN_P(zval_p) (*zval).value.str.len
Z_ARRVAL_P(zval_p) (*zval).value.ht
Z_OBJ_HT_P(zval_p) (*zval).value.obj.handlers
Z_LVAL_PP(zval_pp) (**zval).value.lval
Z_DVAL_PP(zval_pp) (**zval ).value.dval
Z_STRVAL_PP(zval_pp) (**zval).value.str.val
Z_STRLEN_PP(zval_pp) (**zval).value.str. len
Z_ARRVAL_PP(zval_pp) (**zval). value.ht

6. Variable symbol table and scope

PHP’s variable symbol table and zval value mapping is through HashTable (hash table, also called hash table, hereinafter referred to as HT). HashTable is widely used in ZE, including Language features such as constants, variables, and functions are organized by HT, and the array type in PHP is also implemented through HashTable.
For example:

[php] view plaincopy The variable name of $var will be stored in the variable symbol table and represents $ The zval structure of the var's type and value is stored in a hash table. The kernel implements access to PHP variables through the hash mapping of the variable symbol table and the zval address.

Why do we need to mention scope? Because the internal variables of the function are protected. According to the scope, PHP variables are divided into global variables and local variables. Each scope PHP maintains a HashTable of symbol tables. When creating a function or class in PHP, ZE will create a new symbol table to indicate that the variables in the function or class are local variables. This achieves the protection of local variables - variables inside the function cannot be accessed from the outside. When creating a PHP variable, ZE will assign a zval, set the corresponding type and initial value, and add the variable to the symbol table of the current scope so that the user can use the variable.
    ZEND_SET_SYMBOL is used in the kernel to set variables:
  1. [php] view plaincopy ZEND_SET_SYMBOL(EG(active_symbol_table), "foo"
  2. , foo);


View _zend_executor_globals structure

[php] view plaincopy

  1. Zend/zend_globals.h struct _zend_executor_globals { ​HashTable *active_symbol_table;//Symbol table of local variables //Omitted };

When writing a PHP extension, you can access PHP's variable symbol table through the EG macro. EG (symbol_table) accesses the variable symbol table of the global scope, and EG (active_symbol_table) accesses the variable symbol table of the current scope. The local variable stores a pointer, which is passed to the corresponding function when operating on the HashTable.

In order to better understand the hash table and scope of variables, let’s take a simple example:
  1. [php] view plaincopy
  2. $temp =
  3. 'global'

;

function

test() {

$temp

=

    'active'
  1. ; } test(); var_dump($temp); ?> Create a variable $temp outside the function, which will be added to the global symbol table and allocated in the HashTable of the global symbol table A character zval with value 'global'. Create the internal variable $temp of the function test, add it to the symbol table belonging to the function test, and assign the character type zval with the value 'active'. 7. Variable operation in PHP extensionCreate PHP variablesWe can call the function MAKE_STD_ZVAL (pzv) in the extension to create a PHP callable variable. The macros applied to MAKE_STD_ZVAL are: [php] view plaincopy
    1. #define MAKE_STD_ZVAL(zv) ALLOC_ZVAL(zv);INIT_PZVAL(zv) #define ALLOC_ZVAL(z) ZEND_FAST_ALLOC (z, zval, ZVAL_CACHE_LIST) #define ZEND_FAST_ALLOC(p, type, fc_type) (p) = (type * ) emalloc(sizeof(type)) #define INIT_PZVAL(z) (z)->refcount__gc = 1;(z)->is_ref__gc = 0;

    MAKE_ After STD_ZVAL(foo) is expanded, we get:

    [php] view plaincopy

    1. (foo) = (zval *) emalloc(sizeof(zval)); (foo)->refcount__gc = 1; (foo)->is_ref__gc = 0;

    Can watch Out, MAKE_STD_ZVAL does three things: allocates memory, initializes refcount and is_ref in the zval structure.

    The kernel provides some macros to simplify our operations. You can set the type and value of zval in just one step.

    ZVAL_BOOL(pzv, 0);Z_TYPE_P(pzv) = IS_LONG;Z_LVAL_P(pzv) = l;Z_TYPE_P(pzv) = IS_DOUBLE;Z_LVAL_P(pzv) = d;Z_TYPE_P(pzv) = IS_STRING;Z_STRLEN_P (pzv) = len; {Z_STRVAL_P(pzv) = str;} Z_TYPE_P(pzv) = IS_RESOURCE;Z_RESVAL_P(pzv) = res;


    The dup parameter in ZVAL_STRINGL(pzv,str,len,dup)

    First explain ZVAL_STRINGL(pzv,str,len,dup); The two parameters str and len are easy to understand, because we know that they are saved in the kernel The address of the string and its length. The meaning of the following dup is actually very simple. It indicates whether the string needs to be copied. If the value is 1, a new memory will be allocated first and the string will be assigned, and then the address of the new memory will be copied to pzv. If it is 0, the address of str will be directly assigned to zval.

    The difference between ZVAL_STRINGL and ZVAL_STRING

    If you want to intercept the string at a certain position or already know the length of the string, you can use the macro ZVAL_STRINGL(zval, string, length, duplicate), which is explicitly specified String length instead of using strlen(). This macro takes the string length as argument. But it is binary safe and faster than ZVAL_STRING because there is one less strlen.
    ZVAL_RESOURCE is approximately equal to ZVAL_LONG

    In Chapter 4, we said that the value of the resource type in PHP is an integer, so ZVAL_RESOURCE works similarly to ZVAL_LONG, except that it sets the type of zval to IS_RESOURCE.

    8. Summary

    PHP’s weak typing is completed through ZE’s zval container conversion. Variable names and zval data are stored through hash tables, which has a certain sacrifice in operating efficiency. In addition, due to the implicit conversion of variable types, insufficient detection of variable types during the development process may cause problems.

    However, PHP’s language features such as weak typing, arrays, memory hosting, and extensions are very suitable for web development scenarios. The development efficiency is very high and it can speed up the product iteration cycle. In massive services, often the bottleneck lies in the data access layer, not the language itself. In actual use, PHP not only serves as the logic layer and presentation layer, we even use the UDPServer/TCPServer developed by PHP as the middle layer of data and cache.

    The above introduces Wang Shuai: In-depth PHP Core (1) - Exploring the Principle of Weakly Typed Variables, including various aspects. I hope it will be helpful to friends who are interested in PHP tutorials.

    API Macros for Accessing zval
    Macro implementation method
    ZVAL_NULL(pvz) Z_TYPE_P(pzv) = IS_NULL
    ZVAL_BOOL(pvz) Z_TYPE_P(pzv) = IS_BOOL;
    Z_BVAL_P(pzv) = b ? 1 : 0; VAL_FALSE(pvz)
    ZVAL_LONG(pvz, l)(l is the value)
    ZVAL_DOU BLE(pvz, d )
    ZVAL_STRINGL(pvz, str, len, dup)
    if (dup) { {Z_STRVAL_P(pzv) =estrndup(str, len + 1);} }else { }
    ZVAL_STRING(pvz, str, len) ZVAL_STRINGL(pzv, str,strlen(str), dup);




    ZVAL_RESOURCE(pvz, res)
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

PHP 8.4 Installation and Upgrade guide for Ubuntu and Debian PHP 8.4 Installation and Upgrade guide for Ubuntu and Debian Dec 24, 2024 pm 04:42 PM

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

How To Set Up Visual Studio Code (VS Code) for PHP Development How To Set Up Visual Studio Code (VS Code) for PHP Development Dec 20, 2024 am 11:31 AM

Visual Studio Code, also known as VS Code, is a free source code editor — or integrated development environment (IDE) — available for all major operating systems. With a large collection of extensions for many programming languages, VS Code can be c

7 PHP Functions I Regret I Didn't Know Before 7 PHP Functions I Regret I Didn't Know Before Nov 13, 2024 am 09:42 AM

If you are an experienced PHP developer, you might have the feeling that you’ve been there and done that already.You have developed a significant number of applications, debugged millions of lines of code, and tweaked a bunch of scripts to achieve op

How do you parse and process HTML/XML in PHP? How do you parse and process HTML/XML in PHP? Feb 07, 2025 am 11:57 AM

This tutorial demonstrates how to efficiently process XML documents using PHP. XML (eXtensible Markup Language) is a versatile text-based markup language designed for both human readability and machine parsing. It's commonly used for data storage an

Explain JSON Web Tokens (JWT) and their use case in PHP APIs. Explain JSON Web Tokens (JWT) and their use case in PHP APIs. Apr 05, 2025 am 12:04 AM

JWT is an open standard based on JSON, used to securely transmit information between parties, mainly for identity authentication and information exchange. 1. JWT consists of three parts: Header, Payload and Signature. 2. The working principle of JWT includes three steps: generating JWT, verifying JWT and parsing Payload. 3. When using JWT for authentication in PHP, JWT can be generated and verified, and user role and permission information can be included in advanced usage. 4. Common errors include signature verification failure, token expiration, and payload oversized. Debugging skills include using debugging tools and logging. 5. Performance optimization and best practices include using appropriate signature algorithms, setting validity periods reasonably,

PHP Program to Count Vowels in a String PHP Program to Count Vowels in a String Feb 07, 2025 pm 12:12 PM

A string is a sequence of characters, including letters, numbers, and symbols. This tutorial will learn how to calculate the number of vowels in a given string in PHP using different methods. The vowels in English are a, e, i, o, u, and they can be uppercase or lowercase. What is a vowel? Vowels are alphabetic characters that represent a specific pronunciation. There are five vowels in English, including uppercase and lowercase: a, e, i, o, u Example 1 Input: String = "Tutorialspoint" Output: 6 explain The vowels in the string "Tutorialspoint" are u, o, i, a, o, i. There are 6 yuan in total

Explain late static binding in PHP (static::). Explain late static binding in PHP (static::). Apr 03, 2025 am 12:04 AM

Static binding (static::) implements late static binding (LSB) in PHP, allowing calling classes to be referenced in static contexts rather than defining classes. 1) The parsing process is performed at runtime, 2) Look up the call class in the inheritance relationship, 3) It may bring performance overhead.

What are PHP magic methods (__construct, __destruct, __call, __get, __set, etc.) and provide use cases? What are PHP magic methods (__construct, __destruct, __call, __get, __set, etc.) and provide use cases? Apr 03, 2025 am 12:03 AM

What are the magic methods of PHP? PHP's magic methods include: 1.\_\_construct, used to initialize objects; 2.\_\_destruct, used to clean up resources; 3.\_\_call, handle non-existent method calls; 4.\_\_get, implement dynamic attribute access; 5.\_\_set, implement dynamic attribute settings. These methods are automatically called in certain situations, improving code flexibility and efficiency.

See all articles