Analysis of the source code of array_keys and array_unique functions in PHP, arraykeys

Table of Contents

Analysis of array_keys and array_unique function source code in PHP, arraykeys

Articles you may be interested in:

Home

Backend Development

PHP Tutorial

Analysis of the source code of array_keys and array_unique functions in PHP, arraykeys_PHP tutorial

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Jul 12, 2016 am 08:58 AM

array keys unique

Analysis of array_keys and array_unique function source code in PHP, arraykeys

Performance analysis

From the perspective of running performance, take a look at the following test code:

$test=array();
for($run=0; $run<10000; $run++)
$test[]=rand(0,100);

$time=microtime(true);

$out = array_unique($test);

$time=microtime(true)-$time;
echo 'Array Unique: '.$time."\n";

$time=microtime(true);

$out=array_keys(array_flip($test));

$time=microtime(true)-$time;
echo 'Keys Flip: '.$time."\n";

$time=microtime(true);

$out=array_flip(array_flip($test));

$time=microtime(true)-$time;
echo 'Flip Flip: '.$time."\n";

Copy after login

The running results are as follows:

As you can see from the picture above, using the array_unique function takes 0.069s; using array_flip and then using the array_keys function takes 0.00152s; using the array_flip function twice takes 0.00146s.

The test results show that using array_flip and then calling the array_keys function is faster than the array_unique function. So, what is the specific reason? Let's take a look at how these two functions are implemented at the bottom of PHP.

Source code analysis

/* {{{ proto array array_keys(array input [, mixed search_value[, bool strict]])
  Return just the keys from the input array, optionally only for the specified       search_value */
PHP_FUNCTION(array_keys)
{
  //变量定义
  zval *input,        /* Input array */
     *search_value = NULL,  /* Value to search for */
     **entry,        /* An entry in the input array */
      res,          /* Result of comparison */
     *new_val;        /* New value */
  int  add_key;        /* Flag to indicate whether a key should be added */
  char *string_key;      /* String key */
  uint  string_key_len;
  ulong num_key;        /* Numeric key */
  zend_bool strict = 0;    /* do strict comparison */
  HashPosition pos;
  int (*is_equal_func)(zval *, zval *, zval * TSRMLS_DC) = is_equal_function;

  //程序解析参数
  if (zend_parse_parameters(ZEND_NUM_ARGS() TSRMLS_CC, "a|zb", &input, &search_value, &strict) == FAILURE) {
    return;
  }

  // 如果strict是true，则设置is_equal_func为is_identical_function，即全等比较
  if (strict) {
    is_equal_func = is_identical_function;
  }

  /* 根据search_vale初始化返回的数组大小 */
  if (search_value != NULL) {
    array_init(return_value);
  } else {
    array_init_size(return_value, zend_hash_num_elements(Z_ARRVAL_P(input)));
  }
  add_key = 1;

  /* 遍历输入的数组参数，然后添加键值到返回的数组 */
  zend_hash_internal_pointer_reset_ex(Z_ARRVAL_P(input), &pos);//重置指针
  //循环遍历数组
  while (zend_hash_get_current_data_ex(Z_ARRVAL_P(input), (void **)&entry, &pos) == SUCCESS) {
    // 如果search_value不为空
    if (search_value != NULL) {
      // 判断search_value与当前的值是否相同，并将比较结果保存到add_key变量
      is_equal_func(&res, search_value, *entry TSRMLS_CC);
      add_key = zval_is_true(&res);
    }

    if (add_key) {
      // 创建一个zval结构体
      MAKE_STD_ZVAL(new_val);

      // 根据键值是字符串还是整型数字将值插入到return_value中
      switch (zend_hash_get_current_key_ex(Z_ARRVAL_P(input), &string_key, &string_key_len, &num_key, 1, &pos)) {
        case HASH_KEY_IS_STRING:
          ZVAL_STRINGL(new_val, string_key, string_key_len - 1, 0);
          // 此函数负责将值插入到return_value中，如果键值已存在，则使用新值更新对应的值，否则直接插入
          zend_hash_next_index_insert(Z_ARRVAL_P(return_value), &new_val, sizeof(zval *), NULL);
          break;

        case HASH_KEY_IS_LONG:
          Z_TYPE_P(new_val) = IS_LONG;
          Z_LVAL_P(new_val) = num_key;
          zend_hash_next_index_insert(Z_ARRVAL_P(return_value), &new_val, sizeof(zval *), NULL);
          break;
      }
    }

    // 移动到下一个
    zend_hash_move_forward_ex(Z_ARRVAL_P(input), &pos);
  }
}
/* }}} */

Copy after login

The above is the underlying source code of array_keys function. To facilitate understanding, the author has added some Chinese comments. If you need to view the original code, you can click to view it. The function of this function is to create a temporary array, and then copy the key-value pairs to the new array. If duplicate key values appear during the copying process, replace them with new values. The main step of this function is the zend_hash_next_index_insert function called on lines 57 and 63. This function inserts elements into the array. If a duplicate value appears, the new value is used to update the value pointed to by the original key value. Otherwise, it is inserted directly. The time complexity is O(n).

/* {{{ proto array array_flip(array input)
  Return array with key <-> value flipped */
PHP_FUNCTION(array_flip)
{
  // 定义变量
  zval *array, **entry, *data;
  char *string_key;
  uint str_key_len;
  ulong num_key;
  HashPosition pos;

  // 解析数组参数
  if (zend_parse_parameters(ZEND_NUM_ARGS() TSRMLS_CC, "a", &array) == FAILURE) {
    return;
  }

  // 初始化返回数组
  array_init_size(return_value, zend_hash_num_elements(Z_ARRVAL_P(array)));

  // 重置指针
  zend_hash_internal_pointer_reset_ex(Z_ARRVAL_P(array), &pos);
  // 遍历每个元素，并执行键<->值交换操作
  while (zend_hash_get_current_data_ex(Z_ARRVAL_P(array), (void **)&entry, &pos) == SUCCESS) {
    // 初始化一个结构体
    MAKE_STD_ZVAL(data);
    // 将原数组的值赋值为新数组的键
    switch (zend_hash_get_current_key_ex(Z_ARRVAL_P(array), &string_key, &str_key_len, &num_key, 1, &pos)) {
      case HASH_KEY_IS_STRING:
        ZVAL_STRINGL(data, string_key, str_key_len - 1, 0);
        break;
      case HASH_KEY_IS_LONG:
        Z_TYPE_P(data) = IS_LONG;
        Z_LVAL_P(data) = num_key;
        break;
    }

    // 将原数组的键赋值为新数组的值，如果有重复的，则使用新值覆盖旧值
    if (Z_TYPE_PP(entry) == IS_LONG) {
      zend_hash_index_update(Z_ARRVAL_P(return_value), Z_LVAL_PP(entry), &data, sizeof(data), NULL);
    } else if (Z_TYPE_PP(entry) == IS_STRING) {
      zend_symtable_update(Z_ARRVAL_P(return_value), Z_STRVAL_PP(entry), Z_STRLEN_PP(entry) + 1, &data, sizeof(data), NULL);
    } else {
      zval_ptr_dtor(&data); /* will free also zval structure */
      php_error_docref(NULL TSRMLS_CC, E_WARNING, "Can only flip STRING and INTEGER values!");
    }

    // 下一个
    zend_hash_move_forward_ex(Z_ARRVAL_P(array), &pos);
  }
}
/* }}} */

Copy after login

The above is the source code of array_flip function. Click the link to view the original code. The main thing this function does is to create a new array and traverse the original array. At line 26, the values of the original array are assigned to the keys of the new array, and then at line 37, the keys of the original array are assigned to the values of the new array. If there are duplicates, the new values are used to overwrite the old values. The time complexity of the entire function is also O(n). Therefore, the time complexity of using array_keys after using array_flip is O(n).

Next, let’s take a look at the source code of the array_unique function. Click the link to view the original code.

/* {{{ proto array array_unique(array input [, int sort_flags])
  Removes duplicate values from array */
PHP_FUNCTION(array_unique)
{
  // 定义变量
  zval *array, *tmp;
  Bucket *p;
  struct bucketindex {
    Bucket *b;
    unsigned int i;
  };
  struct bucketindex *arTmp, *cmpdata, *lastkept;
  unsigned int i;
  long sort_type = PHP_SORT_STRING;

  // 解析参数
  if (zend_parse_parameters(ZEND_NUM_ARGS() TSRMLS_CC, "a|l", &array, &sort_type) == FAILURE) {
    return;
  }

  // 设置比较函数
  php_set_compare_func(sort_type TSRMLS_CC);

  // 初始化返回数组
  array_init_size(return_value, zend_hash_num_elements(Z_ARRVAL_P(array)));
  // 将值拷贝到新数组
  zend_hash_copy(Z_ARRVAL_P(return_value), Z_ARRVAL_P(array), (copy_ctor_func_t) zval_add_ref, (void *)&tmp, sizeof(zval*));

  if (Z_ARRVAL_P(array)->nNumOfElements <= 1) {  /* 什么都不做 */
    return;
  }

  /* 根据target_hash buckets的指针创建数组并排序 */
  arTmp = (struct bucketindex *) pemalloc((Z_ARRVAL_P(array)->nNumOfElements + 1) * sizeof(struct bucketindex), Z_ARRVAL_P(array)->persistent);
  if (!arTmp) {
    zval_dtor(return_value);
    RETURN_FALSE;
  }
  for (i = 0, p = Z_ARRVAL_P(array)->pListHead; p; i++, p = p->pListNext) {
    arTmp[i].b = p;
    arTmp[i].i = i;
  }
  arTmp[i].b = NULL;
  // 排序
  zend_qsort((void *) arTmp, i, sizeof(struct bucketindex), php_array_data_compare TSRMLS_CC);

  /* 遍历排序好的数组，然后删除重复的元素 */
  lastkept = arTmp;
  for (cmpdata = arTmp + 1; cmpdata->b; cmpdata++) {
    if (php_array_data_compare(lastkept, cmpdata TSRMLS_CC)) {
      lastkept = cmpdata;
    } else {
      if (lastkept->i > cmpdata->i) {
        p = lastkept->b;
        lastkept = cmpdata;
      } else {
        p = cmpdata->b;
      }
      if (p->nKeyLength == 0) {
        zend_hash_index_del(Z_ARRVAL_P(return_value), p->h);
      } else {
        if (Z_ARRVAL_P(return_value) == &EG(symbol_table)) {
          zend_delete_global_variable(p->arKey, p->nKeyLength - 1 TSRMLS_CC);
        } else {
          zend_hash_quick_del(Z_ARRVAL_P(return_value), p->arKey, p->nKeyLength, p->h);
        }
      }
    }
  }
  pefree(arTmp, Z_ARRVAL_P(array)->persistent);
}
/* }}} */

Copy after login

As you can see, this function initializes a new array, then copies the values to the new array, and then calls the sorting function on line 45 to sort the array. The sorting algorithm is the block tree sorting algorithm of the zend engine. Then iterate through the sorted array and delete duplicate elements. The most expensive part of the entire function is calling the sorting function, and the time complexity of quick sort is O(nlogn). Therefore, the time complexity of this function is O(nlogn).

Conclusion

Because the bottom layer of array_unique calls the quick sort algorithm, which increases the time cost of function running, causing the entire function to run slower. That's why array_keys is faster than array_unique function.

Articles you may be interested in:

Judge whether the same value exists in the array under php array_unique
php json_encode after array_unique needs attention
php array array_unique() of function sequence - remove duplicate element values in the array
array_keys() of php array function sequence - get the array key name
PHP get the position of an element in the array and array_keys function Application

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Best Graphic Settings

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

2 weeks ago By DDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

WWE 2K25: How To Unlock Everything In MyRise

1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7509

CakePHP Tutorial

1378

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

Related knowledge

Sort array using Array.Sort function in C# Nov 18, 2023 am 10:37 AM

Title: Example of using the Array.Sort function to sort an array in C# Text: In C#, array is a commonly used data structure, and it is often necessary to sort the array. C# provides the Array class, which has the Sort method to conveniently sort arrays. This article will demonstrate how to use the Array.Sort function in C# to sort an array and provide specific code examples. First, we need to understand the basic usage of the Array.Sort function. Array.So

How to use the array_combine function in PHP to combine two arrays into an associative array Jun 26, 2023 pm 01:41 PM

In PHP, there are many powerful array functions that can make array operations more convenient and faster. When we need to combine two arrays into an associative array, we can use PHP's array_combine function to achieve this operation. This function is actually used to combine the keys of one array as the values of another array into a new associative array. Next, we will explain how to use the array_combine function in PHP to combine two arrays into an associative array. Learn about array_comb

Simple and clear method to use PHP array_merge_recursive() function Jun 27, 2023 pm 01:48 PM

When programming in PHP, we often need to merge arrays. PHP provides the array_merge() function to complete array merging, but when the same key exists in the array, this function will overwrite the original value. In order to solve this problem, PHP also provides an array_merge_recursive() function in the language, which can merge arrays and retain the values of the same keys, making the program design more flexible. array_merge

Tips and FAQs on using unique indexes in MySQL Mar 15, 2024 pm 03:09 PM

Tips and FAQs for using unique indexes in MySQL MySQL is a popular relational database management system. In practical applications, unique indexes (uniqueindex) play a vital role in data table design. A unique index can ensure that the value of a certain column in the table is unique and avoid duplicate data. This article will introduce the usage skills of unique indexes in MySQL and answers to some common questions, and provide specific code examples to help readers better understand. 1.Create

Detailed explanation of PHP array_fill() function usage Jun 27, 2023 am 08:42 AM

In PHP programming, array is a very important data structure that can handle large amounts of data easily. PHP provides many array-related functions, array_fill() is one of them. This article will introduce in detail the usage of the array_fill() function, as well as some tips in practical applications. 1. Overview of the array_fill() function The function of the array_fill() function is to create an array of a specified length and composed of the same values. Specifically, the syntax of this function is

Introduction to how to use the PHP array_change_key_case() function Jun 27, 2023 am 10:43 AM

In PHP programming, array is a frequently used data type. There are also quite a few array operation functions, including the array_change_key_case() function. This function can convert the case of key names in the array to facilitate our data processing. This article will introduce how to use the array_change_key_case() function in PHP. 1. Function syntax and parameters array_change_ke

How to use the Array module in Python May 01, 2023 am 09:13 AM

The array module in Python is a predefined array, so it takes up much less space in memory than a standard list, and can also perform fast element-level operations such as adding, deleting, indexing, and slicing. In addition, all elements in the array are of the same type, so you can use the efficient numerical operation functions provided by the array, such as calculating the average, maximum, and minimum values. In addition, the array module also supports writing and reading array objects directly into binary files, which makes it more efficient when processing large amounts of numerical data. Therefore, if you need to process a large amount of homogeneous data, you may consider using Python's array module to optimize the execution efficiency of your code. To use the array module, you first need to

Solution to ArrayStoreException exception in Java Jun 25, 2023 am 08:05 AM

In Java development, we often use arrays to store a series of data because of the convenience and performance advantages of arrays. However, in the process of using arrays, some exceptions will occur, and one of the common exceptions is ArrayStoreException. This exception is thrown when we store incompatible data types in the array. This article will introduce what an ArrayStoreException is, why it occurs, and how to solve it. 1. Arr

See all articles