Home > Backend Development > PHP Tutorial > Share the latest 28 PHP interview questions in 2023 (with answers)

Share the latest 28 PHP interview questions in 2023 (with answers)

青灯夜游
Release: 2023-04-10 22:10:02
forward
36287 people have browsed it

This article compiles and shares 28 PHP interview questions (with answers to share) to help you sort out the basic knowledge. It has certain reference value. Friends in need can refer to it. I hope it will be helpful to everyone.

Share the latest 28 PHP interview questions in 2023 (with answers)

Related recommendations: 2023 PHP interview questions summary (collection)

After the New Year I plan to look for new job opportunities, but I found that my understanding and study of many basic interviews were not deep enough before. In order to encourage myself to keep moving forward, I recently started to learn and summarize relevant knowledge on forums and search engines. Some of the questions are from the forums. Questions or answers shared by seniors, and some of them are questions I encountered in recent interviews. Based on my own understanding and the sharing of seniors, I have archived some of them, so I share them in the hope that they will be helpful to other friends. I also hope that I can receive guidance from the experts on misunderstandings, and will continue to update in the near future

1. The underlying implementation principle of PHP arrays

1 , the underlying implementation is through a hash table (hash table) doubly linked list (resolving hash conflicts)

  • hashtable: calculate the hash of different keywords (keys) through the mapping function Value (Bucket->h) thereby directly indexing to the corresponding Bucket

  • The hash table saves the pointer of the current loop, so foreach is faster than for

  • Bucket: Save the key and value of the array elements, as well as the hash value h

2. How to ensure orderliness

  • 1. Add a mapping table with the same size as the storage element array between the hash function and the element array (Bucket).

  • 2. The subscript used to store the elements in the actual storage array

  • 3. The elements are inserted into the actual storage in the order of the mapping table

  • in the array 4. The mapping table is just a theoretical idea. In fact, there is no actual mapping table. Instead, when the Bucket memory is allocated during initialization, the same Amount of space the size of uint32_t, and then offset arData to the location where the array of elements is stored.

3. Solve hash duplication (linked list method used by PHP):

  • 1. Linked list method: different keys When the words point to the same unit, use a linked list to save the keyword (traverse the linked list to match the key)

  • 2. Open addressing method: When the keyword points to a unit where data already exists, continue Search for other units until an available unit is found (occupying the position of other units, more prone to hash conflicts, performance degradation)

4. Basic knowledge

  • Linked list: queue, stack, two-way linked list,

  • linked list: pointer of the element pointing to the next element

  • bidirectional Linked list: Pointer to the previous element Pointer to the next element Complexity and space complexity

2. Time complexity and space complexity of bubble sort

1. Code implementation

         $arr = [2, 4, 1, 5, 3, 6];
         for ($i = 0; $i < (count($arr)); $i++) {
             for ($j = $i + 1; $j < (count($arr)); $j++) {
                 if ($arr[$i] <= $arr[$j]) {
                     $temp = $arr[$i];
                     $arr[$i] = $arr[$j];
                     $arr[$j] = $temp;
                 }
             }
         }
     result : [6,5,4,3,2,1]
Copy after login

2. Calculation principle

First round: Compare the first element of the array with all other elements , whichever element is larger, change the order, thus bubble up the first largest (largest) element

First round: Combine the second element of the array and all other elements Compare (the first largest element has been filtered out, no need to continue comparing), whichever element is larger, change the order, so that the second largest element

  • ... and so on , an array sorted by bubbles from large to small

  • Average time complexity:

    O(n^2)

    ;
  • Optimal time complexity Degree:

    O(n)

    , need to add judgment. If there is no exchange in the first loop, jump out of the loop directly
Space complexity:

O(1) , the space occupied by temporary variables when exchanging elements

Optimal space complexity:

O(1), sorted, no need to swap positions

3. Time complexity and space complexity

Time complexity: The whole process is asymptotic time complexity, estimating the usage efficiency of the processor (describing the efficiency trend of the algorithm, not referring to the specific use of the algorithm time, because the performance of different machines is inconsistent, it is just a general method of efficiency calculation)

Representation method: Big O notation

Complexity metric level:

Constant order O(1)

  • Linear order O(n)

  • Square order O(n²)

  • Cubic order O(n³ )

  • Kth power order O(n^k)

  • Exponential order (2^n)

  • Logarithmic order O(logN)

  • Linear logarithmic order O(nlogN)

  • Time replication type:

    • Best time complexity

    • Worst time complexity

    • Average time complexity

    • Amortized time complexity

    Space complexity: the asymptotic space complexity of the whole process, estimating the usage of computer memory (describing the storage space occupied by the algorithm Trend, not actual occupied space, same as above)

    Reference:

    This article talks about the time complexity and space complexity of the algorithm

    3. Seven-layer network protocols and TCP and TCP

    Application layer, presentation layer, session layer, transport layer, network layer, (data) link layer, physical layer

    Memory routine:

    First word: Response to form and transfer (Things Chain Network)

    First word: Application layer (occurs many times, easy to remember)

    The first four forward directions: should be expressed - will be transmitted

    The last three reverse directions: the homophony of the Internet of Things is easier to remember than the Internet of things

    4. TCP and UDP The characteristics and differences

    1. They all belong to the transport layer protocol

    2.TCP

    • Connection-oriented, so it can only be one-to-one

    • Oriented to byte stream transmission

    • The data is reliable and will not be lost

    • Full-duplex communication

    3. UDP (reverse according to TCP characteristics)

    • No connection, supports one-to-one, one-to-many, many-to-many

    • Insulation-oriented transmission

    • The header overhead is small, and the data Not necessarily reliable but faster

    5. TCP’s three-way handshake and four-way wave

    1. Three-way handshake:

    • 1) First time: client sends SYN = 1,seq = client_isn

      Function:

      Client: None

      Server: Confirm its own receiving function and client’s sending function

    • 2) Second time: Server sends SYN = 1, seq = server_isn, ACK =client_isn 1

      Function:

      Client: Confirm that its own sending and receiving are normal, confirm that the server's receiving and sending are normal

      Server: Confirm that its own receiving and receiving are normal, confirm that the server's receiving and sending are normal The sending is normal (at this time, the server cannot confirm whether the client is receiving normally)

    • 3) The third time: the client sends SYN = 0, ACK = server_isn 1, seq =client_isn 1

      Function: Both parties confirm that each other’s reception and transmission are normal and establish a connection

    2. Wave four times

    • 1) The first time: the client sends FIN

      Function: Tell the server that I have no data to send (but can still receive data)

    • 2 ) The second time: The server sends ACK

      Function: Tells the client that the request has been received. The server may still have data to send, so the client enters the FIN_WAIT state upon receipt and waits for the server data transmission to complete Then send FIN

    • ##3) The third time: The server sends FIN

      Function: The server tells the client that I have finished sending and the connection can be closed.

    • 4) The fourth time: the client sends ACK

      Function: After receiving the FIN, the client is worried that the server does not know to close, so it sends an ACK. Enter TIME_WAIT and wait for 2MSL. If no reply is received, it proves that the server has been closed. At this time, the client also closes the connection.

    Note:

    • When receiving the other party’s FIN message, it only means that the other party no longer sends data but can still receive data

    • You need to wait for 2MSL in the end because the network is unreliable. If the server does not receive the last ACK, the server will replay the FIN packet and wait for the client to send the ACK packet again and then close. (So ​​the client cannot close the connection immediately after sending the last ACK)

    6. HTTP status code

    1. Status code classification

    • - 1xx: Information, the server received the request and the requester needs to continue the operation

    • - 2xx: Success

    • - 3xx: Redirect

    • - 4xx: Client Error

    • - 5xx: Server Error

    2. Common status codes

    • 200: Request successful

    • 301: Permanent redirection

    • 302: Temporary move

    • ##400 bad request: Client request syntax error
    • 401 unauthorized: The client has no permission

    • 403 forbidden: The server refuses the client request

    • ##404 not found: The resource requested by the client does not exist

    • 500 Internal Server Eerro: Server internal error

    • 502 bad gateway: Working as a gateway or proxy The server received an invalid response from the upstream server while trying to perform the request

    • 503 Service Unavailable Overload or system maintenance

    • 504 Gateway timeout: Gateway Timeout

    3. Reasons and solutions for 502

    Cause: nginx submits the request to the gateway (php-fpm) to handle the exception

    1) The fastcgi buffer is set too small

    fastcgi_buffers 8 16k;

    fastcgi_buffer_size 32k;

    2) The number of php-cgi processes is set too few

    Check the number of FastCgi processes:

    netstat -anpo | grep "php-cgi"| wc -l

    Adjust parameters Maximum number of child processes:

    max_children

    Generally, the number of child processes that need to be set is calculated based on a single process of 20M

    3) max_requests (memory overflow or frequent restarts)

    The parameter specifies the maximum number of requests that each child can handle. Children will be restarted after reaching the maximum value.

    Setting too small may lead to frequent restart of children:

    PHP will poll the request to each child. In a large traffic scenario, the time for each child to reach the maximum value is almost the same. If set If it is too small, multiple children may be closed at the same time, nginx cannot forward the request to php-fpm, the CPU will be reduced, and the load will become high.

    Setting too large may cause memory leaks

    4) PHP execution time exceeds nginx waiting time

    fastcgi_connect_timeout

    fastcgi_send_timeout

    fastcgi_read_timeout

    5) fastcgi execution time

    max_execution_time

    Reference:


    Learn more about how to optimize php php-fom nginx configuration parameters

    What should I do if nginx reports error 502? Solution sharing

    7. The difference between http and HTTPS

    1. Port: http 80; https: 443

    2. http is stateless, https is a protocol built with http ssl that can perform encrypted transmission

    3. http plain text transmission, https encrypted transmission

    4. http is faster, with three three-way handshakes Package, https requires 12 packages (3 tcp packages and 9 ssl handshake packages)

    8. Redis distributed locks and issues

    1. Implementation :

    Locking: setnx

    Unlocking: del

    Lock timeout: expire

    2. Possible problems

    • 1) The problem of non-atomicity of setnx and expire (it hangs before the timeout can be set after locking)

      Solution:

      Redis 2.6.12 or above adds optional parameters to the set instruction. The pseudo code is as follows: set (key, 1, 30, NX), which can replace the setnx instruction

    • 2) Other process locks are accidentally deleted after timeout. (The execution of process A times out, causing the lock to be released. At this time, process B acquires the lock and starts processing the request. When process A completes the processing, the lock of process B will be deleted by mistake)

      Solution: You can only delete the lock of your own process Lock (lua script prevents process B from accidentally deleting the lock of process A after acquiring the expired lock)

    • 3) Concurrency scenario, the execution timeout of process A causes the lock to be released, and process B acquires the lock at this time .

      Solution: Start the daemon process and delay the lock for the current process to expire.

    • 4) Single point instance security issue

      After a single machine crashes, all clients cannot obtain the lock

      Solution:

      Master-slave replication cannot be completely solved because it is completed asynchronously

    Reference:


    Redis requires attention when implementing distributed locks What? [Summary of Notes]

    Take you in-depth understanding of distributed locks in Redis

    9. Why is redis single-threaded? Why fast?

    Recommended reading: https://www.php.cn/redis/475918.html

    10. Data types and applications of redis Scenario

    1. String:

    Ordinary key/value storage

    2. Hash:

    Hashmap: key-value team collection, storing object information

    3, list:

    Doubly linked list: message queue

    4 , set:

    hashMap whose value is always null: unordered set and non-repeating: calculate intersection, union, difference set, deduplication value

    5, zset :

    Ordered collection without duplication: hashMap (remove duplication) skiplist jump table (guaranteed order): Ranking list

    Reference:

    The 5 data types and application scenarios of Redis

    11. The method, principle and characteristics of redis to achieve persistence

    1. RDB persistence (snapshot): The memory data set snapshot within the specified time interval is written to the disk (dump.rdb), when the child process writes the snapshot content, the new file replaces the old file

    2) The entire redis database only contains one backup file

    3) Maximize performance, Only the fork child process is needed to complete the persistence work and reduce disk IO

    4) Downtime before persistence may cause data loss

    2. AOF persistence: in the form of logs Record all write and delete operations of the server

    1) Each time a write command is received, use the write function to append to the file appendonly.aof

    2) The persistent file will be longer than As it gets bigger, there are a lot of redundant logs (0 increases from 100 times to 100, 100 log records will be generated)

    3) Different fsync policies can be set

      appendfsync everysec: Once every 1s, up to 1s of data will be lost (default)
    • appendfsync always : Every change will be executed once
    • appendfsync no : Not processed
    • 4) The AOF file will be rewritten if it is too large: Compress the AOF file size

      fork a child process and redis The latest status of the mainland data object is written to the AOF temporary file (similar to rdb snapshot)
    • The changes received by the main process will first be written into the memory and then synchronized to the old AOF file ( Data integrity can be guaranteed even after rewriting fails)
    • After the child process completes the rewriting, the new changes in the memory will be synchronously appended to the temporary file of the AOF
    • The parent process replaces the temporary AOF file with a new AOF file and renames it. New commands received later are written to new files
    Reference:


    Detailed explanation of the persistence principle of Redis in-depth learning

    A brief analysis of RDB and AOF persistence, what are the advantages and disadvantages? How to choose?

    12. Flash sale design process and difficulties

    1. Static cache

    2. nginx load balancing

    Three methods: DNS polling, IP debt balancing, CDN

    3, current limiting mechanism

    Method: IP current limit, interface token current limit, user current limit, header dynamic token (front-end encryption, back-end decryption)

    4, distributed lock

    Method:

      setnx expire (non-atomic, set ensures atomicity after redis2.6)
    • Release lock timeout (start daemon process Automatic renewal time)
    • Expired lock accidentally deleted other threads (requestId verification or lua script to ensure the atomicity of search and deletion)
    5. Cache data

    Method:

      Cache breakdown: Cache data preheating Bloom filter/empty cache
    • Cache avalanche: The cache sets a random expiration time to prevent expiration at the same time
    6. Inventory and order

      Deduct Inventory
      • redis self-decreases inventory, which may lead to negative numbers in concurrent scenarios and affect inventory return: use lua scripts to ensure atomicity
      • After redis withholds inventory, then uses asynchronous messages to create orders and update inventory changes
      • The database updates inventory using optimistic locking: where stock_num - sell_num > 0
      • Add message sending record table and retry mechanism to prevent asynchronous message loss
      • Create order
        • The front end establishes a websocket connection or polls to monitor the order status
        • Consumption verification record status to prevent repeated consumption
      • Return Warehouse
        • Send a delayed message after creating an order to verify the payment status of the order and whether the inventory needs to be returned to the warehouse
      • ##13. Prevent sql injection

        1. Filter special characters

        2. Filter database keywords

        3. Verify data type and format

        4. Use precompiled mode and bind variables

        14. Transaction isolation level

        1. Standard SQL isolation level implementation principle

        • Uncommitted read: other transactions can directly read uncommitted: dirty read

          • Transaction Do not lock the currently read data

          • Add a row-level shared lock at the moment of update and release it at the end of the transaction

        • Committed read: The data read between the start and end of the transaction may be inconsistent. Other transactions in the transaction have modified the data: non-repeatability

          • The transaction has a negative impact on the currently read data ( When being read) row-level shared lock, release after reading

          • Add row-level exclusive lock at the moment of update and release at the end of the transaction

        • Repeatable reading: The data read before the start and end of the transaction are consistent. Other transactions in the transaction cannot modify the data: Repeatable reading

          • Transactions read the current Add a row-level shared lock to the incoming data from the beginning of the transaction

          • Add a row-level exclusive lock at the moment of update and release it at the end of the transaction

          • Other transactions may add new data during the transaction process, resulting in phantom reading

        • Serialization

          • Transaction reading Add table-level shared lock when fetching data

          • Add table-level exclusive lock when transaction updates data

        ## 2. Innodb’s transaction isolation level and implementation principle (!! Different from the above, understand one is isolation level and the other is!! Transaction!! Isolation level)

        1) Basic concepts


        • mvcc: Multi-version concurrency control: relies on undo log and read view

          • #Let the data be read without locking the data. Improve the concurrent processing capability of the database

          • The write operation will be locked

          • There are multiple versions of a piece of data, and each transaction will update the data. Generate a new data version, and keep the old data in the undo log

          • When a transaction is started, only all submitted transaction results can be seen

        • Current read: Read the latest version

        • Snapshot read: Read the historical version

        • Gap lock: Gap lock will lock the index within a range

          • update id between 10 and 20

          • regardless of whether it exists within the range or not Data will lock the entire range: insert id = 15, will be prevented

          • Only repeatable read isolation level has gap lock

        • next-key lock:

          • Record lock gap lock on the index record (gap lock between the index value and the previous index value)

          • Open in front and close in back

          • Prevent phantom reads

        2) Transaction isolation level


        • Uncommitted read

          • The transaction does not lock the currently read data, all are current reads

          • Add a row-level shared lock at the moment of update and release it at the end of the transaction

        • Commit read

          • The transaction does not lock the currently read data, it is all snapshot reading

          • Add a row-level exclusive lock at the moment of update and release it at the end of the transaction

        • Repeatable reading

          • Transactions do not lock the currently read data, they are all snapshot reads

          • When a transaction updates certain data, a row-level exclusive lock (Record record lock, GAP gap lock, next-key lock) must be added, and the transaction is released after the transaction ends.

          • Gap lock solves Phantom read problem

            • In the case of master-slave replication, if there is no gap lock, the A and B processes of the master library

            • A process delete id < 6 ;Then there is no commit

            • B process insert id = 3, commit

            • A process submits commit

            • In this scenario, there will be a record with id =3 in the master library, but if it is deleted first and then added in the binlog, there will be no data in the slave library, resulting in inconsistent data between the master and slave

          • MVCC’s snapshot solves the problem of non-repeatable reading

        • Serialization

          • Add table-level locks when the transaction reads data, and add table-level exclusive locks when the current read

          • transaction updates data

        Reference:

        The implementation principle of transaction isolation level in MySQL

        This article explains the principles of transactions and MVCC in MySQL

        How does snapshot work in MVCC?

        What is MVCC and why should gap locks be designed?

        15. Index Principle

        The index is a storage structure that helps the database to find data efficiently. When stored in a disk, it requires disk IO

        1. Storage engine

        • myisam supports table locks, indexes and data are stored separately, suitable for cross-server migration

        • innodb supports row locks, indexes and data storage Another file

        2. Index type

        • hash index

          • Suitable for precise query and high efficiency

          • Cannot be sorted, not suitable for range query

          • In case of hash conflict, the linked list needs to be traversed (php array The implementation principle and the implementation principle of redis zset are similar)

        • b-tree, b tree

          • b-tree and To distinguish the b tree

            • The data of the b tree is all stored in the leaf nodes, and only the key is stored in the internal nodes. One disk IO can obtain more nodes

            • The internal nodes and leaf nodes of b-tree both store keys and data. Finding data does not require finding leaf nodes. Internal nodes can directly return data.

            • b tree has been added Pointers from leaf nodes to adjacent nodes to facilitate return query traversal

        • ##Clustered index and non-clustered index

          • Concept

            • Clustered index: index and data are stored in one node

            • Non-clustered index: index and The data is stored separately, and the address where the data is actually stored can be found through the index

          • ##Detailed explanation:
            • The clustered index used by innodb, and the default primary key index is a clustered index (when there is no primary key index, select a non-empty index, if not, the primary key index is implicit), the auxiliary index points to the clustered index location, and then After finding the actual storage address
            • myisam uses non-clustered indexes, all indexes only need to be queried once to find the data
            • clustered The advantages and potential of the index
            • 1. The index and the data are together, and the data on the same page will be cached in the (buffer) memory, so when viewing the data on the same page, you only need to take it out from the memory,

                    2. After the data is updated, only the primary key index needs to be maintained, and the auxiliary index will not be affected

                      3. The auxiliary index stores the value of the primary key index, which takes up more physical space. Therefore, it will be affected

                4. Using random UUID, the data distribution is uneven, causing the clustered index to scan the entire table and reducing efficiency, so try to use auto-incrementing primary key id

          16. Strategies for sub-tables (sub-databases)

        1. Process

        Evaluate capacity and number of sub-tables -> Select the table sharding key according to the business->Table sharding rules (hash, remainder, range)->Execution->Consider the expansion issue

        2. Horizontal split

          Split horizontally into multiple tables based on fields
        • The structure of each table is the same
        • The collection of all sub-tables is the full quantity
        3. Vertical split

          Split vertically based on fields
        • The table structure is different, the same associated row in the sub-table is a complete piece of data
        • Extended table, hot fields and non-hot fields Splitting (splitting of lists and details)
        • When obtaining data, try to avoid using join, but combine the results of two queries
        4. Problem

          Cross-database join problem
          • Global table: scenario where some system tables need to be associated
          • Redundancy method: redundant common fields
          • Assembly method: assemble the results of multiple queries
        • Cross-node paging, sorting, and function issues
        • Transaction consistency
        • Global primary key id
          • Using uuid -> will reduce the efficiency of the clustered index
          • Use distributed auto-increment id
        • Expansion problem
          • Upgrading the slave database
            • The slave database is upgraded to the master database. The data is consistent and only the redundant database needs to be deleted. The remaining data can be
            • Doubled expansion: you need to double the slave library
          • Double-write migration:
            • Double write new data into the old and new databases simultaneously
            • Copy old data to the new database
            • Based on the old database, delete redundant data after verifying data consistency
        • 17. Execution of select and update Process

        1, mysql composition

        • Server layer: Connector->Cache->Analyzer (preprocessor)->Optimizer->Executor

        • Engine layer: Query and store data

        2. Select execution process

        • The client sends a request and establishes a connection

        • The server layer searches the cache and returns directly if hit, otherwise continue

        • Analysis 7 Analysis of sql statements and preprocessing (verify field legality and type etc.)

        • The optimizer generates the execution plan

        • The executor calls the engine API query result

        • Return query results

        3. Update execution process

        • ##Basic concepts

          • buffer pool (cache pool), in memory, the next time the data of the same page is read, it can be returned directly from the buffer pool (innodb clustered index)

          • Update When updating data, first update the buffer pool, and then update the disk

          • Dirty page: The cache pool in the memory is updated, but the disk is not updated

          • Brush dirty: There is a special process in inndb to write the buffer pool data to the disk, and write multiple modifications to the disk at once every once in a while

          • redo log and binlog

            • redo log (redo log), innodb-specific log, physical log, record modifications

            • redo log is written repeatedly, the space is fixed and It will be used up and the old log will be overwritten.

            • binlog is a log shared by the server layer, a logical log, and records the original logic of the statement

            • binlog is Append writing to a certain size and switch to the next one without overwriting the previous log

            • The redo log is mainly used to recover from crashes, and the bin log is used to record archived binary logs

            • redo log can only recover data for a short period of time, binlog can recover larger data by setting

          • WAL (write -ahead-logging) Write log scheme first

            • Logging is sequential IO

            • Directly writing to disk (flushing) is random IO, Because the data is random and may be distributed in different sectors

            • Sequential IO is more efficient. Writing the modification log first can delay the flushing opportunity and improve throughput

          • redo log flushing mechanism, check point

            • redo log size is fixed, cyclic writing

            • The redo log is like a circle, with a check point in front (starting to overwrite the old log at this point), and a write point (currently written position) behind it

            • When the write point and check point overlap, it proves that the redo log is full, and it is necessary to start synchronizing the redo log to the disk.

        • Execute Steps (two-phase commit - distributed transaction, ensuring the consistency of the two logs)

          • Analyze the update conditions and find the data that needs to be updated (caching will be used)

          • The server calls the API of the engine layer, Innodb updates the data to the memory, then writes the redo log, and then enters prepare

          • The engine notifies the server layer to start submitting data

          • The server layer writes the binlog log and calls the innodb interface to issue a commit request

          • The engine layer submits the commit after receiving the request

        • Data crash recovery rules after downtime

          • If the redo log status is commit, submit it directly

          • If the redo log status is prepare, determine whether the transaction in the binlog is committed. If so, commit it, otherwise roll it back.

        • If you do not use two Error case submitted twice (update table_x set value = 10 where value = 9)

          • First redo log and then write to binlog

            1. After the redo log is written, The binlog was not finished and the machine was down at this time.

                    2. After restarting, the redo log is complete, so the recovery data value = 10

                    3. There is no record in the bin log. If the data needs to be recovered, value = 9

          • Write binlog first and then redo log

                    1. Binlog writing is completed, redo log is not completed

                    2. There is no redo log after restart, so the value is still 9

                3. When the data needs to be restored, the binlog log is complete and the value is updated to 10

        • undo log

          • Record before the update is written to the buffer pool

          • If an error occurs during the update process, roll back directly to the undo log state

        18. The functions and three formats of binlog

        Function:

        1. Data recovery

        2. Master-slave replication

        Format (binary file):

        1) statement

        • 1. Record the original text of each sql statement

        • 2. To delete a table, you only need to record one sql statement, and there is no need to record the changes in each row. This saves IO, improves performance, and reduces the amount of logs

        • 3. Master-slave inconsistency may occur (stored procedures, functions, etc.)

        • 4. RC isolation level (read commit), because the binlog recording order is recorded in the transaction commit order, so it may Leading to master-slave replication inconsistency. This can be solved by introducing gap locks at the repeatable read level.

        2) row

        • 1. Record the modification of each record, there is no need to record the context record of the sql statement

        • 2. Resulting in a large amount of binlog logs

        • 3. Deleting a table: recording the situation of each record being deleted

        3) mixed

        • 1. A mixed version of the first two formats

        • 2. Automatically choose which one to use based on the statement :

          • General sql statement modification uses statement

          • To modify table structure, function, stored procedure and other operations, select row

          • update and delete will still record all recorded changes

        ##19. Principle of master-slave synchronization (master-slave replication) Separation of problems and reading and writing

        1. Problems solved

          ##Data distribution
        • Load Balancing
        • Data backup, high availability, avoid single point of failure
        • Realize read and write separation, relieve database pressure
        • Upgrade test (use a higher version of mysql as the slave library)
        2. Supported replication types (three formats of binlog)

          SQL statement-based replication
        • Row-based replication
        • Hybrid replication
        3. Principle

        1) Basic concept

          Generate two threads from the library
          • I/O thread
          • SQL thread
        • Main library generation thread
          • log dumo thread
        • 2) Process (the master node must enable the bin log function,)

          1. After starting the start slave command from the slave node, create an IO process to connect to the master node
        • 2. After the connection is successful, the master node creates a log dump thread (The master node will create a log dump thread for each slave node)
        • 3. When the binlog changes, the dump log thread of the master node will read the bin-log content and send it Give the slave node
        • 4. When the master node dump log thread reads the contents of the bin-log, it will lock the bin-log of the master node. The reading is completed before sending it to the slave node. Release the lock
        • 5. The IO thread of the slave node receives the binlog content sent by the master node and writes it to the local relay log file
        • 6. The master-slave node locates the position of master-slave synchronization through the position offset of the binlog file. The slave node will save the received position offset. If the slave node crashes and restarts, it will automatically initiate synchronization from the postion position
        • 7. The SQL thread of the slave node copies and reads the content of the local relay log, parses it into specific operations and executes them to ensure master-slave data consistency
        4. Master-slave replication mode

        1) Asynchronous mode (default mode)

          1. It may lead to master-slave inconsistency (master-slave delay When)
        • 2. After receiving the transaction submitted by the client, the master node directly submits the transaction and returns it to the client
        • 3. If the master node After the node transaction is submitted, the log dump crashes before it has time to be written, which will lead to inconsistency between the master and slave data
        • 4. You don’t need to worry about the master-slave synchronization operation, and the performance is the best.
        • 2) Full synchronization mode

        1. More reliable, but will affect the response time of the main database
        • 2. After receiving the transaction submitted by the client, the master node must wait for the binlog to be sent to the slave library, and all slave libraries must complete the transaction before returning it to the client
        • ##3) Semi-synchronous mode

        1. Increase part of the reliability and increase part of the response time of the main library

        • 2. The main node receives the request submitted by the client After the transaction, wait for the binlog to be sent to at least one slave library and successfully saved to the local relay log. At this time, the master library submits the transaction and returns it to the client

        • 4) Configuration of server-id and server-uuid

        1. server-id is used to identify the database instance to prevent infinite loops of SQL statements in chained master-slave, multi-master and multi-slave topologies

        • 2. The default value of server-id is 0. Binary logs will still be recorded for the host, but all slave connections will be rejected.

        • 2. server-id = 0 will refuse to connect to other instances for the slave machine

        • 3. server-id is a global variable , the service must be restarted before modification

        • 4. When the server-id of the main library and the slave library are duplicated

          • Default replicate-same-server-id = 0, the slave library will skip all master-slave synchronization data, resulting in master-slave data inconsistency

          • replicate -same-server-id = 1, may cause wireless loop execution of sql

        • Duplicate server-id of two slave libraries (B, C) will cause the master-slave Connection abnormality, intermittent connection

          • When the main library (A) finds the same server-id, it will disconnect the previous connection and re-register a new connection

          • The connections to slave libraries B and C will be reconnected over and over again

        • MySQL service will automatically create and generate server-uuid configuration

          • When the master-slave synchronization occurs, if the server-uuid of the master-slave instance is the same, an error will be reported and exited. However, we can avoid the error by setting replicate-same-server-id=1 (not recommended)

        5. Read and write separation

        1) Based on code implementation, reducing hardware expenses

        2) Based on intermediate proxy Implementation

        3) Master-slave delay

        • The performance of the slave library is worse than that of the main library

        • A large number of queries causes pressure on the slave library Large, consumes a lot of CPU resources, affecting synchronization speed: one master, multiple slaves

        • Large transaction execution: the binlog will not be written until the transaction is executed, and the slave library reading delay

        • Main library ddl (alter, drop, create)

        Twenty, deadlock

        1. The four generated Necessary conditions

        • 1. Mutually exclusive conditions

        • 2. Request and hold conditions: allocate all resources at once, otherwise one None are allocated

        • 3. Non-deprivation condition: when the process obtains some resources and waits for other resources, it releases the occupied resources

        • 4. Loop Waiting condition:

          Understanding: A resource can only be occupied by one process. A process can also apply for new resources after acquiring resources, and the resources that have been obtained cannot be deprived. At the same time, multiple processes are waiting for each other to be occupied by other processes. Resources

        2. Release deadlock

        • 1. Terminate the process (kill them all)

        • 2. Plant one by one (kill one to see if it is relieved)

        21. Mysql optimization of large paging query limit 100000 (offset), 10 (page_sie )

        1. Reason

        When mysql queries paging data, it does not directly skip offset (100000), but takes offset page_size = 100000 10 = 100010 pieces of data, and then Abandon the first 100,000 pieces of data, so the efficiency is reduced

        2. Optimization plan

        • Delayed association: use covering index

        • Primary key threshold method: When the primary key is auto-incrementing, the maximum and minimum values ​​of the primary key that meet the conditions are calculated through conditions (using covering index)

        • Record the result position of the previous page and avoid using OFFSET

        22. Redis cache and mysql data consistency

        Method:

        1. Update redis first and then update the database

        Scenario: update set value = 10 where value = 9

        1) Redis update successful: redis value = 10

        2) Database update failed: mysql value = 9

        3) Data inconsistency

        2. Update the database first, then update redis

        Scenario: A process update set value = 10 where value = 9; B process update set value = 11 where value = 9;

         1) Process A updates the database first and has not yet written to the cache: mysql value = 10; redis value = 9

         2) Process B updates the database and submits the transaction, writes the cache: mysql value = 11; redis value = 11;

        3) Process A completes the request and submits the transaction, writes the cache: redis value = 10;

        4) Final mysql value = 11; redis value = 10

        3. Delete the cache first and then update the database

        Scenario: A process update set value = 10 where value = 9; Process B queries value;

        1) Process A deletes the cache first and has not had time to modify the data or the transaction has not been submitted

        2) Process B starts querying and does not hit the cache, so Check the database and write to the cache redis value = 9

        3) Process A updates the database to complete mysql value = 10

        4) Final mysql value = 10; redis value = 9

        Solution:

        1. Delayed double deletion

        Scenario: A process update set value = 10 where value = 9; B process query value;

        1) Process A deletes the cache first and has not had time to modify the data or the transaction has not been submitted

        2) Process B starts to query and does not hit the cache, so it checks the database and writes the cache redis value = 9

        3) Process A updates the database and completes mysql value = 10

        4) Process A estimates the delay time and deletes the cache again after sleep

        5) Final mysql value = 10;redis value Is empty (check the database directly next time)

        6) The reason for the delay is to prevent process B from writing to the cache after process A has updated it.

        2. Request serialization

        1) Create two Queue: update queue and query queue

        2) When the cache does not exist and the database needs to be checked, store the key in the update queue

        3) If a new request comes in before the query is completed, and If it is found that the key still exists in the update queue, put the key into the query queue and wait; if it does not exist, repeat the second step

        4) If the queried data finds that the query queue already exists, there is no need to write to the queue again

        5) After the data update is completed, rpop updates the queue. At the same time, rpop queries the queue and releases the query request.

        6) The query request can use while sleep to query the cache and set the maximum delay time. If it is not completed, it will return Empty

        Twenty-three, connect and pconnect in redis

        1. connect : Release the connection after the script ends

        1. close :Release the connection

        2. pconnect (long connection): The connection is not released when the script ends. The connection remains in the php-fpm process, and the life cycle follows the life cycle of the php-fpm process

        • 1. close does not release the connection

          • It is just that redis cannot be requested again in the current php-cgi process

          • Currently Subsequent connections in php-cgi can still be reused until php-fpm ends its life cycle

        • 2. Reduce the cost of establishing a redis connection

        • 3. Reduce one php-fpm to establish multiple connections

        • 4. Consume more memory, and the number of connections continues to increase

        • 5. The previous request of the same php-fpm woker sub-process (php-cgi) may affect the next request

        3, pconnect The problem of connection reuse

        • Variable A select db 1; Variable B select db 2; will affect the db of variable A

        • Solution: Create a connection instance for each db

        24. The principle of using skiplist for redis zset ordered collection

        1. Basic concepts

        1. Skiplist is a random data that saves elements in a hierarchical linked list in an orderly manner (can only be used when the elements are ordered)

        2. Skiplist is real Evolved on the basis of ordered linked lists and multi-layer linked lists

        3. Duplicate values ​​are allowed, so the comparison check needs to compare not only the key but also the value

        4. Each node has a A backward pointer with a height of 1, used for iteration from the head direction to the tail direction

        5. Time complexity O(logn), space complexity O(n)

        2. Comparison between skip table and balanced tree

        1) Range query efficiency

        • The skip table range query is more efficient, because after finding the minimum value, only Traverse the first-level linked list until it is less than the maximum value

        • After the balanced tree range query finds the minimum value, in-order traversal is performed to find other nodes that do not exceed the maximum value

        2) Memory usage

        • skiplist The number of pointers for each node is 1/(1-p)

        • The number of pointers of each node of the balanced tree is 2

        3) Insertion and deletion operations

        • skiplist only needs to modify the adjacent Node pointer

        • Changes in the balanced tree will cause adjustments to the subtree

        25. Redis expiration deletion and elimination mechanism

        1. Conventional expired deletion strategy

        1) Scheduled deletion

        • Delete immediately upon expiration through timer

        • Memory is released in time but consumes more CPU. When there is large concurrency, CPU resources are consumed, which affects the speed of processing requests

        • Memory-friendly, CPU-unfriendly

        2) Lazy deletion

        • Leave the key expired and check whether it has expired and delete it the next time you need to remove it

        • There may be a large number of expired keys that will not be used, causing memory overflow

        • Not memory friendly, CPU friendly

        3) Delete regularly

        • Check every once in a while and delete expired keys

        • How much to delete and how much to check are determined by algorithms

        2. The lazy deletion adopted by redis is regularly deleted.

        • Periodically randomly tests some keys with expiration time set to check. Delete when it expires

        • The time for each cleanup shall not exceed 25% of the CPU. When the time is reached, the check will be exited

        • What is not deleted regularly keys, and keys that will not be used in the future will still exist in the memory, so it is necessary to cooperate with the elimination strategy

        3. Elimination strategy (executed when the memory is not enough to write new data )

        • volatile-lru : The expiration time is set and the less recently used, the priority will be eliminated

        • volatile-ttl : The expiration time is set and the earlier the expiration time is The higher the priority,

        • volatile-random: Set the expiration time to be randomly deleted

        • ##allkeys-lru: The earlier the expiration time of all keys, the better Prioritize elimination

        • allkeys-random: Random elimination of all expired keys

        • no-enviction: No elimination allowed, insufficient memory error

        26. Redis common problems and solutions

        1. Cache avalanche: A large number of cache failures at the same time, causing requests to directly query the database, putting pressure on database memory and CPU Increasing or even downtime

        Solution:

        • Hotspot data never expires or is distributed to different instances, reducing single machine failure issues

        • Add a random number to the cache time to prevent a large number of caches from invalidating at the same time                                                                                                                                                                         

        • Do second-level cache or double cache, A is the original cache for a short period of time, and B is the backup cache, which is valid for a long time. Double-write cache when updating

        #2. Cache penetration: There is no data in the cache and database. Under a large number of requests, all requests directly penetrate the database, causing downtime.

        Solution:

        • Bloom filter: composed of a bit vector or a bit list of length m (only a list containing 0 or 1 bit values)

          • Use multiple unused hash functions to generate multiple index values, and fill in the values ​​corresponding to multiple positions as 1

          • Bloom filter You can check whether the value is "possibly in the set" or "definitely not in the set"

          • Possible misjudgment but the basic filtering efficiency is high

          • Extreme In this case, when the bloom filter has no free space, each query returns true

        • Empty cache (short-term)

        • Business layer parameter filtering

        3. Cache breakdown: There is data in the database, but a large number of requests occur after the cache suddenly fails, resulting in increased database pressure or even downtime

        Solution:

        • Hotspot data never expires

        • ##Mutex lock: No matter whether it succeeds or fails after acquiring the lock The lock must be released
        27. Detailed explanation and life cycle of php-fpm

        1. Basic knowledge

        1) CGI protocol

          Dynamic language code files need to pass the corresponding parser to be recognized by the server
        • CGI protocol It is used to enable the server and the interpreter to communicate with each other
        • The server requires the PHP interpreter plus the corresponding CGI protocol to parse the PHP file
        • 2 ) CGI program = php-cgi

        ##php-cgi is a CGI program that complies with the CGI protocol
        • It is also a PHP interpreter
        • Each request of standard CGI will parse php.ini, initialize the execution environment, etc., reducing performance
        • You need to re-php- cgi can make php.ini take effect
        • Cannot dynamically schedule workers, only a specified number of workers at the beginning
        • ##3) FastCGI protocol

        It is also a protocol/standard like CGI, but it is optimized on the basis of CGI and is more efficient.

        • is used to improve the performance of CGI programs.

        • Realizes the management of CGI processes

        • 4) FastCGI program = php-fpm

        php-fpm is a FastCGI program that complies with the FastCGI protocol.

        • FastCGI program’s management mode for CGI programs

        • Start a master process and parse the configuration File, initialize the environment

          • Start multiple worker sub-processes

          • After receiving the request, pass it to the worker process for execution

          • Solve the problem of smooth restart after modifying php.ini
        • process_control_timeout: The timeout for the child process to accept the main process multiplexing signal (in the specified Process the request within the time, and leave it alone if it cannot be completed)

          • Set the time php-fpm leaves for the fastcgi process to respond to the restart signal

          • process_control_timeout = 0, that is, it does not take effect and smooth restart cannot be guaranteed

          • process_control_timeout set too large may cause system request blocking

          • process_control_timeout =10 In this case, if the code logic takes 11s, restarting the old one may cause the code execution part to exit

          • Recommended value: request_terminate_timeout

          • Restart type
        • Graceful restart

          • Forced restart

          2. php-fpm life cycle: to be updated

        PHP-FPM life cycle: https://www.abelzhou.com/php/php-fpm-lifespan/

        #Reference:

        Chat Communication mechanism between PHP-FPM and Nginx

        A brief analysis of several timeout configurations in the PHP configuration file

        Let’s talk about nginx smooth restart and FPM Smooth restart

        Twenty-eight, communication between Nginx and php

        1. Communication method: fastcgi_pass

        1 )tcp socket

        • Cross-server, when nginx and php are not on the same machine, you can only use this method

        • Connection-oriented protocol, better To ensure the correctness and integrity of communication

        2) unix socket

        • No need for network protocol stack, packaging and unpacking, etc.

        • Reduces tcp overhead and is more efficient than tcp socket

        • It is unstable during high concurrency, and the sudden increase in the number of connections generates a large number of long-term caches and big data The package may directly return an exception

        Reference:

        Let’s talk about the communication mechanism between PHP-FPM and Nginx

        A brief analysis of the communication mechanism between Nginx and php-fpm

        29. Web vulnerabilities and problems

        1. SQL injection

        2. How to prevent XSS attacks? ](https://tech.meituan.com/2018/09/27/fe-security.html)

        3. CSRF attack:

        Recommended reading: [Front-end security series (2): How to prevent CSRF attacks? ](https://tech.meituan.com/2018/10/11/fe-security-csrf.html)

        4. File upload vulnerability

        Recommended Reading: [A brief analysis of file upload vulnerabilities](https://xz.aliyun.com/t/7365)

        5. Cross-domain issues:

        1) jsonp

        2) cors

        3) nginx agent

        Recommended learning: "

        PHP Video Tutorial

        "

    The above is the detailed content of Share the latest 28 PHP interview questions in 2023 (with answers). For more information, please follow other related articles on the PHP Chinese website!

    Related labels:
    source:learnku.com
    Statement of this Website
    The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
    Popular Tutorials
    More>
    Latest Downloads
    More>
    Web Effects
    Website Source Code
    Website Materials
    Front End Template