A detailed introduction to the similar_text and similarity levenshtein functions for calculating string similarity in PHP. Below we will introduce in detail the introduction of string similarity.
similar_text — Calculate the similarity of two strings
int similar_text ( string $first , string $second [, float &$percent ] )
$first is required. Specifies the first string to compare.
$second is required. Specifies the second string to be compared.
$percent optional. Specifies the variable name used to store percent similarity.
The similarity between two strings is calculated according to the description of Oliver [1993]. Note that this implementation does not use the stack in Oliver's virtual code, but does make recursive calls, which may make the entire process slower or faster. Also note that the complexity of this algorithm is O(N**3), where N is the length of the longest string.
For example, we want to find the similarity between the string abcdefg and the string aeg:
The code is as follows | Copy code | ||||
$second = "aeg";
echo similar_text($first, $second); The result output is 3. If you want to display it in percentage, you can use its third parameter, as follows: $second = "aeg"; similar_text($first, $second, $percent); echo $percent;
|
Instructions for use
First read the description of levenshtein() function in the manual:
The levenshtein() function returns the Levenshtein distance between two strings.
Levenshtein distance, also known as edit distance, refers to the minimum number of edit operations required between two strings to convert one into the other. Permitted editing operations include replacing one character with another, inserting a character, and deleting a character.
For example, convert kitten to sitting:
sitten (k→s)
sittin (e→i)
The sitting (→g) levenshtein() function gives equal weight to each operation (replacement, insertion, and deletion). However, you can define the cost of each operation by setting the optional insert, replace, and delete parameters.
Syntax:
Parameter Description
•string1 Required. The first string to compare.
代码如下 | 复制代码 |
echo levenshtein("Hello World","ello World"); |
The code is as follows | Copy code |
echo levenshtein("Hello World","ello World"); echo " "; echo levenshtein("Hello World","ello World",10,20,30); ?> |
Output: 1 30