Table of Contents
回复内容:
Home Backend Development PHP Tutorial 字符串 - PHP短文本匹配的排序算法

字符串 - PHP短文本匹配的排序算法

Jun 06, 2016 pm 08:41 PM
php string

PHP做简单的短文本搜索匹配时,我用的是最长公共子序列。但是如果关键字匹配到两条文本的相似度一样时,如何将两条文本中关键字更靠前的返回?举例:关键字“无”匹配到“无双”和“虚无”,我要如何在返回的结果中把“无双”排在“虚无”前面?那“无双”匹配到的“无小明的双”和“小明无的双”呢?

<code><?php $names = array(
  '真三国无双',
  '无双剑姬',
  '虚无',
  '一时无两',
  '南无阿弥陀佛',
  '崖山之后无中国',
);

//拆分词语为单个字符
function split_name($name) {
  preg_match_all("/./u", $name, $arr);
  return $arr[0];
}

//最长公共子序列
function LCS($str_1, $str_2) {
  $len_1 = strlen($str_1);
  $len_2 = strlen($str_2);
  $len = $len_1 > $len_2 ? $len_1 : $len_2;

  $dp = array();
  for ($i = 0; $i  $dp[$i][$j - 1] ? $dp[$i - 1][$j] : $dp[$i][$j - 1];
      }
    }
  }

  return $dp[$len_1][$len_2];
}

function search($name) {
  Global $names;

  $sort_list = array();
  if (mb_strlen($name, 'utf-8') != strlen($name)) { // 是否全英文字符
    $arr_1 = array_unique(split_name($name));
    foreach ($names as $value) {
      $arr_2 = array_unique(split_name($value));
      $similarity = count($arr_2) - count(array_diff($arr_2, $arr_1));
      $sort_list[$value] = $similarity;
    }
  } else {
    foreach ($names as $value) {
      $similarity = LCS($name, $value);
      $sort_list[$value] = $similarity;
    }
  }
  arsort($sort_list);

  return $sort_list;
}

header('content-type:text/html;charset=utf-8');
print_r(search('无'));
</code>
Copy after login
Copy after login

回复内容:

PHP做简单的短文本搜索匹配时,我用的是最长公共子序列。但是如果关键字匹配到两条文本的相似度一样时,如何将两条文本中关键字更靠前的返回?举例:关键字“无”匹配到“无双”和“虚无”,我要如何在返回的结果中把“无双”排在“虚无”前面?那“无双”匹配到的“无小明的双”和“小明无的双”呢?

<code><?php $names = array(
  '真三国无双',
  '无双剑姬',
  '虚无',
  '一时无两',
  '南无阿弥陀佛',
  '崖山之后无中国',
);

//拆分词语为单个字符
function split_name($name) {
  preg_match_all("/./u", $name, $arr);
  return $arr[0];
}

//最长公共子序列
function LCS($str_1, $str_2) {
  $len_1 = strlen($str_1);
  $len_2 = strlen($str_2);
  $len = $len_1 > $len_2 ? $len_1 : $len_2;

  $dp = array();
  for ($i = 0; $i  $dp[$i][$j - 1] ? $dp[$i - 1][$j] : $dp[$i][$j - 1];
      }
    }
  }

  return $dp[$len_1][$len_2];
}

function search($name) {
  Global $names;

  $sort_list = array();
  if (mb_strlen($name, 'utf-8') != strlen($name)) { // 是否全英文字符
    $arr_1 = array_unique(split_name($name));
    foreach ($names as $value) {
      $arr_2 = array_unique(split_name($value));
      $similarity = count($arr_2) - count(array_diff($arr_2, $arr_1));
      $sort_list[$value] = $similarity;
    }
  } else {
    foreach ($names as $value) {
      $similarity = LCS($name, $value);
      $sort_list[$value] = $similarity;
    }
  }
  arsort($sort_list);

  return $sort_list;
}

header('content-type:text/html;charset=utf-8');
print_r(search('无'));
</code>
Copy after login
Copy after login

所以你只是想让搜索结果中字在前的排名越前咯?那不就直接把所有匹配到的字的位置相加越小的不就在前面么?不知道我理解错没有... 代码在线运行:http://3v4l.org/K0X7m

<code><?php /** 初始化设置查询关键词和查询内容 **/
$names = array(
  '真三国无双',
  '无双剑姬',
  '虚无',
  '一时无两',
  '南无阿弥陀佛',
  '崖山之后无中国',
);
$search = array("无","双");

/** 劈开字符串 **/
$res = array();
foreach($names as $name) {
    preg_match_all("/./u", $name, $match);
    $res[$name]['single'] = $match[0];
}

/** 对字符串进行遍历,存储匹配到的位置,对没有匹配到所有的关键词的字符串剔除,匹配到所有的将位置相加,越小的排名越靠前 **/
foreach($res as $name => $v) {
    $pos = array();
    foreach($v['single'] as $k => $s) {
        if(in_array($s, $search)) $pos[$s][] = $k;
    }
    if(count($pos) != count($search)) {
        unset($res[$name]);
    } else {
        $seq = 0;
        array_walk_recursive($pos, function($i) use(&$seq) {
            $seq += $i;
        });
        $res[$name] = $seq;
    }
}

ksort($res);
$res = array_keys($res);

var_dump($res);
</code>
Copy after login
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
Two Point Museum: All Exhibits And Where To Find Them
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

CakePHP Project Configuration CakePHP Project Configuration Sep 10, 2024 pm 05:25 PM

In this chapter, we will understand the Environment Variables, General Configuration, Database Configuration and Email Configuration in CakePHP.

PHP 8.4 Installation and Upgrade guide for Ubuntu and Debian PHP 8.4 Installation and Upgrade guide for Ubuntu and Debian Dec 24, 2024 pm 04:42 PM

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

CakePHP Date and Time CakePHP Date and Time Sep 10, 2024 pm 05:27 PM

To work with date and time in cakephp4, we are going to make use of the available FrozenTime class.

CakePHP File upload CakePHP File upload Sep 10, 2024 pm 05:27 PM

To work on file upload we are going to use the form helper. Here, is an example for file upload.

CakePHP Routing CakePHP Routing Sep 10, 2024 pm 05:25 PM

In this chapter, we are going to learn the following topics related to routing ?

Discuss CakePHP Discuss CakePHP Sep 10, 2024 pm 05:28 PM

CakePHP is an open-source framework for PHP. It is intended to make developing, deploying and maintaining applications much easier. CakePHP is based on a MVC-like architecture that is both powerful and easy to grasp. Models, Views, and Controllers gu

CakePHP Creating Validators CakePHP Creating Validators Sep 10, 2024 pm 05:26 PM

Validator can be created by adding the following two lines in the controller.

How To Set Up Visual Studio Code (VS Code) for PHP Development How To Set Up Visual Studio Code (VS Code) for PHP Development Dec 20, 2024 am 11:31 AM

Visual Studio Code, also known as VS Code, is a free source code editor — or integrated development environment (IDE) — available for all major operating systems. With a large collection of extensions for many programming languages, VS Code can be c

See all articles