In-depth study of keyword matching project (1) - Introduction of filters
When you start reading this article, please first understand the step-by-step instructions on how to do a keyword matching project (search engine)----Day 1~ The step-by-step guide on how to do a keyword matching project (search engine)----Twenty Two days
Foreword
Continuing from the previous chapter, after Xiao Shuai Shuai presented the completed code and test results, Boss Yu was asked to go online without saying anything. The reason is very simple: everyone wants to contribute, and benefits are indispensable. Yes, we will upgrade the things we encounter later. During that meal, Xiao Shuai Shuai was very happy and satisfied. Of course, Xiao Ding Ding, Boss Yu, and Director Wang were also with him.
Cause of the problem
With the operation of the project, the application of collecting keywords is becoming more and more widespread. Suddenly one day, Xiao Ding Ding came to find Xiao Shuai Shuai. Of course, Xiao Shuai Shuai was very happy, but after hearing the conversation that followed, he was greatly disappointed.
"Xiao Shuai Shuai, look, why do these words have &,?,%, there are a lot of messy words, these words are not suitable for use, please help me how to remove them." Xiao Ding Ding's eyes are also enough It was so powerful that it made Xiao Shuai Shuai tremble.
“Sister, okay, I’ll go take a look first. This matter will be taken care of by me. I’ll let you know when it’s resolved.”
了 Xiaoshuai went back to check and found that there were a lot of such weird words in the database. Xiao Shuai was panicked and had to find Yu Boss again.“There are many words like &,?,% in the dictionary keywords now, and according to the matching algorithm, these words are all available, what should we do?”.
Boss Yu listened to Xiao Shuaishuai’s explanation and came up with a plan. We control these keywords. Since these keywords are not used in the business anyway, we don’t want them to be included in the database. This can reduce the number of keywords in the database. quantity, and can ensure high availability of keywords.
Xiao Shuai Shuai is different from the boss, and he is beyond his reach.
Solution
Added a function to control the entry of etymology into the database, the admission that conforms to the rules, and the discard that does not conform to the rules - the function of the filter.
Design plan
Code planThe upgraded version is decisive and colorful.
Class diagram:
1. The codes of InputSource, FileSource and TaobaoHotsSource remain unchanged.
2. New filter code is added to the Source code, as follows:
<span> 1</span> <span>class</span><span> Source { </span><span> 2</span> <span> 3</span> <span>public</span> <span>$keywords</span><span>; </span><span> 4</span> <span> 5</span> <span>public</span> <span>function</span><span> run() { </span><span> 6</span> <span> 7</span> <span>foreach</span> (<span>$this</span>->keywords <span>as</span> <span>$word</span><span>) { </span><span> 8</span> <span>#</span><span> code...</span> <span> 9</span> <span>if</span>(Filter::is(<span>$word</span>)) <span>continue</span><span>; </span><span>10</span> <span>11</span> <span>$keyword</span> = <span>new</span><span> Keyword(); </span><span>12</span> <span>$keyword</span>->word = <span>$word</span><span>; </span><span>13</span> <span>$keyword</span>-><span>save(); </span><span>14</span> <span> } </span><span>15</span> <span> } </span><span>16</span> <span>17</span> }
3. Added Filter code:
<span> 1</span> <?<span>php </span><span> 2</span> <span> 3</span> <span>#</span><span>@Filename: filter/Filter.php</span> <span> 4</span> <span>#</span><span>@Author: oShine</span> <span> 5</span> <span> 6</span> <span>class</span><span> Filter { </span><span> 7</span> <span> 8</span> <span>private</span> <span>static</span> <span>$filterWords</span> = <span>array</span>("*","?","%",".","&"<span>); </span><span> 9</span> <span>10</span> <span>public</span> <span>static</span> <span>function</span> is(<span>$word</span><span>){ </span><span>11</span> <span>$pattern</span> = "/(".<span>implode</span>("|", self::<span>$filterWords</span>).")/"<span>; </span><span>12</span> <span>return</span> <span>preg_match</span>(pattern, <span>$word</span><span>); </span><span>13</span> <span> } </span><span>14</span> <span>15</span> }
Summary
Good code structure, adding and removing functions is very simple, and work efficiency is improved. Xiao Shuai Shuai fell in love with this job deeply.
The above introduces the in-depth study of the keyword matching project - the introduction of filters, including aspects of it. I hope it will be helpful to friends who are interested in PHP tutorials.