Home Backend Development PHP Tutorial Integrate PHP application and Solr search engine

Integrate PHP application and Solr search engine

Mar 24, 2017 pm 01:25 PM
php solr application search engine integrated

Why do you need a search engine? Is a database alone not enough? If you are just creating a small website, a database will suffice. But when you're creating neutral or large-scale applications, search engines are the smarter choice. Of course, small websites can also use Solr to obtain highly relevant search results.

Imagine you are writing a search query program for an e-commerce website. The most straightforward idea is the following database query statement:

SELECT * FROM PRODUCTS
WHERE LOWER(title) like LOWER('%$phrase%')
OR LOWER(description) like LOWER('%$phrase%');
Copy after login

It works fine when querying phrases in titles or descriptions. But the reality is very complicated, for example, Apple iPhone 4G black 16GB (Apple 4G network iPhone black 16GB). When searching for "iPhone 16G", there are no results. You can replace spaces with % to handle this situation.

$phrase = str_replace(' ', '%', $phrase);
Copy after login

What about when querying "iPhone 16GB 4G"? Apparently the word order changed and it doesn't work properly. I'm guessing you'd add another field to hold the word order. So what should I do if I write the wrong word? What about synonyms? Coming up with a good solution for such a search system is challenging.



Designing an exquisite algorithm is not the key to solving this kind of problem. Text search consumes resources. Putting too much pressure on the database side It is never a good idea. The reason is that the database cannot be easily expanded. You cannot simply add an instance like a web server or Memcached. Expanding the database requires some preparation, code modification, configuration, downtime and maintenance time. In short, the cost is Very expensive. The good news is that Solr is designed to solve this type of problem.

Solr is an enterprise-level search platform based on Apache Lucene. It is fast, stable, has good documentation, and of course is easy to expand. Since Solr is a powerful solution, all its features are not listed one by one in this article. This guy is also quite easy to install.


First download the latest version from the official website version. Solr is an application written in the Java language, and you need the Java Runtime environment to run it.

$ cd solr-4.1.0/example/
$ java -jar start.jar
Copy after login

After a few seconds you will see the following information:

2013-03-09 18:47:41.177:INFO:oejs.AbstractConnector:Started SocketConnector@0.0.0.0:8983
Copy after login

Solr has a web interface that works under port 8983. Open the browser to access http: //localhost:8983/solr/.

In the navigation area on the left hand side you will find "collection1". In Solr, Collections are similar to database tables, and you can query data. Click on a collection and select its submenu "query".

The first option is called "Request-Handler (qt)" and has the default value "/select". Request handlers are a set of predefined queries. If you look at the Solr config file, you'll see something like this:

$ vim solr-4.1.0/example/solr/collection1/conf/solrconfig.xml
Copy after login
<requestHandler name="/select" class="solr.SearchHandler">
    <lst name="defaults">
        <str name="echoParams">explicit</str>
       <int name="rows">10</int>
       <str name="df">text</str>
    </lst></requestHandler>
Copy after login

The second parameter is the one we're most interested in. The default value "*:*" means query anything. If you click "execute query", you can get something like the following:

<?xml version="1.0" encoding="UTF-8"?><response>
    <lst name="responseHeader">
        <int name="status">0</int>
        <int name="QTime">1</int>
        <lst name="params">
        <str name="indent">true</str>
        <str name="q">*:*</str>
        <str name="wt">xml</str>
        </lst>
    </lst>
    <result name="response" numFound="0" start="0" /></response>
Copy after login

The index result is empty, but this is not a problem, You need to insert some sample data.

$ cd solr-4.1.0/example/exampledocs/
$ java -jar post.jar monitor.xml
 
SimplePostTool version 1.5
Posting files to base url http://localhost:8983/solr/update using content-type application/xml..
POSTing file monitor.xml1 files indexed.
COMMITting Solr index changes to http://localhost:8983/solr/update..
Copy after login

Now you can return to the query interface and a document will be returned this time.

The data structure of Collection is defined in the schema file.

$ vim solr-4.1.0/example/solr/collection1/conf/schema.xml
Copy after login

This file has a lot of comments, you can easily tell what they do. If you want to modify the scheme file, please do not delete the field named "text" (if there is no good reason), it is associated with other fields and query statements (including select, look, etc.).

$ grep text solr-4.1.0/example/solr/collection1/conf/schema.xml | grep copy
 
<copyField source="cat" dest="text"/>
<copyField source="name" dest="text"/>
<copyField source="manu" dest="text"/>
<copyField source="features" dest="text"/>
<copyField source="includes" dest="text"/>
<copyField source="title" dest="text"/>
<copyField source="author" dest="text"/>
<copyField source="description" dest="text"/>
<copyField source="keywords" dest="text"/>
<copyField source="content" dest="text"/>
<copyField source="content_type" dest="text"/>
<copyField source="resourcename" dest="text"/>
<copyField source="url" dest="text"/>
Copy after login

If you are using a relational database, you don’t want to have duplicate data. Solr is not a database, and most fields are also processed as text fields. This is the default request handler.


Accessing Solr from PHP requires a client. I recommend downloading one from PECL. It's fast, the API is clear, and the documentation is excellent. However, please note that this extension is now version 1.0.2 and does not support Solr 4.x. Solr 3.x and 4.x have slightly different protocols. But don't worry, I have made the modifications and you can download a compatible version from https://github.com/lukaszkujawa/php-pecl-solr. I've been using it for a while and it's reliable. It is slightly different from the official one. There is an additional Solr version parameter in the SolrClient constructor. This patch will be released in the official version, so you don't have to worry about future compatibility.

$ git clone https://github.com/lukaszkujawa/php-pecl-solr.git$ cd php-pecl-solr/
$ phpize
$ whereis php-config
php-config: /usr/bin/php-config /usr/bin/X11/php-config
$ ./configure --with-php-config=/usr/bin/php-config
$ make
$ make install
Copy after login

Add to your php.ini

extension=solr.so
Copy after login

Restart the web server

$ /etc/init.d/apache2 restart
Copy after login

Now you can write php to add content to the index.

<?php
 $options = array (    &#39;hostname&#39; => &#39;127.0.0.1&#39;,
);
 
$client = new SolrClient($options, "4.0"); // 参数4.0针对Solr4.x,其他版本时忽略
 $doc = new SolrInputDocument();
 
$doc->addField(&#39;id&#39;, 100);
$doc->addField(&#39;title&#39;, &#39;Hello Wolrd&#39;);
$doc->addField(&#39;description&#39;, &#39;Example Document&#39;);
$doc->addField(&#39;cat&#39;, &#39;Foo&#39;);
$doc->addField(&#39;cat&#39;, &#39;Bar&#39;);
 
$response = $client->addDocument($doc);
 
$client->commit(); 
/* ------------------------------- */
 $query = new SolrQuery();
 
$query->setQuery(&#39;hello&#39;);
 
$query->addField(&#39;id&#39;)
->addField(&#39;title&#39;)
->addField(&#39;description&#39;)
->addField(&#39;cat&#39;);
 
$queryResponse = $client->query($query);
 
$response = $queryResponse->getResponse();
 
print_r( $response->response->docs );
Copy after login

If you add more than one document, she can handle it well without frequent commits.

Knowing how Solr works is valuable and you can use it in many projects. It has a great feature that allows you to pull all the data you need in one request. Of course, it takes some time to master her, but it's worth the effort. Solr has an active community and complete documentation resources. If you are still worried about using it in your project, please read Solr 3 enterprise search server. It not only allows you to quickly set up search services, but is also the basis for your data mining.

Related articles:

php solr operation classes and demo

Installing php-solr extension

Search solution How to install and configure solr+php?

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
Two Point Museum: All Exhibits And Where To Find Them
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

CakePHP Project Configuration CakePHP Project Configuration Sep 10, 2024 pm 05:25 PM

In this chapter, we will understand the Environment Variables, General Configuration, Database Configuration and Email Configuration in CakePHP.

PHP 8.4 Installation and Upgrade guide for Ubuntu and Debian PHP 8.4 Installation and Upgrade guide for Ubuntu and Debian Dec 24, 2024 pm 04:42 PM

PHP 8.4 brings several new features, security improvements, and performance improvements with healthy amounts of feature deprecations and removals. This guide explains how to install PHP 8.4 or upgrade to PHP 8.4 on Ubuntu, Debian, or their derivati

CakePHP Date and Time CakePHP Date and Time Sep 10, 2024 pm 05:27 PM

To work with date and time in cakephp4, we are going to make use of the available FrozenTime class.

CakePHP File upload CakePHP File upload Sep 10, 2024 pm 05:27 PM

To work on file upload we are going to use the form helper. Here, is an example for file upload.

CakePHP Routing CakePHP Routing Sep 10, 2024 pm 05:25 PM

In this chapter, we are going to learn the following topics related to routing ?

Discuss CakePHP Discuss CakePHP Sep 10, 2024 pm 05:28 PM

CakePHP is an open-source framework for PHP. It is intended to make developing, deploying and maintaining applications much easier. CakePHP is based on a MVC-like architecture that is both powerful and easy to grasp. Models, Views, and Controllers gu

CakePHP Creating Validators CakePHP Creating Validators Sep 10, 2024 pm 05:26 PM

Validator can be created by adding the following two lines in the controller.

How To Set Up Visual Studio Code (VS Code) for PHP Development How To Set Up Visual Studio Code (VS Code) for PHP Development Dec 20, 2024 am 11:31 AM

Visual Studio Code, also known as VS Code, is a free source code editor — or integrated development environment (IDE) — available for all major operating systems. With a large collection of extensions for many programming languages, VS Code can be c

See all articles