Home Backend Development PHP Tutorial Integrate PHP application and Solr search engine

Integrate PHP application and Solr search engine

Mar 24, 2017 pm 01:25 PM
php solr application search engine integrated

Why do you need a search engine? Is a database alone not enough? If you are just creating a small website, a database will suffice. But when you're creating neutral or large-scale applications, search engines are the smarter choice. Of course, small websites can also use Solr to obtain highly relevant search results.

Imagine you are writing a search query program for an e-commerce website. The most straightforward idea is the following database query statement:

SELECT * FROM PRODUCTS
WHERE LOWER(title) like LOWER('%$phrase%')
OR LOWER(description) like LOWER('%$phrase%');
Copy after login

It works fine when querying phrases in titles or descriptions. But the reality is very complicated, for example, Apple iPhone 4G black 16GB (Apple 4G network iPhone black 16GB). When searching for "iPhone 16G", there are no results. You can replace spaces with % to handle this situation.

$phrase = str_replace(' ', '%', $phrase);
Copy after login

What about when querying "iPhone 16GB 4G"? Apparently the word order changed and it doesn't work properly. I'm guessing you'd add another field to hold the word order. So what should I do if I write the wrong word? What about synonyms? Coming up with a good solution for such a search system is challenging.



Designing an exquisite algorithm is not the key to solving this kind of problem. Text search consumes resources. Putting too much pressure on the database side It is never a good idea. The reason is that the database cannot be easily expanded. You cannot simply add an instance like a web server or Memcached. Expanding the database requires some preparation, code modification, configuration, downtime and maintenance time. In short, the cost is Very expensive. The good news is that Solr is designed to solve this type of problem.

Solr is an enterprise-level search platform based on Apache Lucene. It is fast, stable, has good documentation, and of course is easy to expand. Since Solr is a powerful solution, all its features are not listed one by one in this article. This guy is also quite easy to install.


First download the latest version from the official website version. Solr is an application written in the Java language, and you need the Java Runtime environment to run it.

$ cd solr-4.1.0/example/
$ java -jar start.jar
Copy after login

After a few seconds you will see the following information:

2013-03-09 18:47:41.177:INFO:oejs.AbstractConnector:Started SocketConnector@0.0.0.0:8983
Copy after login

Solr has a web interface that works under port 8983. Open the browser to access http: //localhost:8983/solr/.

In the navigation area on the left hand side you will find "collection1". In Solr, Collections are similar to database tables, and you can query data. Click on a collection and select its submenu "query".

The first option is called "Request-Handler (qt)" and has the default value "/select". Request handlers are a set of predefined queries. If you look at the Solr config file, you'll see something like this:

$ vim solr-4.1.0/example/solr/collection1/conf/solrconfig.xml
Copy after login
<requestHandler name="/select" class="solr.SearchHandler">
    <lst name="defaults">
        <str name="echoParams">explicit</str>
       <int name="rows">10</int>
       <str name="df">text</str>
    </lst></requestHandler>
Copy after login

The second parameter is the one we're most interested in. The default value "*:*" means query anything. If you click "execute query", you can get something like the following:

<?xml version="1.0" encoding="UTF-8"?><response>
    <lst name="responseHeader">
        <int name="status">0</int>
        <int name="QTime">1</int>
        <lst name="params">
        <str name="indent">true</str>
        <str name="q">*:*</str>
        <str name="wt">xml</str>
        </lst>
    </lst>
    <result name="response" numFound="0" start="0" /></response>
Copy after login

The index result is empty, but this is not a problem, You need to insert some sample data.

$ cd solr-4.1.0/example/exampledocs/
$ java -jar post.jar monitor.xml
 
SimplePostTool version 1.5
Posting files to base url http://localhost:8983/solr/update using content-type application/xml..
POSTing file monitor.xml1 files indexed.
COMMITting Solr index changes to http://localhost:8983/solr/update..
Copy after login

Now you can return to the query interface and a document will be returned this time.

The data structure of Collection is defined in the schema file.

$ vim solr-4.1.0/example/solr/collection1/conf/schema.xml
Copy after login

This file has a lot of comments, you can easily tell what they do. If you want to modify the scheme file, please do not delete the field named "text" (if there is no good reason), it is associated with other fields and query statements (including select, look, etc.).

$ grep text solr-4.1.0/example/solr/collection1/conf/schema.xml | grep copy
 
<copyField source="cat" dest="text"/>
<copyField source="name" dest="text"/>
<copyField source="manu" dest="text"/>
<copyField source="features" dest="text"/>
<copyField source="includes" dest="text"/>
<copyField source="title" dest="text"/>
<copyField source="author" dest="text"/>
<copyField source="description" dest="text"/>
<copyField source="keywords" dest="text"/>
<copyField source="content" dest="text"/>
<copyField source="content_type" dest="text"/>
<copyField source="resourcename" dest="text"/>
<copyField source="url" dest="text"/>
Copy after login

If you are using a relational database, you don’t want to have duplicate data. Solr is not a database, and most fields are also processed as text fields. This is the default request handler.


Accessing Solr from PHP requires a client. I recommend downloading one from PECL. It's fast, the API is clear, and the documentation is excellent. However, please note that this extension is now version 1.0.2 and does not support Solr 4.x. Solr 3.x and 4.x have slightly different protocols. But don't worry, I have made the modifications and you can download a compatible version from https://github.com/lukaszkujawa/php-pecl-solr. I've been using it for a while and it's reliable. It is slightly different from the official one. There is an additional Solr version parameter in the SolrClient constructor. This patch will be released in the official version, so you don't have to worry about future compatibility.

$ git clone https://github.com/lukaszkujawa/php-pecl-solr.git$ cd php-pecl-solr/
$ phpize
$ whereis php-config
php-config: /usr/bin/php-config /usr/bin/X11/php-config
$ ./configure --with-php-config=/usr/bin/php-config
$ make
$ make install
Copy after login

Add to your php.ini

extension=solr.so
Copy after login

Restart the web server

$ /etc/init.d/apache2 restart
Copy after login

Now you can write php to add content to the index.

<?php
 $options = array (    &#39;hostname&#39; => &#39;127.0.0.1&#39;,
);
 
$client = new SolrClient($options, "4.0"); // 参数4.0针对Solr4.x,其他版本时忽略
 $doc = new SolrInputDocument();
 
$doc->addField(&#39;id&#39;, 100);
$doc->addField(&#39;title&#39;, &#39;Hello Wolrd&#39;);
$doc->addField(&#39;description&#39;, &#39;Example Document&#39;);
$doc->addField(&#39;cat&#39;, &#39;Foo&#39;);
$doc->addField(&#39;cat&#39;, &#39;Bar&#39;);
 
$response = $client->addDocument($doc);
 
$client->commit(); 
/* ------------------------------- */
 $query = new SolrQuery();
 
$query->setQuery(&#39;hello&#39;);
 
$query->addField(&#39;id&#39;)
->addField(&#39;title&#39;)
->addField(&#39;description&#39;)
->addField(&#39;cat&#39;);
 
$queryResponse = $client->query($query);
 
$response = $queryResponse->getResponse();
 
print_r( $response->response->docs );
Copy after login

If you add more than one document, she can handle it well without frequent commits.

Knowing how Solr works is valuable and you can use it in many projects. It has a great feature that allows you to pull all the data you need in one request. Of course, it takes some time to master her, but it's worth the effort. Solr has an active community and complete documentation resources. If you are still worried about using it in your project, please read Solr 3 enterprise search server. It not only allows you to quickly set up search services, but is also the basis for your data mining.

Related articles:

php solr operation classes and demo

Installing php-solr extension

Search solution How to install and configure solr+php?

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Nordhold: Fusion System, Explained
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Mandragora: Whispers Of The Witch Tree - How To Unlock The Grappling Hook
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1667
14
PHP Tutorial
1273
29
C# Tutorial
1255
24
PHP: A Key Language for Web Development PHP: A Key Language for Web Development Apr 13, 2025 am 12:08 AM

PHP is a scripting language widely used on the server side, especially suitable for web development. 1.PHP can embed HTML, process HTTP requests and responses, and supports a variety of databases. 2.PHP is used to generate dynamic web content, process form data, access databases, etc., with strong community support and open source resources. 3. PHP is an interpreted language, and the execution process includes lexical analysis, grammatical analysis, compilation and execution. 4.PHP can be combined with MySQL for advanced applications such as user registration systems. 5. When debugging PHP, you can use functions such as error_reporting() and var_dump(). 6. Optimize PHP code to use caching mechanisms, optimize database queries and use built-in functions. 7

PHP and Python: Comparing Two Popular Programming Languages PHP and Python: Comparing Two Popular Programming Languages Apr 14, 2025 am 12:13 AM

PHP and Python each have their own advantages, and choose according to project requirements. 1.PHP is suitable for web development, especially for rapid development and maintenance of websites. 2. Python is suitable for data science, machine learning and artificial intelligence, with concise syntax and suitable for beginners.

PHP vs. Python: Understanding the Differences PHP vs. Python: Understanding the Differences Apr 11, 2025 am 12:15 AM

PHP and Python each have their own advantages, and the choice should be based on project requirements. 1.PHP is suitable for web development, with simple syntax and high execution efficiency. 2. Python is suitable for data science and machine learning, with concise syntax and rich libraries.

PHP in Action: Real-World Examples and Applications PHP in Action: Real-World Examples and Applications Apr 14, 2025 am 12:19 AM

PHP is widely used in e-commerce, content management systems and API development. 1) E-commerce: used for shopping cart function and payment processing. 2) Content management system: used for dynamic content generation and user management. 3) API development: used for RESTful API development and API security. Through performance optimization and best practices, the efficiency and maintainability of PHP applications are improved.

The Enduring Relevance of PHP: Is It Still Alive? The Enduring Relevance of PHP: Is It Still Alive? Apr 14, 2025 am 12:12 AM

PHP is still dynamic and still occupies an important position in the field of modern programming. 1) PHP's simplicity and powerful community support make it widely used in web development; 2) Its flexibility and stability make it outstanding in handling web forms, database operations and file processing; 3) PHP is constantly evolving and optimizing, suitable for beginners and experienced developers.

PHP vs. Other Languages: A Comparison PHP vs. Other Languages: A Comparison Apr 13, 2025 am 12:19 AM

PHP is suitable for web development, especially in rapid development and processing dynamic content, but is not good at data science and enterprise-level applications. Compared with Python, PHP has more advantages in web development, but is not as good as Python in the field of data science; compared with Java, PHP performs worse in enterprise-level applications, but is more flexible in web development; compared with JavaScript, PHP is more concise in back-end development, but is not as good as JavaScript in front-end development.

PHP and Python: Different Paradigms Explained PHP and Python: Different Paradigms Explained Apr 18, 2025 am 12:26 AM

PHP is mainly procedural programming, but also supports object-oriented programming (OOP); Python supports a variety of paradigms, including OOP, functional and procedural programming. PHP is suitable for web development, and Python is suitable for a variety of applications such as data analysis and machine learning.

PHP and Python: Code Examples and Comparison PHP and Python: Code Examples and Comparison Apr 15, 2025 am 12:07 AM

PHP and Python have their own advantages and disadvantages, and the choice depends on project needs and personal preferences. 1.PHP is suitable for rapid development and maintenance of large-scale web applications. 2. Python dominates the field of data science and machine learning.

See all articles