Home Database Mysql Tutorial oracle 全文索引详细做法

oracle 全文索引详细做法

Jun 07, 2016 pm 05:46 PM
index nbsp oracle

oracle 全文索引详细做法


Oracle从7.3开始支持全文检索,即用户可以使用Oracle服务器的上下文(ConText)选项完成基于文本的查询。具体可以采用通配符查找、模糊匹配、相关分类、近似查找、条件加权和词意扩充等方法。在Oracle8.0.x中称为ConText ;在Oracle8i中称为interMedia Text ; Oracle9i中称为Oracle Text。

Oracle Text是9i标准版和企业版的一部分。Oracle9i将全文检索功能做为内置功能提供给用户,使得用户在创建实例时自动安装全文检索。Oracle Text的应用领域有很多:

l         搜索文本 :需要快捷有效搜索文本数据的应用程序。

l         管理多种文档:允许搜索各种混和文档格式的应用程序,包括ord,excel,lotus等。

l         从多种数据源中检索文本:不仅来自Oracle数据库中的文本数据,而且可以来自Internet和文件系统的文本数据。

l         搜索XML应用程序。

 

GRANT CTXAPP TO XYADMIN;
 Begin
    -- 定义一个词法分析器
    --ctx_ddl.drop_preference('cnlex');
    --ctx_ddl.create_preference('cnlex','CHINESE_LEXER');  --针对中文
    ctx_ddl.create_preference('cnlex','chinese_vgram_lexer');  --针对中文
    -- 定义一个相关词表
    --ctx_ddl.drop_preference('mywordlist');
    --ctx_ddl.create_preference('mywordlist', 'BASIC_WORDLIST');
    --ctx_ddl.set_attribute('mywordlist','PREFIX_INDEX','TRUE');
    --ctx_ddl.set_attribute('mywordlist','PREFIX_MIN_LENGTH',1);
    --ctx_ddl.set_attribute('mywordlist','PREFIX_MAX_LENGTH', 5);
    --ctx_ddl.set_attribute('mywordlist','SUBSTRING_INDEX', 'YES');
  end;
 
COMMIT;
drop index CorporationName_index force
drop index BusinessScope_index force

--CREATE INDEX CorporationName_index ON CorporationMainTable(CorporationName) indextype is ctxsys.context
--parameters ('DATASTORE CTXSYS.DIRECT_DATASTORE FILTER
--CTXSYS.NULL_FILTER LEXER cnlex WORDLIST mywordlist');

CREATE INDEX CorporationName_index ON CorporationMainTable(CorporationName) indextype is ctxsys.context PARAMETERS ('lexer cnlex');
CREATE INDEX BusinessScope_index ON CorporationMainTable(BusinessScope) INDEXTYPE IS CTXSYS.CONTEXT PARAMETERS ('lexer cnlex');

 

exec ctx_ddl.sync_index('CorporationName_index');

-- sync:
VARIABLE jobno number;
BEGIN
DBMS_JOB.SUBMIT(:jobno,'ctx_ddl.sync_index(''CorporationName_index'');
ctx_ddl.sync_index(''BusinessScope_index'');
ctx_ddl.sync_index(''LegalRepresentative_index'');',
SYSDATE, 'SYSDATE + (1/24/4)');
commit;
END;

-- optimizer
VARIABLE jobno number;
BEGIN
DBMS_JOB.SUBMIT(:jobno,'ctx_ddl.optimize_index(''CorporationName_index'',''FULL'');
ctx_ddl.optimize_index(''BusinessScope_index'',''FULL'');
ctx_ddl.optimize_index(''LegalRepresentative_index'',''FULL'');',
SYSDATE, 'SYSDATE + 1');
commit;
END;
grant execute any procedure to CREDIT_CORP;
exec dbms_job.run(73);
exec dbms_job.remove(69);
exec dbms_job.remove(72);
* from user_jobs;

--其中,第一个job的SYSDATE + (1/24/4)是指每隔15分钟同步一次,第二个job的SYSDATE + 1是每隔1天做一次全优化。具体的时间间隔,你可以根据自己的应用的需要而定。至此,你的全文检索功能已设置完成。

搜索文本

       不使用Oracle text功能,也有很多方法可以在Oracle数据库中搜索文本.可以使用标准的INSTR函数和LIKE操作符实现.

SELECT *

  FROM  mytext

 WHERE  INSTR (thetext, 'Oracle') > 0;

SELECT *

  FROM mytext

 WHERE thetext LIKE '%Oracle%';

有很多时候,使用instr和like是很理想的,特别是搜索仅跨越很小的表的时候。然而通过这些文本定位的方法将导致全表扫描,对资源来说消耗比较昂贵,而且实现的搜索功能也非常有限。

利用Oracle Text,你可以回答如“在存在单词’Oracle’的行同时存在单词’Corporation’而且两单词间距不超过10个单词的文本,查询含有单词’Oracle’或者单词’california’的文本,并且将结果按准确度进行排序,含有词根train的文本”,以下的sql代码实现了如上功能,我们且不管这些语法是如何使用的:

DROP INDEX index mytext_idx;  --丢弃索引mytext_idx

/

CREATE INDEX mytext_idx

ON mytext( thetext )

INDEXTYPE is CTXSYS.CONTEXT; --创建CONTEXT类型索引mytext_idx

/

SELECT id

  FROM mytext

 WHERE contains (thetext, 'near((Oracle,Corporation),10)') > 0; --发出contains查询

SELECT score (1), id

    FROM mytext

   WHERE contains (thetext, 'Oracle or california', 1) > 0

ORDER BY score (1) DESC

/

SELECT id

  FROM mytext

 WHERE contains (thetext, '$train') > 0;

 

--其中,第一个job的SYSDATE + (1/24/4)是指每隔15分钟同步一次,第二个job的SYSDATE + 1是每隔1天做一次全优化。具体的时间间隔,你可以根据自己的应用的需要而定。至此,你的全文检索功能已设置完成。

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Function to calculate the number of days between two dates in oracle Function to calculate the number of days between two dates in oracle May 08, 2024 pm 07:45 PM

The function in Oracle to calculate the number of days between two dates is DATEDIFF(). The specific usage is as follows: Specify the time interval unit: interval (such as day, month, year) Specify two date values: date1 and date2DATEDIFF(interval, date1, date2) Return the difference in days

How long will Oracle database logs be kept? How long will Oracle database logs be kept? May 10, 2024 am 03:27 AM

The retention period of Oracle database logs depends on the log type and configuration, including: Redo logs: determined by the maximum size configured with the "LOG_ARCHIVE_DEST" parameter. Archived redo logs: Determined by the maximum size configured by the "DB_RECOVERY_FILE_DEST_SIZE" parameter. Online redo logs: not archived, lost when the database is restarted, and the retention period is consistent with the instance running time. Audit log: Configured by the "AUDIT_TRAIL" parameter, retained for 30 days by default.

The order of the oracle database startup steps is The order of the oracle database startup steps is May 10, 2024 am 01:48 AM

The Oracle database startup sequence is: 1. Check the preconditions; 2. Start the listener; 3. Start the database instance; 4. Wait for the database to open; 5. Connect to the database; 6. Verify the database status; 7. Enable the service (if necessary ); 8. Test the connection.

How to use interval in oracle How to use interval in oracle May 08, 2024 pm 07:54 PM

The INTERVAL data type in Oracle is used to represent time intervals. The syntax is INTERVAL <precision> <unit>. You can use addition, subtraction, multiplication and division operations to operate INTERVAL, which is suitable for scenarios such as storing time data and calculating date differences.

How to determine whether two strings are contained in oracle How to determine whether two strings are contained in oracle May 08, 2024 pm 07:00 PM

In Oracle, you can use the nested INSTR function to determine whether a string contains two substrings at the same time: when INSTR(string1, string2a) is greater than 0 and INSTR(string1, string2b) is greater than 0, it is included; otherwise, it is not included.

How to see the number of occurrences of a certain character in Oracle How to see the number of occurrences of a certain character in Oracle May 09, 2024 pm 09:33 PM

To find the number of occurrences of a character in Oracle, perform the following steps: Get the total length of a string; Get the length of the substring in which a character occurs; Count the number of occurrences of a character by subtracting the substring length from the total length.

Oracle database server hardware configuration requirements Oracle database server hardware configuration requirements May 10, 2024 am 04:00 AM

Oracle database server hardware configuration requirements: Processor: multi-core, with a main frequency of at least 2.5 GHz. For large databases, 32 cores or more are recommended. Memory: At least 8GB for small databases, 16-64GB for medium sizes, up to 512GB or more for large databases or heavy workloads. Storage: SSD or NVMe disks, RAID arrays for redundancy and performance. Network: High-speed network (10GbE or higher), dedicated network card, low-latency network. Others: Stable power supply, redundant components, compatible operating system and software, heat dissipation and cooling system.

How much memory does oracle require? How much memory does oracle require? May 10, 2024 am 04:12 AM

The amount of memory required by Oracle depends on database size, activity level, and required performance level: for storing data buffers, index buffers, executing SQL statements, and managing the data dictionary cache. The exact amount is affected by database size, activity level, and required performance level. Best practices include setting the appropriate SGA size, sizing SGA components, using AMM, and monitoring memory usage.

See all articles