Home Topics word word2vector principle

word2vector principle

Jan 10, 2020 am 10:54 AM

word2vector principle

Mapping word into a new space and representing it as a multi-dimensional continuous real number vector is called "Word Represention" or "Word Embedding".

Since the 21st century, people have gradually transitioned from the original sparse representation of word vectors to the current dense representation in low-dimensional space.

Using sparse representation often encounters the curse of dimensionality when solving practical problems, and semantic information cannot be represented and potential connections between words cannot be revealed.

The use of low-dimensional space representation not only solves the problem of the curse of dimensionality, but also explores the associated attributes between words, thereby improving the accuracy of vector semantics.

word2vec learning tasks

Suppose there is such a sentence: The search engine group will hold a group meeting at 2 o'clock today.

Task 1: For each word, use the words surrounding the word to predict the probability of generating the current word. For example, use "today, afternoon, search, engine, group" to generate "2 o'clock".

Task 2: For each word, use the word itself to predict the probability of generating other words. For example, use "2 o'clock" to generate each word in "today, afternoon, search, engine, group".

The common restriction of both tasks is: for the same input, the sum of the probabilities of outputting each word is 1.

The Word2vec model is a way to improve the accuracy of the above tasks through machine learning. The two tasks correspond to two models (CBOW and skim-gram) respectively. Unless otherwise specified, CBOW, the model corresponding to Task 1, will be used for analysis below.

The Skim-gram model analysis method is the same.

word2vector principle

For more Word related technical articles, please visit the Word Tutorial column to learn!

The above is the detailed content of word2vector principle. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

How to set the tick symbol in the word box How to set the tick symbol in the word box Mar 07, 2025 pm 02:29 PM

This article details methods for inserting checkmarks into Microsoft Word text boxes. It explains two primary approaches: using the "Symbols" feature (via the "Insert" tab) and directly inserting from the "Symbols" grou

How to make a word directory into a table style How to make a word directory into a table style Mar 07, 2025 pm 02:36 PM

This article details how to create a Word table of contents visually resembling a table. It explains a workaround involving converting the automatically generated TOC to a table, then customizing its appearance using table formatting options. The m

How to delete blank pages on word How to delete blank pages on word Mar 07, 2025 pm 02:31 PM

This article addresses how to remove blank pages from the beginning of Word documents. The main issue is hidden formatting, primarily section breaks and extra paragraph marks. Solutions involve using the "Show/Hide" feature to identify an

What is the reason why the word picture is not in the middle? What is the reason why the word picture is not in the middle? Mar 07, 2025 pm 02:33 PM

This article addresses common Microsoft Word image-centering problems. Incorrect text wrapping, paragraph alignment, table cell alignment, conflicting styles, and section breaks are identified as frequent causes. Troubleshooting steps involve check

How to set the correct number in the word box How to set the correct number in the word box Mar 07, 2025 pm 02:37 PM

This article explains how to add checkmarks to Microsoft Word text boxes. It details two methods: inserting a static checkmark symbol using the "Insert Symbol" feature, and creating a functional checkbox using form fields. The simplest me

How to delete blank pages in word How to delete blank pages in word Mar 07, 2025 pm 02:38 PM

This article addresses how to delete blank pages in Word documents. It details troubleshooting steps, including checking for hidden paragraph marks, page breaks, section breaks, and hidden text/objects. Solutions for persistent issues and preventat

How to create page code for word directory How to create page code for word directory Mar 07, 2025 pm 02:40 PM

This article explains how to create and update automatic page numbers in a Word table of contents. It details using heading styles, the "Update Field" function, and pre-defined/custom styles to control page number appearance and formatting

How to create a mobile phone word directory How to create a mobile phone word directory Mar 07, 2025 pm 02:35 PM

This article explains how to create a table of contents (TOC) in Microsoft Word on mobile devices. It details the process, emphasizing the importance of correctly formatted headings. While automatic generation is possible, mobile TOC creation has li

See all articles