vec2word

Map embedding vector to word

所有的页面崩溃

Syntax

words = vec2word(emb,M)

[words,dist] = vec2word(emb,M)

___= vec2word(emb,M,k)

___= vec2word(___,'Distance',distance)

Description

example

words= vec2word(emb,M)returns the closest words to the embedding vectors in the rows ofM.

example

[words,dist] = vec2word(emb,M)returns the closest words to the embedding vectors inM, and returns the distancesdistof each to their source vectors.

example

___= vec2word(emb,M,k)returns the topkclosest words.

example

___= vec2word(___,'Distance',distance)specifies the distance metric.

Examples

collapse all

Map Words to Vectors and Back

Open Live Script

Load a pretrained word embedding usingfastTextWordEmbedding. This function requires Text Analytics Toolbox™ Modelfor fastText English 16 Billion Token Word Embeddingsupport package. If this support package is not installed, then the function provides a download link.

emb = fastTextWordEmbedding

emb = wordEmbedding with properties: Dimension: 300 Vocabulary: [1×1000000 string]

Map the words "Italy", "Rome", and "Paris" to vectors usingword2vec.

italy = word2vec(emb,"Italy"); rome = word2vec(emb,"Rome"); paris = word2vec(emb,"Paris");

Map the vectoritaly - rome + paristo a word usingvec2word.

word = vec2word(emb,italy - rome + paris)

word = "France"

Find Closest Words to Vector

Open Live Script

Find the top five closest words to a word embedding vector and their distances.

emb = fastTextWordEmbedding;

Map the words "Italy", "Rome", and "Paris" to vectors using word2vec.

italy = word2vec(emb,"Italy"); rome = word2vec(emb,"Rome"); paris = word2vec(emb,"Paris");

Map the vectoritaly - rome + paristo a word usingvec2word. Find the top five closest words using the Euclidean distance metric.

k = 5; M = italy - rome + paris; [words,dist] = vec2word(emb,M,k,'Distance','euclidean');

条形图中的单词和距离。

figure; bar(dist) xticklabels(words) xlabel("Word") ylabel("Distance") title("Distances to Vector")

Input Arguments

collapse all

`emb`—Input word embedding
`wordEmbedding`object

Input word embedding, specified as awordEmbeddingobject.

`M`—Word embedding vectors
matrix

Word embedding vectors, specified as a matrix. Each row ofMis a word embedding vector.Mmust haveemb.Dimensioncolumns.

`k`—Number of closest words
positive integer

Number of closest words to return, specified as a positive integer.

`distance`—Distance metric
`“因为ine'`(default) |`'euclidean'`

Distance metric, specified as“因为ine'or'euclidean'.

Output Arguments

collapse all

`words`— Output words
字符串向量

Output words, returned as a string vector.

`dist`— Distance of words to source vectors
vector

Distance of words to their source vectors, returned as a vector.

Version History

Introduced in R2017b

vec2word

Syntax

Description

Examples

Map Words to Vectors and Back

Find Closest Words to Vector

Input Arguments

`emb`—Input word embedding
`wordEmbedding`object

`M`—Word embedding vectors
matrix

`k`—Number of closest words
positive integer

`distance`—Distance metric
`“因为ine'`(default) |`'euclidean'`

Output Arguments

`words`— Output words
字符串向量

`dist`— Distance of words to source vectors
vector

Version History

See Also

Topics

vec2word

Syntax

Description

Examples

Map Words to Vectors and Back

Find Closest Words to Vector

Input Arguments

emb—Input word embeddingwordEmbeddingobject

M—Word embedding vectorsmatrix

k—Number of closest wordspositive integer

distance—Distance metric“因为ine'(default) |'euclidean'

Output Arguments

words— Output words字符串向量

dist— Distance of words to source vectorsvector

Version History

See Also

Topics

`emb`—Input word embedding
`wordEmbedding`object

`M`—Word embedding vectors
matrix

`k`—Number of closest words
positive integer

`distance`—Distance metric
`“因为ine'`(default) |`'euclidean'`

`words`— Output words
字符串向量

`dist`— Distance of words to source vectors
vector