Extract word matrix and context matrix from output of trainWordEmbedding / word2vec

Question

Daniel Ringel 2018년 7월 13일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/410192-extract-word-matrix-and-context-matrix-from-output-of-trainwordembedding-word2vec

답변: Jayanti 2025년 2월 14일 14:21

When I use trainWordEmbedding on a set of documents to train a word embedding that I can then use word2vec with, I get an object "emb" as output that I can input into word2vec. Using word2vec I then get, for each word, the vectors that I can then further process.

However, I would like to also receive as output the underlying word matrix and context matrix (as well as the value of the loss of the training). Does anyone know how I can access these data?

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Christopher Creutzig 2018년 11월 26일

What exactly do you mean by “word matrix” and “context matrix”?

I guess the “context matrix” is what (some) other people call the cooccurrence matrix in the skip-gram model? We do not currently have a way to compute that.

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Jayanti 2025년 2월 14일 14:21

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/410192-extract-word-matrix-and-context-matrix-from-output-of-trainwordembedding-word2vec#answer_1559807

MATLAB Online에서 열기

Hi Daniel,

By word matrix I assume you want the unique words in the document. When you use “trainWordEmbedding” to train a word embedding model on a set of documents, it returns an object called “emb”. This object includes a property named “Vocabulary”, which contains the unique words from the model, stored as a string vector. You can access these unique words using the following code:

emb = trainWordEmbedding(filename);
words = emb.Vocabulary;

By context matrix I assume you mean cooccurrence matrix. However, I couldn't find specific documentation on accessing a co-occurrence matrix directly through the “trainWordEmbedding” or “word2vec”.

Hope this will be helpful!

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Extract word matrix and context matrix from output of trainWordEmbedding / word2vec

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

답변 (1개)

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

Extract word matrix and context matrix from output of trainWordEmbedding / word2vec

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

답변 (1개)

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기