![]() |
(1) |
Here,
is the number of words which are used to represent
the meanings of sentences.
As for words to be used for vector elements, similarly to the
information retrieval system which uses controlled Key Words,
important words are statistically selected by some conventional
method, such as "" from all of the documents in the
database. The values of weight
are usually determined dependent on
the frequency of the appearance of word
.
Here, we call the specific vector given by
(1) as "Word-Vector" and the VSM which uses this type of the specific
vector as "W-VSM" (Word-Vector Space Model).