(1) |
Here, is the number of words which are used to represent the meanings of sentences.
As for words to be used for vector elements, similarly to the information retrieval system which uses controlled Key Words, important words are statistically selected by some conventional method, such as "" from all of the documents in the database. The values of weight are usually determined dependent on the frequency of the appearance of word .
Here, we call the specific vector given by
(1) as "Word-Vector" and the VSM which uses this type of the specific
vector as "W-VSM" (Word-Vector Space Model).