Training

次へ: Decoding 上へ: Concepts of our Statistical 戻る: Concepts of our Statistical

The training process is as follows.

Parallel Corpus
We prepare a Chinese-English parallel corpus.
Rule-based Machine Translation
We used a Chinese-English rule-based machine translation. Thus, we obtain "ENGLISH" sentences from Chinese sentences. These "ENGLISH" sentences are pairs of English sentences.
Make "ENGLISH"-English phrase table
We make an "ENGLISH"-English phrase table using training-phrase-model.perl[10].
English -gram model
We make an -gram model from English sentences using SRILM [6].

Fig. 1 shows the flow chart of the training process.

**図 1:** Flowchart of Training
$\fbox{ \includegraphics[width=0.7\columnwidth]{figure/figure1.eps} }$

Jin'ichi Murakami 平成22年2月26日