next up previous
Next: Size of trainig parallel Up: Discussion Previous: Discussion

Removal of long parallel sentences

We sometimes found that poor or wrong phrase tables caused long parallel sentences in training data. So, we removed these long parallel sentences. This method is effective for the Intrinsic-JE task. However, this method is not so effective for the Intrinsic-EJ tasks.

But, we had many experimental results for many parameters. And in many cases, this proposed method was effective.



Jin'ichi Murakami 2008-12-22