next up previous
Next: Unknown Words Up: Discussion Previous: Discussion

Removal of long parallel sentences

We sometimes found that poor or wrong phrase tables caused long parallel sentences in training data. So, we removed these long parallel sentences. This method is effective for the Challenge-EC task. However, this method is not so effective for the BTEC-CE and Challenge-CE tasks. This proposed method was effective for IWSLT 2007. So this method may have low reliablilty.



Jin'ichi Murakami 2008-10-28