Initial Experiments on Russian to Kazakh SMT

dc.contributor.authorMyrzakhmetov, Bagdat
dc.contributor.authorMakazhanov, Aibek
dc.date.accessioned2017-01-11T03:53:33Z
dc.date.available2017-01-11T03:53:33Z
dc.date.issued2016
dc.description.abstractWe present our initial experiments on Russian to Kazakh phrase-based statistical machine translation. Following a common approach to SMT between morphologically rich languages, we employ morphological processing techniques. Namely, for our initial experiments, we perform source-side lemmatization. Given a rather humble-sized parallel corpus at hand, we also put some effort in data cleaning and investigate the impact of data quality vs. quantity trade off on the overall performance. Although our experiments mostly focus on source side preprocessing we achieve a substantial, statistically significant improvement over the baseline that operates on raw, unprocessed data.ru_RU
dc.identifier.citationMyrzakhmetov, Bagdat., Makazhanov, Aibek (2016) Initial Experiments on Russian to Kazakh SMT. Research in Computing Science 117. pp. 153–160. http://www.rcs.cic.ipn.mx/rcs/2016_117/ru_RU
dc.identifier.urihttp://nur.nu.edu.kz/handle/123456789/2233
dc.language.isoenru_RU
dc.publisherResearch in Computing Science 117ru_RU
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/us/*
dc.subjectstatistical machine translationru_RU
dc.subjectSMTru_RU
dc.subjectmachine translationru_RU
dc.subjectResearch Subject Categories::SOCIAL SCIENCES::Statistics, computer and systems scienceru_RU
dc.titleInitial Experiments on Russian to Kazakh SMTru_RU
dc.typeArticleru_RU

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Initial Experiments.pdf
Size:
102.83 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.22 KB
Format:
Item-specific license agreed upon to submission
Description: