Professional Documents
Culture Documents
Neural Machine Translation For Low-Resource Languages
Neural Machine Translation For Low-Resource Languages
Source 4 , 5 . cules son algunas de las preguntas ms importantes que podemos hacernos , y
por qu debemos buscar las respuestas ?
Reference 4 , 5 . what are some of the most important questions we can ask in life , and why
should we seek the answers ?
SMT 4 , 5 . what are some of the questions that matter most that we can make , and why
should we find answers ?
HNMT 4 , 5 . what are some of the bible , and why ?
Our , 5 . what are some of the special more important that can , and we why should feel
the answers
Table 2: Example translations from the different systems (Spanish-English; Watchtower test set, trained
on Watchtower data).
Kenneth Heafield, Ivan Pouzyrevsky, Jonathan H. Ilya Sutskever, Oriol Vinyals, and Quoc V. V Le.
Clark, and Philipp Koehn. 2013. Scalable modi- 2014. Sequence to sequence learning with neural
fied Kneser-Ney language model estimation. In Pro- networks. In Z. Ghahramani, M. Welling, C. Cortes,
ceedings of ACL. pages 690–696. N.D. Lawrence, and K.Q. Weinberger, editors, Ad-
vances in Neural Information Processing Systems
Sepp Hochreiter and Jürgen Schmidhu- 27, Curran Associates, Inc., pages 3104–3112.
ber. 1997. Long short-term memory.
Neural Computation 9(8):17351780. Seiya Tokui, Kenta Oono, Shohei Hido, and Justin
https://doi.org/10.1162/neco.1997.9.8.1735. Clayton. 2015. Chainer: a next-generation open
source framework for deep learning. In Proceedings
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A of Workshop on Machine Learning Systems (Learn-
method for stochastic optimization. The Interna- ingSys) in The Twenty-ninth Annual Conference on
tional Conference on Learning Representations. Neural Information Processing Systems (NIPS).
Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V.
Callison-Burch, Marcello Federico, Nicola Bertoldi, Le, Mohammad Norouzi, Wolfgang Macherey,
Brooke Cowan, Wade Shen, Christine Moran, Maxim Krikun, Yuan Cao, Qin Gao, Klaus
Richard Zens, Christopher J. Dyer, Ondřej Bo- Macherey, Jeff Klingner, Apurva Shah, Melvin
jar, Alexandra Constantin, and Evan Herbst. 2007. Johnson, Xiaobing Liu, Lukasz Kaiser, Stephan
Moses: Open Source Toolkit for Statistical Machine Gouws, Yoshikiyo Kato, Taku Kudo, Hideto
Translation. In Proceedings of ACL. pages 177–180. Kazawa, Keith Stevens, George Kurian, Nishant
Patil, Wei Wang, Cliff Young, Jason Smith, Jason
Philipp Koehn, Franz Josef Och, and Daniel Marcu. Riesa, Alex Rudnick, Oriol Vinyals, Greg Corrado,
2003. Statistical phrase-based translation. In Pro- Macduff Hughes, and Jeffrey Dean. 2016. Google’s
ceedings of the 2003 Conference of the North Amer- neural machine translation system: Bridging the gap
ican Chapter of the Association for Computational between human and machine translation. CoRR
Linguistics on Human Language Technology - Vol- abs/1609.08144.
ume 1. Association for Computational Linguistics,
Stroudsburg, PA, USA, NAACL ’03, pages 48–54. David Yarowsky, Grace Ngai, and
https://doi.org/10.3115/1073445.1073462. Richard Wicentowski. 2001.
Inducing multilingual text analysis tools via robust projection across a
Minh-Thang Luong and Christopher D. Manning. In Proceedings of the First International Confer-
2016. Achieving open vocabulary neural machine ence on Human Language Technology Research.
translation with hybrid word-character models. In Association for Computational Linguistics,
Proceedings of the 54th Annual Meeting of the As- Stroudsburg, PA, USA, HLT ’01, pages 1–8.
sociation for Computational Linguistics (Volume 1: https://doi.org/10.3115/1072133.1072187.
Long Papers). Association for Computational Lin-
guistics, Berlin, Germany, pages 1054–1063. Barret Zoph, Deniz Yuret, Jonathan
May, and Kevin Knight. 2016.
Tomáš Mikolov, Martin Karafiát, Lukáš Burget, Jan Transfer learning for low-resource neural machine translation.
Černocký, and Sanjeev Khudanpur. 2010. Recur- In Proceedings of the 2016 Conference on
rent neural network based language model. In IN- Empirical Methods in Natural Language Pro-
TERSPEECH 2010. pages 1045–1048. cessing. Association for Computational Lin-
guistics, Austin, Texas, pages 1568–1575.
Franz Josef Och. 2003. Minimum error rate training https://aclweb.org/anthology/D16-1163.
in statistical machine translation. In Proceedings of
ACL. pages 160–167.