我想要一天分享一點「LLM從底層堆疊的技術」,並且每篇文章長度控制在三分鐘以內,讓大家不會壓力太大,但是又能夠每天成長一點。
以下附上參考項目:
- English-German BLEU scores with reference papers and code: https://paperswithcode.com/ sota/machine-translation-on-wmt2014-english-german
- The 2014 Workshop on Machine Translation (WMT): https://www.statmt.org/wmt14/ translation-task.html
- European Parliament Proceedings Parallel Corpus 1996-2011, parallel corpus French-English: https://www.statmt.org/europarl/v7/fr-en.tgz
- Jason Brownlee, Ph.D., How to Prepare a French-to-English Dataset for Machine Translation: https:// machinelearningmastery.com/prepare-french-english-dataset-machine-translation/
- Jason Brownlee, Ph.D., A Gentle Introduction to Calculating the BLEU Score for Text in Python: https://machinelearningmastery.com/calculate-bleu-score-for-text-python/
- Boxing Chen and Colin Cherry, 2014, A Systematic Comparison of Smoothing Techniques for Sen- tence-Level BLEU: http://acl2014.org/acl2014/W14-33/pdf/W14-3346.pdf
- Trax’s repository: https://github.com/google/trax
- Trax tutorial: https://trax-ml.readthedocs.io/en/latest/
以下附上額外閱讀項目:
- Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu, 2002, BLEU: a Method for Automatic Evaluation of Machine Translation: https://aclanthology.org/P02-1040.pdf
- Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin, 2017, Attention Is All You Need: https://arxiv.org/abs/1706.03762