diff options
author | Marcin Junczys-Dowmunt <junczys@amu.edu.pl> | 2017-03-27 17:47:43 +0300 |
---|---|---|
committer | Marcin Junczys-Dowmunt <junczys@amu.edu.pl> | 2017-03-27 17:47:43 +0300 |
commit | 8a4cad3a10f53b7fa656cf757347c7ca7d1ef0c7 (patch) | |
tree | 17b5c2f29e2ee2a896b353a01e7976f748ae4b30 | |
parent | 550f936d012f7376810b487daa4e87a04c07ce63 (diff) |
better readme
-rw-r--r-- | examples/training/README.md | 12 |
1 files changed, 9 insertions, 3 deletions
diff --git a/examples/training/README.md b/examples/training/README.md index 571907d2..1716eadc 100644 --- a/examples/training/README.md +++ b/examples/training/README.md @@ -1,6 +1,10 @@ # Example for training with Marian -Files and scripts in this folder have been adapted from the Romanian-English sample from https://github.com/rsennrich/wmt16-scripts. +Files and scripts in this folder have been adapted from the Romanian-English sample +from https://github.com/rsennrich/wmt16-scripts. We also add the back-translated data from +http://data.statmt.org/rsennrich/wmt16_backtranslations/ as desribed in +http://www.aclweb.org/anthology/W16-2323. The resulting system should be competitive +or even slightly better than reported in the Edinburgh WMT2016 paper. To execute the complete example type: @@ -8,9 +12,11 @@ To execute the complete example type: ./run-me.sh ``` -which downloads the Romanian-English training files and preprocesses them (tokenization, truecasing, segmentation into subwords units). +which downloads the Romanian-English training files and preprocesses them (tokenization, +truecasing, segmentation into subwords units). -To use with a different GPU than device 0 or more GPUs (here 0 1 2 3) type: +To use with a different GPU than device 0 or more GPUs (here 0 1 2 3) type the command below. +Training time of 4 GPUs should be roughly 24 hours. ``` ./run-me.sh 0 1 2 3 |