Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/marian.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorMarcin Junczys-Dowmunt <junczys@amu.edu.pl>2017-03-27 17:47:43 +0300
committerMarcin Junczys-Dowmunt <junczys@amu.edu.pl>2017-03-27 17:47:43 +0300
commit8a4cad3a10f53b7fa656cf757347c7ca7d1ef0c7 (patch)
tree17b5c2f29e2ee2a896b353a01e7976f748ae4b30
parent550f936d012f7376810b487daa4e87a04c07ce63 (diff)
better readme
-rw-r--r--examples/training/README.md12
1 files changed, 9 insertions, 3 deletions
diff --git a/examples/training/README.md b/examples/training/README.md
index 571907d2..1716eadc 100644
--- a/examples/training/README.md
+++ b/examples/training/README.md
@@ -1,6 +1,10 @@
# Example for training with Marian
-Files and scripts in this folder have been adapted from the Romanian-English sample from https://github.com/rsennrich/wmt16-scripts.
+Files and scripts in this folder have been adapted from the Romanian-English sample
+from https://github.com/rsennrich/wmt16-scripts. We also add the back-translated data from
+http://data.statmt.org/rsennrich/wmt16_backtranslations/ as desribed in
+http://www.aclweb.org/anthology/W16-2323. The resulting system should be competitive
+or even slightly better than reported in the Edinburgh WMT2016 paper.
To execute the complete example type:
@@ -8,9 +12,11 @@ To execute the complete example type:
./run-me.sh
```
-which downloads the Romanian-English training files and preprocesses them (tokenization, truecasing, segmentation into subwords units).
+which downloads the Romanian-English training files and preprocesses them (tokenization,
+truecasing, segmentation into subwords units).
-To use with a different GPU than device 0 or more GPUs (here 0 1 2 3) type:
+To use with a different GPU than device 0 or more GPUs (here 0 1 2 3) type the command below.
+Training time of 4 GPUs should be roughly 24 hours.
```
./run-me.sh 0 1 2 3