remove superfluous options

author: Marcin Junczys-Dowmunt <marcinjd@microsoft.com> 2018-11-26 22:29:54 +0300
committer: Marcin Junczys-Dowmunt <marcinjd@microsoft.com> 2018-11-26 22:29:54 +0300
commit: e14dc5428f90a5b01525dd9d99bcf421d8a27322 (patch)
tree: 92df144b3efe630e02523fa592bd9b8eeed439f1
parent: 47db5b55d39d7d616b33ab70d0e2c44a91687b97 (diff)
1 files changed, 8 insertions, 4 deletions
diff --git a/training-basics-sentencepiece/README.md b/training-basics-sentencepiece/README.md
index 03f34e5..3d9a99f 100644
--- a/training-basics-sentencepiece/README.md
+++ b/training-basics-sentencepiece/README.md
@@ -185,6 +185,8 @@ for details):
 00EE    69 # î => i
 ```
 
+<!-- @TODO: add example for ../../build/spm_normalize --normalization_rule_tsv=data/romanian.tsv -->
+
 ### Training the NMT model
 
 Next, we execute a training run with `marian`. Note how the training command is called passing the
@@ -237,6 +239,8 @@ mkdir model
 The training should stop if cross-entropy on the validation set
 stops improving. Depending on the number of and generation of GPUs you are using that may take a while.
 
+<!-- @TODO: add example for ../../build/spm_encode/spm_decode --model=model/vocab.roen.spm -->
+
 ### Translating the test and validation sets with evaluation
 
 After training, the model with the highest translation validation score is used
@@ -248,13 +252,13 @@ normalization and segmentation on the fly. Similarly, sacreBLEU expects raw text
 ```
 # translate dev set
 cat data/newsdev2016.ro \
-    | ../../build/marian-decoder -c model/model.npz.best-bleu-detok.npz.decoder.yml -d 0 1 2 3 -b 6 -n0.6 \
-      --mini-batch 64 --maxi-batch 100 --maxi-batch-sort src > data/newsdev2016.ro.output
+    | ../../build/marian-decoder -c model/model.npz.best-bleu-detok.npz.decoder.yml -d 0 1 2 3 \
+    > data/newsdev2016.ro.output
 
 # translate test set
 cat data/newstest2016.ro \
-    | ../../build/marian-decoder -c model/model.npz.best-bleu-detok.npz.decoder.yml -d 0 1 2 3 -b 6 -n0.6 \
-      --mini-batch 64 --maxi-batch 100 --maxi-batch-sort src > data/newstest2016.ro.output
+    | ../../build/marian-decoder -c model/model.npz.best-bleu-detok.npz.decoder.yml -d 0 1 2 3 \
+    > data/newstest2016.ro.output
 ```
 after which BLEU scores for the dev and test set are reported.
 ```
author	Marcin Junczys-Dowmunt <marcinjd@microsoft.com>	2018-11-26 22:29:54 +0300
committer	Marcin Junczys-Dowmunt <marcinjd@microsoft.com>	2018-11-26 22:29:54 +0300
commit	e14dc5428f90a5b01525dd9d99bcf421d8a27322 (patch)
tree	92df144b3efe630e02523fa592bd9b8eeed439f1
parent	47db5b55d39d7d616b33ab70d0e2c44a91687b97 (diff)