Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/sentencepiece.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorTaku Kudo <taku910@users.noreply.github.com>2018-05-01 15:55:08 +0300
committerGitHub <noreply@github.com>2018-05-01 15:55:08 +0300
commit8183193de86fdfa4b335acd089480dce0210fce3 (patch)
treecb62b87cffd36af810997f6eceae9f293ba8c046 /README.md
parent694896108c299d982e8ecdbc24855ce3817a645f (diff)
Update README.md
Diffstat (limited to 'README.md')
-rw-r--r--README.md2
1 files changed, 1 insertions, 1 deletions
diff --git a/README.md b/README.md
index eb55d04..8d26dc5 100644
--- a/README.md
+++ b/README.md
@@ -48,7 +48,7 @@ vocabulary. Unlike most unsupervised word segmentation algorithms, which
assume an infinite vocabulary, SentencePiece trains the segmentation model such
that the final vocabulary size is fixed, e.g., 8k, 16k, or 32k.
-Note that SentencePices specifies the final vocabulary size for training, which is different from the
+Note that SentencePices specifies the final vocabulary size for training, which is different from
[subword-nmt](https://github.com/rsennrich/subword-nmt) that uses the number of merge operations.
The number of merge operations is a BPE-specific parameter and not applicable to other segmentation algorithms, including unigram, word and character.