Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/sentencepiece.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorTaku Kudo <taku910@users.noreply.github.com>2018-05-01 16:09:24 +0300
committerGitHub <noreply@github.com>2018-05-01 16:09:24 +0300
commit9aa2b9c7ad606e0f9f6f0b03646197e20c29ca2e (patch)
tree77adba3d7448f1bd6e4aa8a3d8e432b0d738091a /README.md
parent8183193de86fdfa4b335acd089480dce0210fce3 (diff)
Update README.md
Diffstat (limited to 'README.md')
-rw-r--r--README.md4
1 files changed, 2 insertions, 2 deletions
diff --git a/README.md b/README.md
index 8d26dc5..71a5ede 100644
--- a/README.md
+++ b/README.md
@@ -94,8 +94,8 @@ special symbol. Tokenized sequences do not preserve the necessary information to
Subword regularization [[Kudo.](https://arxiv.org/abs/1804.10959)] is a simple regularization method
that virtually augments training data with on-the-fly subword sampling, which helps to improve the accuracy as well as robustness of NMT models.
-To enable subword regularization, you would like to use the SentencePiece library
-([C++](doc/api.md#sampling-subword-regularization)/[Python](python/README.md)) to sample one segmentation for each parameter update, which is different from the standard off-line data preparations. Here's the example of [Python library](python/README.md). You can find that 'New York' is segmented differently on each ``SampleEncode`` call. The details of sampling parameters are found in [sentencepiece_processor.h](src/sentencepiece_processor.h).
+To enable subword regularization, you would like to integrate SentencePiece library
+([C++](doc/api.md#sampling-subword-regularization)/[Python](python/README.md)) into the NMT system to sample one segmentation for each parameter update, which is different from the standard off-line data preparations. Here's the example of [Python library](python/README.md). You can find that 'New York' is segmented differently on each ``SampleEncode`` call. The details of sampling parameters are found in [sentencepiece_processor.h](src/sentencepiece_processor.h).
```
>>> import sentencepiece as spm