Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/sentencepiece.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorDmitrii Kravchenko <equivalence1@gmail.com>2020-10-15 14:51:35 +0300
committerDmitrii Kravchenko <equivalence1@gmail.com>2020-10-15 14:51:35 +0300
commitd796421cbaa4b8f57f8005bb3c1a1ad4173d68d8 (patch)
tree75ed571eae158d56238d02f7ba5e9595f7271508
parent0b5bd12205364c064a93c77d425ac7e6f8e41df3 (diff)
Fix SIL symbol in code snippet: _ -> _
-rw-r--r--README.md2
1 files changed, 1 insertions, 1 deletions
diff --git a/README.md b/README.md
index 5543d79..d1873e7 100644
--- a/README.md
+++ b/README.md
@@ -84,7 +84,7 @@ Then, this text is segmented into small pieces, for example:
Since the whitespace is preserved in the segmented text, we can detokenize the text without any ambiguities.
```
- detokenized = ''.join(pieces).replace('_', ' ')
+ detokenized = ''.join(pieces).replace('▁', ' ')
```
This feature makes it possible to perform detokenization without relying on language-specific resources.