Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/stanfordnlp/stanza.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorJohn Bauer <horatio@gmail.com>2022-05-05 09:07:55 +0300
committerJohn Bauer <horatio@gmail.com>2022-11-11 09:19:18 +0300
commit795a0f289cbb23541f02e65e9e94344fccee3672 (patch)
tree0c9302aed95b4f1ef62f3fcf3165f4c01ada7b4a
parent1e462b9b975a20185f1c6dd5ebfa1418593463ac (diff)
NFC instead of NFDNFC
-rw-r--r--stanza/models/common/utils.py2
1 files changed, 1 insertions, 1 deletions
diff --git a/stanza/models/common/utils.py b/stanza/models/common/utils.py
index 5e2365ce..91954b27 100644
--- a/stanza/models/common/utils.py
+++ b/stanza/models/common/utils.py
@@ -200,7 +200,7 @@ def print_config(config):
logger.info("\n" + info + "\n")
def normalize_text(text):
- return unicodedata.normalize('NFD', text)
+ return unicodedata.normalize('NFC', text)
def unmap_with_copy(indices, src_tokens, vocab):
"""