Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/moses-smt/mosesdecoder.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorHieu Hoang <hieuhoang@gmail.com>2020-06-02 03:19:36 +0300
committerGitHub <noreply@github.com>2020-06-02 03:19:36 +0300
commitd90a8df86240e4352d812d0c89d8c561e427c7e2 (patch)
tree841c26449ad81a3317810f37f87773a1b49b5062 /scripts
parent89b9b4fba2cb11dc2a2602ecdcace17b6ec4a86a (diff)
parentb7038d5f24f8b17ca82f263d5f849cf67258202c (diff)
Merge pull request #221 from HjalmarrSv/master
Added some for sv
Diffstat (limited to 'scripts')
-rw-r--r--scripts/share/nonbreaking_prefixes/nonbreaking_prefix.sv53
1 files changed, 52 insertions, 1 deletions
diff --git a/scripts/share/nonbreaking_prefixes/nonbreaking_prefix.sv b/scripts/share/nonbreaking_prefixes/nonbreaking_prefix.sv
index df5ef2959..f061a2b1a 100644
--- a/scripts/share/nonbreaking_prefixes/nonbreaking_prefix.sv
+++ b/scripts/share/nonbreaking_prefixes/nonbreaking_prefix.sv
@@ -25,22 +25,73 @@ W
X
Y
Z
#misc abbreviations
+#If all words in text are in small case, then tex, mao, tom, maj, may be confused with names, and iaf, etc with named entities.
AB
-G
VG
dvs
+d.v.s
+d. v. s
etc
from
+fr.o.m
+fr. o. m
iaf
+i.a.f
+i. a. f
jfr
kl
kr
mao
+m.a.o
+m. a. o
mfl
+m.fl
+m. fl
mm
+m.m
+m. m.
osv
+o.s.v
+o. s. v
pga
+p.g.a
+p. g. a
tex
+t.ex
+t. ex
+#tom. is risky, as tom is a word, and can be at end of sentence. One recent text has 9 tom., and 52 tom not at end of sentence.
tom
+t.o.m
+t. o. m
vs
+adv
+jur
+kand
+mag
+fil
+lic
+prop
+d
+f
+s
+mha
+m.h.a
+m. h. a
+vol
+#months
+jan
+feb
+mar
+apr
+#maj is a full word
+jun
+jul
+aug
+sep
+okt
+nov
+dec