Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/moses-smt/mosesdecoder.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorBarry Haddow <barry.haddow@gmail.com>2019-10-31 19:12:59 +0300
committerBarry Haddow <barry.haddow@gmail.com>2019-10-31 19:12:59 +0300
commitd708e26b60ea8496fd41ed7fd512eb67bd9015ac (patch)
tree635869b2f782419b4618a11aea1553c2dd0a47d8 /scripts/ems/support
parent0fef8ebf4cd8fd4ffb3cca69b003ae7bae3a91b5 (diff)
use block notation for indic scripts
Diffstat (limited to 'scripts/ems/support')
-rwxr-xr-xscripts/ems/support/split-sentences.perl2
1 files changed, 1 insertions, 1 deletions
diff --git a/scripts/ems/support/split-sentences.perl b/scripts/ems/support/split-sentences.perl
index 6268e1d68..e553c3b05 100755
--- a/scripts/ems/support/split-sentences.perl
+++ b/scripts/ems/support/split-sentences.perl
@@ -170,7 +170,7 @@ sub preprocess {
)
\s+
( [\'\"\x{201e}\x{bb}\(\[\¿\¡\p{IsPi}]*
- [\x{0900}-\x{097F}\x{0a80}-\x{0aff}]
+ [\p{Block: Devanagari_Extended}\p{Block: Gujarati}]
)
}{$1\n$2}gx;
}