Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/moses-smt/mosesdecoder.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorRico Sennrich <rico.sennrich@gmx.ch>2014-01-30 03:01:53 +0400
committerRico Sennrich <rico.sennrich@gmx.ch>2014-01-30 03:01:53 +0400
commitd26fe4cc4dc8bc417fcdc91ffe05cd6dc8503f4c (patch)
treed75c32c1b7bfa62feb23e6ab0788ceb8d2d44a2b /scripts/recaser/truecase.perl
parent86ee3e15a441aec72eaebdd0389fa925da2316c7 (diff)
fix truecaser with XML input (didn't do anything depending on formatting/whitespace)
Diffstat (limited to 'scripts/recaser/truecase.perl')
-rwxr-xr-xscripts/recaser/truecase.perl2
1 files changed, 1 insertions, 1 deletions
diff --git a/scripts/recaser/truecase.perl b/scripts/recaser/truecase.perl
index 517f5c7a1..a1340f3b6 100755
--- a/scripts/recaser/truecase.perl
+++ b/scripts/recaser/truecase.perl
@@ -74,7 +74,7 @@ sub split_xml {
$MARKUP[$i] .= $1." ";
$line = $2;
}
- elsif ($line =~ /^\s*(\S+)(.*)$/) {
+ elsif ($line =~ /^\s*([^\s<>]+)(.*)$/) {
$WORD[$i++] = $1;
$MARKUP[$i] = "";
$line = $2;