diff options
author | bgottesman <bgottesman@1f5c12ca-751b-0410-a591-d2e778427230> | 2011-08-08 19:02:56 +0400 |
---|---|---|
committer | bgottesman <bgottesman@1f5c12ca-751b-0410-a591-d2e778427230> | 2011-08-08 19:02:56 +0400 |
commit | 14587cdafc42cdbff9221c07b5545551ecec475b (patch) | |
tree | 94072fc1768258fc972820f41c16a0551e29ac71 /regression-testing | |
parent | 79142d18e644325e0a870610d54652d9618b60de (diff) |
fix a detokenization bug that was preventing the removal of the whitespace following a contracted French or Italian article/pronoun (e.g. "l' immigration") when the contraction was the second-last word in the segment
remove the expectation of failure on the corresponding unit test
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4133 1f5c12ca-751b-0410-a591-d2e778427230
Diffstat (limited to 'regression-testing')
-rw-r--r-- | regression-testing/run-test-detokenizer.t | 7 |
1 files changed, 1 insertions, 6 deletions
diff --git a/regression-testing/run-test-detokenizer.t b/regression-testing/run-test-detokenizer.t index f9cc3423a..9d677b43e 100644 --- a/regression-testing/run-test-detokenizer.t +++ b/regression-testing/run-test-detokenizer.t @@ -82,9 +82,7 @@ Moi, j'ai une apostrophe. EXP ); -# A (failing) French test involving an apostrophe on the second-last word -{ -my $testCase = +# A French test involving an apostrophe on the second-last word &addDetokenizerTest("TEST_FRENCH_APOSTROPHE_PENULTIMATE", "fr", <<'TOK' de musique rap issus de l' immigration @@ -95,9 +93,6 @@ de musique rap issus de l'immigration EXP ); -$testCase->setExpectedToFail("A bug is causing this to be detokenized wrong."); -} - # A German test involving non-ASCII characters # Note: We don't specify a language because the detokenizer errors if you pass in a language for which it has no special rules, of which German is an example. &addDetokenizerTest("TEST_GERMAN_NONASCII", undef, |