diff options
author | bgottesman <bgottesman@1f5c12ca-751b-0410-a591-d2e778427230> | 2011-08-05 23:14:01 +0400 |
---|---|---|
committer | bgottesman <bgottesman@1f5c12ca-751b-0410-a591-d2e778427230> | 2011-08-05 23:14:01 +0400 |
commit | c030dae094632972ea1d69c03735d10a57f2c1e9 (patch) | |
tree | 685611496c1fd3841a725f9f833f137062ad0312 /regression-testing | |
parent | d7752b44fc52f3dbb7f682a5646e39d6daf11fe0 (diff) |
Allow a test case to have an undefined language, since the detokenizer doesn't require a language to be passed in and, indeed, errors if a language is passed in for which there are no special rules (which seems dubious to me ...). Add test case TEST_GERMAN_NONASCII with an undefined language.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4130 1f5c12ca-751b-0410-a591-d2e778427230
Diffstat (limited to 'regression-testing')
-rw-r--r-- | regression-testing/run-test-detokenizer.t | 16 |
1 files changed, 15 insertions, 1 deletions
diff --git a/regression-testing/run-test-detokenizer.t b/regression-testing/run-test-detokenizer.t index 1331b4c19..aaea9c768 100644 --- a/regression-testing/run-test-detokenizer.t +++ b/regression-testing/run-test-detokenizer.t @@ -98,6 +98,20 @@ EXP $testCase->setExpectedToFail("A bug is causing this to be detokenized wrong."); } +# A German test involving non-ASCII characters +# Note: We don't specify a language because the detokenizer errors if you pass in a language for which it has no special rules, of which German is an example. +&addDetokenizerTest("TEST_GERMAN_NONASCII", undef, +<<'TOK' +Ich hoffe , daß Sie schöne Ferien hatten . +Frau Präsidentin ! Frau Díez González und ich hatten einige Anfragen +TOK +, +<<'EXP' +Ich hoffe, daß Sie schöne Ferien hatten. +Frau Präsidentin! Frau Díez González und ich hatten einige Anfragen +EXP +); + ###################################### # Now run those babies ... ###################################### @@ -145,7 +159,7 @@ sub runDetokenizerTest { close TRUTH; &runTest($testCase->getName(), $testOutputDir, $tokenizedFile, sub { - return [$detokenizer, "-l", $testCase->getLanguage()]; + return defined($testCase->getLanguage())? [$detokenizer, "-l", $testCase->getLanguage()] : [$detokenizer]; }, sub { &verifyIdentical($testCase->getName(), $expectedFile, catfile($testOutputDir, "stdout.txt")) }, 1, $testCase->getFailureExplanation()); |