Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/moses-smt/mosesdecoder.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2017-04-07ignore words where there is nothing to caseOndrej Bojar
2015-06-08Merge branch 'master' of https://github.com/moses-smt/mosesdecoderLexi Birch
2015-06-08Allowing the truecaser to work on uncased ASR input, pass the -a flagLexi Birch
2015-06-02Fixing lint. Only 600 or so lines of errors left!Jeroen Vermeulen
2015-05-29Add license notices to scripts.Jeroen Vermeulen
2015-05-17Fix a lot of lint, mostly trailing whitespace.Jeroen Vermeulen
2015-04-13add use warnings to all perl scriptsHieu Hoang
2015-04-02consistently use 'env perl' command for environments where the 1st perl in PA...Hieu Hoang
2015-03-18Replace truecase-egret.sh with more general tree-converter-wrapper.perlPhil Williams
2015-03-11proper handling of specified configuration filePhilipp Koehn
2015-03-10Add truecase-egret.shPhil Williams
2015-02-10Relative pathKenneth Heafield
2015-02-10default path update in train-recaserCharley C
2015-02-04more efficient default recaser trainingPhilipp Koehn
2014-10-14reduce lmplz memory consumption in recaserHieu Hoang
2014-06-07default kenlm training and inference in recaserphikoehn
2014-06-06allow < > in factorsPhilipp Koehn
2014-04-05Avoid errors in truecaser if input isn't factored and contains vertical bars.Ulrich Germann
2014-02-08don't complain if input contains non-escaped '<' or '>', but is not XMLRico Sennrich
2014-01-30fix truecaser with XML input (didn't do anything depending on formatting/whit...Rico Sennrich
2013-03-04added unbuffered mode for casers (using -b)Christian Buck
2013-01-14bug fix with MML settingsphikoehn
2013-01-14fixed bug in detruecaser / interaction with esacpingphikoehn
2013-01-14bug fixes with escaping / truecasing interactionsphikoehn
2012-10-20use kenlm if sri specifiedHieu Hoang
2012-09-25exit 0 on success. /Henry HuHieu Hoang
2012-07-11distortion limit for recaser should be 0Rico Sennrich
2012-07-11truecase corpus before training recaserRico Sennrich
2012-06-08default pt implementation if no phrase table specifiedHieu Hoang
2011-11-27- Bug fix: when --help set, errors on absence of --corpus or --dir must not b...Jehan
2011-11-27- Exit with failure when a step of train-recaser.sh fails.Jehan
2011-11-25- Help output for train-recaser script.Jehan
2011-11-25- Coding style fix: use the upstream coding style.Jehan
2011-11-25- Recaser train script updated to support IRSTLM as well.Jehan
2010-11-09add --possiblyUseFirstToken option, which, when selected, allows certain sent...bgottesman
2010-10-13delete duplicate detokenizerhieuhoang1972
2010-10-11keep perl scripts with Unix line endingshieuhoang1972
2010-04-16Merge remaining script support for tree-based models from mt3_chart.pjwilliams
2010-03-18set utf8 mode on the input and output files, instead of on stdin and stdout, ...bgottesman
2010-02-03uppercasing first letter even if after punctbojar
2009-02-09bug fixphkoehn
2009-02-09added truecaserphkoehn
2008-02-22added some heuristics for Czech quotation marksbojar
2008-02-22added optional sentence uppercasing (use -u)bojar
2007-04-04added "-v 0" moses flag to decoder call to minimize log output.jdschroeder
2007-03-26Adding simple Czech rules to detokenizer. Making detokenizer 'released'.bojar
2007-03-26Adding detokenizer from WMT07 shared scripts.tgz, hoping there are no copyrightbojar
2007-03-26Proper unicode-based lower and uppercasing.bojar
2007-03-15add svn id comments to start of filehieuhoang1972
2007-03-15add svn id comments to start of filehieuhoang1972