Age | Commit message (Expand) | Author |
2017-04-07 | ignore words where there is nothing to case | Ondrej Bojar |
2015-06-08 | Merge branch 'master' of https://github.com/moses-smt/mosesdecoder | Lexi Birch |
2015-06-08 | Allowing the truecaser to work on uncased ASR input, pass the -a flag | Lexi Birch |
2015-06-02 | Fixing lint. Only 600 or so lines of errors left! | Jeroen Vermeulen |
2015-05-29 | Add license notices to scripts. | Jeroen Vermeulen |
2015-05-17 | Fix a lot of lint, mostly trailing whitespace. | Jeroen Vermeulen |
2015-04-13 | add use warnings to all perl scripts | Hieu Hoang |
2015-04-02 | consistently use 'env perl' command for environments where the 1st perl in PA... | Hieu Hoang |
2015-03-18 | Replace truecase-egret.sh with more general tree-converter-wrapper.perl | Phil Williams |
2015-03-11 | proper handling of specified configuration file | Philipp Koehn |
2015-03-10 | Add truecase-egret.sh | Phil Williams |
2015-02-10 | Relative path | Kenneth Heafield |
2015-02-10 | default path update in train-recaser | Charley C |
2015-02-04 | more efficient default recaser training | Philipp Koehn |
2014-10-14 | reduce lmplz memory consumption in recaser | Hieu Hoang |
2014-06-07 | default kenlm training and inference in recaser | phikoehn |
2014-06-06 | allow < > in factors | Philipp Koehn |
2014-04-05 | Avoid errors in truecaser if input isn't factored and contains vertical bars. | Ulrich Germann |
2014-02-08 | don't complain if input contains non-escaped '<' or '>', but is not XML | Rico Sennrich |
2014-01-30 | fix truecaser with XML input (didn't do anything depending on formatting/whit... | Rico Sennrich |
2013-03-04 | added unbuffered mode for casers (using -b) | Christian Buck |
2013-01-14 | bug fix with MML settings | phikoehn |
2013-01-14 | fixed bug in detruecaser / interaction with esacping | phikoehn |
2013-01-14 | bug fixes with escaping / truecasing interactions | phikoehn |
2012-10-20 | use kenlm if sri specified | Hieu Hoang |
2012-09-25 | exit 0 on success. /Henry Hu | Hieu Hoang |
2012-07-11 | distortion limit for recaser should be 0 | Rico Sennrich |
2012-07-11 | truecase corpus before training recaser | Rico Sennrich |
2012-06-08 | default pt implementation if no phrase table specified | Hieu Hoang |
2011-11-27 | - Bug fix: when --help set, errors on absence of --corpus or --dir must not b... | Jehan |
2011-11-27 | - Exit with failure when a step of train-recaser.sh fails. | Jehan |
2011-11-25 | - Help output for train-recaser script. | Jehan |
2011-11-25 | - Coding style fix: use the upstream coding style. | Jehan |
2011-11-25 | - Recaser train script updated to support IRSTLM as well. | Jehan |
2010-11-09 | add --possiblyUseFirstToken option, which, when selected, allows certain sent... | bgottesman |
2010-10-13 | delete duplicate detokenizer | hieuhoang1972 |
2010-10-11 | keep perl scripts with Unix line endings | hieuhoang1972 |
2010-04-16 | Merge remaining script support for tree-based models from mt3_chart. | pjwilliams |
2010-03-18 | set utf8 mode on the input and output files, instead of on stdin and stdout, ... | bgottesman |
2010-02-03 | uppercasing first letter even if after punct | bojar |
2009-02-09 | bug fix | phkoehn |
2009-02-09 | added truecaser | phkoehn |
2008-02-22 | added some heuristics for Czech quotation marks | bojar |
2008-02-22 | added optional sentence uppercasing (use -u) | bojar |
2007-04-04 | added "-v 0" moses flag to decoder call to minimize log output. | jdschroeder |
2007-03-26 | Adding simple Czech rules to detokenizer. Making detokenizer 'released'. | bojar |
2007-03-26 | Adding detokenizer from WMT07 shared scripts.tgz, hoping there are no copyright | bojar |
2007-03-26 | Proper unicode-based lower and uppercasing. | bojar |
2007-03-15 | add svn id comments to start of file | hieuhoang1972 |
2007-03-15 | add svn id comments to start of file | hieuhoang1972 |