Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/moses-smt/mosesdecoder.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2019-09-04The dot before an acronym should be optional.alvations-patch-regexesalvations
2019-08-22Merge pull request #211 from achimr/masterHieu Hoang
Support for Urdu in sentence splitter
2019-07-10Support for Urdu in sentence splitterAchim Ruopp
2019-06-08tweak readmeHieu Hoang
2019-04-27Merge pull request #210 from mjpost/patch-1Hieu Hoang
escape angle brackets
2019-04-26escape angle bracketsMatt Post
The script doesn't escape angle brackets which can result in bad SGML / XML output. This fixes that, although ideally, this should be implemented with a proper parser and dumper.
2019-03-02Merge pull request #209 from joelb-git/multi-bleu-detok-non-ascii-fixHieu Hoang
Fix non-ASCII lowercasing
2019-02-27Fix non-ASCII lowercasingJoel Barry
2019-01-17check state object are not null before using it. For alternate weights ↵Hieu Hoang
setting where some feature functions are not used for a particular sentence
2019-01-04Revert "use ucfirst instead of defined uppercase function"Hieu Hoang
This reverts commit dfbb17e549d4cb4ece452c7224ae47a590b7a4da.
2019-01-03Merge pull request #207 from alvations/patch-truecaserHieu Hoang
Reverting split_xml()
2019-01-03Reverting split_xml()alvations
2018-12-30consistent outputHieu Hoang
2018-12-29Merge pull request #206 from alvations/patch-truecaserHieu Hoang
Patching truecaser
2018-12-28rename file so it appears on github website. Clarify mailing listHieu Hoang
2018-12-20use ucfirst instead of defined uppercase functionalvations
2018-12-20split_xml should be consistent for training and usingalvations
2018-12-10increase cores to 16. For bitextor azure pipelineHieu Hoang
2018-12-10put fix into UnorderedComparer again. Maybe weird template bugHieu Hoang
2018-12-10fix weird unordered set error on ubuntu 18.04, gcc 7.3.0, boost 1.65. May be ↵Hieu Hoang
over-optimizing or bug in gcc or boost
2018-12-10debugHieu Hoang
2018-12-08ems config for moses2Hieu Hoang
2018-12-04sacre bleuHieu Hoang
2018-12-04sacre bleuHieu Hoang
2018-12-04use --discount_fallbackHieu Hoang
2018-11-12Merge branch 'master' of github.com:moses-smt/mosesdecoderHieu Hoang
2018-11-12removing python port. Sacremoses is newerHieu Hoang
2018-11-11Merge pull request #205 from coylz/masterHieu Hoang
Add option "-b" (unbuffer output) to tokenizer scripts
2018-11-10Add option "-b" (unbuffer output) to tokenizer scriptsLoïc Vial
2018-11-09rename directory to work with python importHieu Hoang
2018-11-09python wrapper worksHieu Hoang
2018-11-07start borging Luis Gomes codeHieu Hoang
2018-11-07Merge pull request #204 from ozancaglayan/nb-fixHieu Hoang
tokenizer.perl: split final dots unconditionally
2018-11-07tokenizer.perl: split final dots unconditionallyOzan Caglayan
Allow tokenization of non-breaking prefixes at end of sentences. This should be a fair compromise in many cases to construct a cleaner vocabulary. EN-old: So am I. EN-new: So am I . DE-old: ... schwer wie ein iPhone 5. DE-new: ... schwer wie ein iPhone 5 . FR-old: Des gens admirent une œuvre d' art. FR-new: Des gens admirent une œuvre d' art . CS-old: Dvě děti, které běží bez bot. CS-new: Dvě děti, které běží bez bot .
2018-10-30basic support for Gujarati and Hindi, backported from one of the many upstreamsBarry Haddow
2018-10-26Merge branch 'master' of github.com:moses-smt/mosesdecoderHieu Hoang
2018-10-26bump againHieu Hoang
2018-10-26Merge pull request #203 from maxthomas/contrib-modular-boostHieu Hoang
contrib: make boost variable modular; update version to 1.68.0
2018-10-26bumpHieu Hoang
2018-10-25contrib: make boost variable modular; update version to 1.68.0max thomas
2018-09-27Merge pull request #202 from thuvh/python3_compatibleHieu Hoang
fix print to compatible with python2 and python3
2018-09-26fix print to compatible with python2 and python3Hoai-Thu Vuong
2018-09-26multi-bleu-detok should take raw referenceRico Sennrich
2018-09-16grammarHieu Hoang
2018-09-10Merge branch 'master' of github.com:moses-smt/mosesdecoderHieu Hoang
2018-09-10unused scriptHieu Hoang
2018-09-06Handle glottal stops in SomalianBarry Haddow
2018-07-05Merge pull request #201 from louismartin/bleu-fix-newlineHieu Hoang
[BLEU] Fix multi-bleu.perl bug (no newline at end of file)
2018-07-03Fix multi-bleu.perl bug when file does not end with newlineLouis MARTIN
When reading hypothesis and reference files, multi-bleu.perl uses the chop function to remove the trailing newline character. If one of these files happens to not end with a newline, then chop will remove the last character of the last line (instead of the newline). This causes the BLEU score to be slightly off from its theoretical value. Using the safest chomp function solves this problem, i.e. it only removes newlines when present.
2018-06-25Merge branch 'RELEASE-4.0' of github.com:jowagner/mosesdecoderHieu Hoang