Age | Commit message (Collapse) | Author | |
---|---|---|---|
2017-01-05 | Create a Cantonese version, distinct from Mandarin. | Linas Vepstas | |
The content is identical, at this moment, but having distinct langauge suffixes solves processing-pipeline problems later on. | |||
2017-01-05 | Preliminary support for Chinese. | Linas Vepstas | |
2017-01-05 | More abbreviations for LLithuanian. | Linas Vepstas | |
2017-01-05 | More abbreviations | Linas Vepstas | |
2017-01-05 | New file: Lithuanian | Linas Vepstas | |
2016-07-31 | Single lower-case letter French word | Antoine Dusséaux | |
"a" is a single lower-case letter French word that can be at the end of a sentence: "Oui, il l'a." | |||
2015-09-23 | Create nonbreaking_prefix.ga | Jim Regan | |
2015-08-23 | dos2unix everything | Hieu Hoang | |
2015-01-20 | May is not an abbreviation | Kenneth Heafield | |
2014-12-11 | chmod | Hieu Hoang | |
2014-12-05 | Month abbreviations shouldn't be causing a sentence split. | Kenneth Heafield | |
Yes this will break existing tokenized data :-(. | |||
2014-09-04 | Merge pull request #72 from flammie/master | Hieu Hoang | |
Add Finnish non-breaking prefixes | |||
2014-09-04 | fix location and remove english notes | Flammie Pirinen | |
2014-08-06 | move notice about czech prefixes to share/README | Hieu Hoang | |
2014-01-05 | Tamil tokenization /P.Arththika | Hieu Hoang | |
2013-10-07 | Update nonbreaking_prefix.el | Dimitris Mavroeidis | |
Added non-breaking prefixes for Greek. | |||
2013-08-16 | Fixed bug in tokenizer.perl where comma separated lists of single | Jeremy Gwinnup | |
characters aren't handled correctly input> A,B,C,D,E,F yielded> A, B,C , D,E , F now yields> A, B, C, D, E, F Updated Russian nonbreaking prefixes list with capital letters | |||
2013-03-19 | Hungarian and Latvian non-breaking prefix files | Achim | |
2012-07-17 | Merge branch 'trunk' into miramerge | Barry Haddow | |
Compiles, not tested. Conflicts: Jamroot OnDiskPt/PhraseNode.h OnDiskPt/TargetPhrase.cpp OnDiskPt/TargetPhrase.h OnDiskPt/TargetPhraseCollection.cpp mert/BleuScorer.cpp mert/Data.cpp mert/FeatureData.cpp moses-chart-cmd/src/Main.cpp moses/src/AlignmentInfo.h moses/src/ChartManager.cpp moses/src/LM/Ken.cpp moses/src/LM/Ken.h moses/src/LMList.h moses/src/LexicalReordering.h moses/src/PhraseDictionaryTree.h moses/src/ScoreIndexManager.h moses/src/StaticData.h moses/src/TargetPhrase.h moses/src/Word.cpp scripts/ems/experiment.meta scripts/ems/experiment.perl scripts/training/train-model.perl | |||
2012-07-14 | czech prefixes | Karel Bílek | |
2012-06-26 | lock m_vocab variable access in Encode() and Lookup(). Other functions are ↵ | Hieu Hoang | |
still not threadsafe |