Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/moses-smt/mosesdecoder.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2015-05-02beautifyHieu Hoang
2015-05-01sentence-bleu less greedy regarding memoryMatthias Huck
Don't load all references, read them line by line. Corpora with millions of sentences can now be evaluated without consuming gigabytes of RAM.
2015-04-30sentence-bleu-nbestMatthias Huck
2015-03-29Fix some compile warnings (gcc 4.9.2).Jeroen Vermeulen
Mostly signed/unsigned comparisons and reordered member initializations; also a few unused variables. There are more, but if I chip away at them for a while, who knows, it may catch on and warnings may eventually become socially stigmatizing. :)
2014-09-22basic support for more metrics with kbmiraRico Sennrich
metrics need getReferenceLength (for background smoothing) to work with kbmira
2014-09-22calculateScore with float (for smoothing support)Rico Sennrich
2014-07-21Merge 08811deb17337356cd8dae9c59c0160590679a35 from joshuaBarry Haddow
2013-11-18replace CHECK with UTIL_THROW_IF in mertHieu Hoang
2013-05-29beautifyHieu Hoang
2013-02-18Implement brevity penalty smoothing for PROBarry Haddow
As in Nakov et al (Coling 2012)
2012-12-07Further optimization for extractor.Tetsuo Kiso
Fixes inefficient updating N-gram counts. NOTE: Using '--binary' option (this option is not enabled by default yet) for saving outputs would lead to significant speed up.
2012-12-07Use boost::unordered_map instead of std::map.Tetsuo Kiso
For storing the word vocabulary used in computation of BLEU scores. This change will reduce the running time of extractor about 2-3 seconds (9% reduction).
2012-12-06Speed up N-gram counts when running extractor.Tetsuo Kiso
By replacing std::map with boost::unordered_map. Runtime of extractor on 100-best lists of 2679 sentences: Before: real 0m35.314s user 0m34.030s sys 0m1.280s Ater: real 0m26.729s user 0m25.420s sys 0m1.310s
2012-11-04Using namespace std in a header file pollutes the global namespace.Tetsuo Kiso
Using directives should be put into the implementation files.
2012-07-17Merge branch 'trunk' into miramergeBarry Haddow
Compiles, not tested. Conflicts: Jamroot OnDiskPt/PhraseNode.h OnDiskPt/TargetPhrase.cpp OnDiskPt/TargetPhrase.h OnDiskPt/TargetPhraseCollection.cpp mert/BleuScorer.cpp mert/Data.cpp mert/FeatureData.cpp moses-chart-cmd/src/Main.cpp moses/src/AlignmentInfo.h moses/src/ChartManager.cpp moses/src/LM/Ken.cpp moses/src/LM/Ken.h moses/src/LMList.h moses/src/LexicalReordering.h moses/src/PhraseDictionaryTree.h moses/src/ScoreIndexManager.h moses/src/StaticData.h moses/src/TargetPhrase.h moses/src/Word.cpp scripts/ems/experiment.meta scripts/ems/experiment.perl scripts/training/train-model.perl
2012-06-30namespace all classes in mert directoryHieu Hoang
2012-06-24merge Lexi Birch's LRScore from mert_mtm5 branch. Compiles and run. Hack, ↵Hieu Hoang
must double check with barry or lexi
2012-06-07mergeEva Hasler
2012-06-01fix start weights in experiment.perl, add hypothesis queue for picking hope ↵Eva Hasler
and fear translations, add variations to 1slack formulation
2012-05-29Batch k-best MIRA is written and integrated into mert-moses.plColin Cherry
Regression tests all check out, and kbmira seems to work fine on a Hansard French->English task. HypPackEnumerator class may be of interest to pro.cpp and future optimizers, as it abstracts a lot of the boilerplate involved in enumerating multiple k-best lists. MiraWeightVector is not really mira-specific - just a weight vector that enables efficient averaging. Could be useful to a perceptron as well. Same goes for MiraFeatureVector. Interaction with sparse features is written, but untested.
2012-05-10changed permission, everything changed..Eva Hasler
2012-05-10Avoid "using namespace std" in headers.Tetsuo Kiso
2012-05-09Added support for external unix filters to preprocess sentences in mert and ↵Matous Machacek
evaluator
2012-04-30clean up mira, add sampling from hope/model/fearEva
2012-04-29testEva
2012-04-06Add test cases for BLEU and sentence-level BLEU+1.Tetsuo Kiso
- Move a definition of sentenceLevelBleuPlusOne() from pro.cpp to BleuScorer.cpp. - Add check for the length of an input vector.
2012-04-04Add a test case for BLEU's clipped counts.Tetsuo Kiso
- Make BleuScorer::setReferenceFiles() more testable by adding OpenReference() and OpenReferenceStream().
2012-03-20Add thread unsafe Singleton class.Tetsuo Kiso
- Add Vocabulary factory and the unit test. - Remove Scorer::ClearVocabulary().
2012-03-19Change the Encoder class to Vocabulary.Tetsuo Kiso
- Introduce the namespace to avoid naming collisions. The class name is used in KenLM. - Add the unit test.
2012-03-19Move Encoder class from Scorer.h to Ngram.h.Tetsuo Kiso
To add unit tests.
2012-03-19Add some functions to BleuScorer for unit testing.Tetsuo Kiso
This commit also includes - Fix typo. - Fix indentations. - Add 'const' to Scorer::applyFactors().
2012-03-18Create Reference class to clean up BleuScorer.Tetsuo Kiso
- Add an unit test for Reference. - Move functions to calculate the reference length from BleuScorer to Reference.
2012-03-14Clean up BleuScorer.Tetsuo Kiso
2012-03-14Clean up BleuScorer with lookup().Tetsuo Kiso
2012-03-14Create a header file for NgramCounts class.Tetsuo Kiso
The reason is that we want to add the unit test.
2012-03-09Minor change for calculating BLEU.Tetsuo Kiso
To avoid defining the similar variables twice to calculate document-wise BLEU and sentence-wise BLEU scores.
2012-02-28Support for using factors in mert and evaluatorMatous Machacek
example: Use --factor "0|2" to use only first and third factor from nbest list and from reference. If you use interpolated scorer, separate records with comma (e.g. --factor "0|2,1").
2012-02-25Remove obsolete and unused logging statements.Tetsuo Kiso
2012-02-25Don't repeat calling functions many times.Tetsuo Kiso
Consider using constants the result if it is possible.
2012-02-25Minor change the logging utility for n-gram counts.Tetsuo Kiso
Use std::ostream instead of directly using std::cerr.
2012-02-25Clean up calculation effective reference length.Tetsuo Kiso
2012-02-25Add a more efficient member to set up ScoreStats.Tetsuo Kiso
- Remove unnecessary conversions. - Add 'const' to local variables.
2012-02-25Replace string objects with const char[].Tetsuo Kiso
2012-02-25Hide the implementation details of Ngram counts from the header.Tetsuo Kiso
2012-02-25Clean up commented out code snippets for debugging purposes.Tetsuo Kiso
2012-02-01Create private class to encapssulate encoding process.Tetsuo Kiso
Instead of using typedefs inside a class only, it might be better to create a private class to do same things.
2012-02-01Add prefix 'm_' to private and protected members in Scorer classes.Tetsuo Kiso
2012-02-01Move reference length type into a private member of BleuScorer.Tetsuo Kiso
The reason is that the type is used as internal purpose.
2012-02-01Change casts to C++ style casts.Tetsuo Kiso
2012-02-01Change casts to C++ style casts, and delete unnecessary casts.Tetsuo Kiso