Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/moses-smt/mosesdecoder.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2015-06-09moses/phrase-extract refactoringPhil Williams
Final commit in this round of refactoring (which started with commit 2f735998...). The main changes are: - a general storage mechanism for attribute/value pairs in XML-style tree / lattice input. E.g. the "pcfg-score" and "semantic-role" attributes in: <tree label="PRP" pcfg-score="1.0" semantic-role="AGENT"> I </tree> - consolidation of the various near-duplicate Tree / XmlTreeParser classes that have accumulated over the years (my fault) - miscellaneous de-crufting
2015-06-04Ongoing moses/phrase-extract refactoringPhil Williams
2015-06-04Ongoing moses/phrase-extract refactoringPhil Williams
2015-06-03Ongoing moses/phrase-extract refactoringPhil Williams
2015-06-03Ongoing moses/phrase-extract refactoringPhil Williams
2015-06-03Ongoing moses/phrase-extract refactoringPhil Williams
2015-06-02Ongoing moses/phrase-extract refactoringPhil Williams
2015-06-01Add LeafIterator and ConstLeafIterator to MosesTraining::Syntax::TreePhil Williams
2015-06-01Add ConstPreOrderIterator to MosesTraining::Syntax::TreePhil Williams
2015-05-29Ongoing moses/phrase-extract refactoringPhil Williams
2015-05-29Rename MosesTraining::SyntaxTree to MosesTraining::SyntaxNodeCollectionPhil Williams
This is the first step in a small-scale refactoring effort that will touch a lot of the syntax-related code in moses/phrase-extract. The end goals are: - a storage mechanism for general attribute/value pairs in XML-style tree / lattice input. E.g. the "pcfg-score" and "semantic-role" attributes in: <tree label="PRP" pcfg-score="1.0" semantic-role="AGENT"> I </tree> - consolidation of the various near-duplicate Tree / XmlTreeParser classes that have accumulated over the years (my fault) - general de-crufting
2015-04-22Support tokenize(const std::string &) as well.Jeroen Vermeulen
Convenience wrapper: the actual function takes a const char[], but many of the call sites want to pass a string and have to call its c_str() first.
2015-04-22Unify tokenize() into util, and unit-test it.Jeroen Vermeulen
The duplicate definition works fine in environments where the inline definition becomes a weak symbol in the object file, but if it gets generated as a regular definition, the duplicate definition causes link problems. In most call sites the return value could easily be made const, which gives both the reader and the compiler a bit more certainty about the code's intentions. In theory this may help performance, but it's mainly for clarity. The comments are based on reverse-engineering, and the unit tests are based on the comments. It's possible that some of what's in there is not essential, in which case, don't feel bad about changing it! I left a third identical definition in place, though I updated it with my changes to avoid creeping divergence, and noted the duplication in a comment. It would be nice to get rid of this definition as well, but it'd introduce headers from the main Moses tree into biconcor, which may be against policy.
2014-12-07phrase-extract: add syntax-common sub-libraryPhil Williams
And remove some (near-)duplicate code from pcfg-common and score-stsg.