github.com/marian-nmt/marian.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2022-02-08	update VERSION and CHANGELOG - Release 1.11.0HEAD 1.11.0 master	Marcin Junczys-Dowmunt

2022-02-08	Document Structure (#910)	Graeme Nail
	* Add architectural outline * Update index
2022-02-08	add -DDETERMINISTIC=ON/OFF flag (#912)	Marcin Junczys-Dowmunt
	* Add -DDETERMINISTIC=ON/OFF flag to CMake * Use -DDETERMINISTIC=on in GitHub/Azure workflows Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2022-02-07	fix server behaviour	Marcin Junczys-Dowmunt

2022-02-07	update regression tests pointer	Marcin Junczys-Dowmunt

2022-02-06	resolve conflicts	Marcin Junczys-Dowmunt

2022-02-06	merged with master	Marcin Junczys-Dowmunt

2022-01-30	Update list of contributors (#906)	Roman Grundkiewicz

2022-01-28	Cherry picked cleaning/refeactoring patches (#905)	Roman Grundkiewicz
	Cherry-picked updates from pull request #457 Co-authored-by: Mateusz Chudyk <mateuszchudyk@gmail.com>
2022-01-26	Layer documentation (#892)	Qianqian Zhu
	* More examples for MLP layers and docs about RNN layers * Docs about embedding layer and more doxygen code docs * Add layer and factors docs into index.rst * Update layer documentation * Fix typos Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com> Co-authored-by: Graeme Nail <graemenail.work@gmail.com>
2022-01-24	Update VERSION	Roman Grundkiewicz

2022-01-24	Improve checks on transformer cache (#881)	Graeme Nail
	* Fix caching in transformer attention * Move hash specialization * Swap comments to doxygen * Include string header
2022-01-18	Update VERSION	Roman Grundkiewicz

2022-01-18	Scorer model loading (#860)	Graeme Nail
	* Add MMAP as an option * Use io::isBin * Allow getYamlFromModel from an Item vector * ScorerWrapper can now load on to a graph from Item vector The interface IEncoderDecoder can now call graph loads directly from an Item Vector. * Translator loads model before creating scorers Scorers are created from an Item vector * Replace model-config try-catch with check using IsNull * Prefer empty vs size * load by items should be pure virtual * Stepwise forward load to encdec * nematus can load from items * amun can load from items * loadItems in TranslateService * Remove logging * Remove by filename scorer functions * Replace by filename createScorer * Explicitly provide default value for get model-mmap * CLI option for model-mmap only for translation and CPU compile * Ensure model-mmap option is CPU only * Remove move on temporary object * Reinstate log messages for model loading in Amun / Nematus * Add log messages for model loading in scorers Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2021-12-16	Update VERSION	Roman Grundkiewicz

2021-12-16	Use apple accelerate on MacOs by default (#897)	Nikolay Bogoychev

2021-12-07	Fix AVX2+ detection on Mac (#895)	Nikolay Bogoychev
	MacOS is weird and its CPU flags are separated in two separate fields returned by the sysctl interface. To get around this, we need to test both of them, so here goes Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2021-12-07	Documentation about how to write code documentation (#891)	Qianqian Zhu
	* add initial guidelines of code documentation * fix math formula not displayed in Sphinx * remove @name tags which cannot be extracted by exhale and cause function signature errors * fix markdown ref warning and update markdown parser in sphinx * more about doxygen: add Doxygen commands and math formulas * move code doc guide to a new .rst file * add formula image * Set myst-parser version appropriate for the requested sphinx version * Update documentation on how to write Doxygen comments * Add new section to the documentation index * Sphinx 2.4.4 requires myst-parser 0.14 * complete code doc guide and small fixes on reStructuredText formats * More about reStructuredText * Update badges on the documentation frontpage Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2021-12-07	Merged PR 21648: Allow for dynamic gradient scaling to fade out after N updates	Marcin Junczys-Dowmunt
	Allow for dynamic gradient scaling to fade out after N updates
2021-12-06	Constrain version of mistune to before v2 in GitHub CI Documentation builds ↵	Graeme Nail
	(#894)
2021-11-25	update CHANGELOG and VERSION	Marcin Junczys-Dowmunt

2021-11-25	Merged PR 21553: Parallelize data reading for training	Marcin Junczys-Dowmunt
	This parallelizes data reading. On very fast GPUs and with small models training speed can be starved by too slow batch creation. Use --data-threads 8 or more, by default currently set to 1 for backcompat.
2021-11-23	Add GCC 11 support (#888)	Nikolay Bogoychev
	* Add GCC 11 support Some C++ Standard Library headers have been changed to no longer include other headers that they do need to depend on. As such, C++ programs that used standard library components without including the right headers will no longer compile. The following headers are used less widely in libstdc++ and may need to be included explicitly when compiled with GCC 11: <limits> (for std::numeric_limits) <memory> (for std::unique_ptr, std::shared_ptr etc.) <utility> (for std::pair, std::tuple_size, std::index_sequence etc.) <thread> (for members of namespace std::this_thread.) Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2021-11-22	Task alias validation during training mode (#886)	Nikolay Bogoychev
	* Attempt to validate task alias * Validate allowed options for --task alias * Update comment in aliases.cpp * Show allowed values for alias Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2021-11-22	Update submodule regression-tests	Roman Grundkiewicz

2021-11-22	Added pragma to ignore unused-private-field error on elementType_ which ↵	David Meikle
	failed in macOS (#872) Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2021-11-22	Merged PR 20729: Add top-k sampling	Marcin Junczys-Dowmunt
	This adds Top-K sampling to Marian and extends the --output-sampling option to take arguments
2021-11-21	Update badges in README.md	Roman Grundkiewicz

2021-10-26	Merged PR 21151: Cleaning up fp16 behavior	Marcin Junczys-Dowmunt
	This PR improves clipping and pruning behavior of NaNs and Infs during fp16 training, ultimately avoiding the underflow problems that we were facing so far.
2021-10-26	Merged PR 21166: Keep building on macOS-10.15	Roman Grundkiewicz
	Marian does not compile on macOS 11.6, so the build has stopped working due to an upgrade from macOS-10.15 to macOS 11.6 in Azure Pipelines: https://github.com/actions/virtual-environments/issues/4060 This PR explicitly set macOS 10.15 in the workflow.
2021-10-13	Merged PR 20933: beam & batch works for n on-factored models	Hieu Hoang

2021-10-11	Remove Ubuntu 16.04 from GitHub workflows (#879)	Roman Grundkiewicz
	* Add --allow-unauthenticated when installing CUDA * Remove workflow with Ubuntu 16.04
2021-09-28	Merged PR 20879: Adjustable ffn width and depth in transformer decoder	Marcin Junczys-Dowmunt

2021-09-28	Merged PR 20839: Do not ignore ignoreEOS for spm decoding	Marcin Junczys-Dowmunt
	With final space this eliminates trailing whitespace caused by appending EOS
2021-09-20	Merged PR 20593: Fix and update Azure pipelines	Roman Grundkiewicz
	- Add `--allow-unauthenticated` to `apt` when installing CUDA on Ubuntu - Removing `ubuntu-16.04` image from Azure pipelines, which will become unavailable after September 20
2021-09-16	Merged PR 20230: Add option for running regression tests only in Azure Pipelines	Roman Grundkiewicz
	This PR adds a checkbox which can be unchecked to skip running compilation checks when triggering them manually. It is useful for generating expected outputs on different CPUs for tests using 8-bit models.
2021-09-08	Factor concatenation improvements and documentation (#748)	Kenneth Heafield
	* concatenation combining option added when embeding using factors * crossMask not used by default * added an option to better clarify when choosing factor predictor options * fixed bug when choosing re-embedding option and not setting embedding size * avoid uncessary string copy * Check in factors documentation * Fix duplication in merge * Self-referential repository * change --factors-predictor to --lemma-dependency. Default behaviour changed. * factor related options are now stored with the model * Update doc/factors.md * add backward compability for the target factors * Move backward compatibility checks for factors to happen after the model.npz config is loaded * Add explicit error msg if using concat on target * Update func comments. Fix spaces * Add Marian version requirement * delete experimental code Co-authored-by: Pedro Coelho <pedrodiascoelho97@gmail.com> Co-authored-by: Pedro Coelho <pedro.coelho@unbabel.com> Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2021-09-07	Add --allow-unauthenticated when installing CUDA (#878)	Roman Grundkiewicz

2021-09-02	Merged PR 20560: Update SPM in Marian	Rohit Jain
	Update SPM in Marian
2021-08-06	Merged PR 20070: Run regression tests in Azure Pipelines	Roman Grundkiewicz
	The changes proposed in this pull request: * Added regression testing with internal models into Azure Pipelines on both Windows and Ubuntu * Created https://machinetranslation.visualstudio.com/Marian/_git/marian-prod-tests (more tests will be added over time) * Made regression test outputs (all `.log`, `.out`, `.diff` files) available for inspection as a downloadable artifact. * Made `--build-info` option available in CMake-based Windows builds Warning: I tried to handle multiple cases, but some regression tests may occasionally fail, especially tests using avx2 or avx512 models, because the outputs are system/CPU dependent. I think it's better to merge this already, monitoring the stability of tests, and adding expected outputs variations if necessary, improving the coverage and stability of regression tests over time.
2021-08-04	allow float32 conversion in QS interface	Marcin Junczys-Dowmunt

2021-07-30	Merged PR 19761: Expose SPM Interface from Marian	Rohit Jain
	This PR adds interfaces in Marian to allow it to handle segmentation duties. Related work items: #121418
2021-07-23	Merged PR 19910: Fix training/scoring error with FSM	Martin Junczys-Dowmunt
	Fixes a dimension mismatch during training and scoring introduced in the decoding-only shortlist changes. Related work items: #122643
2021-07-22	Merged PR 19904: Update instructions for building on Windows	Roman Grundkiewicz
	Changes proposed in this pull request: 1. Clarified instructions how to build Marian on Windows in `vs/README.md`. 2. `vs/CheckOrInstallDeps.bat` does not stop if CUDA or MKL are not installed.
2021-07-22	Merged PR 19914: Fix Windows Azure Pipelines	Roman Grundkiewicz
	Updating vcpkg seems to fix the recent issues with Windows builds in Azure Pipelines.
2021-07-21	Merged PR 19864: add bias if it exists	Hieu Hoang
	Fixes backcompat with shortlist and bias.
2021-07-18	Merged PR 19860: Case augmented data, if not using factored vocab must not ↵	Rohit Jain
	set guided alignments This change allows marking SentenceTuples as 'altered', if they were generated or modified by data augmentation internally in such a way so as to impact processing. In particular, for such sentence tuples, we do not want to try setting guided alignments if the externally provided guided alignments might no longer be correct after that alteration.
2021-07-16	Merged PR 19842: Adapt LSH to work with Leaf	Martin Junczys-Dowmunt
	Small changes to make the LSH work with Leaf server and QuickSand.
2021-07-11	Binary shortlist (#856)	Qianqian Zhu
	Co-authored-by: Kenneth Heafield <github@kheafield.com>
2021-07-10	silence unreferenced formal parameter warning on windows	Marcin Junczys-Dowmunt