Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/marian.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Collapse)Author
2021-04-29start broadcastHieu Hoang
2021-04-29move cache variables into shortlist classHieu Hoang
2021-04-29restartHieu Hoang
2021-04-28Remove unused memoized_ variable (#852)Kenneth Heafield
2021-04-26Merged PR 18612: Early stopping on first, all, or any validation metricsRoman Grundkiewicz
Adds `--early-stopping-on first|all|any` allowing to decide if early stopping should take into account only first, all, or any validation metrics. Feature request: https://github.com/marian-nmt/marian-dev/issues/850 Regression tests: https://github.com/marian-nmt/marian-regression-tests/pull/79
2021-04-21Merge branch 'master' of github.com:marian-nmt/marian-dev into pmasterMarcin Junczys-Dowmunt
2021-04-20fix depth-scaling in FFNMarcin Junczys-Dowmunt
2021-04-14Compute tensor size using integers (#851)Kenneth Heafield
2021-04-11Merge branch 'master' of ↵Marcin Junczys-Dowmunt
vs-ssh.visualstudio.com:v3/machinetranslation/Marian/marian-dev
2021-04-11Merge branch 'pmaster'Marcin Junczys-Dowmunt
2021-04-11fix beam_search ABORT when enable openmp and OMP_NUM_THREADS > 1 (#767)huangjq0617
2021-04-10Merged PR 18505: RMSNorm on GPUMartin Junczys-Dowmunt
Support for RMSNorm as drop-in replace for LayerNorm from _Biao Zhang; Rico Sennrich (2019). Root Mean Square Layer Normalization_. Enabled in Transformer model via `--transformer-postprocess dar` instead of `dan`.
2021-04-09synced with public masterMarcin Junczys-Dowmunt
2021-04-09Adds better Affine support for GPUs when using CUDA 11. Introduces a new ↵rhenry-nv
bias addition kernel for CUDA < 11 (#778) Co-authored-by: Marcin Junczys-Dowmunt <marcinjd@microsoft.com>
2021-04-08Revert "remove TC_MALLOC from optional dependencies (#840)"Marcin Junczys-Dowmunt
This reverts commit 096c48e51cd2e61bb275345d7cca99cbfd6bc5c7.
2021-03-31Merged PR 18366: Fix generation of special control characters for default ↵Rohit Jain
vocabulary This PR extends the --allow-special feature to default vocabulary items as well. If the default vocabulary is provided with symbols ostensibly generated from the SentencePiece Byte Fallback mechanism, we suppress the control characters from that list.
2021-03-26Merged PR 18309: Cleaner suppression of unwanted output wordsMartin Junczys-Dowmunt
This PR adds cleaner suppression of unwanted output words. We identified a situation where SPM with byte-fallback can generate random bytes with output-sampling. That is particularly harmful when that random bytes happens to be a newline symbol. Here we suppress newline in output unless explicitly wanted.
2021-03-24Properly copy the entire vector in the int16_t case (#845)Nikolay Bogoychev
Fixes #842 #843 #844
2021-03-22Fix FBGEMM build with gcc 9.3+ (#836)Young Jin Kim
2021-03-22Display decoder speed statistics with --stat-freq N (#841)Marcin Junczys-Dowmunt
Display decoder time statistics if requested
2021-03-22remove TC_MALLOC from optional dependencies (#840)Marcin Junczys-Dowmunt
There seems to be no benefit from TC_MALLOC any more, hence removing.
2021-03-22Merge branch 'master' into pmasterMarcin Junczys-Dowmunt
2021-03-19Fix model loading on architectures where size_t is 32bits (#825)Nikolay Bogoychev
* fix model loading on architectures where size_t is 32bit * Update the changelog Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2021-03-19Add simple unit tests for binary files (#826)Marcin Junczys-Dowmunt
* unit tests for binary file operations * adjust changelog * Set file_ in TemporaryFile for MSVC Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2021-03-19Fix broken links to MNIST data sets (#838)Roman Grundkiewicz
2021-03-19Merged PR 18232: Update VS CMake builds and scriptsRoman Grundkiewicz
This PR updates Windows build using Visual Studio CMake compilation with Ninja. It does not affect standard VS compilation or Windows builds on Azure/GitHub CI. List of changes: - Fixed syntax in the script installing dependencies via vcpkg. - Removed installation of Protobuf (already included as a submodule) and Boost 1.72 (the previous solution no longer works with new vcpkg). - Disabled compilation of marian-server in the default setting due to Boost issues. - Disabled compilation of NCCL in the default setting due to an error (see comment in the code). - Updated vs/README.
2021-03-18sync public and internal masterMarcin Junczys-Dowmunt
2021-03-18Merged PR 18185: Support for Microsoft legacy binary shortlistMartin Junczys-Dowmunt
Adds support for Microsoft-internal binary shortlist format.
2021-03-12Fix missing float template specialisation for elem::Plus (#822)Graeme
* Fix missing float template specialisation for elem::Plus * Update CHANGELOG.md
2021-03-12Fix fallback to default paths in MNIST example (#821)Graeme
If --train-sets or --valid-sets are not provided, the fallback to the hard-coded paths does not occur. This commit requires that these entities have a non-empty value.
2021-03-08Update formattingRoman Grundkiewicz
2021-03-06clang-format -iHieu Hoang
2021-03-05add .hHieu Hoang
2021-03-04add logits.cppHieu Hoang
2021-03-04more code from .h -> .cppHieu Hoang
2021-03-04costs.cppHieu Hoang
2021-03-04move logits to its own fileHieu Hoang
2021-03-04move logits to its own fileHieu Hoang
2021-03-04move output to its own fileHieu Hoang
2021-03-04move embedding to its own fileHieu Hoang
2021-03-04chmod -xHieu Hoang
2021-03-03Fix OMP compilation (#824)Kenneth Heafield
* Fix omp variable names
2021-03-02Update simple websocket server (#823)Roman Grundkiewicz
* Update simple-websocket-server submodule * Update VERSION
2021-03-02merge with internal masterMarcin Junczys-Dowmunt
2021-02-28Add graph operations documentation (#801)Graeme
* Doxygen structure for expression graph operators * Document arithmetic expression operations * Document comparison expression operations * Document exp/log and trig operations * Add missing implementation for cos/tan * Document expression manipulation operations * Document misc math operations * Overview of operators * Document activation functions * Document element-wise min/max * Document debugging/checkpoint operators * Document topk/argmin/argmax operations * Document index-based operations * Document reduction operations * Document lambda expression operators * Document product operations * Document softmax, cross-entropy, unlikelihood operations * Document dropout operations * Document scalar product and weighted average operations * Document layer normalization, highway and pooling operations * Document shift expression operator * Extra details on rules for adding specializations to .inc files * Add SinNodeOp example for specialization documentation * Additional details in tensor operator documentation * Remove brief command from doxygen comments * Prefer @ style doxygen functions to \ * Document n-ary function macros * Enable .cu and .inc files in documentation * Add a comment about ONNX mapping * Remove empty lines in doxygen * Update CHANGELOG Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2021-02-28Add graph documentations (#788)Qianqian Zhu
* add API docs for expression_graph.h * change API docs to doxygen-readable format * add API docs for node_initializers * update doxygen configure file * add hyperlinks and remove layers section from graph documentation * fixing typos and links on graph doc
2021-02-22Update simple-websocket-server to the latest version (#799)Rihards Krišlauks
This adds support for boost 1.75
2021-02-12do not do dropout at inferenceMarcin Junczys-Dowmunt
2021-02-12enable marian-tgz building for non-static build, fix small dropout bugMarcin Junczys-Dowmunt
2021-02-08Merged PR 17430: Refactors MPI interfaces and adds different types of ↵Martin Junczys-Dowmunt
gradient exchanges * Refactors MPI-related code * Adds node-local updates with occasional inter-node updates * decouples batch-reading across nodes