diff options
author | Marcin Junczys-Dowmunt <marcinjd@microsoft.com> | 2021-11-25 05:42:14 +0300 |
---|---|---|
committer | Marcin Junczys-Dowmunt <marcinjd@microsoft.com> | 2021-11-25 05:42:14 +0300 |
commit | bbc673c50fbf2faa90bdc44003d15087632262bc (patch) | |
tree | df64393365a3052ffa25d5078631390848384da3 | |
parent | 8b8d1b11e28a421b348703d702c9c5206061df9d (diff) |
update CHANGELOG and VERSION
-rw-r--r-- | CHANGELOG.md | 6 | ||||
-rw-r--r-- | VERSION | 2 |
2 files changed, 6 insertions, 2 deletions
diff --git a/CHANGELOG.md b/CHANGELOG.md index 05658fe1..bce24cfc 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -9,7 +9,11 @@ and this project adheres to [Semantic Versioning](http://semver.org/spec/v2.0.0. ## [Unreleased] ### Added -- Adds option --add-lsh to marian-conv which allows the LSH to be memory-mapped. +- Parallelized data reading with e.g. `--data-threads 8` +- Top-k sampling during decoding with e.g. `--output-sampling topk 10` +- Improved mixed precision training with `--fp16` +- Set FFN width in decoder independently from encoder with e.g. `--transformer-dim-ffn 4096 --transformer-decoder-dim-ffn 2048` +- Adds option `--add-lsh` to marian-conv which allows the LSH to be memory-mapped. - Early stopping based on first, all, or any validation metrics via `--early-stopping-on` - Compute 8.6 support if using CUDA>=11.1 - Support for RMSNorm as drop-in replace for LayerNorm from `Biao Zhang; Rico Sennrich (2019). Root Mean Square Layer Normalization`. Enabled in Transformer model via `--transformer-postprocess dar` instead of `dan`. @@ -1,2 +1,2 @@ -v1.10.24 +v1.10.42 |