CHANGELOG.md


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348

## [Unreleased]

### Breaking changes

### New features

* Introduce hook mechanism for additional customization of workflows
* Sentence-level negative log-likelihood criterion for sequence tagging
* '-' stands for stdin for inference tools (translate, lm, tag)
* Optional source features per request (for domain control) with rest translation server
* Display oov rate (source/target) in translate
* Add `-log_tag` option to add tag in logs for better automatic processing

### Fixes and improvements

* Misc fixes on lexical beam search
* Fix batch size non function with `rest_translation_server.lua`
* Introduce `-tokenizer max` option to scorer for evaluation on non tokenized test data.
* Fix non deterministic inference of language models
* Fix language model sampling mode
* Fix retraining from a language model
* Fix `-update_vocab` option for language models
* Correct error handling for all file open commands

## [v0.9.7](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.9.7) (2017-12-19)

### Fixes and improvements

* Fix detokenization when replaced target tokens contain spaces

## [v0.9.6](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.9.6) (2017-12-11)

### Fixes and improvements

* Protected sequence outputs correctly deserialize protected characters (％abcd)
* Bypass case management on protected sequences

## [v0.9.5](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.9.5) (2017-12-07)

### Fixes and improvements

* Enable constrained beam search for protected sequence
* Fix invalid `NOERROR` log level (rename it to `NONE`)

## [v0.9.4](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.9.4) (2017-11-30)

### Fixes and improvements

* Fix regression when normalizing protected sequences

## [v0.9.3](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.9.3) (2017-11-30)

### Fixes and improvements

* Fix vocabulary extraction of protected sequences (#444)

## [v0.9.2](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.9.2) (2017-11-27)

### Fixes and improvements

* Fix empty translation returned by the REST translation server
* Fix random split of protected sequences by BPE (#441)
* Fix error when using `-update_vocab` with additional word features

## [v0.9.1](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.9.1) (2017-11-16)

### Fixes and improvements

* Fix missing normalization during translation
* Fix normalization when the command contains pipes
* Fix incorrect TER normalization (#424)
* Fix error when the file to translate contains empty lines

## [v0.9.0](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.9.0) (2017-11-07)

### Breaking changes

* Learning rate is also decayed when using Adam
* Fix some wrong tokenization rules (punctuation-numbers)
* `-report_every` option is renamed to `-report_progress_every`
* `-EOT_marker` option is renamed to `-bpe_EOT_marker` for `tokenize.lua`
* `-BOT_marker` option is renamed to `-bpe_BOT_marker` for `tokenize.lua`
* `bit32` package is now required for LuaJIT users

### New features

* Dynamic dataset to train on large and raw training data repository
* Convolutional encoder
* Shallow fusion of language model in decoder
* Lexically constrained beam search
* TER validation metric
* Protection blocks for tokenization - and implement placeholder
* Hook to call external normalization
* JSON log formatting when the log file suffix is `.json`
* Training option to save the validation translation to a file
* Training option to reset the optimizer states when the learning rate is decayed
* Training option to update the vocabularies during a retraining
* Translation option to save alignment history
* Translation translation option to mark replaced tokens with `｟unk:xxxxx｠`
* Tokenization option to split numbers on each digit
* Multi-model rest server using yaml config file

### Fixes and improvements

* Allow disabling gradients clipping with `-max_grad_norm 0`
* Allow disabling global parameters initialization with `-param_init 0`
* Introduce error estimation in scorer for all metrics
* Reduce memory footprint of Adam, Adadelta and Adagrad optimizers
* Make validation data optional for training
* Faster tokenization (up to x2 speedup)
* Fix missing final model with some values of `-save_every_epochs`
* Fix validation score delta that was applied in the incorrect direction
* Fix LuaJIT out of memory issues in `learn_bpe.lua`
* Fix documentation generation of embedded tokenization options
* Fix release of sequence tagger models

## [v0.8.0](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.8.0) (2017-06-28)

### Breaking changes

* Models previously trained with `-pdbrnn` or `-dbrnn` are no more compatible
* `-start_decay_ppl_delta` option is renamed to `-start_decay_score_delta`
* `-decay perplexity_only` option is renamed to `-decay score_only`

### Deprecations

* `-brnn`, `-dbrnn` and `-pdbrnn` options are replaced by `-encoder_type <type>` for future extensions
* `-sample_tgt_vocab` option is renamed `-sample_vocab` and is extended to language models

### New features

* Implement inference for language models for scoring or sampling
* Support variational dropout and dropout on source sequence
* Support several validation metrics: loss, perplexity, BLEU and Damerau-Levenshtein edit ratio
* Add option in preprocessing to check that lengths of source and target are equal (e.g. for sequence tagging)
* Add `-pdbrnn_merge` option to define how to reduce the time dimension
* Add option to segment mixed cased words
* Add option to segment words of given alphabets or when switching alphabets
* Add Google's NMT encoder
* Add external scorer script for BLEU and Damerau-Levenshtein edit ratio
* Add script to average multiple models
* Add option to save the beam search as JSON

### Fixes and improvements

* Support input vectors for sequence tagging
* Fix incorrect gradients when using variable length batches and bidirectional encoders

## [v0.7.1](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.7.1) (2017-05-29)

### Fixes and improvements

* Fix backward compatibility with older models using target features
* Fix importance sampling when using multiple GPUs
* Fix language models training

## [v0.7.0](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.7.0) (2017-05-19)

### Breaking changes

* `-sample_w_ppl` option is renamed `-sample_type` for future extensions

### New features

* Support vectors as inputs using [Kaldi](http://kaldi-asr.org/) input format
* Support parallel file alignment by index in addition to line-by-line
* Add script to generate pretrained word embeddings:
  * from [Polyglot](https://sites.google.com/site/rmyeid/projects/polyglot) repository
  * from pretrained *word2vec*, *GloVe* or *fastText* files
* Add an option to only fix the pretrained part of word embeddings
* Add a bridge layer between the encoder and decoder to define how encoder states are passed to the decoder
* Add `epoch_only` decay strategy to only decay learning based on epochs
* Make epoch models save frequency configurable
* Optimize decoding and training with target vocabulary reduction (importance sampling)
* Introduce `partition` data sampling

### Fixes and improvements

* Improve command line and configuration file parser
  * space-separated list of values
  * boolean arguments
  * disallow duplicate command line options
  * clearer error messages
* Improve correctness of `DBiEncoder` and `PDBiEncoder` implementation
* Improve unicode support for languages using combining marks like Hindi
* Improve logging during preprocessing with dataset statistics
* Fix translation error of models profiled during training
* Fix translation error of models trained without attention
* Fix error when using one-layer GRU
* Fix incorrect coverage normalization formula applied during the beam search

## [v0.6.0](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.6.0) (2017-04-07)

### Breaking changes

* `-fix_word_vecs` options now requires `0` and `1` as argument for a better retraining experience

### New features

* Add new encoders: deep bidirectional and pyramidal deep bidirectional
* Add attention variants: no attention and *dot*, *general* or *concat* global attention
* Add alternative learning rate decay strategy for SGD training
* Introduce dynamic parameter change for dropout and fixed word embeddings
* Add length and coverage normalization during the beam search
* Add translation option to dump input sentence encoding
* Add TensorBoard metrics visualisation with [Crayon](https://github.com/torrvision/crayon)
* [*experimental*] Add sequence tagger model

### Fixes and improvements

* Check consistency of option settings when training from checkpoints
* Save and restore random number generator states from checkpoints
* Output more dataset metrics during the preprocessing
* Improve error message on invalid options
* Fix missing n-best hypotheses list in the output file
* Fix individual losses that were always computed when using random sampling
* Fix duplicated logs in parallel mode

## [v0.5.3](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.5.3) (2017-03-30)

### Fixes and improvements

* Fix data loading during training

## [v0.5.2](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.5.2) (2017-03-29)

### Fixes and improvements

* Improve compatibility with older Torch versions missing the `fmod` implementation

## [v0.5.1](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.5.1) (2017-03-28)

### Fixes and improvements

* Fix translation with FP16 precision
* Fix regression that make `tds` mandatory for translation

## [v0.5.0](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.5.0) (2017-03-06)

### New features

* Training code is now part of the library
* Add `-fallback_to_cpu` option to continue execution on CPU if GPU can't be used
* Add standalone script to generate vocabularies
* Add script to extract word embeddings
* Add option to prune vocabularies by minimum word frequency
* New REST server
* [*experimental*] Add data sampling during training
* [*experimental*] Add half floating point (fp16) support (with [cutorch@359ee80](https://github.com/torch/cutorch/commit/359ee80be391028ffa098de429cc0533b2f268f5))

### Fixes and improvements

* Make sure released model does not contain any serialized function
* Reduce size of released BRNN models (up to 2x smaller)
* Reported metrics are no longer averaged on the entire epoch
* Improve logging in asynchronous training
* Allow fixing word embeddings without providing pre-trained embeddings
* Fix pretrained word embeddings that were overriden by parameters initialization
* Fix error when using translation server with GPU model
* Fix gold data perplexity reporting during translation
* Fix wrong number of attention vectors returned by the translator

## [v0.4.1](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.4.1) (2017-02-16)

### Fixes and improvements

* Fix translation server error when clients send escaped unicode sequences
* Fix compatibility issue with the `:split()` function

## [v0.4.0](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.4.0) (2017-02-10)

### Breaking changes

* New translator API for better integration

### New features

* Profiler option
* Support hypotheses filtering during the beam search
* Support individually setting features vocabulary and embedding size
* [*experimental*] Scripts to interact with the [benchmark platform](http://scorer.nmt-benchmark.net/)
* [*experimental*] Language modeling example

### Fixes and improvements

* Improve beam search speed (up to 90% faster)
* Reduce released model size (up to 2x smaller)
* Fix tokenization of text containing the joiner marker character
* Fix `-joiner_new` option when using BPE
* Fix joiner marker generated without the option enabled
* Fix translation server crash on Lua errors
* Fix error when loading configuration files containing the `gpuid` option
* Fix BLEU drop when applying beam search on some models
* Fix error when using asynchronous parallel mode
* Fix non SGD model serialization after retraining
* Fix error when using `-replace_unk` with empty sentences in the batch
* Fix error when translating empty batch

## [v0.3.0](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.3.0) (2017-01-23)

### Breaking changes

* Rename `-epochs` option to `-end_epoch` to clarify its behavior
* Remove `-nparallel` option and support a list of comma-separated identifiers on `-gpuid`
* Rename `-sep_annotate` option to `-joiner_annotate`

### New features

* ZeroMQ translation server
* Advanced log management
* GRU cell
* Tokenization option to make the token separator an independent token
* Tokenization can run in parallel mode

### Fixes and improvements

* Zero-Width Joiner unicode character (ZWJ) is now tokenizing but as a joiner
* Fix Hangul tokenization
* Fix duplicated tokens in aggressive tokenization
* Fix error when using BRNN and multiple source features
* Fix error when preprocessing empty lines and using additional features
* Fix error when translating empty sentences
* Fix error when retraining a BRNN model on multiple GPUs

## [v0.2.0](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.2.0) (2017-01-02)

### Breaking changes

* `-seq_length` option is split into `-src_seq_length` and `-tgt_seq_length`

### New features

* Asynchronous SGD
* Detokenization
* BPE support in tokenization

### Fixes and improvements

* Smaller memory footprint during training
* Smaller released model size after a non-SGD training
* Fix out of memory errors in preprocessing
* Fix BRNN models serialization and release
* Fix error when retraining a model
* Fix error when using more than one feature

## [v0.1.0](https://github.com/OpenNMT/OpenNMT/releases/tag/v0.1.0) (2016-12-19)

Initial release.