github.com/OpenNMT/CTranslate2.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2022-11-07	Rename TranslatorPool -> Translator and GeneratorPool -> Generator (#969)	Guillaume Klein
	* Rename TranslatorPool -> Translator and GeneratorPool -> Generator * Fix option help * Fix include * Add ModelLoader helper class * Reduce diff
2022-11-04	Remove options normalize_scores and allow_early_exit (#968)	Guillaume Klein
	* Remove options normalize_scores and allow_early_exit * Update test output * Also apply penalties in greedy search to get consistent scores * Remove commented code
2022-11-02	Integrate the Whisper model from OpenAI (#956)	Guillaume Klein
	* Integrate the Whisper model from OpenAI * Reformat test file * Install cuDNN devel package * Return Numpy tensors in example * Push missing __init__.py file * Add datasets module in test requirements * Add decoded audio test file in the repo * Enable REORDER primitive
2022-10-26	Add Conv1D operator and layer (#941)	Guillaume Klein
	* Add Conv1D operator and layer * Fix typo in CMakeLists.txt * Skip the tests when the backend is not available * Add a separate CMake option for cuDNN * Minor code cleanup
2022-10-25	Simplify extension of ReplicaPool (#947)	Guillaume Klein

2022-10-21	Allow running the encoder without the lengths vector (#942)	Guillaume Klein

2022-10-20	Generalize disable_unk implementation to support a list of tokens (#940)	Guillaume Klein
	* Generalize disable_unk implementation to support a list of tokens * Update test
2022-09-28	Update deprecated GoogleTest macro (#924)	Guillaume Klein

2022-09-14	Get more candidates from topk to replace finished hypotheses (#911)	Guillaume Klein
	* Get more candidates from topk to replace finished hypotheses * Consider when the number of candidates is larger than the vocab size
2022-09-02	Improve support of restricted vocabulary in decoding (#904)	Guillaume Klein
	* Improve support of restricted vocabulary in decoding * Move mapping to the decoder * Check before updating ids * Verify that prefix tokens are included in the restricted vocab * Restore comment
2022-08-29	Fix biased decoding when the target vocabulary is restricted (#898)	Guillaume Klein

2022-08-03	Add option to not expand unlikely alternatives (#890)	Guillaume Klein
	* Add option to not expand unlikely alternatives * Reword * Revert docstring change * Simply resize state
2022-07-06	Fix application of max_decoding_length in return_alternatives mode (#866)	Guillaume Klein

2022-06-30	Add option no_repeat_ngram_size to prevent repetitions of ngrams (#854)	Guillaume Klein
	* Add option no_repeat_ngram_size to prevent repetitions of ngrams * Remove debug argument * Update blocks/threads configuration * Implement a more generic primitive
2022-06-21	Factorize layer creation (#846)	Guillaume Klein
	* Factorize layer creation * Fix function signature * Add tests * Minor diff cleanup
2022-06-20	Define float16_t type in public headers (#841)	Guillaume Klein
	* Define float16_t type in public headers The type was hidden in commit https://github.com/OpenNMT/CTranslate2/commit/b3038b859ed014058033d576157dcf4ad4ece9e4, but it is actually required by some public functions. * Cleanup install rule
2022-06-13	Fix tests compilation with CUDA enabled	Guillaume Klein

2022-06-03	Apply max_input_length to the actual model input, not the user input (#820)	Guillaume Klein

2022-06-03	Fix crash of scoring methods on empty inputs (#819)	Guillaume Klein
	* Fix crash of scoring methods on empty inputs * Fix target check
2022-04-12	Define ModelReplica to dissociate the model runtime from the weights (#771)	Guillaume Klein
	* Define ModelReplica to dissociate the model runtime from the weights * Keep cached attributes * Fix diff * Remove duplicated include
2022-04-04	Cache output layer transformations when possible (#764)	Guillaume Klein
	* Cache output layer transformations when possible * Update removal of excluded ids * Fix when the output size is the same when the exclude ids changed * Minor cleanup
2022-03-16	Manage empty inputs in sample method (#677)	Guillaume Klein
	* Manage empty inputs in sample method * Cleanup JobCreator methods * Update test * Also handle return_alternatives case in sample method
2022-03-15	Improve correctness of the scoring output (#736)	Guillaume Klein
	* Improve correctness of the scoring output * Include score of EOS token * Return the actual tokens that were scored (after vocabulary lookup and sequence truncation) * Update doc
2022-03-14	Allow batch encoding in return_alternatives mode (#735)	Guillaume Klein
	Only the decoding logic cannot run in batch mode at the moment.
2022-02-25	Add Swish activation (#719)	Guillaume Klein
	* Add Swish activation * Dispatch inside parallel for * Remove benchmark code
2022-02-01	Round value before cast in quantization (#704)	Guillaume Klein
	* Round value before cast in quantization * Update quantization formula in documentation * Add missing variables in lambda capture * Save binary version in named variable * Defer implicit cast to value assignment * Fix Python test * Remove non needed changes
2022-01-20	Fix copy of a storage with a zero dimension (#695)	Guillaume Klein

2022-01-18	Generalize batching to support additional input streams (#692)	Guillaume Klein
	* Generalize batching to support additional input streams * Use vector as argument
2021-12-02	Factorize the test data path getter	Guillaume Klein

2021-11-19	Add a CUDA kernel for Multinomial op (#631)	Guillaume Klein
	* Add a CUDA kernel for Multinomial op The current implementation can only return a single sample per batch. * Install libcurand-devel to build the Python wheels
2021-11-19	Reduce multi-head attention based on options alignment_{layer,heads} (#630)	Guillaume Klein
	* Reduce multi-head attention based on options alignment_{layer,heads} * Factorize number of threads * Improve variable name
2021-11-15	Improve implementation of the repetition penalty (#616)	Guillaume Klein
	* Improve implementation of the repetition penalty * Do not support repetition penalty with vmap for now * Apply penalty on logits during greedy search to match HF implementation * Also apply penalty on logits during beam search for consistency * Add test case * Cleanup test case
2021-11-10	Avoid creation of vector in storage indexing method (#618)	Guillaume Klein

2021-10-26	Add options max_input_length to truncate long inputs (#605)	Guillaume Klein

2021-10-22	Add repetition penalty in beam search (#601)	Guillaume Klein
	* Add repetition penalty in beam search * Fix test compilation * Remove parallel for in CPU primitive
2021-10-15	Apply LogSoftMax in-place during decoding (#584)	Guillaume Klein

2021-10-08	Fix correctness of returned normalized score (#575)	Guillaume Klein
	The normalized score was incorrect in the following case: * in greedy search when `max_decoding_length` is reached * in `return_alternatives` mode
2021-10-04	Fix in-place implementation of Gather op (#573)	Guillaume Klein
	The in-place implementation should only be selected when the ids are strictly increasing, not just increasing.
2021-09-02	Add CUDA implementation for the Tile operator (#549)	Guillaume Klein
	* Add CUDA implementation for the Tile operator * Simplify shape manipulation in expand_to_beam_size * Cleanup index computation
2021-07-30	Remove time dimension when decoding step by step (#530)	Guillaume Klein

2021-07-26	Add a scoring API (#522)	Guillaume Klein
	* Add a scoring API * Add missing include * Adapt initial state for non iterative decoding * Add missing include * Pass missing argument * Revert ScopedDeviceSetter changes in this branch * Add methods to get the aggregated score in ScoringResult * Rename function to get_results_from_futures * Update profile name to match function name * Move check in while loop condition * Factorize readers function * Factorize Python callables wrapper * Assign default values to sample method arguments * Add JobCreator class to specialize job creation * Factorize logic of job creation * Return statistics from file scoring * Fix indent * Fix indent * Update class comments
2021-07-21	Prepare the length mask before calling SoftMax op (#517)	Guillaume Klein
	The SoftMax op should not assume how to broadcast the length vector.
2021-06-21	Delegate bias and activation to operators (#499)	Guillaume Klein

2021-06-18	Support float16 type in GELU operator (#497)	Guillaume Klein

2021-06-14	Create background thread after instance has been initialized	Guillaume Klein

2021-06-14	Disable test on AArch64	Guillaume Klein

2021-06-10	Add a translation wrapper that buffers and batches incoming inputs (#486)	Guillaume Klein
	* Add a translation wrapper that buffers and batches incoming inputs * Notify outside the lock
2021-06-09	Revert "Add a translation wrapper that buffers and batches incoming inputs"	Guillaume Klein
	This reverts commit 6f949c07991a41a2663008201d3ffb225276e753.
2021-06-09	Add a translation wrapper that buffers and batches incoming inputs	Guillaume Klein

2021-06-08	Add translation option to return normalized scores (#484)	Guillaume Klein
	In some cases this can improve the beam search output.