github.com/marian-nmt/marian.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2021-07-09	Merged PR 19685: Marianize LSH as operators for mmapping and use in Quicksand	Martin Junczys-Dowmunt
	This PR turns the LSH index and search into a set of operators that live in the expression graph. This makes creation etc. thread-safe (one index per graph) and allows to later implement GPU versions. This allows to mmap the LSH as a Marian parameter since now we only need to turn the index into something that can be saved to disk using the existing tensors. This happens in marian_conv or the equivalent interface function in the Quicksand interface.
2021-03-02	merge with internal master	Marcin Junczys-Dowmunt

2021-02-28	Add graph operations documentation (#801)	Graeme
	* Doxygen structure for expression graph operators * Document arithmetic expression operations * Document comparison expression operations * Document exp/log and trig operations * Add missing implementation for cos/tan * Document expression manipulation operations * Document misc math operations * Overview of operators * Document activation functions * Document element-wise min/max * Document debugging/checkpoint operators * Document topk/argmin/argmax operations * Document index-based operations * Document reduction operations * Document lambda expression operators * Document product operations * Document softmax, cross-entropy, unlikelihood operations * Document dropout operations * Document scalar product and weighted average operations * Document layer normalization, highway and pooling operations * Document shift expression operator * Extra details on rules for adding specializations to .inc files * Add SinNodeOp example for specialization documentation * Additional details in tensor operator documentation * Remove brief command from doxygen comments * Prefer @ style doxygen functions to \ * Document n-ary function macros * Enable .cu and .inc files in documentation * Add a comment about ONNX mapping * Remove empty lines in doxygen * Update CHANGELOG Co-authored-by: Roman Grundkiewicz <rgrundkiewicz@gmail.com>
2021-01-28	Merged PR 17337: fp16 support for training	Martin Junczys-Dowmunt
	This PR refactors the training graph groups and optimizers to enable and simplify things for fp16 support. Deprecates old unused graph groups and fixes a couple of MPI issues.
2020-05-21	Merged PR 12958: ONNX support	Frank Seide
	This branch adds functionality to export ONNX models (with limitations).
2020-05-17	Merged PR 12959: minor fixes from my old ONNX code	Frank Seide
	These are minor comments/fixes I found when doing my ONNX prototype, would be good to get them out of the way
2020-05-15	Merged PR 12874: Add topk operator and other small changes in preparation of ↵	Martin Junczys-Dowmunt
	LSH-based short-list replacement * Add tuple nodes via views and trickery * Add `topk` operator, currently unused outside unit tests * Add `abs` operator, currently unused outside unit tests * Change return type of `Node::allocate()` to `void`. This used to return the number of allocated elements, but isn't really used anywhere. To avoid future confusion of elements and bytes, removed for now.
2020-01-11	Merged PR 11103: Clear cache for RNN object between batches	Martin Junczys-Dowmunt
	* Clears cache for RNN object in transformer, otherwise stale tensor might be kept around. * Add missing `hash()` and `equal` functions everywhere. * Fixes bug from deployment test.
2019-09-12	towards fp16 inference	Marcin Junczys-Dowmunt

2019-09-10	add initializers and more pieces of the type system	Marcin Junczys-Dowmunt

2019-09-07	CPU-side compilation with new pointers and automatic vectorization	Marcin Junczys-Dowmunt

2019-06-21	workaround for a segfault	Frank Seide

2019-04-30	weird mode change back	Frank Seide

2019-04-30	weird mode change	Frank Seide

2019-04-27	weirdo change of access permissions	Frank Seide

2019-02-13	merged with latest updates of fseide/commentbeamsearch	Frank Seide

2019-02-13	bug fix: reshape() must verify that #elements does not change; bug fix: beam ↵	Frank Seide
	search must reshape first step correctly
2019-02-07	merged from fseide/commentbeamsearch	Frank Seide

2019-01-27	add gelu activation	Marcin Junczys-Dowmunt

2019-01-25	Merged PR 6177: new operator log(sum(exp(x))), and a few more	Frank Seide
	This PR adds the `logsumexp()` reduction, that is, y = log(sum_j exp(x_i)) With this, `logsoftmax(z, ax)` can now be written as `z - logsumexp(z, ax)`. I need this for factored projections. The PR merges the near-duplicates `sum()` and `mean()` into a single `ReduceNodeOpCode`, which, for good measure, I extended to also implement additional reductions. Since now we need additional reduction operations besides the sum, this PR changes the current `functional::Add()` operation into an `functional::Aggregate()` operation that takes a second `Functor` for the reduction operation. This made it straight-forward to implement a whole range of reduction operations (the names are the same as Numpy): * `sum()` * `mean()` * `std()` * `var()` * `min()` * `max()` * `logsumexp()` I just noticed that I forgot the gradient for `prod()`. Operator tests have been added and pass. NOTE: There are no gradient tests. Please review the gradients carefully. I will test `logsumexp()` by replacing `logsoftmax` by the above formula in training. Related work items: #98143
2019-01-23	Merge branch 'fseide/indexops' into fseide/factoredembeddings	Frank Seide

2019-01-23	changed index operations' parameter lists to match PyTorch parameter order ↵	Frank Seide
	(axis before arg)
2019-01-21	Merge branch 'fseide/indexops' of ↵	Frank Seide
	https://machinetranslation.visualstudio.com/DefaultCollection/Marian/_git/marian-dev into fseide/factoredembeddings
2019-01-20	bug fix: SliceViewNodeOp should forward value_type() correctly	Frank Seide

2019-01-20	now routing rows() and cols() via index_select(), which then redistributes ↵	Frank Seide
	them to RowsNodeOp or ColsNodeOp; tests updated accordingly; bug fix: missed an axis normalization; bug fix: ReshapeNodeOp should pass on the value_type as to allow reshaping IndexType tensors
2019-01-19	switched to memory-saving implementation of smoothing	Frank Seide

2019-01-19	(fixed an indentation)	Frank Seide

2019-01-19	bugbug: ReduceNodeOpCode::sumSqr should be meanSqr	Frank Seide

2019-01-19	added gradients for std() and var()	Frank Seide

2019-01-19	added gradients for min(), max(), and logsumexp()	Frank Seide

2019-01-19	added tests for all reduction operators	Frank Seide

2019-01-19	(minor bug fix)	Frank Seide

2019-01-19	resolved a template ambiguity, still not compiling on gcc for now	Frank Seide

2019-01-19	(towards GPU aggregator)	Frank Seide

2019-01-18	first shot at extending Reduce() with a redunction functor (CPU only so far)	Frank Seide

2019-01-18	towards unifying reduction operators	Frank Seide

2018-12-27	generalized step() to narrow() and sliceView(), new class Slice;	Frank Seide
	bug fix: SliceViewNodeOp should use correct size for memory piece; new operation stopGradient()
2018-12-13	minibatch-size warmup (manually merged over from fseide/covbias);	Frank Seide
	minibatches are now fed in GPU-sized chunks rather than a massive joint batch for all GPUs in the update; Adam hyper-parameter adjustment limited to learning rate, as momentum adjustment is counterproductive for MB scaling; log output now includes the last batch size; log output now shows current best for stalled validation metrics; bug fix: Adam optimizer should persist denominators; bug fix: Adam and Adagrad should use correct element size when persisting; min and max renamed to minimum and maximum, for consistency with other toolkits; pathie now compiles in manual VS Project
2018-11-29	fix all warnings	Marcin Junczys-Dowmunt

2018-10-01	get rid of masked softmax, just use logMask as in transformer.h	Marcin Junczys-Dowmunt

2018-09-30	use more integer tensors	Marcin Junczys-Dowmunt

2018-09-30	towards using integer tensors	Marcin Junczys-Dowmunt

2018-09-30	Make Word uint32_t and introduce IndexType	Marcin Junczys-Dowmunt

2018-09-27	Rename ax_ to axis_	Roman Grundkiewicz

2018-09-16	remove more keywords	Marcin Junczys-Dowmunt

2018-09-16	get rid of keywords	Marcin Junczys-Dowmunt

2018-09-14	get rid of boost::hash_combine	Marcin Junczys-Dowmunt

2018-08-28	fix transpose operator	Marcin Junczys-Dowmunt

2018-08-26	working guided alignment and alignment computation during translation	marcinj

2018-08-23	added missing override specifiers	Frank Seide