github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2020-05-21	Remove an unnecessary memory allocationyouki/improve-mem-alloc	Young Jin Kim
2020-05-21	Add the missing header for std::runtime_error (#379)	Jianyu Huang
2020-05-18	Drop extraneous "src" from local includes (#377)	Nikita Shulga
2020-05-17	Float16 rowwise sparse adagrad with stochastic rounding (reattempt of D215191...	Yong Wu
2020-05-16	Back out "Float16 rowwise sparse adagrad with stochastic rounding"	Natalia Gimelshein
2020-05-16	Float16 rowwise sparse adagrad with stochastic rounding (#370)	Yong Wu
2020-05-15	Add conv_1d (#369)	Hongzhang Shan
2020-05-14	Minor improvements in GEMM Kernels (#368)	Daya Khudia
2020-05-14	dirsync fbgemm with xplat	Jongsoo Park
2020-05-13	fix benchmark to consider offsets is now default instead of lengths (#373)	Jongsoo Park
2020-05-13	use lea to simplify code (#372)	Jongsoo Park
2020-05-12	minor fix (#371)	Hongzhang Shan
2020-05-06	unified conv to call dw conv with 2 oc per g (#360)	Jongsoo Park
2020-05-06	depthwise convolution with 2 output channels per group (#359)	Jongsoo Park
2020-05-04	Fix convert indexing bugs (#367)	Pawel Garbacki
2020-05-01	use vreg.ymm() instead of ymm(vreg.id()) (#361)	Jongsoo Park
2020-05-01	remove one-stage interface for sparse adagrad (#366)	Jongsoo Park
2020-04-30	implement L2 regularization for Adagrad in fbgemm (#365)	Jongsoo Park
2020-04-30	asmjit submodule update (#364)	Daya Khudia
2020-04-23	Zero point addition after rounding in quantization routines (#362)	Daya Khudia
2020-04-14	Move `transpose_simd` to TransposeUtils.cc (#352)	Nikita Shulga
2020-04-14	move multiplication with b_zp from jit'ed code to requantization (#358)	Jongsoo Park
2020-04-13	minor changes in UniConvTest and GConvTest (#357)	Jongsoo Park
2020-04-13	simplify transposeConvWeights (#356)	Jongsoo Park
2020-04-13	follow EXPECT_EQ(actual, expected) convention (#354)	Jongsoo Park
2020-04-11	use .half (#353)	Jongsoo Park
2020-04-11	Revert D20956364: Move `transpose_simd` to TransposeUtils.cc	Zafar Takhirov
2020-04-10	Move `transpose_simd` to TransposeUtils.cc (#349)	Nikita Shulga
2020-04-10	add option to use int64_t offsets/lengths in embedding operators (#350)	Jongsoo Park
2020-04-10	Move FakeFP16 back to internal to remove dependency on MKL (#36297)	Jianyu Huang
2020-04-10	Bazel build definition to use filelists from defs.bzl (#348)	Nikita Shulga
2020-04-10	change default value of use_offsets to true (to match with PyTorch EmbeddingB...	Jongsoo Park
2020-04-10	Move filelists to defs.bzl file (#344)	Nikita Shulga
2020-04-09	Add bazel build files (#346)	Nikita Shulga
2020-04-09	Merge pull request #345 from jspark1105/fixup-T64383689-master	Daya Khudia
2020-04-09	Re-sync with internal repository	Jongsoo Park
2020-04-09	Open sourcing fbgemm_fp16 ops (#36212)	Yinghai Lu
2020-04-07	add license header (#341)	Jongsoo Park
2020-04-07	JIT depth-wise conv (#338)	Jongsoo Park
2020-04-07	use read-write lock in CodeCache (#339)	Jongsoo Park
2020-04-07	Match numerics of quantize with vector path and fake quantize (#340)	Daya Khudia
2020-04-03	test 1024^3 matrix mulmat performance (#332)	Hongzhang Shan
2020-04-03	add dummy compute to cache evict func (#337)	Jongsoo Park
2020-03-31	disable asan for QuantizeAvx2 (#336)	Jongsoo Park
2020-03-31	add offset-based interface for PyTorch (#334)	Jongsoo Park
2020-03-27	Rename `times` global to avoid conflict	Andrew Gallagher
2020-03-24	remove unnecessary bound check for prefetching; avoid prefetching for -1 inde...	Jongsoo Park
2020-03-23	clamping with 1 comparison trick by treating signed as if it's unsigned (#324)	Jongsoo Park
2020-03-23	use xor instead of mov 0 (#325)	Jongsoo Park
2020-03-23	minor fixes in embedding benchmarks (#326)	Jongsoo Park