github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2019-09-25	Merge remote-tracking branch 'upstream/master' into youki/win-jit-debug-int8	Young Jin Kim
2019-09-24	remove template parameter from PackedDepthWiseConvMatrix (#128)	Jongsoo Park
2019-09-11	API changes to take unquantized bias for depthwise conv	Daya Khudia
2019-09-05	Modifying PackAWithIm2Col to support dilated convolution and adding test cases	Protonu Basu
2019-09-04	remove dw conv refs and use conv_ref instead (#122)	Jongsoo Park
2019-09-03	disable clang formatting in a few array definitions (#121)	Jongsoo Park
2019-08-01	Merge upstream master	Young Jin Kim
2019-07-19	Support pointwise with unified convolution interface as well (#108)	Daya Khudia
2019-07-16	Assume input weights to be in transposed format for convUnified (#104)	Daya Khudia
2019-06-14	Improve some memroy allocation codes	Young Jin Kim
2019-06-13	Compile both on windows and linux	Young Jin Kim
2019-06-05	Unified convolution interface	Daya Khudia
2019-04-19	make sure cpuinfo_initialize called before fbgemmHasAvx2/512Support (#94)	Jongsoo Park
2019-04-02	Exposing tuning parameters in FBGEMM (MCB, NCB, KCB, MR, NR, Row Interleave) ...	Protonu Basu
2019-03-13	optimize requantize for float out processing (#85)	Jongsoo Park
2019-02-26	barebone int8-acc16 and int8-acc32 benchmarks	Daya S Khudia
2019-02-15	simple spmdm optimization (#76)	Jongsoo Park
2019-02-14	clean up depthwise conv interface (#72)	Jongsoo Park
2019-02-13	group conv optimized for 16 channels per group (#68)	Jongsoo Park
2019-02-02	gconv optimized for 8 channels per group (#65)	Jongsoo Park
2019-01-31	use 1 thread in benchmarks if OMP_NUM_THREADS is not explicitly set (#66)	Jongsoo Park
2019-01-31	Add threading for FBGEMM FP16	Jianyu Huang
2019-01-14	Groupwise direct convolution when number of channels per group is small	Daya S Khudia
2019-01-14	FP16Benchmark: Allow fp32 comparison using cblas (#56)	WilliamTambellini
2019-01-12	3x3x3 depthwise convolution with per channel quantization (#15775)	Jongsoo Park
2019-01-04	missing copyright headers	Daya S Khudia
2019-01-03	optimize remainder loops of requantization and rowoffset (#54)	Jongsoo Park
2019-01-02	use 1 omp thread unless OMP_NUM_THREADS is explicitly set (#53)	Jongsoo Park
2018-12-21	Update the profiling format for Acc32 Benchmark (#50)	Jianyu Huang
2018-12-21	Update with clang format (#51)	Jianyu Huang
2018-12-06	File name change for FbgemmI8Depthwise.h and FbgemmI8Depthwise.cc (#14725)	Daya S Khudia
2018-12-04	Fix the group issue in the benchmark and use ResNext101 conv shapes (#32)	Jianyu Huang
2018-11-30	protect omp.h include by a pragma	Daya S Khudia
2018-11-27	per-group and per-channel quantization (#14340)	Jongsoo Park
2018-11-26	remove unnecessary zero_point argument from constructors (#14323)	Jongsoo Park
2018-11-20	Parallelize the benchmark	Jianyu Huang
2018-11-19	clang-format (#11)	Jongsoo Park
2018-11-16	grouped (batched) gemm (#7)	Jongsoo Park
2018-11-08	Sync with internal copy: Asymmetric padding; fbgemm2 -> fbgemm	Jianyu Huang
2018-11-06	generalized conv_param_t and download third party libraries in build dir	dskhudia
2018-11-05	CMake minimum version required update	dskhudia
2018-11-03	Manually syncing with internal copy	dskhudia
2018-10-31	Initial commit	Daya S Khudia