github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2019-08-21	Per channel support in fbgemmConv (#119)	Daya Khudia
2019-08-12	fix error message (#117)	Daya Khudia
2019-08-09	Integrate VNNI into FBGEMM master branch (#114)	Jianyu Huang
2019-08-09	Add unpack to PackedGemmMatrixFP16 (#112)	Yinghai Lu
2019-08-06	Back out "[fbgemm] Integrate VNNI into FBGEMM master branch"	Jianyu Huang
2019-08-06	Integrate VNNI into FBGEMM master branch (#113)	Jianyu Huang
2019-08-02	Pass blocking param pointer into packedBufferSize() in PackBMatrix.cc	Mike Tsai
2019-07-19	Support pointwise with unified convolution interface as well (#108)	Daya Khudia
2019-07-17	While calling fbgemmConv with packed weights, packed weights should be compli...	Daya Khudia
2019-07-16	Add functions needed for unpacking in PackWeightsForConv (#106)	Daya Khudia
2019-07-16	unpack through unified convolution interface (#105)	Daya Khudia
2019-07-10	Refactoring unpack weight function (#103)	Jianyu Huang
2019-07-06	Unpack data for 3x3 (and 3x3x3) depthwise convolution	Daya Khudia
2019-07-06	Implement ::unpack() for PackWeightMatrixForGConv	Jaewon Lee
2019-06-20	Per channel and groupwise quantization (#99)	Daya Khudia
2019-06-15	Update the logic of checking valid parameters.	Mike Tsai
2019-06-07	Remove duplicated header and undo some changes in D15399811	Daya Khudia
2019-06-05	Unified convolution interface	Daya Khudia
2019-06-04	Add quantized::fbgemm_linear_unpack operator for serialization (#97)	Jianyu Huang
2019-04-02	Exposing tuning parameters in FBGEMM (MCB, NCB, KCB, MR, NR, Row Interleave) ...	Protonu Basu
2019-03-21	Improves small N cases back to what they were	Daya S Khudia
2019-03-21	Allocate some registers for B matrix loading and reuse loaded results	Daya S Khudia
2019-03-21	Further optimize acc16 kernel and cache blocking dimension for B matrix is no...	Daya S Khudia
2019-03-21	Further optimize acc32 kernel and cache blocking dimension for B matrix is no...	Daya S Khudia
2019-03-13	optimize requantize for float out processing (#85)	Jongsoo Park
2019-03-08	Fixes for FBGEMM FP16 performance (#82)	Jianyu Huang
2019-03-06	Add Avx512BW/VL/DQ check (#84)	Jianyu Huang
2019-03-01	Add documentations for the cache/register blocking parameters (#81)	Jianyu Huang
2019-02-20	optimize PackAWithIm2Col for symmetric b quant	Jongsoo Park
2019-02-13	optimize gconv for b symmetric quantization (#70)	Jongsoo Park
2019-02-13	no need to subtract col offset if a_zp is 0 (#69)	Jongsoo Park
2019-02-02	gconv optimized for 8 channels per group (#65)	Jongsoo Park
2019-02-02	Remove inappropriate consts (#67)	Lu Fang
2019-02-01	specialized requantization for gconv (#61)	Jongsoo Park
2019-01-31	Add threading for FBGEMM FP16	Jianyu Huang
2019-01-14	Groupwise direct convolution when number of channels per group is small	Daya S Khudia
2019-01-11	don't keep conv_param_p member as a const reference (#57)	Jongsoo Park
2019-01-03	fix shared lib build	Daya S Khudia
2018-12-21	Update with clang format (#51)	Jianyu Huang
2018-12-19	Refactor to use FbgemmFP16 in packed gemm operator (#49)	Amy Yang
2018-12-17	add comments on col_offsets (#48)	Jongsoo Park
2018-12-06	Final cleanup for avx2 isolation and consistent file names (#40)	Daya S Khudia
2018-12-06	avx2 intrinsic separation from OutputProcessing-inl.h (#38)	Daya S Khudia
2018-12-05	avx2 specific code in a separate file for QuantUtils (#29)	Daya S Khudia
2018-12-05	Move avx2 specific code in different source files (#28)	Daya S Khudia
2018-12-01	Build fix with fbgemm shared lib (#31)	Daya S Khudia
2018-11-30	Only export symbols that are required while building shared library	Daya S Khudia
2018-11-29	sparse convolution output processing (#27)	Jongsoo Park
2018-11-27	per-group and per-channel quantization (#14340)	Jongsoo Park
2018-11-27	fix group convention in B packing (#26)	Jongsoo Park