Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2019-03-08Fixes for FBGEMM FP16 performance (#82)Jianyu Huang
2019-03-06Add Avx512BW/VL/DQ check (#84)Jianyu Huang
2019-03-01Add documentations for the cache/register blocking parameters (#81)Jianyu Huang
2019-02-20optimize PackAWithIm2Col for symmetric b quantJongsoo Park
2019-02-13optimize gconv for b symmetric quantization (#70)Jongsoo Park
2019-02-13no need to subtract col offset if a_zp is 0 (#69)Jongsoo Park
2019-02-02gconv optimized for 8 channels per group (#65)Jongsoo Park
2019-02-02Remove inappropriate consts (#67)Lu Fang
2019-02-01specialized requantization for gconv (#61)Jongsoo Park
2019-01-31Add threading for FBGEMM FP16Jianyu Huang
2019-01-14Groupwise direct convolution when number of channels per group is smallDaya S Khudia
2019-01-11don't keep conv_param_p member as a const reference (#57)Jongsoo Park
2019-01-03fix shared lib buildDaya S Khudia
2018-12-21Update with clang format (#51)Jianyu Huang
2018-12-19Refactor to use FbgemmFP16 in packed gemm operator (#49)Amy Yang
2018-12-17add comments on col_offsets (#48)Jongsoo Park
2018-12-06Final cleanup for avx2 isolation and consistent file names (#40)Daya S Khudia
2018-12-06avx2 intrinsic separation from OutputProcessing-inl.h (#38)Daya S Khudia
2018-12-05avx2 specific code in a separate file for QuantUtils (#29)Daya S Khudia
2018-12-05Move avx2 specific code in different source files (#28)Daya S Khudia
2018-12-01Build fix with fbgemm shared lib (#31)Daya S Khudia
2018-11-30Only export symbols that are required while building shared libraryDaya S Khudia
2018-11-29sparse convolution output processing (#27)Jongsoo Park
2018-11-27per-group and per-channel quantization (#14340)Jongsoo Park
2018-11-27fix group convention in B packing (#26)Jongsoo Park
2018-11-26remove unnecessary zero_point argument from constructors (#14323)Jongsoo Park
2018-11-26minimize code compiled with avx2 and header includes from them (#14313)Jongsoo Park
2018-11-22adding quantization utility functions (#19)Jongsoo Park
2018-11-22optimize for symmetric quantization in requantization (#18)Jongsoo Park
2018-11-20Optimize parallelization performance (#15)Jianyu Huang
2018-11-20A function to check if we are running on a fbgemm supported cpu (#13)Daya S Khudia
2018-11-19clang-format (#11)Jongsoo Park
2018-11-16grouped conv (#8)Jongsoo Park
2018-11-16grouped (batched) gemm (#7)Jongsoo Park
2018-11-13refactoring transpose code and commentJongsoo Park
2018-11-11Fix the issue caused by isA(); Remove use of avx512 cast intrinsics for GCC 4...Jianyu Huang
2018-11-08Sync with internal copy: Asymmetric padding; fbgemm2 -> fbgemmJianyu Huang
2018-11-06Add equals and metaEquals method to PackBMatrixJongsoo Park
2018-11-06generalized conv_param_t and download third party libraries in build dirdskhudia
2018-11-04Syncing with internal version. Fixes for Mac/clang build. Other minor fixesdskhudia
2018-10-31Initial commitDaya S Khudia