Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2019-08-21Per channel support in fbgemmConv (#119)Daya Khudia
2019-08-12fix error message (#117)Daya Khudia
2019-08-09Integrate VNNI into FBGEMM master branch (#114)Jianyu Huang
2019-08-09Add unpack to PackedGemmMatrixFP16 (#112)Yinghai Lu
2019-08-06Back out "[fbgemm] Integrate VNNI into FBGEMM master branch"Jianyu Huang
2019-08-06Integrate VNNI into FBGEMM master branch (#113)Jianyu Huang
2019-08-02Pass blocking param pointer into packedBufferSize() in PackBMatrix.ccMike Tsai
2019-07-19Support pointwise with unified convolution interface as well (#108)Daya Khudia
2019-07-17While calling fbgemmConv with packed weights, packed weights should be compli...Daya Khudia
2019-07-16Add functions needed for unpacking in PackWeightsForConv (#106)Daya Khudia
2019-07-16unpack through unified convolution interface (#105)Daya Khudia
2019-07-10Refactoring unpack weight function (#103)Jianyu Huang
2019-07-06Unpack data for 3x3 (and 3x3x3) depthwise convolutionDaya Khudia
2019-07-06Implement ::unpack() for PackWeightMatrixForGConvJaewon Lee
2019-06-20Per channel and groupwise quantization (#99)Daya Khudia
2019-06-15Update the logic of checking valid parameters.Mike Tsai
2019-06-07Remove duplicated header and undo some changes in D15399811Daya Khudia
2019-06-05Unified convolution interfaceDaya Khudia
2019-06-04Add quantized::fbgemm_linear_unpack operator for serialization (#97)Jianyu Huang
2019-04-02Exposing tuning parameters in FBGEMM (MCB, NCB, KCB, MR, NR, Row Interleave) ...Protonu Basu
2019-03-21Improves small N cases back to what they wereDaya S Khudia
2019-03-21Allocate some registers for B matrix loading and reuse loaded resultsDaya S Khudia
2019-03-21Further optimize acc16 kernel and cache blocking dimension for B matrix is no...Daya S Khudia
2019-03-21Further optimize acc32 kernel and cache blocking dimension for B matrix is no...Daya S Khudia
2019-03-13optimize requantize for float out processing (#85)Jongsoo Park
2019-03-08Fixes for FBGEMM FP16 performance (#82)Jianyu Huang
2019-03-06Add Avx512BW/VL/DQ check (#84)Jianyu Huang
2019-03-01Add documentations for the cache/register blocking parameters (#81)Jianyu Huang
2019-02-20optimize PackAWithIm2Col for symmetric b quantJongsoo Park
2019-02-13optimize gconv for b symmetric quantization (#70)Jongsoo Park
2019-02-13no need to subtract col offset if a_zp is 0 (#69)Jongsoo Park
2019-02-02gconv optimized for 8 channels per group (#65)Jongsoo Park
2019-02-02Remove inappropriate consts (#67)Lu Fang
2019-02-01specialized requantization for gconv (#61)Jongsoo Park
2019-01-31Add threading for FBGEMM FP16Jianyu Huang
2019-01-14Groupwise direct convolution when number of channels per group is smallDaya S Khudia
2019-01-11don't keep conv_param_p member as a const reference (#57)Jongsoo Park
2019-01-03fix shared lib buildDaya S Khudia
2018-12-21Update with clang format (#51)Jianyu Huang
2018-12-19Refactor to use FbgemmFP16 in packed gemm operator (#49)Amy Yang
2018-12-17add comments on col_offsets (#48)Jongsoo Park
2018-12-06Final cleanup for avx2 isolation and consistent file names (#40)Daya S Khudia
2018-12-06avx2 intrinsic separation from OutputProcessing-inl.h (#38)Daya S Khudia
2018-12-05avx2 specific code in a separate file for QuantUtils (#29)Daya S Khudia
2018-12-05Move avx2 specific code in different source files (#28)Daya S Khudia
2018-12-01Build fix with fbgemm shared lib (#31)Daya S Khudia
2018-11-30Only export symbols that are required while building shared libraryDaya S Khudia
2018-11-29sparse convolution output processing (#27)Jongsoo Park
2018-11-27per-group and per-channel quantization (#14340)Jongsoo Park
2018-11-27fix group convention in B packing (#26)Jongsoo Park