Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2019-01-02Fix a bug in FbgemmFP16 (#52)Feiteng
2018-12-21Update with clang format (#51)Jianyu Huang
2018-12-11instantiate more kernels for PackAmatrix (#47)Jongsoo Park
2018-12-06Fix duplicate symbols for thread local member variables (#43)James Reed
2018-12-06Add missing <algorithm> include (#42)James Reed
2018-12-06Final cleanup for avx2 isolation and consistent file names (#40)Daya S Khudia
2018-12-06avx2 intrinsic separation from OutputProcessing-inl.h (#38)Daya S Khudia
2018-12-06Separate out avx2 code from dense x sparse matrix multiplication (#39)Daya S Khudia
2018-12-06File name change for FbgemmI8Depthwise.h and FbgemmI8Depthwise.cc (#14725)Daya S Khudia
2018-12-06remove usage of c++ stdlib templates from FbgemmI8Depthwise (#37)Daya S Khudia
2018-12-05clean up PackAWithQuantRowOffset from avx2 intrinsics (#36)Daya S Khudia
2018-12-05Move avx2 intrinsics from PackAWithIm2Col (#35)Daya S Khudia
2018-12-05Removed avx2 code from PackAWithRowOffset.cc (#34)Daya S Khudia
2018-12-05avx2 specific code in a separate file for QuantUtils (#29)Daya S Khudia
2018-12-05Move avx2 specific code in different source files (#28)Daya S Khudia
2018-12-01Fix a bug in conv_refJianyu Huang
2018-11-30Only export symbols that are required while building shared libraryDaya S Khudia
2018-11-29sparse convolution output processing (#27)Jongsoo Park
2018-11-27per-group and per-channel quantization (#14340)Jongsoo Park
2018-11-27fix group convention in B packing (#26)Jongsoo Park
2018-11-26remove unnecessary zero_point argument from constructors (#14323)Jongsoo Park
2018-11-26minimize code compiled with avx2 and header includes from them (#14313)Jongsoo Park
2018-11-23parallelization over groups (#23)Jongsoo Park
2018-11-22adding quantization utility functions (#19)Jongsoo Park
2018-11-22use avx512 packing trait in PackWithQuantRowOffset (#20)Jongsoo Park
2018-11-22Unify the PackA file names (#21)Jianyu Huang
2018-11-21Fix assert failureJianyu Huang
2018-11-20Optimize parallelization performance (#15)Jianyu Huang
2018-11-20Simple parallelism, add -openmp flags and omp parallel for Acc16/32 Unit Test...Jianyu Huang
2018-11-20A function to check if we are running on a fbgemm supported cpu (#13)Daya S Khudia
2018-11-19clang-format (#11)Jongsoo Park
2018-11-16grouped conv (#8)Jongsoo Park
2018-11-16grouped (batched) gemm (#7)Jongsoo Park
2018-11-14consistent rounding (#9)Jongsoo Park
2018-11-13refactoring transpose code and commentJongsoo Park
2018-11-11Fix the issue caused by isA(); Remove use of avx512 cast intrinsics for GCC 4...Jianyu Huang
2018-11-09Merge pull request #4 from jamesr66a/fix_isaJianyu Huang
2018-11-09Fix use of isAJames Reed
2018-11-08Sync with internal copy: Asymmetric padding; fbgemm2 -> fbgemmJianyu Huang
2018-11-06Add equals and metaEquals method to PackBMatrixJongsoo Park
2018-11-06generalized conv_param_t and download third party libraries in build dirdskhudia
2018-11-04Syncing with internal version. Fixes for Mac/clang build. Other minor fixesdskhudia
2018-11-03Manually syncing with internal copydskhudia
2018-10-31Initial commitDaya S Khudia