Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
path: root/bench
AgeCommit message (Expand)Author
2019-09-25Merge remote-tracking branch 'upstream/master' into youki/win-jit-debug-int8Young Jin Kim
2019-09-24remove template parameter from PackedDepthWiseConvMatrix (#128)Jongsoo Park
2019-09-11API changes to take unquantized bias for depthwise convDaya Khudia
2019-09-05Modifying PackAWithIm2Col to support dilated convolution and adding test casesProtonu Basu
2019-09-04remove dw conv refs and use conv_ref instead (#122)Jongsoo Park
2019-09-03disable clang formatting in a few array definitions (#121)Jongsoo Park
2019-08-01Merge upstream masterYoung Jin Kim
2019-07-19Support pointwise with unified convolution interface as well (#108)Daya Khudia
2019-07-16Assume input weights to be in transposed format for convUnified (#104)Daya Khudia
2019-06-14Improve some memroy allocation codesYoung Jin Kim
2019-06-13Compile both on windows and linuxYoung Jin Kim
2019-06-05Unified convolution interfaceDaya Khudia
2019-04-19make sure cpuinfo_initialize called before fbgemmHasAvx2/512Support (#94)Jongsoo Park
2019-04-02Exposing tuning parameters in FBGEMM (MCB, NCB, KCB, MR, NR, Row Interleave) ...Protonu Basu
2019-03-13optimize requantize for float out processing (#85)Jongsoo Park
2019-02-26barebone int8-acc16 and int8-acc32 benchmarksDaya S Khudia
2019-02-15simple spmdm optimization (#76)Jongsoo Park
2019-02-14clean up depthwise conv interface (#72)Jongsoo Park
2019-02-13group conv optimized for 16 channels per group (#68)Jongsoo Park
2019-02-02gconv optimized for 8 channels per group (#65)Jongsoo Park
2019-01-31use 1 thread in benchmarks if OMP_NUM_THREADS is not explicitly set (#66)Jongsoo Park
2019-01-31Add threading for FBGEMM FP16Jianyu Huang
2019-01-14Groupwise direct convolution when number of channels per group is smallDaya S Khudia
2019-01-14FP16Benchmark: Allow fp32 comparison using cblas (#56)WilliamTambellini
2019-01-123x3x3 depthwise convolution with per channel quantization (#15775)Jongsoo Park
2019-01-04missing copyright headersDaya S Khudia
2019-01-03optimize remainder loops of requantization and rowoffset (#54)Jongsoo Park
2019-01-02use 1 omp thread unless OMP_NUM_THREADS is explicitly set (#53)Jongsoo Park
2018-12-21Update the profiling format for Acc32 Benchmark (#50)Jianyu Huang
2018-12-21Update with clang format (#51)Jianyu Huang
2018-12-06File name change for FbgemmI8Depthwise.h and FbgemmI8Depthwise.cc (#14725)Daya S Khudia
2018-12-04Fix the group issue in the benchmark and use ResNext101 conv shapes (#32)Jianyu Huang
2018-11-30protect omp.h include by a pragmaDaya S Khudia
2018-11-27per-group and per-channel quantization (#14340)Jongsoo Park
2018-11-26remove unnecessary zero_point argument from constructors (#14323)Jongsoo Park
2018-11-20Parallelize the benchmarkJianyu Huang
2018-11-19clang-format (#11)Jongsoo Park
2018-11-16grouped (batched) gemm (#7)Jongsoo Park
2018-11-08Sync with internal copy: Asymmetric padding; fbgemm2 -> fbgemmJianyu Huang
2018-11-06generalized conv_param_t and download third party libraries in build dirdskhudia
2018-11-05CMake minimum version required updatedskhudia
2018-11-03Manually syncing with internal copydskhudia
2018-10-31Initial commitDaya S Khudia