Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/marian-nmt/FBGEMM.git
aaronpburke/fix-install-targets2
copyPublic
gcc11support
master
youki/avx512-avx2
youki/benchmarksparse
youki/fix-avx2-fp16
youki/fix-gcc10-compile
youki/fix-gcc9-build
youki/fix-stdexcept
youki/fp16avx512
youki/fp16intrinsic
youki/improve-mem-alloc
youki/improve-mem-alloc-marian
youki/jit-experiments
youki/merge-win-int8
youki/mergemaster01092020
youki/mergemaster1206
youki/static-code-cache
youki/testsparse
youki/unit-test-fix
youki/unordered_map
youki/upstream0217
youki/upstream0509
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
bench
Age
Commit message (
Expand
)
Author
2019-04-19
make sure cpuinfo_initialize called before fbgemmHasAvx2/512Support (#94)
Jongsoo Park
2019-04-02
Exposing tuning parameters in FBGEMM (MCB, NCB, KCB, MR, NR, Row Interleave) ...
Protonu Basu
2019-03-13
optimize requantize for float out processing (#85)
Jongsoo Park
2019-02-26
barebone int8-acc16 and int8-acc32 benchmarks
Daya S Khudia
2019-02-15
simple spmdm optimization (#76)
Jongsoo Park
2019-02-14
clean up depthwise conv interface (#72)
Jongsoo Park
2019-02-13
group conv optimized for 16 channels per group (#68)
Jongsoo Park
2019-02-02
gconv optimized for 8 channels per group (#65)
Jongsoo Park
2019-01-31
use 1 thread in benchmarks if OMP_NUM_THREADS is not explicitly set (#66)
Jongsoo Park
2019-01-31
Add threading for FBGEMM FP16
Jianyu Huang
2019-01-14
Groupwise direct convolution when number of channels per group is small
Daya S Khudia
2019-01-14
FP16Benchmark: Allow fp32 comparison using cblas (#56)
WilliamTambellini
2019-01-12
3x3x3 depthwise convolution with per channel quantization (#15775)
Jongsoo Park
2019-01-04
missing copyright headers
Daya S Khudia
2019-01-03
optimize remainder loops of requantization and rowoffset (#54)
Jongsoo Park
2019-01-02
use 1 omp thread unless OMP_NUM_THREADS is explicitly set (#53)
Jongsoo Park
2018-12-21
Update the profiling format for Acc32 Benchmark (#50)
Jianyu Huang
2018-12-21
Update with clang format (#51)
Jianyu Huang
2018-12-06
File name change for FbgemmI8Depthwise.h and FbgemmI8Depthwise.cc (#14725)
Daya S Khudia
2018-12-04
Fix the group issue in the benchmark and use ResNext101 conv shapes (#32)
Jianyu Huang
2018-11-30
protect omp.h include by a pragma
Daya S Khudia
2018-11-27
per-group and per-channel quantization (#14340)
Jongsoo Park
2018-11-26
remove unnecessary zero_point argument from constructors (#14323)
Jongsoo Park
2018-11-20
Parallelize the benchmark
Jianyu Huang
2018-11-19
clang-format (#11)
Jongsoo Park
2018-11-16
grouped (batched) gemm (#7)
Jongsoo Park
2018-11-08
Sync with internal copy: Asymmetric padding; fbgemm2 -> fbgemm
Jianyu Huang
2018-11-06
generalized conv_param_t and download third party libraries in build dir
dskhudia
2018-11-05
CMake minimum version required update
dskhudia
2018-11-03
Manually syncing with internal copy
dskhudia
2018-10-31
Initial commit
Daya S Khudia