Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/marian-nmt/FBGEMM.git
aaronpburke/fix-install-targets2
copyPublic
gcc11support
master
youki/avx512-avx2
youki/benchmarksparse
youki/fix-avx2-fp16
youki/fix-gcc10-compile
youki/fix-gcc9-build
youki/fix-stdexcept
youki/fp16avx512
youki/fp16intrinsic
youki/improve-mem-alloc
youki/improve-mem-alloc-marian
youki/jit-experiments
youki/merge-win-int8
youki/mergemaster01092020
youki/mergemaster1206
youki/static-code-cache
youki/testsparse
youki/unit-test-fix
youki/unordered_map
youki/upstream0217
youki/upstream0509
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
include
/
fbgemm
Age
Commit message (
Expand
)
Author
2019-08-21
Per channel support in fbgemmConv (#119)
Daya Khudia
2019-08-12
fix error message (#117)
Daya Khudia
2019-08-09
Integrate VNNI into FBGEMM master branch (#114)
Jianyu Huang
2019-08-09
Add unpack to PackedGemmMatrixFP16 (#112)
Yinghai Lu
2019-08-06
Back out "[fbgemm] Integrate VNNI into FBGEMM master branch"
Jianyu Huang
2019-08-06
Integrate VNNI into FBGEMM master branch (#113)
Jianyu Huang
2019-08-02
Pass blocking param pointer into packedBufferSize() in PackBMatrix.cc
Mike Tsai
2019-07-19
Support pointwise with unified convolution interface as well (#108)
Daya Khudia
2019-07-17
While calling fbgemmConv with packed weights, packed weights should be compli...
Daya Khudia
2019-07-16
Add functions needed for unpacking in PackWeightsForConv (#106)
Daya Khudia
2019-07-16
unpack through unified convolution interface (#105)
Daya Khudia
2019-07-10
Refactoring unpack weight function (#103)
Jianyu Huang
2019-07-06
Unpack data for 3x3 (and 3x3x3) depthwise convolution
Daya Khudia
2019-07-06
Implement ::unpack() for PackWeightMatrixForGConv
Jaewon Lee
2019-06-20
Per channel and groupwise quantization (#99)
Daya Khudia
2019-06-15
Update the logic of checking valid parameters.
Mike Tsai
2019-06-07
Remove duplicated header and undo some changes in D15399811
Daya Khudia
2019-06-05
Unified convolution interface
Daya Khudia
2019-06-04
Add quantized::fbgemm_linear_unpack operator for serialization (#97)
Jianyu Huang
2019-04-02
Exposing tuning parameters in FBGEMM (MCB, NCB, KCB, MR, NR, Row Interleave) ...
Protonu Basu
2019-03-21
Improves small N cases back to what they were
Daya S Khudia
2019-03-21
Allocate some registers for B matrix loading and reuse loaded results
Daya S Khudia
2019-03-21
Further optimize acc16 kernel and cache blocking dimension for B matrix is no...
Daya S Khudia
2019-03-21
Further optimize acc32 kernel and cache blocking dimension for B matrix is no...
Daya S Khudia
2019-03-13
optimize requantize for float out processing (#85)
Jongsoo Park
2019-03-08
Fixes for FBGEMM FP16 performance (#82)
Jianyu Huang
2019-03-06
Add Avx512BW/VL/DQ check (#84)
Jianyu Huang
2019-03-01
Add documentations for the cache/register blocking parameters (#81)
Jianyu Huang
2019-02-20
optimize PackAWithIm2Col for symmetric b quant
Jongsoo Park
2019-02-13
optimize gconv for b symmetric quantization (#70)
Jongsoo Park
2019-02-13
no need to subtract col offset if a_zp is 0 (#69)
Jongsoo Park
2019-02-02
gconv optimized for 8 channels per group (#65)
Jongsoo Park
2019-02-02
Remove inappropriate consts (#67)
Lu Fang
2019-02-01
specialized requantization for gconv (#61)
Jongsoo Park
2019-01-31
Add threading for FBGEMM FP16
Jianyu Huang
2019-01-14
Groupwise direct convolution when number of channels per group is small
Daya S Khudia
2019-01-11
don't keep conv_param_p member as a const reference (#57)
Jongsoo Park
2019-01-03
fix shared lib build
Daya S Khudia
2018-12-21
Update with clang format (#51)
Jianyu Huang
2018-12-19
Refactor to use FbgemmFP16 in packed gemm operator (#49)
Amy Yang
2018-12-17
add comments on col_offsets (#48)
Jongsoo Park
2018-12-06
Final cleanup for avx2 isolation and consistent file names (#40)
Daya S Khudia
2018-12-06
avx2 intrinsic separation from OutputProcessing-inl.h (#38)
Daya S Khudia
2018-12-05
avx2 specific code in a separate file for QuantUtils (#29)
Daya S Khudia
2018-12-05
Move avx2 specific code in different source files (#28)
Daya S Khudia
2018-12-01
Build fix with fbgemm shared lib (#31)
Daya S Khudia
2018-11-30
Only export symbols that are required while building shared library
Daya S Khudia
2018-11-29
sparse convolution output processing (#27)
Jongsoo Park
2018-11-27
per-group and per-channel quantization (#14340)
Jongsoo Park
2018-11-27
fix group convention in B packing (#26)
Jongsoo Park
[next]