Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/marian-nmt/FBGEMM.git
aaronpburke/fix-install-targets2
copyPublic
gcc11support
master
youki/avx512-avx2
youki/benchmarksparse
youki/fix-avx2-fp16
youki/fix-gcc10-compile
youki/fix-gcc9-build
youki/fix-stdexcept
youki/fp16avx512
youki/fp16intrinsic
youki/improve-mem-alloc
youki/improve-mem-alloc-marian
youki/jit-experiments
youki/merge-win-int8
youki/mergemaster01092020
youki/mergemaster1206
youki/static-code-cache
youki/testsparse
youki/unit-test-fix
youki/unordered_map
youki/upstream0217
youki/upstream0509
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2020-05-10
Add prepacked B matrix
youki/upstream0509
Young Jin Kim
2020-02-07
clean up temporary code for isa-dependent serialized packed weight (#290)
Jongsoo Park
2020-02-07
row-wise sparse fp32/fp16/int8 EmbeddingSpMDM (#288)
Jongsoo Park
2020-02-05
enable github action build/test on windows (#285)
Hongzhang Shan
2020-02-05
changes for shared release on windows (#284)
Hongzhang Shan
2020-02-04
enable positional weight for EmbeddingSpMDMNBitRowWiseSparse (#281)
Jongsoo Park
2020-02-04
enable EmbeddingSpMDMNBitRowWiseSparse avx2 (#280)
Jongsoo Park
2020-02-02
Fix vector size setting and min max set for randFill (#283)
Hongzhang Shan
2020-02-02
fix "invalid min and max arguments for uniform_int" error (#282)
Hongzhang Shan
2020-02-01
fix overflow in cache flush (#260)
Jongsoo Park
2020-02-01
add 2-stage SparseAdagrad interface (#276)
Jiecao Yu
2020-02-01
clang-format the code (#278)
Jianyu Huang
2020-02-01
Fix the root cause for _aligned_free issues on row_offset_ allocation (#277)
Jianyu Huang
2020-02-01
put cmake cpuinfo vars into cmake cache
Hongzhang Shan
2020-01-31
Fix the _aligned_free() issue on Windows (#275)
Jianyu Huang
2020-01-31
add more instantiation for EmbeddingSpMDMAvx2 (#274)
Hongzhang Shan
2020-01-31
2/4-bit EmbeddingSPMDM with rowwise sparsity (#263)
Jongsoo Park
2020-01-31
Enable FMA operation for windows (#273)
Hongzhang Shan
2020-01-31
2bit JIT'ed SLS kernel (#234)
Jongsoo Park
2020-01-31
cmake changes for windows compiling
Hongzhang Shan
2020-01-29
Add instantiation for template parameters
Hongzhang Shan
2020-01-28
Remove extra paranthesis (#270)
Jianyu Huang
2020-01-28
make gconv work for any W and H (#269)
Jongsoo Park
2020-01-24
use 2-stage EmbeddingSpMDM interface from benchmark and test (#256)
Jongsoo Park
2020-01-23
add missing copyright header (#267)
Jongsoo Park
2020-01-22
EmbeddingSpMDM with float16 input (#265)
Jongsoo Park
2020-01-22
modified instrinsic fp16 kernel for windows build (#259)
Jongsoo Park
2020-01-22
Intrinsic implementation of FP16 kernels for windows build (#254)
Young Jin Kim
2020-01-22
don't need extra register for positional weights (#261)
Jongsoo Park
2020-01-22
specialize fp32 SLS for emb dim 1 (#264)
Jongsoo Park
2020-01-21
avx2 4-bit SLS (#262)
Jongsoo Park
2020-01-21
remove EmbeddingSpMDM4BitKernelSignature (#257)
Jongsoo Park
2020-01-19
Add github action for windows
Hongzhang Shan
2020-01-18
enable positional weight test (#252)
Jongsoo Park
2020-01-18
Update readme with link to wiki page
Daya Khudia
2020-01-17
Bug fixes in default path for fp16 (#251)
Daya Khudia
2020-01-17
add 2-stage EmbeddingSpMDM interface and change CodeCache to use read-write l...
Jongsoo Park
2020-01-16
pass the number of rows to EmbeddingSpMDM4Bit (#245)
Jongsoo Park
2020-01-15
Add FBGEMM_ENUM_CLASS_API (#249)
Hongzhang Shan
2020-01-14
Fix undefined reference error on macos (#247)
Hongzhang Shan
2020-01-14
Cmake changes for MSVC build (#242)
Daya Khudia
2020-01-14
Allocate memory instead of using variable length arrays (#241)
Daya Khudia
2020-01-14
More fixes related to MSVC build (#240)
Daya Khudia
2020-01-14
fix perf regression in resnet50 and resnet101 (#239)
Daya Khudia
2020-01-13
Fix multi-instance benchmarking for fp16 (#246)
Daya Khudia
2020-01-10
attributes not at the right place
Daya Khudia
2020-01-10
4-bit SLS with emb dim not a multiple of 2 (#243)
Jongsoo Park
2020-01-09
Githubaction (#237)
Hongzhang Shan
2020-01-07
4bit JIT'ed SLS kernel (#233)
Jongsoo Park
2020-01-05
start to add github action file
Hongzhang Shan
[next]