Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/marian-nmt/FBGEMM.git
aaronpburke/fix-install-targets2
copyPublic
gcc11support
master
youki/avx512-avx2
youki/benchmarksparse
youki/fix-avx2-fp16
youki/fix-gcc10-compile
youki/fix-gcc9-build
youki/fix-stdexcept
youki/fp16avx512
youki/fp16intrinsic
youki/improve-mem-alloc
youki/improve-mem-alloc-marian
youki/jit-experiments
youki/merge-win-int8
youki/mergemaster01092020
youki/mergemaster1206
youki/static-code-cache
youki/testsparse
youki/unit-test-fix
youki/unordered_map
youki/upstream0217
youki/upstream0509
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2020-05-21
Remove an unnecessary memory allocation
youki/improve-mem-alloc
Young Jin Kim
2020-05-21
Add the missing header for std::runtime_error (#379)
Jianyu Huang
2020-05-18
Drop extraneous "src" from local includes (#377)
Nikita Shulga
2020-05-17
Float16 rowwise sparse adagrad with stochastic rounding (reattempt of D215191...
Yong Wu
2020-05-16
Back out "Float16 rowwise sparse adagrad with stochastic rounding"
Natalia Gimelshein
2020-05-16
Float16 rowwise sparse adagrad with stochastic rounding (#370)
Yong Wu
2020-05-15
Add conv_1d (#369)
Hongzhang Shan
2020-05-14
Minor improvements in GEMM Kernels (#368)
Daya Khudia
2020-05-14
dirsync fbgemm with xplat
Jongsoo Park
2020-05-13
fix benchmark to consider offsets is now default instead of lengths (#373)
Jongsoo Park
2020-05-13
use lea to simplify code (#372)
Jongsoo Park
2020-05-12
minor fix (#371)
Hongzhang Shan
2020-05-06
unified conv to call dw conv with 2 oc per g (#360)
Jongsoo Park
2020-05-06
depthwise convolution with 2 output channels per group (#359)
Jongsoo Park
2020-05-04
Fix convert indexing bugs (#367)
Pawel Garbacki
2020-05-01
use vreg.ymm() instead of ymm(vreg.id()) (#361)
Jongsoo Park
2020-05-01
remove one-stage interface for sparse adagrad (#366)
Jongsoo Park
2020-04-30
implement L2 regularization for Adagrad in fbgemm (#365)
Jongsoo Park
2020-04-30
asmjit submodule update (#364)
Daya Khudia
2020-04-23
Zero point addition after rounding in quantization routines (#362)
Daya Khudia
2020-04-14
Move `transpose_simd` to TransposeUtils.cc (#352)
Nikita Shulga
2020-04-14
move multiplication with b_zp from jit'ed code to requantization (#358)
Jongsoo Park
2020-04-13
minor changes in UniConvTest and GConvTest (#357)
Jongsoo Park
2020-04-13
simplify transposeConvWeights (#356)
Jongsoo Park
2020-04-13
follow EXPECT_EQ(actual, expected) convention (#354)
Jongsoo Park
2020-04-11
use .half (#353)
Jongsoo Park
2020-04-11
Revert D20956364: Move `transpose_simd` to TransposeUtils.cc
Zafar Takhirov
2020-04-10
Move `transpose_simd` to TransposeUtils.cc (#349)
Nikita Shulga
2020-04-10
add option to use int64_t offsets/lengths in embedding operators (#350)
Jongsoo Park
2020-04-10
Move FakeFP16 back to internal to remove dependency on MKL (#36297)
Jianyu Huang
2020-04-10
Bazel build definition to use filelists from defs.bzl (#348)
Nikita Shulga
2020-04-10
change default value of use_offsets to true (to match with PyTorch EmbeddingB...
Jongsoo Park
2020-04-10
Move filelists to defs.bzl file (#344)
Nikita Shulga
2020-04-09
Add bazel build files (#346)
Nikita Shulga
2020-04-09
Merge pull request #345 from jspark1105/fixup-T64383689-master
Daya Khudia
2020-04-09
Re-sync with internal repository
Jongsoo Park
2020-04-09
Open sourcing fbgemm_fp16 ops (#36212)
Yinghai Lu
2020-04-07
add license header (#341)
Jongsoo Park
2020-04-07
JIT depth-wise conv (#338)
Jongsoo Park
2020-04-07
use read-write lock in CodeCache (#339)
Jongsoo Park
2020-04-07
Match numerics of quantize with vector path and fake quantize (#340)
Daya Khudia
2020-04-03
test 1024^3 matrix mulmat performance (#332)
Hongzhang Shan
2020-04-03
add dummy compute to cache evict func (#337)
Jongsoo Park
2020-03-31
disable asan for QuantizeAvx2 (#336)
Jongsoo Park
2020-03-31
add offset-based interface for PyTorch (#334)
Jongsoo Park
2020-03-27
Rename `times` global to avoid conflict
Andrew Gallagher
2020-03-24
remove unnecessary bound check for prefetching; avoid prefetching for -1 inde...
Jongsoo Park
2020-03-23
clamping with 1 comparison trick by treating signed as if it's unsigned (#324)
Jongsoo Park
2020-03-23
use xor instead of mov 0 (#325)
Jongsoo Park
2020-03-23
minor fixes in embedding benchmarks (#326)
Jongsoo Park
[next]