Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/marian-nmt/FBGEMM.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2020-01-14Fix compile errorsyouki/merge-win-int8Young Jin Kim
2020-01-14Update jit parameters for windowsYoung Jin Kim
2020-01-14Fix undefined reference error on macos (#247)Hongzhang Shan
2020-01-14Cmake changes for MSVC build (#242)Daya Khudia
2020-01-14Allocate memory instead of using variable length arrays (#241)Daya Khudia
2020-01-14More fixes related to MSVC build (#240)Daya Khudia
2020-01-14fix perf regression in resnet50 and resnet101 (#239)Daya Khudia
2020-01-13Fix multi-instance benchmarking for fp16 (#246)Daya Khudia
2020-01-10attributes not at the right placeDaya Khudia
2020-01-104-bit SLS with emb dim not a multiple of 2 (#243)Jongsoo Park
2020-01-09Githubaction (#237)Hongzhang Shan
2020-01-074bit JIT'ed SLS kernel (#233)Jongsoo Park
2020-01-05start to add github action fileHongzhang Shan
2020-01-04Handle remainder loop with masked SIMD instructions (#235)Daya Khudia
2020-01-04Restructure code gen for FBGEMMFP16 (#236)Evgeny Fiksman
2020-01-03Adding missing copyright messageDaya Khudia
2020-01-02simplify SparseAdagrad (#227)Jongsoo Park
2020-01-02simplify EmbeddingSpMDM (#228)Jongsoo Park
2019-12-27add more comments on handling small scale handling; remove redundant if (#232)Jongsoo Park
2019-12-26Fixing cblas.h for mac build in fp16 benchmarkHongzhang Shan
2019-12-26refine special handling of small quant scale (#231)Jongsoo Park
2019-12-20Fix ISA detection mechanism (#225)Evgeny Fiksman
2019-12-20Adding FP32 JIT SparseAdagrad and Rowwise Adagrad (#224)Protonu Basu
2019-12-20Using import/export correctlyDaya Khudia
2019-12-19cleaning up some code in referene implementations WRT SLS that is not neededProtonu Basu
2019-12-19Avoid GNU compiler specific C++ extensionDaya Khudia
2019-12-18Streamline usage of attributes (#222)Daya Khudia
2019-12-18Cast instead of reinterpret_cast in recently added codeDaya Khudia
2019-12-17Fix build with C++11 (#223)Evgeny Fiksman
2019-12-17avoid enforcment of unsupported ISA (#221)Evgeny Fiksman
2019-12-13Use cast intrinsicsDaya Khudia
2019-12-12Suitable declaration for cblas_gemm_computeDaya Khudia
2019-12-12Explicit template parametersDaya Khudia
2019-12-12Things like iota/accumulate are defined in numeric headerDaya Khudia
2019-12-12fbgemm API decorator only needed at declarationDaya Khudia
2019-12-12Aligned memory allocation that works with MSVC as wellDaya Khudia
2019-12-11Back out "Revert D18138146: [caffe2] Add support for AVX512-256(YMM) in FBGEM...Evgeny Fiksman
2019-12-11Revert D18138146: Add support for AVX512-256(YMM) in FBGEMM16Jianyu Huang
2019-12-11Add support for AVX512-256(YMM) in FBGEMM16 (#209)Evgeny Fiksman
2019-12-11Reapply fix for make_unique not supported by C++11 (#217)Evgeny Fiksman
2019-12-11Fix non-MKL build in CI (#216)Evgeny Fiksman
2019-12-10FBGEMM CI avoid call to C++14 (#215)Evgeny Fiksman
2019-12-10Fix FBGEMM OSS CI (#212)Evgeny Fiksman
2019-12-10Fix pytorch build (#214)Evgeny Fiksman
2019-12-10Fix FBGEMM OSS CI (#211)Jianyu Huang
2019-12-10Add additional execution arguments to the benchmark (#207)Evgeny Fiksman
2019-12-10Pre-transpose Weight matrix (B) before sending to MKL routine (#205)Evgeny Fiksman
2019-12-10Enhance benchmark to include cache eviction and multi instance with OMP (#204)Evgeny Fiksman
2019-12-09Adding FP 32 SLS, and unifying it with 8 Bit SLS (#206)Protonu Basu
2019-12-09Add the the FP16 <-> FP32 conversion benchmark (#210)Jianyu Huang