Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/google/ruy.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/google/ruy.git
bjacob-patch-1
bjacob-patch-2
bjacob-patch-3
master
revert-233-cmake
revert-243-revert-233-cmake
test_304319643
test_306981662
test_308313346
test_313514670
test_313928014
test_314399108
test_314774544
test_317350670
test_323013778
test_337422027
test_337974156
test_341055751
test_361230515
test_362328919
test_406772541
test_440406010
test_456909969
test_460342347
test_464475210
test_466965492
test_474105012
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2020-10-20
Use movi of dup with zero register
test_337974156
Jaesung Chung
2020-10-19
Update AVX, AVX2, AVX512 Rescale operations with Rounding Right Shift
T.J. Alumbaugh
2020-10-16
move example.cc into one directory
Leslie-Fang
2020-10-16
Use movi NEON instruction to zero out registers
Lukas Geiger
2020-10-16
It's _MSC_VER not __MSC_VER.
Benoit Jacob
2020-10-15
Enable x86 SIMD code paths on MSVC 2019 and similarly-versioned Clang-CL.
Benoit Jacob
2020-10-08
Add a few PMU counters.
Benoit Jacob
2020-09-26
Additional optimizations for AVX 8bit quantized kernel.
T.J. Alumbaugh
2020-09-23
Optimize AVX/AVX2 quantized path
T.J. Alumbaugh
2020-09-22
Fix buffer overrun on asan for AVX512 float.
T.J. Alumbaugh
2020-09-21
Optimize AVX512 float path
T.J. Alumbaugh
2020-09-18
Optimize AVX/AVX2+FMA float path
T.J. Alumbaugh
2020-09-03
AVX 8bit row major/col major packing code
T.J. Alumbaugh
2020-09-01
AVX Pack inherits from StandardCpp
T.J. Alumbaugh
2020-08-31
AVX 8bit kernel. Forked from AVX2+FMA version
T.J. Alumbaugh
2020-08-24
Update README.md
Benoit Jacob
2020-08-21
Changes are excluded via Copybara
Ruy Contributors
2020-08-21
Changes are excluded via Copybara
Ruy Contributors
2020-08-19
Rename kOutOfOrder -> kGeneric, kInOrder -> kA55ish,
Benoit Jacob
2020-08-19
Reimplement :tune on top of :cpuinfo.
Benoit Jacob
2020-08-17
Add bzl_library rules for .bzl files without one.
Ruy Contributors
2020-07-31
Adds AVX float packing code.
T.J. Alumbaugh
2020-07-31
Adds AVX path and AVX float kernel.
T.J. Alumbaugh
2020-07-30
Adds AVX path and AVX float kernel.
T.J. Alumbaugh
2020-07-29
Adds AVX path and AVX float kernel.
T.J. Alumbaugh
2020-07-28
Print extra information in case of disagreeing TestResults.
T.J. Alumbaugh
2020-07-27
check_macros improvements: promote operands before comparisons (avoids -Wsign...
Benoit Jacob
2020-07-27
Add missing #include of <cstring>.
Benoit Jacob
2020-07-25
Simplify bias-loading code now that bias buffers are always rounded up to mul...
Benoit Jacob
2020-07-25
Use lambdas to shorten source code like we did in the avx512 kernel.
Benoit Jacob
2020-07-25
Handle per-column multipliers in the avx512 kernel without transposing the 16...
Benoit Jacob
2020-07-24
Optimized packing code path for row-major 8bit inputs for the x86 paths.
Benoit Jacob
2020-07-24
Optimized packing code path for row-major 8bit inputs for the kNeon path. Wri...
Benoit Jacob
2020-07-21
Use lambdas to shorten Kernel8bitAvx512's source code, and to split the resul...
Benoit Jacob
2020-07-21
Optimized packing code path for row-major float inputs.
Benoit Jacob
2020-07-20
Optimized packing code path for row-major 8bit inputs for the kNeonDotprod path.
Benoit Jacob
2020-07-15
Fix the build on some toolchains - a missing #include<cstring> and some avx51...
Benoit Jacob
2020-07-15
Rename packing code implementation functions now that they are explicitly abo...
Benoit Jacob
2020-07-15
Templatize packing code paths on the source order, so that we support any com...
Benoit Jacob
2020-07-14
Simplification of FallBackToStandardCpp now that we are past the incremental ...
Benoit Jacob
2020-07-14
Efficient support for any channel_dimension for quantized kernels on AVX-512,...
Benoit Jacob
2020-07-14
Efficient support for any channel_dimension for quantized kernels on AVX-512,...
Benoit Jacob
2020-07-14
Efficient support for any channel_dimension for quantized kernels on AVX2.
Benoit Jacob
2020-07-14
Simplify x86 kernels by using the fact that there always is a per-channel buf...
Benoit Jacob
2020-07-14
Simplify x86 kernels thanks to the new fact that perchannel buffers are round...
Benoit Jacob
2020-07-13
Fix runtime detection of support for our AVX2+FMA code path: we were only che...
Benoit Jacob
2020-07-13
FMA is technically a separate ISA extension from AVX2.
Benoit Jacob
2020-07-13
Efficient support for any channel_dimension for float kernels on AVX-512.
Benoit Jacob
2020-07-13
Efficient support for any channel_dimension for float kernels on AVX2.
Benoit Jacob
2020-07-13
Allow the user to specify that they have allocated a slightly larger capacity...
Benoit Jacob
[next]