Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/google/ruy.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2020-10-20Use movi of dup with zero registertest_337974156Jaesung Chung
2020-10-19Update AVX, AVX2, AVX512 Rescale operations with Rounding Right ShiftT.J. Alumbaugh
2020-10-16move example.cc into one directoryLeslie-Fang
2020-10-16Use movi NEON instruction to zero out registersLukas Geiger
2020-10-16It's _MSC_VER not __MSC_VER.Benoit Jacob
2020-10-15Enable x86 SIMD code paths on MSVC 2019 and similarly-versioned Clang-CL.Benoit Jacob
2020-10-08Add a few PMU counters.Benoit Jacob
2020-09-26Additional optimizations for AVX 8bit quantized kernel.T.J. Alumbaugh
2020-09-23Optimize AVX/AVX2 quantized pathT.J. Alumbaugh
2020-09-22Fix buffer overrun on asan for AVX512 float.T.J. Alumbaugh
2020-09-21Optimize AVX512 float pathT.J. Alumbaugh
2020-09-18Optimize AVX/AVX2+FMA float pathT.J. Alumbaugh
2020-09-03AVX 8bit row major/col major packing codeT.J. Alumbaugh
2020-09-01AVX Pack inherits from StandardCppT.J. Alumbaugh
2020-08-31AVX 8bit kernel. Forked from AVX2+FMA versionT.J. Alumbaugh
2020-08-24Update README.mdBenoit Jacob
2020-08-21Changes are excluded via CopybaraRuy Contributors
2020-08-21Changes are excluded via CopybaraRuy Contributors
2020-08-19Rename kOutOfOrder -> kGeneric, kInOrder -> kA55ish,Benoit Jacob
2020-08-19Reimplement :tune on top of :cpuinfo.Benoit Jacob
2020-08-17Add bzl_library rules for .bzl files without one.Ruy Contributors
2020-07-31Adds AVX float packing code.T.J. Alumbaugh
2020-07-31Adds AVX path and AVX float kernel.T.J. Alumbaugh
2020-07-30Adds AVX path and AVX float kernel.T.J. Alumbaugh
2020-07-29Adds AVX path and AVX float kernel.T.J. Alumbaugh
2020-07-28Print extra information in case of disagreeing TestResults.T.J. Alumbaugh
2020-07-27check_macros improvements: promote operands before comparisons (avoids -Wsign...Benoit Jacob
2020-07-27Add missing #include of <cstring>.Benoit Jacob
2020-07-25Simplify bias-loading code now that bias buffers are always rounded up to mul...Benoit Jacob
2020-07-25Use lambdas to shorten source code like we did in the avx512 kernel.Benoit Jacob
2020-07-25Handle per-column multipliers in the avx512 kernel without transposing the 16...Benoit Jacob
2020-07-24Optimized packing code path for row-major 8bit inputs for the x86 paths.Benoit Jacob
2020-07-24Optimized packing code path for row-major 8bit inputs for the kNeon path. Wri...Benoit Jacob
2020-07-21Use lambdas to shorten Kernel8bitAvx512's source code, and to split the resul...Benoit Jacob
2020-07-21Optimized packing code path for row-major float inputs.Benoit Jacob
2020-07-20Optimized packing code path for row-major 8bit inputs for the kNeonDotprod path.Benoit Jacob
2020-07-15Fix the build on some toolchains - a missing #include<cstring> and some avx51...Benoit Jacob
2020-07-15Rename packing code implementation functions now that they are explicitly abo...Benoit Jacob
2020-07-15Templatize packing code paths on the source order, so that we support any com...Benoit Jacob
2020-07-14Simplification of FallBackToStandardCpp now that we are past the incremental ...Benoit Jacob
2020-07-14Efficient support for any channel_dimension for quantized kernels on AVX-512,...Benoit Jacob
2020-07-14Efficient support for any channel_dimension for quantized kernels on AVX-512,...Benoit Jacob
2020-07-14Efficient support for any channel_dimension for quantized kernels on AVX2.Benoit Jacob
2020-07-14Simplify x86 kernels by using the fact that there always is a per-channel buf...Benoit Jacob
2020-07-14Simplify x86 kernels thanks to the new fact that perchannel buffers are round...Benoit Jacob
2020-07-13Fix runtime detection of support for our AVX2+FMA code path: we were only che...Benoit Jacob
2020-07-13FMA is technically a separate ISA extension from AVX2.Benoit Jacob
2020-07-13Efficient support for any channel_dimension for float kernels on AVX-512.Benoit Jacob
2020-07-13Efficient support for any channel_dimension for float kernels on AVX2.Benoit Jacob
2020-07-13Allow the user to specify that they have allocated a slightly larger capacity...Benoit Jacob