Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/google/ruy.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2020-08-21Changes are excluded via CopybaraRuy Contributors
2020-08-19Rename kOutOfOrder -> kGeneric, kInOrder -> kA55ish,Benoit Jacob
2020-08-19Reimplement :tune on top of :cpuinfo.Benoit Jacob
2020-08-17Add bzl_library rules for .bzl files without one.Ruy Contributors
2020-07-31Adds AVX float packing code.T.J. Alumbaugh
2020-07-31Adds AVX path and AVX float kernel.T.J. Alumbaugh
2020-07-30Adds AVX path and AVX float kernel.T.J. Alumbaugh
2020-07-29Adds AVX path and AVX float kernel.T.J. Alumbaugh
2020-07-28Print extra information in case of disagreeing TestResults.T.J. Alumbaugh
2020-07-27check_macros improvements: promote operands before comparisons (avoids -Wsign...Benoit Jacob
2020-07-27Add missing #include of <cstring>.Benoit Jacob
2020-07-25Simplify bias-loading code now that bias buffers are always rounded up to mul...Benoit Jacob
2020-07-25Use lambdas to shorten source code like we did in the avx512 kernel.Benoit Jacob
2020-07-25Handle per-column multipliers in the avx512 kernel without transposing the 16...Benoit Jacob
2020-07-24Optimized packing code path for row-major 8bit inputs for the x86 paths.Benoit Jacob
2020-07-24Optimized packing code path for row-major 8bit inputs for the kNeon path. Wri...Benoit Jacob
2020-07-21Use lambdas to shorten Kernel8bitAvx512's source code, and to split the resul...Benoit Jacob
2020-07-21Optimized packing code path for row-major float inputs.Benoit Jacob
2020-07-20Optimized packing code path for row-major 8bit inputs for the kNeonDotprod path.Benoit Jacob
2020-07-15Fix the build on some toolchains - a missing #include<cstring> and some avx51...Benoit Jacob
2020-07-15Rename packing code implementation functions now that they are explicitly abo...Benoit Jacob
2020-07-15Templatize packing code paths on the source order, so that we support any com...Benoit Jacob
2020-07-14Simplification of FallBackToStandardCpp now that we are past the incremental ...Benoit Jacob
2020-07-14Efficient support for any channel_dimension for quantized kernels on AVX-512,...Benoit Jacob
2020-07-14Efficient support for any channel_dimension for quantized kernels on AVX-512,...Benoit Jacob
2020-07-14Efficient support for any channel_dimension for quantized kernels on AVX2.Benoit Jacob
2020-07-14Simplify x86 kernels by using the fact that there always is a per-channel buf...Benoit Jacob
2020-07-14Simplify x86 kernels thanks to the new fact that perchannel buffers are round...Benoit Jacob
2020-07-13Fix runtime detection of support for our AVX2+FMA code path: we were only che...Benoit Jacob
2020-07-13FMA is technically a separate ISA extension from AVX2.Benoit Jacob
2020-07-13Efficient support for any channel_dimension for float kernels on AVX-512.Benoit Jacob
2020-07-13Efficient support for any channel_dimension for float kernels on AVX2.Benoit Jacob
2020-07-13Allow the user to specify that they have allocated a slightly larger capacity...Benoit Jacob
2020-07-09Fix ARM32 packing code reading past the end of the source matrix, and finishi...Benoit Jacob
2020-07-09Add comments and some minor simplications to packing code.Benoit Jacob
2020-07-09Avoid overrunning per-channel buffers, whose size is that of the correspondin...Benoit Jacob
2020-07-09Minor optimization of in-order arm64 kernels, interleave the dup's used in th...Benoit Jacob
2020-07-09Minor simplification of arm32 assembly: the add instruction itself can be con...Benoit Jacob
2020-07-09Efficient support for any channel_dimension for quantized kernels on ARM32.Benoit Jacob
2020-07-09Efficient support for any channel_dimension for float kernels on ARM32.Benoit Jacob
2020-07-08Efficient support for any channel_dimension for kNeonDotprod quantized kernel...Benoit Jacob
2020-07-08Efficient support for any channel_dimension for kNeon quantized kernels on AR...Benoit Jacob
2020-07-08Ensure that the 1Col kernels are not used with channel_dimension==kCol, so th...Benoit Jacob
2020-07-08Efficient support for any channel_dimension for float kernels on ARM64.Benoit Jacob
2020-07-08Groundwork to pass channel_dimension down to kernels and to incrementally ena...Benoit Jacob
2020-07-07Revisiting RUY_OPT(AVOID_ALIASING).Benoit Jacob
2020-07-07Fix benchmarking of caching.Benoit Jacob
2020-07-06Allow benchmarking any combination of storage orders, and disable the randomi...Benoit Jacob
2020-07-06Allow disabling the reference path in the benchmark.Benoit Jacob
2020-07-02Start of a documentation directory.Benoit Jacob