Welcome to
mirror list
, hosted at
ThFree Co
, Russian Federation.
github.com/marian-nmt/intgemm.git - Unnamed repository; edit this file 'description' to name the repository.
index
:
github.com/marian-nmt/intgemm.git
4bit
absolute_std
add-postprocess-sigmoid
add127
benchmark_refactor
big-refactoring
compile_with_marian
debug_add127
hacky_nonmult8
log4-unstable
mac_support
marian-ssru
master
multiply-tiling
multiply-tiling-8x
multiply-tiling-manymatrices
overb
rearrangement-b
static
static-callbacks
static-multiply1x16
static-selectcolumnsb-colmajor
static-unquantize-and-add-bias
tests_for_kpu
unsigned
working
Unnamed repository; edit this file 'description' to name the repository.
www-data
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2020-04-25
Add SelectColumnsB for column major order
static-selectcolumnsb-colmajor
Mateusz Chudyk
2020-04-24
Rudimentary tile benchmark. Keep in mind Multiply still needs optimization.
Kenneth Heafield
2020-04-24
Silence compiler warnings on 1<< overflow
Kenneth Heafield
2020-04-24
Extract randomly generated matrix class
Kenneth Heafield
2020-04-24
Comment
Kenneth Heafield
2020-04-24
Oops use memcmp in test for whole array
Kenneth Heafield
2020-04-24
Basic general sized multiply, not optimized yet
Kenneth Heafield
2020-04-24
Add empty check for Tile
Kenneth Heafield
2020-04-23
Comment ends of ifdefs
Kenneth Heafield
2020-04-23
General write working on AVX512, at least for tested cases
Kenneth Heafield
2020-04-23
Insane implementation of most cases for writing C. Still missing offset scat...
Kenneth Heafield
2020-04-23
Tests for unrolled inner dimension are tricky
Kenneth Heafield
2020-04-22
Lots of tests, including inner failing
Kenneth Heafield
2020-04-22
Fix TestMultiplyNoOverhangShapes to call kernel
Kenneth Heafield
2020-04-22
Merge remote-tracking branch 'origin/master' into static
Kenneth Heafield
2020-04-20
Merge pull request #73 from kpu/absolute_std
Kenneth Heafield
2020-04-20
Rename and fix interface
absolute_std
Nikolay Bogoychev
2020-04-20
Rename and move the if outside the hot loop
Nikolay Bogoychev
2020-04-20
Merge branch 'master' into absolute_std
Nikolay Bogoychev
2020-04-20
Fix OMP parallel wrap typing for Shift
Kenneth Heafield
2020-04-20
Workaround gcc bug producing extra move instructions
Kenneth Heafield
2020-04-19
Don't catch clang with the gcc hack, move VNNI to a function
Kenneth Heafield
2020-04-19
Fix comment
Kenneth Heafield
2020-04-19
Work around gcc _mm512_dpbusds_epi32 spurious vmovdqa64 instructions
Kenneth Heafield
2020-04-19
template argument for shuffle immediate
Kenneth Heafield
2020-04-19
Remove StaticLoop
Kenneth Heafield
2020-04-19
Change tile_test to variadic index_sequence
Kenneth Heafield
2020-04-19
Sum16To32 using variadic templates
Kenneth Heafield
2020-04-19
Replace StaticLoop with variadic template
Kenneth Heafield
2020-04-19
Document unordered_unfurl
Kenneth Heafield
2020-04-19
Header for std::size_t
Kenneth Heafield
2020-04-19
Change Index to size_t
Kenneth Heafield
2020-04-19
Switch reduce to taking RegisterPair
Kenneth Heafield
2020-04-19
Change to integer sequence for unrolling kernels
Kenneth Heafield
2020-04-18
Even more test configurations
Kenneth Heafield
2020-04-18
Test statically unrolled multiplies too
Kenneth Heafield
2020-04-18
Tiled multiply with basic testing work
Kenneth Heafield
2020-04-18
Merge remote-tracking branch 'origin/master' into static
Kenneth Heafield
2020-04-13
Juse use posix_memalign everywhere
Kenneth Heafield
2020-04-06
Merge pull request #77 from kpuatamazon/master
Kenneth Heafield
2020-04-05
Comments
Kenneth Heafield
2020-04-04
Test SSE2
Kenneth Heafield
2020-04-04
Rename Pack to Reduce
Kenneth Heafield
2020-04-04
More thoroughly test reduction code
Kenneth Heafield
2020-04-04
Does AVX512 reduce work?
Kenneth Heafield
2020-04-04
Reduce working for SSE2 and AVX2, working on AVX512
Kenneth Heafield
2020-04-02
Merge branch 'master' into static
Kenneth Heafield
2020-04-02
Merge pull request #76 from kpu/static-loop-empty-iterator
Mateusz Chudyk
2020-04-02
Add support for empty iterator to static loop
Mateusz Chudyk
2020-04-02
Reduction within 128-bit lanes
Kenneth Heafield
[next]