Welcome to mirror list, hosted at ThFree Co, Russian Federation.

github.com/torch/cutorch.git - Unnamed repository; edit this file 'description' to name the repository.
summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2017-09-12fix alignment warningHEADmasterNatalia Gimelshein
2017-09-10Optimize pow for different exponents and add testsFrancisco Massa
2017-09-07fix static linkage and make THD statically linkedSoumith Chintala
2017-08-29Fix grid size for batch cat tensor now that getApplyGrid has been changed.Christian Sarofeen
2017-08-27Allowing larger grids for THCApply shows improved performance.Christian Sarofeen
2017-08-25Fix typos.Zhou Mo
2017-08-25add ones_like and zeros_likeAlykhan Tejani
2017-08-25cuda 9 hgemm fixSoumith Chintala
2017-08-25Updates for CUDA 9Christian Sarofeen
2017-08-17fixing the bug with squeezing a singleton dimension in torch.min and torch.maxAnton Osokin
2017-08-17Add CUDA version of eyeFrancisco Massa
2017-08-15accumulate in accType for reductions over dimensionsNatalia Gimelshein
2017-08-15Support __neg__, .neg(), and neg_() for Long, Int, Short tensor types.Gregory Chanan
2017-08-10call gemmStridedBatched for cuda >=8 to avoid calling kernels to set up ↵ngimel
pointers (#794)
2017-08-05move normal variants to TH/THCTrevor Killeen
2017-07-21Fix torch.inverse when magma is not availableSam Gross
Fixes #2156
2017-07-19Add CUDA support for arangeFrancisco Massa
Also enables CUDA for range
2017-07-19add explicit BLAS linkage to THC when linked against magma (in binary build)Soumith Chintala
2017-07-19move to model with cuda indexing tensors for cuda tensor adv indexingTrevor Killeen
2017-07-18fix baddbmm for expanded tensorsNatalia Gimelshein
2017-07-17fix cwrapsoumith
2017-07-15fix cwrap for std/varSoumith Chintala
2017-07-15Wrap unbiased flag in var, std, varall, stdallLuca Antiga
2017-07-14add launch_bounds to greedy kernelsNatalia Gimelshein
2017-07-13Advanced Indexing: Calculate linear offsets directly on the GPU when working ↵Trevor Killeen
with CUDA Tensors
2017-07-13Check for shared_mem size in multinomial single-sample implementationPan He
Handle limited shared memory on function torch.multinomial Update THCTensorRandom.cu
2017-07-12Avoid two unnecessary copies in addmm backwardSam Gross
The `r_` and `t` tensors become different objects, even though they point to the same data. Avoid the copy whenever beta=0.
2017-07-11Alias multinomial sampling in Cuda (#784)Amartya Sanyal
* Support Multinomial Alias sampling in cuda Moving benchmark file * Review changes
2017-07-04add missing definitionSoumith Chintala
2017-07-04Have median reduce over all dims and return just the value when dim is not ↵Luca Antiga
provided
2017-07-03Add a nonContigDim reduction kernel to improve latency for small tensors. (#768)Christian Sarofeen
2017-07-03Make reduction functors accept only constant arguments (#753)ngimel
(similar to MaxValuePair and MinValuePair above).
2017-06-29Warp intrinsic fixes (#785)ngimel
2017-06-26support more than 8 gpus (#774)Sergey Zagoruyko
2017-06-26Fp16 fixes for CUDA 9 (#783)Christian Sarofeen
2017-06-23Advanced Indexing Part 1 -- Purely Integer Array IndexingTrevor Killeen
2017-06-22Remove THCTensor_(expand2) and THCTensor_(expand3).Gregory Chanan
They are no longer needed and the corresponding TH versions have been removed.
2017-06-22btrifact: Make pivoting optional.Brandon Amos
2017-06-13Short-circuit copy if src and dest are equal.Edward Z. Yang
Signed-off-by: Edward Z. Yang <ezyang@fb.com>
2017-06-11Remove raiseErrors from THTensor functions, have THStorage functions take an ↵Gregory Chanan
error_buffer to return a proper error message while being able to handle memory management correctly from calling function.
2017-06-11Incorporate review comments:Gregory Chanan
1) Line up trailing dimensions in broadcast docs. 2) remove unnecessary expand_as in common_nn test. 3) use view in tensor_str instead of resize_. 4) newExpand remove raiseErrors change. 5) clarify expandedSizes/expandedStrides parameters in inferExpandGeometry. 6) simplify inferSize2/inferSizeN implementations. 7) use new-style classes for warning.
2017-06-11Add broadcasting support for copy_, simplify code generation by moving a lot ↵Gregory Chanan
of currently generated code to expand_utils.
2017-06-11Support "fused" ops: addcmul/addcdiv.Gregory Chanan
2017-06-11Expand improvementsGregory Chanan
1) Rename calculateExpandGeometry to inferExpandGeometry for consistency 2) Simplify inferExpandGeometry implementation by using a single pass through dimensions 3) Implement a two operand expansion, expand2. 4) Implement versions that return error code to use for fallback to equal nElem support.
2017-06-05Fix sharing of CUDA tensors on non-current devicesSam Gross
2017-06-02substitute cudnnFind* functions with cudnnFind*ExAlexey Romanenko
2017-05-25Add scatterAddAdam Paszke
2017-05-15Cuda reduce in a consistent directionRudy Bunel
2017-05-10Make torch.cat not synchronize the host and deviceSam Gross
2017-05-10Add keepdim to lua cwrap. (#763)gchanan