Age | Commit message (Collapse) | Author | |
---|---|---|---|
2017-11-19 | Merge branch 'master' into blender2.8 | Julian Eisel | |
2017-11-17 | Cycles: Add per-tile render time debug pass | Lukas Stockner | |
Reviewers: sergey, brecht Differential Revision: https://developer.blender.org/D2920 | |||
2017-11-13 | Merge remote-tracking branch 'origin/master' into blender2.8 | Dalai Felinto | |
2017-11-12 | Fix T53289: CUDA missing textures not showing pink, after recent changes. | Brecht Van Lommel | |
2017-11-11 | Cycles: Set error if a split kernel fails to load | Mai Lavelle | |
To help catch cases where adding a new kernel is missed for one of the device implementations. | |||
2017-11-10 | Merge branch 'master' into blender2.8 | Bastien Montagne | |
2017-11-09 | Cycles: avoid reallocating tile denoising memory many times during render. | Brecht Van Lommel | |
2017-11-09 | Merge branch 'master' into blender2.8 | Sergey Sharybin | |
2017-11-09 | Cycles: Replace __MAX_CLOSURE__ build option with runtime integrator variable | Mai Lavelle | |
Goal is to reduce OpenCL kernel recompilations. Currently viewport renders are still set to use 64 closures as this seems to be faster and we don't want to cause a performance regression there. Needs to be investigated. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2775 | |||
2017-11-08 | Merge branch 'master' into blender2.8 | Brecht Van Lommel | |
2017-11-08 | Code refactor: rename subsurface to local traversal, for reuse. | Brecht Van Lommel | |
2017-11-08 | Cycles: add an extra CUDA synchronize before rendering. | Brecht Van Lommel | |
It should not be needed as far as I know, but just in case it fixes any of the recent issues like T52572. | |||
2017-11-06 | Merge branch 'master' into blender2.8 | Bastien Montagne | |
Conflicts: intern/cycles/device/device.cpp source/blender/blenkernel/intern/library.c source/blender/blenkernel/intern/material.c source/blender/editors/object/object_add.c source/blender/editors/object/object_relations.c source/blender/editors/space_outliner/outliner_draw.c source/blender/editors/space_outliner/outliner_edit.c source/blender/editors/space_view3d/drawobject.c source/blender/editors/util/ed_util.c source/blender/windowmanager/intern/wm_files_link.c | |||
2017-11-05 | Code refactor: device memory cleanups, preparing for mapped host memory. | Brecht Van Lommel | |
2017-11-05 | Cycles: reserve CUDA local memory ahead of time. | Brecht Van Lommel | |
This way we can log the amount of memory used, and it will be important for host mapped memory support. | |||
2017-11-04 | Merge branch 'master' into blender2.8 | Campbell Barton | |
2017-11-04 | Code refactor: replace CUDA array with linear memory for 1D and 2D textures. | Brecht Van Lommel | |
This is a prequisite for getting host memory allocation to work. There appears to be no support for 3D textures using host memory. The original version of this code was written by Stefan Werner for D2056. | |||
2017-11-03 | Fix T53247: mixed CPU + GPU render wrong texture limits. | Brecht Van Lommel | |
2017-11-02 | Merge branch 'master' into blender2.8 | Campbell Barton | |
2017-11-02 | Cycles: Add another limit to OpenCL memory usage | Mai Lavelle | |
Some drivers may report very large allocation sizes, which could cause unnecessary memory usage. This is now limited to 2gb which should still be enough to get the needed performance benefits without waste. | |||
2017-10-25 | Merge branch 'master' into blender2.8 | Sergey Sharybin | |
2017-10-25 | Fix one more assert being triggered due to recent changes. | Brecht Van Lommel | |
2017-10-25 | Code refactor: remove MEM_WRITE_ONLY, always use MEM_READ_WRITE. | Brecht Van Lommel | |
It's unlikely the driver can do useful optimizations with this, and if we sum multiple samples we are reading from the memory anyway. | |||
2017-10-24 | Fix T53146: incomplete multi GPU and CPU + GPU memory statistics. | Brecht Van Lommel | |
Part due to recent changes, part old bug. | |||
2017-10-24 | Merge branch 'master' into blender2.8 | Sergey Sharybin | |
2017-10-24 | Cycles: Fix compilation in debug mode | Sergey Sharybin | |
Please check compilation before committing refactor changes! | |||
2017-10-24 | Cycles: Fix compilation error without C++11 | Sergey Sharybin | |
2017-10-24 | Fix T53134: denoising with CPU + GPU render leaves some tiles noisy. | Brecht Van Lommel | |
2017-10-24 | Merge branch 'master' into blender2.8 | Brecht Van Lommel | |
2017-10-24 | Code refactor: move more memory allocation logic into device API. | Brecht Van Lommel | |
* Remove tex_* and pixels_* functions, replace by mem_*. * Add MEM_TEXTURE and MEM_PIXELS as memory types recognized by devices. * No longer create device_memory and call mem_* directly, always go through device_only_memory, device_vector and device_pixels. | |||
2017-10-24 | Code refactor: use device_only_memory and device_vector in more places. | Brecht Van Lommel | |
2017-10-24 | Code refactor: store device/interp/extension/type in each device_memory. | Brecht Van Lommel | |
2017-10-24 | Code refactor: pass device to scene, check OSL with device info. | Brecht Van Lommel | |
2017-10-23 | Merge branch 'master' into blender2.8 | Julian Eisel | |
2017-10-21 | Code refactor: avoid some unnecessary device memory copying. | Brecht Van Lommel | |
2017-10-21 | Cycles: combined CPU + GPU rendering support. | Brecht Van Lommel | |
CPU rendering will be restricted to a BVH2, which is not ideal for raytracing performance but can be shared with the GPU. Decoupled volume shading will be disabled to match GPU volume sampling. The number of CPU rendering threads is reduced to leave one core dedicated to each GPU. Viewport rendering will also only use GPU rendering still. So along with the BVH2 usage, perfect scaling should not be expected. Go to User Preferences > System to enable the CPU to render alongside the GPU. Differential Revision: https://developer.blender.org/D2873 | |||
2017-10-19 | Merge branch 'master' into blender2.8 | Sergey Sharybin | |
2017-10-19 | Cycles: Add extra logging in CUDA device detection code | Sergey Sharybin | |
2017-10-18 | Merge branch 'master' into blender2.8 | Campbell Barton | |
2017-10-18 | Fix T53098, T53079: OpenCL world texture errors after recent changes. | Brecht Van Lommel | |
2017-10-11 | Merge branch 'master' into blender2.8 | Sergey Sharybin | |
2017-10-11 | Cycles: Fix possible race condition when initializing devices list | Sergey Sharybin | |
2017-10-09 | Merge branch 'master' into blender2.8 | Campbell Barton | |
2017-10-08 | Cycles: schedule more work for non-display and compute preemption CUDA cards. | Brecht Van Lommel | |
This change affects CUDA GPUs not connected to a display or connected to a display but supporting compute preemption so that the display does not freeze. I couldn't find an official list, but compute preemption seems to be only supported with GTX 1070+ and Linux (not GTX 1060- or Windows). This helps improve small tile rendering performance further if there are sufficient samples x number of pixels in a single tile to keep the GPU busy. | |||
2017-10-08 | Fix T53017: Cycles not detecting AMD GPU when there is an NVidia GPU too. | Mathieu Menuet | |
Best guess is that cuInit() somehow interferes with the AMD graphics driver on Windows, and switching the initialization order to do OpenCL first seems to solve the issue. | |||
2017-10-08 | Code refactor: use DeviceInfo to enable QBVH and decoupled volume shading. | Brecht Van Lommel | |
2017-10-07 | Code refactor: make texture code more consistent between devices. | Brecht Van Lommel | |
* Use common TextureInfo struct for all devices, except CUDA fermi. * Move image sampling code to kernels/*/kernel_*_image.h files. * Use arrays for data textures on Fermi too, so device_vector<Struct> works. | |||
2017-10-06 | Merge branch 'master' into blender28 | Campbell Barton | |
2017-10-05 | Code refactor: split displace/background into separate kernels, remove luma. | Brecht Van Lommel | |
2017-10-05 | Fix incorrect CUDA remaining time estimate after previous commit. | Brecht Van Lommel | |