git.blender.org/blender.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2020-11-23	Cycles: Remove Compilation Warning	Jeroen Bakker
	ROCm 3.9 already defined `NULL`. This patch will first check if it was already defined to remove compilation warnings. NOTE: This doesn't add official support for ROCm as it still fails to render correctly (crashes with default cube). Reviewed By: Brecht van Lommel Differential Revision: https://developer.blender.org/D9610
2020-11-09	Cycles: Fix tricubic sampling with NanoVDB	Patrick Mours
	Volumes using tricubic sampling were producing different results with NanoVDB compared to dense textures. This fixes that by using the same tricubic sampling algorithm in both cases. It also fixes some remaining offset issues and some minor things that broke OpenCL kernel compilation on NVIDIA. Reviewed By: brecht Differential Revision: https://developer.blender.org/D9491
2020-05-07	Fix T76469: OpenCL 1.2 Compilation	Jeroen Bakker
	Recent changes assumed OpenCL 2.0 platform. This adds a check to see if we are compiling on an OpenCL 2.0 platform. Patch was tested on: * AMD Radeon Pro WX 7100 with amdgpu-pro-19.50-1011208-ubuntu-18.04 drivers * AMD Vega 64 with amdgpu-pro-20.10-1048554-ubuntu-18.04 drivers * AMD RX 5700 with amdgpu-pro-20.10-1048554-ubuntu-18.04 drivers Reviewed By: Brecht van Lommel Differential Revision: https://developer.blender.org/D7637
2020-04-30	Fix T75895: Unable to Compile Cycles on NAVI/Linux	Jeroen Bakker
	This patch will add some compiler hints to break unrolling in the nestled for loops of the voronoi node. Reviewed by: Brecht van Lommel Differential Revision: https://developer.blender.org/D7574
2019-12-08	Fix T72282: Cycles OpenCL error after recent math node changes	Brecht Van Lommel

2019-08-26	Cycles: code to optionally zero initialize some structs in the kernel	Brecht Van Lommel
	This will be used by Optix to help the compiler figure out scoping. It is not used by other devices currently, but worth testing if it helps there too. Ref D5363
2019-08-26	Cycles: inline more functions on the GPU	Patrick Mours
	This makes little difference for CUDA and OpenCL, but will be helpful for Optix.
2019-04-17	ClangFormat: apply to source, most of intern	Campbell Barton
	Apply clang format as proposed in T53211. For details on usage and instructions for migrating branches without conflicts, see: https://wiki.blender.org/wiki/Tools/ClangFormat
2019-03-18	Fix AMD OpenCL build error after recent changes.	Brecht Van Lommel
	Always use native function since this was already the case due to __CL_USE_NATIVE__ not being defined in time, and seems to have caused no known issues.
2019-03-17	Cleanup: simplify kernel features definition.	Brecht Van Lommel
	No functional changes, logic here got too complex after many changes over the years.
2018-11-09	Cycles: Cleanup, spacing after preprocessor	Sergey Sharybin
	It is supposed to be two spaces before comment stating which if else/endif statements corresponds to. Was mainly violated in the header guards.
2018-07-18	Cycles: add Principled Hair BSDF.	L. E. Segovia
	This is a physically-based, easy-to-use shader for rendering hair and fur, with controls for melanin, roughness and randomization. Based on the paper "A Practical and Controllable Hair and Fur Model for Production Path Tracing". Implemented by Leonardo E. Segovia and Lukas Stockner, part of Google Summer of Code 2018.
2018-07-06	Cycles: Enabled half precision textures for OpenCL devices that support the ↵	Stefan Werner
	cl_khr_fp16 extension.
2018-07-06	Cleanup: strip trailing space for cycles	Campbell Barton

2018-05-24	Cycles: Cleanup: Remove duplicated atan2f definition for OpenCL	Lukas Stockner
	Turns out that atan2f was already defined for OpenCL.
2018-05-24	Cycles/Compositor: Add arctan2 operation to the Math node	Lukas Stockner
	The Math node currently has the normal atan() function, but for actual angles this is fairly useless without additional nodes to handle the signs. Since the node has two inputs anyways, it only makes sense to add an arctan2 option. Reviewers: sergey, brecht Differential Revision: https://developer.blender.org/D3430
2018-03-10	Cycles: support arbitrary number of motion blur steps for cameras.	Brecht Van Lommel

2017-10-07	Code refactor: make texture code more consistent between devices.	Brecht Van Lommel
	* Use common TextureInfo struct for all devices, except CUDA fermi. * Move image sampling code to kernels//kernel__image.h files. * Use arrays for data textures on Fermi too, so device_vector<Struct> works.
2017-10-05	Fix T53001: more workarounds for crash in AMD compiler with recent drivers.	Brecht Van Lommel

2017-08-08	Cycles: Add utility macro ccl_ref	Sergey Sharybin
	It is defined to & for CPU side compilation, and defined to an empty for any GPU platform. The idea here is to use this macro instead of #ifdef block with bunch of duplicated lines just to make it so CPU code is efficient. Eventually we might switch to references on CUDA as well, but that would require some intensive testing.
2017-08-08	Cycles: Pack kernel textures into buffers for OpenCL	Mai Lavelle
	Image textures were being packed into a single buffer for OpenCL, which limited the amount of memory available for images to the size of one buffer (usually 4gb on AMD hardware). By packing textures into multiple buffers that limit is removed, while simultaneously reducing the number of buffers that need to be passed to each kernel. Benchmarks were within 2%. Fixes T51554. Differential Revision: https://developer.blender.org/D2745
2017-08-07	Cycles: Fix compilation error on NVidia OpenCL after recent refactor	Sergey Sharybin
	Still need to verify this is proper thing to do for AMD OpenCL. At least now i can compile OpenCL kernel on my laptop with sm21 card.
2017-05-24	Cycles: Use falltrhough attribute to help catching missing break statements	Sergey Sharybin

2017-05-19	\0;115;0cCycles: Cleanup, use ccl_restrict instead of ccl_restrict_ptr	Sergey Sharybin
	There were following issues with ccl_restrict_ptr: - We already had ccl_restrict for all platforms. - It was secretly adding `const` qualifier to the declaration, which is quite weird since non-const pointer can also be declared as restricted. - We never in Blender are using foo_ptr or FooPtr type definitions, so not sure why we should introduce such a thing here. - It is absolutely wrong from semantic point of view to put pointer into the restrict macro -- const is a part of type, not part of hint for compiler that some pointer is never aliased.
2017-05-07	Cycles: Implement denoising option for reducing noise in the rendered image	Lukas Stockner
	This commit contains the first part of the new Cycles denoising option, which filters the resulting image using information gathered during rendering to get rid of noise while preserving visual features as well as possible. To use the option, enable it in the render layer options. The default settings fit a wide range of scenes, but the user can tweak individual settings to control the tradeoff between a noise-free image, image details, and calculation time. Note that the denoiser may still change in the future and that some features are not implemented yet. The most important missing feature is animation denoising, which uses information from multiple frames at once to produce a flicker-free and smoother result. These features will be added in the future. Finally, thanks to all the people who supported this project: - Google (through the GSoC) and Theory Studios for sponsoring the development - The authors of the papers I used for implementing the denoiser (more details on them will be included in the technical docs) - The other Cycles devs for feedback on the code, especially Sergey for mentoring the GSoC project and Brecht for the code review! - And of course the users who helped with testing, reported bugs and things that could and/or should work better!
2017-03-29	Cycles: Make all #include statements relative to cycles source directory	Sergey Sharybin
	The idea is to make include statements more explicit and obvious where the file is coming from, additionally reducing chance of wrong header being picked up. For example, it was not obvious whether bvh.h was refferring to builder or traversal, whenter node.h is a generic graph node or a shader node and cases like that. Surely this might look obvious for the active developers, but after some time of not touching the code it becomes less obvious where file is coming from. This was briefly mentioned in T50824 and seems @brecht is fine with such explicitness, but need to agree with all active developers before committing this. Please note that this patch is lacking changes related on GPU/OpenCL support. This will be solved if/when we all agree this is a good idea to move forward. Reviewers: brecht, lukasstockner97, maiself, nirved, dingto, juicyfruit, swerner Reviewed By: lukasstockner97, maiself, nirved, dingto Subscribers: brecht Differential Revision: https://developer.blender.org/D2586
2017-03-08	Cycles: OpenCL split kernel refactor	Mai Lavelle
	This does a few things at once: - Refactors host side split kernel logic into a new device agnostic class `DeviceSplitKernel`. - Removes tile splitting, a new work pool implementation takes its place and allows as many threads as will fit in memory regardless of tile size, which can give performance gains. - Refactors split state buffers into one buffer, as well as reduces the number of arguments passed to kernels. Means there's less code to deal with overall. - Moves kernel logic out of OpenCL kernel files so they can later be used by other device types. - Replaced OpenCL specific APIs with new generic versions - Tiles can now be seen updating during rendering
2016-10-03	Fix Cycles CUDA performance on CUDA 8.0.	Brecht Van Lommel
	Mostly this is making inlining match CUDA 7.5 in a few performance critical places. The end result is that performance is now better than before, possibly due to less register spilling or other CUDA 8.0 compiler improvements. On benchmarks scenes, there are 3% to 35% render time reductions. Stack memory usage is reduced a little too. Reviewed By: sergey Differential Revision: https://developer.blender.org/D2269
2016-08-09	Fix Cycles CUDA adaptive kernel not working correctly after recent closure ↵	Brecht Van Lommel
	changes.
2016-07-11	Cycles: Use utility define for restrict pointers	Sergey Sharybin
	This way restrict can be used for CUDA and OpenCL as well. From quick tests in areas i've been testing this it might give some barely measurable %% of speedup, but it increases registers pressure. So use of this qualifier is still really limited.
2016-03-25	Cycles: Cleanup, indent nested preprocessor directives	Sergey Sharybin
	Quite straightforward, main trick is happening in path_source_replace_includes(). Reviewers: brecht, dingto, lukasstockner97, juicyfruit Differential Revision: https://developer.blender.org/D1794
2015-05-09	Cycles: OpenCL kernel split	George Kyriazis
	This commit contains all the work related on the AMD megakernel split work which was mainly done by Varun Sundar, George Kyriazis and Lenny Wang, plus some help from Sergey Sharybin, Martijn Berger, Thomas Dinges and likely someone else which we're forgetting to mention. Currently only AMD cards are enabled for the new split kernel, but it is possible to force split opencl kernel to be used by setting the following environment variable: CYCLES_OPENCL_SPLIT_KERNEL_TEST=1. Not all the features are supported yet, and that being said no motion blur, camera blur, SSS and volumetrics for now. Also transparent shadows are disabled on AMD device because of some compiler bug. This kernel is also only implements regular path tracing and supporting branched one will take a bit. Branched path tracing is exposed to the interface still, which is a bit misleading and will be hidden there soon. More feature will be enabled once they're ported to the split kernel and tested. Neither regular CPU nor CUDA has any difference, they're generating the same exact code, which means no regressions/improvements there. Based on the research paper: https://research.nvidia.com/sites/default/files/publications/laine2013hpg_paper.pdf Here's the documentation: https://docs.google.com/document/d/1LuXW-CV-sVJkQaEGZlMJ86jZ8FmoPfecaMdR-oiWbUY/edit Design discussion of the patch: https://developer.blender.org/T44197 Differential Revision: https://developer.blender.org/D1200
2015-05-09	Cycles: Initial work towards selective nodes support compilation	Sergey Sharybin
	The goal is to be able to compile kernel with nodes which are actually needed to render current scene, hence improving performance of the kernel, The idea is: - Have few node groups, starting with a group which contains nodes are used really often, and then couple of groups which will be extension of this one. - Have feature-based nodes disabling, so it's possible to disable nodes related to features which are not used with the currently used nodes group. This commit only lays down needed routines for this approach, actual split will happen later after gathering statistics from bunch of production scenes.
2014-12-25	Cleanup: Fix Cycles Apache header.	Thomas Dinges
	This was already mixed a bit, but the dot belongs there.
2014-10-08	Cycles: correct math wrappers	Campbell Barton
	include the parens around value before cast, in some cases was causing double/float promotion by only casting the left value.
2014-10-05	Fix T42081, OpenCL supports float3 since the 1.1 specification, not sure why ↵	Thomas Dinges
	we needed this.
2014-06-15	* Fix OpenCL after uchar4 commit.	Thomas Dinges

2014-04-07	OpenCL + AMD adapt kernel to newer driver	Martijn Berger

2014-01-15	Code cleanup: move half float functions to separate header file.	Brecht Van Lommel

2013-11-18	Cycles: change __device and similar qualifiers to ccl_device in kernel code.	Brecht Van Lommel
	This to avoids build conflicts with libc++ on FreeBSD, these __ prefixed values are reserved for compilers. I apologize to anyone who has patches or branches and has to go through the pain of merging this change, it may be easiest to do these same replacements in your code and then apply/merge the patch. Ref T37477.
2013-08-18	Cycles: relicense GNU GPL source code to Apache version 2.0.	Brecht Van Lommel
	More information in this post: http://code.blender.org/ Thanks to all contributes for giving their permission!
2013-06-27	Code cleanup: cycles	Brecht Van Lommel
	* Reshuffle SSE #ifdefs to try to avoid compilation errors enabling SSE on 32 bit. * Remove CUDA kernel launch size exception on Mac, is not needed. * Make OSL file compilation quiet like c/cpp files.
2013-06-08	Cycles / OpenCL:	Thomas Dinges
	* Fix for recent commits, ceilf is not available in OpenCL.
2013-05-27	Cycles OpenCL: patch #35514 by Doug Gale	Brecht Van Lommel
	* Support using devices from all OpenCL platforms, so that you can use e.g. both Intel and NVidia OpenCL implementations if you have them installed. * Fix compile error due to missing fmodf after recent math node change. * Enable advanced shading for Intel OpenCL. * CYCLES_OPENCL_DEBUG environment variable for generating debug symbols so you can debug with gdb. This crashes the compiler with Intel OpenCL on Linux though. To make this work the preprocessed kernel source code is written out, as gdb needs this. * Show OpenCL compiler warnings even if the build succeeded. * Some small fixes to initialize cdDevice to NULL, add missing NULL check when creating buffer and add missing space at end of build options for Apple OpenCL. * Fix crash with multi device + opencl, now e.g. CPU + GPU render should work. I did a few tweaks to the code and also: * Fix viewport render failing sometimes with Apple CPU OpenCL, was not taking workgroup size limits into account properly. * Add compile error when advanced shading in the Blender binary and OpenCL kernel are not in sync.
2013-05-09	Cycles OpenCL: a few fixes to get things compiling after kernel changes,	Brecht Van Lommel
	for Apple OpenCL on OS X 10.8 and simple AO render. Also environment variable CYCLES_OPENCL_TEST can now be set to CPU, GPU, ACCELERATOR, DEFAULT or ALL values to test particuler devices.
2013-04-02	Cycles: initial subsurface multiple scattering support. It's not working as	Brecht Van Lommel
	well as I would like, but it works, just add a subsurface scattering node and you can use it like any other BSDF. It is using fully raytraced sampling compatible with progressive rendering and other more advanced rendering algorithms we might used in the future, and it uses no extra memory so it's suitable for complex scenes. Disadvantage is that it can be quite noisy and slow. Two limitations that will be solved are that it does not work with bump mapping yet, and that the falloff function used is a simple cubic function, it's not using the real BSSRDF falloff function yet. The node has a color input, along with a scattering radius for each RGB color channel along with an overall scale factor for the radii. There is also no GPU support yet, will test if I can get that working later. Node Documentation: http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Nodes/Shaders#BSSRDF Implementation notes: http://wiki.blender.org/index.php/Dev:2.6/Source/Render/Cycles/Subsurface_Scattering
2013-04-02	Cycles: code refactoring to add generic lookup table memory.	Brecht Van Lommel

2012-12-21	Fix cycles aliasing warnings caused by motion blur transforms.	Brecht Van Lommel

2012-04-30	Cycles: remove a few usages of double, to fix opencl warnings.	Brecht Van Lommel

2011-12-20	Cycles: some tweaks for apple opencl with ATI cards, to get it working up to	Brecht Van Lommel
	the level of ambient occlusion render, shaders still fail. Fixes found with much help from Jens and Dalai.