git.blender.org/blender.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2021-10-26	Cycles: add additive AO support through Fast GI settings	Brecht Van Lommel
	Add a Fast GI Method, either Replace for the existing behavior, or Add to add ambient occlusion like the old world settings. This replaces the old Ambient Occlusion settings in the world properties.
2021-10-26	Cycles: restore Denoising Depth pass, when enabling Denoising Data passes	Brecht Van Lommel
	This is still useful in some cases even if not used by OpenImageDenoise. In the future this may be replaced with a more generic system to control render passes and filtering, but for now this just does what it did before.
2021-10-26	Cycles: change Position render pass to be not antialiased	Brecht Van Lommel
	Similar to the Depth, for compositing the interpolated values between a far and near object can be non-sensical.
2021-10-22	Fix Cycles HIP binaries always recompiling	Brecht Van Lommel

2021-10-22	Cleanup: refactor float/half conversions for clarity	Brecht Van Lommel

2021-10-22	Cycles: various fixes for HIP and compilation of HIP binaries	Sayak Biswas
	* Additional structs added to the hipew loader for device props * Adds hipRTC functions to the loader for future usage * Enables CPU+GPU usage for HIP * Cleanup to the adaptive kernel compilation process * Fix for kernel compilation failures with HIP with latest master Ref T92393, D12958
2021-10-21	Fix T92363: OptiX fails with ambient occlusion node, after recent changes	Brecht Van Lommel
	This triggered a compiler bug where it does not handle the sub.s16 PTX instruction. Instead refactor the code so we don't need to do uint16_t subtraction at all. Also update OptiX device to remove the AO pass direct callable. Thanks Patrick Mours for figuring this out.
2021-10-21	Cycles: add shadow path compaction for GPU rendering	Brecht Van Lommel
	Similar to main path compaction that happens before adding work tiles, this compacts shadow paths before launching kernels that may add shadow paths. Only do it when more than 50% of space is wasted. It's not a clear win in all scenes, some are up to 1.5% slower. Likely caused by different order of scheduling kernels having an unpredictable performance impact. Still feels like compaction is just the right thing to avoid cases where a few shadow paths can hold up a lot of main paths. Differential Revision: https://developer.blender.org/D12944
2021-10-20	Cleanup: remove unused code	Brecht Van Lommel

2021-10-20	Cleanup: some renaming to better distinguish main and shadow paths	Brecht Van Lommel

2021-10-20	Cycles: make ambient occlusion pass take into account transparency again	Brecht Van Lommel
	Taking advantage of the new decoupled main and shadow paths. For CPU we just store two nested structs in the integrator state, one for direct light shadows and one for AO. For the GPU we restrict the number of shade surface states to be executed based on available space in the shadow paths queue. This also helps improve performance in benchmark scenes with an AO pass, since it is no longer needed to use the shader raytracing kernel there, which has worse performance. Differential Revision: https://developer.blender.org/D12900
2021-10-20	HIP device code cleanup and fix for high VRAM usage	Sayak Biswas
	This patch cleans up code for HIP device and makes it more consistent with the CUDA code. It also fixes the issue with high VRAM usage on AMD cards using HIP allowing better performance and usage on cards like 6600XT. Added a check in intern/cycles/kernel/bvh/bvh_util.h to prevent compiler error with hipcc Reviewed By: brecht, leesonw Maniphest Tasks: T92124 Differential Revision: https://developer.blender.org/D12834
2021-10-19	Cycles: bake transparent shadows for hair	Brecht Van Lommel
	These transparent shadows can be expansive to evaluate. Especially on the GPU they can lead to poor occupancy when only some pixels require many kernel launches to trace and evaluate many layers of transparency. Baked transparency allows tracing a single ray in many cases by accumulating the throughput directly in the intersection program without recording hits or evaluating shaders. Transparency is baked at curve vertices and interpolated, for most shaders this will look practically the same as actual shader evaluation. Fixes T91428, performance regression with spring demo file due to transparent hair, and makes it render significantly faster than Blender 2.93. Differential Revision: https://developer.blender.org/D12880
2021-10-19	Cycles: avoid intermediate stack array for writing shadow intersections	Brecht Van Lommel
	Helps save one OptiX payload and is a bit more efficient. Differential Revision: https://developer.blender.org/D12909
2021-10-19	Cycles: decouple shadow paths from main path on GPU	Brecht Van Lommel
	The motivation for this is twofold. It improves performance (5-10% on most benchmark scenes), and will help to bring back transparency support for the ambient occlusion pass. * Duplicate some members from the main path state in the shadow path state. * Add shadow paths incrementally to the array similar to what we do for the shadow catchers. * For the scheduling, allow running shade surface and shade volume kernels as long as there is enough space in the shadow paths array. If not, execute shadow kernels until it is empty. * Add IntegratorShadowState and ConstIntegratorShadowState typedefs that can be different between CPU and GPU. For GPU both main and shadow paths juse have an integer for SoA access. Bt with CPU it's a different pointer type so we get type safety checks in code shared between CPU and GPU. * For CPU, add a separate IntegratorShadowStateCPU struct embedded in IntegratorShadowState. * Update various functions to take the shadow state, and make SVM take either type of state using templates. Differential Revision: https://developer.blender.org/D12889
2021-10-19	Cleanup: fix compiler warnings	Brecht Van Lommel

2021-10-19	Fix invalid principled diffuse in Cycles OSL	Sergey Sharybin
	Need to initialize components for the full Diffuse BSDF. Steps to reproduce: - Default cube scene - Switch to Cycles renderer - Enable OSL backend - Start viewport render - Observe cube being much black Differential Revision: https://developer.blender.org/D12921
2021-10-19	Cleanup: More readable Cycles OSL BSDF definition	Sergey Sharybin
	A Clang-Format configuration to make the closure definition block to be properly recognized as such. Also small wrapper macro to avoid comma in the actual definition code which was causing unwanted indentation of parameters definition. Requires Clang-Format 7 or newer. The version we ship in the libs is 12, so for recommended development setup it should all be good. Differential Revision: https://developer.blender.org/D12920
2021-10-19	Cleanup: clang-format	Campbell Barton

2021-10-18	Revert "Cycles: optimize volume stack copying for shadow catcher/compaction"	Brecht Van Lommel
	This reverts commit 3065d2609700d14100490a16c91152a6e71790e8. Causing crashes in the spring scene.
2021-10-18	Cleanup: minor refactoring in preparation of main and shadow path decoupling	Brecht Van Lommel
	Ref D12889
2021-10-18	Cycles: reduce GPU state memory a little	Brecht Van Lommel
	* isect Ng is no longer needed for shadows, for main path needed for SSS only * Reduce rng_offset and queued_kernel to 16 bits Ref D12889
2021-10-18	Cycles: optimize volume stack copying for shadow catcher/compaction	Brecht Van Lommel
	Only copy the number of items used instead of the max items. Ref D12889
2021-10-18	Cleanup: consistently use uint32_t for path flag	Brecht Van Lommel

2021-10-18	Cycles: replace integrator state argument macros	Brecht Van Lommel
	* Rename struct KernelGlobals to struct KernelGlobalsCPU * Add KernelGlobals, IntegratorState and ConstIntegratorState typedefs that every device can define in its own way. * Remove INTEGRATOR_STATE_ARGS and INTEGRATOR_STATE_PASS macros and replace with these new typedefs. * Add explicit state argument to INTEGRATOR_STATE and similar macros In preparation for decoupling main and shadow paths. Differential Revision: https://developer.blender.org/D12888
2021-10-15	Cycles: Voronoi noise, fix uninitialised variable	Charlie Jolly
	Caused a debug crash in Windows MSVS. Reviewed By: brecht Differential Revision: https://developer.blender.org/D12873
2021-10-15	Cleanup: refactor BVH2 shadow intersection for upcoming changes	Brecht Van Lommel

2021-10-15	Cleanup: refactor OptiX shadow intersection for upcoming changes	Brecht Van Lommel

2021-10-15	Cleanup: add utility functions for packing integers	Brecht Van Lommel

2021-10-15	Cleanup: refactor to make number of channels for shader evaluation variable	Brecht Van Lommel

2021-10-15	Fix T92128: Cycles CUDA wrong hair attributes, after recent changes	Brecht Van Lommel

2021-10-14	Cycles: Kernel address space changes for MSL	Michael Jones
	This is the first of a sequence of changes to support compiling Cycles kernels as MSL (Metal Shading Language) in preparation for a Metal GPU device implementation. MSL requires that all pointer types be declared with explicit address space attributes (device, thread, etc...). There is already precedent for this with Cycles' address space macros (ccl_global, ccl_private, etc...), therefore the first step of MSL-enablement is to apply these consistently. Line-for-line this represents the largest change required to enable MSL. Applying this change first will simplify future patches as well as offering the emergent benefit of enhanced descriptiveness. The vast majority of deltas in this patch fall into one of two cases: - Ensuring ccl_private is specified for thread-local pointer types - Ensuring ccl_global is specified for device-wide pointer types Additionally, the ccl_addr_space qualifier can be removed. Prior to Cycles X, ccl_addr_space was used as a context-dependent address space qualifier, but now it is either redundant (e.g. in struct typedefs), or can be replaced by ccl_global in the case of pointer types. Associated function variants (e.g. lcg_step_float_addrspace) are also redundant. In cases where address space qualifiers are chained with "const", this patch places the address space qualifier first. The rationale for this is that the choice of address space is likely to have the greater impact on runtime performance and overall architecture. The final part of this patch is the addition of a metal/compat.h header. This is partially complete and will be extended in future patches, paving the way for the full Metal implementation. Ref T92212 Reviewed By: brecht Maniphest Tasks: T92212 Differential Revision: https://developer.blender.org/D12864
2021-10-14	Fix shadow catcher behind transparent object on GPU	Sergey Sharybin
	The assumption about absent shadow path was wrong. The rest of the changes are to ensure shadow paths are finished prior to the split, so that they write to the proper passes. The issue was caught by running regression tests on OptiX. Differential Revision: https://developer.blender.org/D12857
2021-10-12	Cleanup: spelling in comments	Campbell Barton

2021-10-11	Cycles: improve SSS Fresnel and retro-reflection in Principled BSDF	Brecht Van Lommel
	For details see the "Extending the Disney BRDF to a BSDF with Integrated Subsurface Scattering" paper. We split the diffuse BSDF into a lambertian and retro-reflection component. The retro-reflection component is always handled as a BSDF, while the lambertian component can be replaced by a BSSRDF. For the BSSRDF case, we compute Fresnel separately at the entry and exit points, which may have different normals. As the scattering radius decreases this converges to the BSDF case. A downside is that this increases noise for subsurface scattering in the Principled BSDF, due to some samples going to the retro-reflection component. However the previous logic (also in 2.93) was simple wrong, using a non-sensical view direction vector at the exit point. We use an importance sampling weight estimate for the retro-reflection to try to better balance samples between the BSDF and BSSRDF. Differential Revision: https://developer.blender.org/D12801
2021-10-11	Cycles: restore Christensen-Burley SSS	Brecht Van Lommel
	There is not enough time before the release to improve Random Walk to handle all cases this was used for, so restore it for now. Since there is no more path splitting in cycles-x, this can increase noise in non-flat areas for the sample number of samples, though fewer rays will be traced also. This is fundamentally a trade-off we made in the new design and why Random Walk is a better fit. However the importance resampling we do now does help to reduce noise. Differential Revision: https://developer.blender.org/D12800
2021-10-08	Fix T91997: Cycles glass + SSS not rendering correctly	Brecht Van Lommel

2021-10-08	Fix Cycles speed regression after dynamic volume stack change	Sergey Sharybin
	Only copy required part of volume stack instead of entire stack. Solves time regression introduced by D12759 and avoids need in implementing volume stack calculation to exactly match what the path tracing will do (as well as potentially makes scenes with a lot of volumes ans a tiny bit of deeply nested ones render faster). Still need to look into memory aspect of the regression, but that is for separate patch. Ref T92014 Maniphest Tasks: T92014 Differential Revision: https://developer.blender.org/D12790
2021-10-08	Cleanup: spelling	Campbell Barton

2021-10-07	Cleanup: remove unnecessary data from LocalIntersection	Brecht Van Lommel

2021-10-06	Cycles: fully decouple triangle and curve primitive storage from BVH2	Brecht Van Lommel
	Previously the storage here was optimized to avoid indirections in BVH2 traversal. This helps improve performance a bit, but makes performance and memory usage of Embree and OptiX BVHs a bit worse also. It also adds code complexity in other parts of the code. Now decouple triangle and curve primitive storage from BVH2. * Reduced peak memory usage on all devices * Bit better performance for OptiX and Embree * Bit worse performance for CUDA * Simplified code: Intersection.prim/object now matches ShaderData.prim/object No more offset manipulation for mesh displacement before a BVH is built Remove primitive packing code and flags for Embree and OptiX Curve segments are now stored in a KernelCurve struct * Also happens to fix a bug in baking with incorrect prim/object Fixes T91968, T91770, T91902 Differential Revision: https://developer.blender.org/D12766
2021-10-06	Fix compilation error with MSVC	Sergey Sharybin
	MSVC does not support variable size array definition. Use maximum possible stack, similar to the GPU case. Not expected to have user-measurable difference.
2021-10-06	Fix T91922: Cycles artifacts with high volume nested level	Sergey Sharybin
	Make volume stack allocated conditionally, potentially based on the actual nested level of objects in the scene. Currently the nested level is estimated by number of volume objects. This is a non-expensive check which is probably enough in practice to get almost perfect memory usage and performance. The conditional allocation is a bit tricky. For the CPU we declare and define maximum possible volume stack, because there are only that many integrator states on the CPU. On the GPU we declare outer SoA to have all volume stack elements, but only allocate actually needed ones. The actually used volume stack size is passed as a pre-processor, which seems to be easiest and fastest for the GPU state copy. There seems to be no speed regression in the demo files on RTX6000. Note that scenes with high nested level of volume will now be slower but correct. Differential Revision: https://developer.blender.org/D12759
2021-10-06	Build: add ccache support for CUDA kernels on Linux	Brecht Van Lommel

2021-10-06	Fix T91064: Cycles low poly meshes having black edges when shade smoothed	Mikhail Matrosov
	Fixes:{T91064} Caused by {rBcd118c5581f482afc8554ff88b5b6f3b552b1682} - Applies `ensure_valid_reflection()` to the normal input on all BSDFs for CPU and GPU. - This doesn't affect hair. - Removes `ensure_valid_reflection()` from the output of Bump Map and Normal Map nodes for CPU/GPU as it is not needed. - The fix doesn't touch OSL. Reviewed By: brecht, leesonw Maniphest Tasks: T91064 Differential Revision: https://developer.blender.org/D12403
2021-10-06	Cleanup: spelling in comments	Campbell Barton

2021-10-06	Cleanup: Remove data duplication from various lookup tables in Cycles	Jesse Yurkovich
	This effectively undoes some of the following commit: rB4537e8558468c71a03bf53f59c60f888b3412de2 The tables in question were duplicated 5-6 times into the blender executable due to the headers being used in multiple translation units. This contributes ~6.3kb worth of duplicate data into the binary. Some further details are in the below revision. Differential Revision: https://developer.blender.org/D12724
2021-10-05	Fix adaptive sampling artifacts on tile boundaries	Sergey Sharybin
	Implement an overscan support for tiles, so that adaptive sampling can rely on the pixels neighbourhood. Differential Revision: https://developer.blender.org/D12599
2021-10-05	Cycles: improve detection of HIP compiler for buildbot	Brecht Van Lommel
	And fix various broken things in the HIP kernel compilation.
2021-10-04	Fix T91861: Black environment behind shadow catcher	Sergey Sharybin
	Always sample background pass behind shadow catcher (if the pass exists, of course), regardless of whether shadow catcher will be used as approximate or accurate. Allows to combine accurate shadows into an environment map. Differential Revision: https://developer.blender.org/D12747