git.blender.org/blender.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2017-11-09	Cycles: Replace __MAX_CLOSURE__ build option with runtime integrator variable	Mai Lavelle
	Goal is to reduce OpenCL kernel recompilations. Currently viewport renders are still set to use 64 closures as this seems to be faster and we don't want to cause a performance regression there. Needs to be investigated. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2775
2017-11-08	Code refactor: rename subsurface to local traversal, for reuse.	Brecht Van Lommel

2017-11-05	Cycles: reduce closure memory usage for emission/shadow shader data.	Brecht Van Lommel
	With a Titan Xp, reduces path trace local memory from 1092MB to 840MB. Benchmark performance was within 1% with both RX 480 and Titan Xp. Original patch was implemented by Sergey. Differential Revision: https://developer.blender.org/D2249
2017-09-20	Cycles: slightly improve BSDF sample stratification for path tracing.	Brecht Van Lommel
	Similar to what we did for area lights previously, this should help preserve stratification when using multiple BSDFs in theory. Improvements are not easily noticeable in practice though, because the number of BSDFs is usually low. Still nice to eliminate one sampling dimension.
2017-09-20	Code cleanup: refactor BSSRDF closure sampling, for next commit.	Brecht Van Lommel

2017-08-21	Fix T52470: cycles OpenCL hair rendering not working after recent changes.	Brecht Van Lommel

2017-08-19	Code cleanup: move rng into path state.	Brecht Van Lommel
	Also pass by value and don't write back now that it is just a hash for seeding and no longer an LCG state. Together this makes CUDA a tiny bit faster in my tests, but mainly simplifies code.
2017-06-13	Cycles: Cleanup, indentation	Sergey Sharybin

2017-06-10	Cycles: Faster split branched path tracing by sharing samples with inactive ↵	Mai Lavelle
	threads Unlike regular path tracing, branched path tracing is usually used with lower sample counts, at least for primary rays. This means that are less samples for the GPU to work on in parallel and rendering is slower. As there is less work overall there is also more inactive threads during rendering with BPT. This patch makes use of those inactive rays to render branched samples in parallel with other samples. Each thread that is preparing for a branched sample will attempt to find an inactive thread and if one is found the state for the sample is copied to that thread. Potentially, if there are enough inactive threads, 100s of branched samples could be generated from the same originating thread and ran in parallel giving large speed ups. Gives 70% faster render for pavillion midday scene. 20-60% faster on BMW with car paint replaced with SSS/volumes.
2017-05-05	Cycles: Fix access array index of -1 in SSS and volume split kernels	Sergey Sharybin

2017-05-05	Cycles: Cleanup, indentation	Sergey Sharybin

2017-05-02	Cycles: Branched path tracing for the split kernel	Mai Lavelle
	This implements branched path tracing for the split kernel. General approach is to store the ray state at a branch point, trace the branched ray as normal, then restore the state as necessary before iterating to the next part of the path. A state machine is used to advance the indirect loop state, which avoids the need to add any new kernels. Each iteration the state machine recreates as much state as possible from the stored ray to keep overall storage down. Its kind of hard to keep all the different integration loops in sync, so this needs lots of testing to make sure everything is working correctly. We should probably start trying to deduplicate the integration loops more now. Nonbranched BMW is ~2% slower, while classroom is ~2% faster, other scenes could use more testing still. Reviewers: sergey, nirved Reviewed By: nirved Subscribers: Blendify, bliblubli Differential Revision: https://developer.blender.org/D2611
2017-03-27	Cycles: Remove ccl_addr_space from RNG passed to functions	Hristo Gueorguiev
	Simplifies code quite a bit, making it shorter and easier to extend. Currently no functional changes for users, but is required for the upcoming work of shadow catcher support with OpenCL.
2017-03-17	Cycles: Fix handling of barriers	Mai Lavelle

2017-03-16	Cycles: Define ccl_local variables in kernel functions	Sergey Sharybin
	Declaring ccl_local in a device function is not supported by certain compilers.
2017-03-13	Cycles: Add missing header in the file	Sergey Sharybin

2017-03-09	Cycles: SSS and Volume rendering in split kernel	Hristo Gueorguiev
	Decoupled ray marching is not supported yet. Transparent shadows are always enabled for volume rendering. Changes in kernel/bvh and kernel/geom are from Sergey. This simiplifies code significantly, and prepares it for record-all transparent shadow function in split kernel.