git.blender.org/blender.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2017-10-16	Merge remote-tracking branch 'origin/master' into openvdbopenvdb	Lukas Tönne

2017-10-08	Code refactor: use DeviceInfo to enable QBVH and decoupled volume shading.	Brecht Van Lommel

2017-10-04	Code refactor: remove rng_state buffer and compute hash on the fly.	Brecht Van Lommel
	A little faster on some benchmark scenes, a little slower on others, seems about performance neutral on average and saves a little memory.
2017-10-04	Code refactor: add WorkTile struct for passing work to kernel.	Brecht Van Lommel
	This makes sharing some code between mega/split in following commits a bit easier, and also paves the way for rendering multiple tiles later.
2017-09-29	Fix Cycles OpenCL compiler error after recent changes.	Brecht Van Lommel

2017-09-28	Cycles: reduce subsurface stack memory usage.	Brecht Van Lommel
	This is done by storing only a subset of PathRadiance, and by storing direct light immediately in the main PathRadiance. Saves about 10% of CUDA stack memory, and simplifies subsurface indirect ray code.
2017-09-20	Cycles: slightly improve BSDF sample stratification for path tracing.	Brecht Van Lommel
	Similar to what we did for area lights previously, this should help preserve stratification when using multiple BSDFs in theory. Improvements are not easily noticeable in practice though, because the number of BSDFs is usually low. Still nice to eliminate one sampling dimension.
2017-09-13	Code cleanup: store branch factor in PathState.	Brecht Van Lommel

2017-09-13	Code cleanup: abstract shadow catcher logic more into accumulation code.	Brecht Van Lommel

2017-09-12	Cycles: improve sample stratification on area lights for path tracing.	Brecht Van Lommel
	Previously we used a 1D sequence to select a light, and another 2D sequence to sample a point on the light. For multiple lights this meant each light would get a random subset of a 2D stratified sequence, which is not guaranteed to be stratified anymore. Now we use only a 2D sequence, split into segments along the X axis, one for each light. The samples that fall within a segment then each are a stratified sequence, at least in the limit. So for example for two lights, we split up the unit square into two segments [0,0.5[ x [0,1[ and [0.5,1[ x [0,1[. This doesn't make much difference in most scenes, mainly helps if you have a few large area lights or some types of HDR backgrounds.
2017-08-24	Code cleanup: remove shader context.	Brecht Van Lommel
	This was needed when we accessed OSL closure memory after shader evaluation, which could get overwritten by another shader evaluation. But all closures are immediatley converted to ShaderClosure now, so no longer needed.
2017-08-20	Cycles: support baking normals plugged into BSDFs, averaged with closure weight.	Brecht Van Lommel

2017-08-19	Code cleanup: move rng into path state.	Brecht Van Lommel
	Also pass by value and don't write back now that it is just a hash for seeding and no longer an LCG state. Together this makes CUDA a tiny bit faster in my tests, but mainly simplifies code.
2017-08-13	Code cleanup: make L_transparent part of PathRadiance.	Brecht Van Lommel

2017-08-13	Code cleanup: make DebugData part of PathRadiance.	Brecht Van Lommel

2017-08-11	Cycles: Clarify new argument in PathRadiance	Sergey Sharybin

2017-08-11	Fix T52229: Shadow Catcher artifacts when under transparency	Sergey Sharybin
	Added some extra tirckery to avoid background being tinted dark with transparent surface. Maybe a bit hacky, but seems to work fine.
2017-08-07	Fix Cycles shadow catcher objects influencing each other.	Brecht Van Lommel
	Since all the shadow catchers are already assumed to be in the footage, the shadows they cast on each other are already in the footage too. So don't just let shadow catchers skip self, but all shadow catchers. Another justification is that it should not matter if the shadow catcher is modeled as one object or multiple separate objects, the resulting render should be the same. Differential Revision: https://developer.blender.org/D2763
2017-08-05	Cycles: remove min bounces, modify RR to terminate less.	Brecht Van Lommel
	Differential Revision: https://developer.blender.org/D2766
2017-07-20	Fix T52125: principled BSDF missing with macOS OpenCL.	Brecht Van Lommel

2017-07-18	Fix T52021: Shadow catcher renders wrong when catcher object is behind ↵	Sergey Sharybin
	transparent object Tweaked the path radiance summing and alpha to accommodate for possible contribution of light by transparent surface bounces happening prior to shadow catcher intersection. This commit will change the way how shadow catcher results looks when was behind semi transparent object, but the old result seemed to be fully wrong: there were big artifacts when alpha-overing the result on some actual footage.
2017-06-14	Cycles: Fix typo in comment	Sergey Sharybin

2017-06-10	Cycles: Selectively include denoising in kernel	Sergey Sharybin

2017-06-10	Cycles: Faster split branched path tracing by sharing samples with inactive ↵	Mai Lavelle
	threads Unlike regular path tracing, branched path tracing is usually used with lower sample counts, at least for primary rays. This means that are less samples for the GPU to work on in parallel and rendering is slower. As there is less work overall there is also more inactive threads during rendering with BPT. This patch makes use of those inactive rays to render branched samples in parallel with other samples. Each thread that is preparing for a branched sample will attempt to find an inactive thread and if one is found the state for the sample is copied to that thread. Potentially, if there are enough inactive threads, 100s of branched samples could be generated from the same originating thread and ran in parallel giving large speed ups. Gives 70% faster render for pavillion midday scene. 20-60% faster on BMW with car paint replaced with SSS/volumes.
2017-06-10	Cycles: Add kernel to enqueue inactive rays	Mai Lavelle
	The queue will be used to make reuse of inactive threads to keep the GPU more busy.
2017-05-09	Cycles: Enable BPT for NVidia OpenCL	Sergey Sharybin

2017-05-07	Cycles: Implement denoising option for reducing noise in the rendered image	Lukas Stockner
	This commit contains the first part of the new Cycles denoising option, which filters the resulting image using information gathered during rendering to get rid of noise while preserving visual features as well as possible. To use the option, enable it in the render layer options. The default settings fit a wide range of scenes, but the user can tweak individual settings to control the tradeoff between a noise-free image, image details, and calculation time. Note that the denoiser may still change in the future and that some features are not implemented yet. The most important missing feature is animation denoising, which uses information from multiple frames at once to produce a flicker-free and smoother result. These features will be added in the future. Finally, thanks to all the people who supported this project: - Google (through the GSoC) and Theory Studios for sponsoring the development - The authors of the papers I used for implementing the denoiser (more details on them will be included in the technical docs) - The other Cycles devs for feedback on the code, especially Sergey for mentoring the GSoC project and Brecht for the code review! - And of course the users who helped with testing, reported bugs and things that could and/or should work better!
2017-05-03	Cycles: Split kernel - sort shaders	Hristo Gueorguiev
	Reduce thread divergence in kernel_shader_eval. Rays are sorted in blocks of 2048 according to shader->id. On R9 290 Classroom is ~30% faster, and Pabellon Barcelone is ~8% faster. No sorting for CUDA split kernel. Reviewers: sergey, maiself Reviewed By: maiself Differential Revision: https://developer.blender.org/D2598
2017-05-02	Cycles: Branched path tracing for the split kernel	Mai Lavelle
	This implements branched path tracing for the split kernel. General approach is to store the ray state at a branch point, trace the branched ray as normal, then restore the state as necessary before iterating to the next part of the path. A state machine is used to advance the indirect loop state, which avoids the need to add any new kernels. Each iteration the state machine recreates as much state as possible from the stored ray to keep overall storage down. Its kind of hard to keep all the different integration loops in sync, so this needs lots of testing to make sure everything is working correctly. We should probably start trying to deduplicate the integration loops more now. Nonbranched BMW is ~2% slower, while classroom is ~2% faster, other scenes could use more testing still. Reviewers: sergey, nirved Reviewed By: nirved Subscribers: Blendify, bliblubli Differential Revision: https://developer.blender.org/D2611
2017-04-26	Cycles: Enable Correlated Multi Jitter for OpenCL and split kernel	Mai Lavelle
	Testing showed no issues so there's no reason to not have this.
2017-04-21	Cycles: Solve speed regression of classroom scene after principled commit	Sergey Sharybin
	This way we can skip it from compiling into OpenCL kernels by making this shader compile-time feature.
2017-04-21	Cycles: Cleanup, indentation in preprocessor	Sergey Sharybin

2017-04-07	Cycles: Change work pool and global size of split CPU for easier debugging	Mai Lavelle

2017-03-29	Cycles: Make all #include statements relative to cycles source directory	Sergey Sharybin
	The idea is to make include statements more explicit and obvious where the file is coming from, additionally reducing chance of wrong header being picked up. For example, it was not obvious whether bvh.h was refferring to builder or traversal, whenter node.h is a generic graph node or a shader node and cases like that. Surely this might look obvious for the active developers, but after some time of not touching the code it becomes less obvious where file is coming from. This was briefly mentioned in T50824 and seems @brecht is fine with such explicitness, but need to agree with all active developers before committing this. Please note that this patch is lacking changes related on GPU/OpenCL support. This will be solved if/when we all agree this is a good idea to move forward. Reviewers: brecht, lukasstockner97, maiself, nirved, dingto, juicyfruit, swerner Reviewed By: lukasstockner97, maiself, nirved, dingto Subscribers: brecht Differential Revision: https://developer.blender.org/D2586
2017-03-27	Cycles: Make shadow catcher an optional feature for OpenCL	Sergey Sharybin
	Solves majority of speed regression on AMD OpenCL.
2017-03-27	Cycles: Add OpenCL support for shadow catcher feature	Hristo Gueorguiev
	The title says it all actually.
2017-03-27	Cycles: First implementation of shadow catcher	Sergey Sharybin
	It uses an idea of accumulating all possible light reachable across the light path (without taking shadow blocked into account) and accumulating total shaded light across the path. Dividing second figure by first one seems to be giving good estimate of the shadow. In fact, to my knowledge, it's something really similar to what is happening in the denoising branch, so we are aligned here which is good. The workflow is following: - Create an object which matches real-life object on which shadow is to be catched. - Create approximate similar material on that object. This is needed to make indirect light properly affecting CG objects in the scene. - Mark object as Shadow Catcher in the Object properties. Ideally, after doing that it will be possible to render the image and simply alpha-over it on top of real footage.
2017-03-17	Cycles: Fix handling of barriers	Mai Lavelle

2017-03-14	Cycles: Try to avoid infinite loops by catching invalid ray states	Mai Lavelle

2017-03-10	Cycles: Enable SSS and volumes for CUDA and Nvidia OpenCL split kernel	Mai Lavelle

2017-03-09	Cycles: Speedup transparent shadows in split kernel	Hristo Gueorguiev
	This commit enables record-all transparent shadows rays. Perfromance results: R9 290 render time (without synchronization), seconds Before After Change BMW 261.5 262.5 +0.4 % Classroom 869.6 867.3 -0.3 % Fishy Cat 657.4 639.8 -2.7 % Koro 1909.8 692.8 -63.7 % Pabellon Barcelona 1633.3 1238.0 -24.2 % Pabellon Barcelona() 1158.1 903.8 -22.0 % () without glossy connected to volume
2017-03-09	Cycles: SSS and Volume rendering in split kernel	Hristo Gueorguiev
	Decoupled ray marching is not supported yet. Transparent shadows are always enabled for volume rendering. Changes in kernel/bvh and kernel/geom are from Sergey. This simiplifies code significantly, and prepares it for record-all transparent shadow function in split kernel.
2017-03-08	Cycles: CUDA implementation of split kernel	Mai Lavelle

2017-03-08	Cycles: CPU implementation of split kernel	Mai Lavelle

2017-03-08	Cycles: Remove ccl_fetch and SOA	Mai Lavelle

2017-03-08	Cycles: OpenCL split kernel refactor	Mai Lavelle
	This does a few things at once: - Refactors host side split kernel logic into a new device agnostic class `DeviceSplitKernel`. - Removes tile splitting, a new work pool implementation takes its place and allows as many threads as will fit in memory regardless of tile size, which can give performance gains. - Refactors split state buffers into one buffer, as well as reduces the number of arguments passed to kernels. Means there's less code to deal with overall. - Moves kernel logic out of OpenCL kernel files so they can later be used by other device types. - Replaced OpenCL specific APIs with new generic versions - Tiles can now be seen updating during rendering
2017-02-15	Cycles: Pass special flag whether BVH motion steps are used	Sergey Sharybin
	Doesn't currently change anything, but would need for some future work here. It uses existing padding in kernel BVH structure, so there is nothing changed memory-wise.
2017-02-08	Cycles: Speedup transparent shadows on CUDA	Sergey Sharybin
	This commit enables record-all behavior of transparent shadows rays. Render times difference goes as following: GTX 1080 render time BMW -0.5% Fishy Cat -0.0% Pabellon Barcelona -11.6% Classroom +1.2% Koro -58.6% Kernel will now use some extra VRAM memory to store the intersection array (200MB on my configuration). This we can optimize out with some further commits.
2017-02-01	Cycles: Fix rng_state initialization when using resumable rendering	Lukas Stockner

2017-01-27	Cycles: Add option to replace GI with AO approximation after certain amount ↵	Sergey Sharybin
	of bounces This is a speed up option which is mainly useful for viewport. Gives nice speedup in the barbershop scene of 2x when replacing GI with AO after 2nd bounce without loosing too much details. Reviewers: brecht Subscribers: eyecandy, venomgfx Differential Revision: https://developer.blender.org/D2383