git.blender.org/blender.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2017-08-10	Cycles: Fix instanced shadow catcher objects influencing each other	Sergey Sharybin

2017-08-10	Cycles: De-duplicate trace-time object visibility calculation	Sergey Sharybin
	We already have enough files to worry about in BVH builders. no need to add yet another copy-paste code which is tempting to be running out of sync.
2017-08-09	Cycles: Remove ulong usage	Sergey Sharybin
	This is a bit confusing, especially when one mixes OpenCL code where ulong equals to uint64_t with CPU side code where ulong is expected to be something else from the naming. This commit makes it so we use explicit name, common on all platforms.
2017-08-09	Cycles: Proper fix for recent OpenCL image crash	Mai Lavelle
	Problem was that some code checks to see if device_pointer is null or not and the new allocator wasn't even setting the pointer to anything as it tracks memory location separately. Setting the pointer to non null keeps all users of device_pointer happy.
2017-08-09	Revert "Cycles: Fix crash changing image after recent OpenCL changes"	Mai Lavelle
	This reverts commit f2809ae0a671057caa1005e2b9cc91648c33dd1f.
2017-08-08	Cycles: More fixes for Windows 32 bit	Sergey Sharybin
	- Apparently MSVC does not support compound literals in C++ (at least by the looks of it). - Not sure how opencl_device_assert was managing to set protected property of the Device class.
2017-08-08	Cycles: Fix compilation error of filter kernels on 32 bit Windows	Sergey Sharybin
	We don't enable global SSE optimizations in regular kernel, and we keep those disabled on Linux 32bit. One possible workaround would be to pass arguments by ccl_ref, but that is quite a few of code which better be done accurately.
2017-08-08	Cycles: Fix crash changing image after recent OpenCL changes	Sergey Sharybin
	Steps to reproduce: - Create shader Image texture -> Diffuse BSDF -> Output. Do NOT select image yet! - Start viewport render. - Select image from the ID browser of Image Texture node. Thing is: with the memory manager we always need to inform device that memory was freed.
2017-08-08	Cycles: Fix compilation error without C++11	Sergey Sharybin
	Common folks, nobody considered master a C++11 only branch. Such decision is to be done officially and will involve changes in quite a few infrastructure related areas.
2017-08-08	Cycles: Cleanup, de-duplicate function parameter list	Sergey Sharybin
	Was only needed to sue const reference on CPU. Now it is done using ccl_ref.
2017-08-08	Cycles: Add utility macro ccl_ref	Sergey Sharybin
	It is defined to & for CPU side compilation, and defined to an empty for any GPU platform. The idea here is to use this macro instead of #ifdef block with bunch of duplicated lines just to make it so CPU code is efficient. Eventually we might switch to references on CUDA as well, but that would require some intensive testing.
2017-08-08	Cycles: Pack kernel textures into buffers for OpenCL	Mai Lavelle
	Image textures were being packed into a single buffer for OpenCL, which limited the amount of memory available for images to the size of one buffer (usually 4gb on AMD hardware). By packing textures into multiple buffers that limit is removed, while simultaneously reducing the number of buffers that need to be passed to each kernel. Benchmarks were within 2%. Fixes T51554. Differential Revision: https://developer.blender.org/D2745
2017-08-07	Cycles: Cleanup, move curve intersection functions to own file	Sergey Sharybin
	This way curve file becomes much shorter and it's also easier to write a benchmark application to check performance before/after future changes.
2017-08-07	Cycles: Cleanup, trailign whitespace	Sergey Sharybin

2017-08-07	Cycles: Cleanup, remove bvh prefix from curve functions	Sergey Sharybin
	Those are nothing to do with BVH, and can be used separately.
2017-08-07	Cycles: Fix compilation error on NVidia OpenCL after recent refactor	Sergey Sharybin
	Still need to verify this is proper thing to do for AMD OpenCL. At least now i can compile OpenCL kernel on my laptop with sm21 card.
2017-08-07	Fix Cycles shadow catcher objects influencing each other.	Brecht Van Lommel
	Since all the shadow catchers are already assumed to be in the footage, the shadows they cast on each other are already in the footage too. So don't just let shadow catchers skip self, but all shadow catchers. Another justification is that it should not matter if the shadow catcher is modeled as one object or multiple separate objects, the resulting render should be the same. Differential Revision: https://developer.blender.org/D2763
2017-08-07	Fix Windows build errors with recent Cycles SIMD refactoring.	Brecht Van Lommel

2017-08-07	Cycles: Cleanup, space after keyword	Sergey Sharybin

2017-08-07	Code refactor: use float4 instead of intrinsics for CPU denoise filtering.	Brecht Van Lommel
	Differential Revision: https://developer.blender.org/D2764
2017-08-07	Code refactor: add, remove, optimize various SSE functions.	Brecht Van Lommel
	* Remove some unnecessary SSE emulation defines. * Use full precision float division so we can enable it. * Add sqrt(), sqr(), fabs(), shuffle variations, mask(). * Optimize reduce_add(), select(). Differential Revision: https://developer.blender.org/D2764
2017-08-07	Code refactor: split defines into separate header, changes to SSE type headers.	Brecht Van Lommel
	I need to use some macros defined in util_simd.h for float3/float4, to emulate SSE4 instructions on SSE2. But due to issues with order of header includes this was not possible, this does some refactoring to make it work. Differential Revision: https://developer.blender.org/D2764
2017-08-07	Cycles: remove option to disable transparent shadows globally.	Brecht Van Lommel
	We already detect this automatically based on shading nodes and per shader settings, and performance of this option is ok now all devices. Differential Revision: https://developer.blender.org/D2767
2017-08-07	Fix Cycles CUDA adaptive megakernel build error.	Brecht Van Lommel

2017-08-05	Cycles: CUDA split performance tweaks, still far from megakernel.	Brecht Van Lommel
	On Pabellon, 25.8s mega, 35.4s split before, 32.7s split after.
2017-08-05	Cycles: remove min bounces, modify RR to terminate less.	Brecht Van Lommel
	Differential Revision: https://developer.blender.org/D2766
2017-08-03	Cycles: Extra tweaks to performance of header expansion	Sergey Sharybin
	Two main things here: 1. Replace all unsafe for #line directive characters into a single loop, avoiding multiple iterations and multiple temporary strings created. 2. Don't merge token char by char but calculate start and end point and then copy all substring at once. This gives about 15% speedup of source processing time. At this point (with all previous commits from today) we've shrinked down compiled sources size from 108 MB down to ~5.5 MB and lowered processing time from 4.5 sec down to 0.047 sec on my laptop running Linux (this was a constant time which Blender will always spent first time loading kernel, even if we've got compiled clbin).
2017-08-02	Cycles: Avoid some expensive operations in header expansions	Sergey Sharybin
	Basically gather lines as-is during traversal, avoiding allocating memory for all the lines in headers. Brings additional performance improvement abut 20%.
2017-08-02	Cycles: Support "precompiled" headers in include expansion algorithm	Sergey Sharybin
	The idea here is that it is possible to mark certain include statements as "precompiled" which means all subsequent includes of that file will be replaced with an empty string. This is a way to deal with tricky include pattern happening in single program OpenCL split kernel which was including bunch of headers about 10 times. This brings preprocessing time from ~1sec to ~0.1sec on my laptop.
2017-08-02	Cycles: Speed up #include expansion algorithm	Sergey Sharybin
	The idea is to re-use files which were already processed. Gives about 4x speedup of processing time (~4.5sec vs 1.0sec) on my laptop for the whole OpenCL kernel. For users it will mean lower delay before OpenCL rendering might start.
2017-07-25	Fix T51450: viewport render time keeps increasing after render is done.	Jeff Knox
	Reviewed By: brecht Differential Revision: https://developer.blender.org/D2747
2017-07-24	Fix Cycles multi scatter GGX different render results with Clang and GCC.	Brecht Van Lommel
	The order of evaluation of function arguments is undefined, and the order was reversed between these compilers. This was causing regressions tests to give different results between Linux and macOS.
2017-07-23	Fix T52152: allow zero roughness for Cycles principled BSDF, don't clamp.	Brecht Van Lommel

2017-07-23	Fix Cycles denoising NaNs with a 1 sample renders.	Brecht Van Lommel
	This was causing different render results with different compilers. We can't do much useful with 1 sample, but better for debugging.
2017-07-22	Fix use of uninitialized value in Cycles, probably did not cause a bug.	Brecht Van Lommel

2017-07-21	Fix a few harmless maybe uninitialized warnings with GCC 5.4.	Brecht Van Lommel
	GCC seems to detect uninitialized into function calls now, but then isn't always smart enough to see that it is actually initialized. Disabling this warning entirely seems a bit too much, so initialize a bit more now.
2017-07-21	Fix T52135: Cycles should not keep generated/packed images in memory after ↵	Brecht Van Lommel
	render.
2017-07-21	Fix potential memory leak in Cycles loading of packed/generated images.	Brecht Van Lommel

2017-07-20	Fix T52125: principled BSDF missing with macOS OpenCL.	Brecht Van Lommel

2017-07-20	Cycles: fixed the SM_2x CUDA kernel build that I broke in my previous commit	Stefan Werner

2017-07-20	Fix T52107: Color management difference when using multiple and different ↵	Stefan Werner
	GPUs together This commit unifies the flattened texture slot names for bindless and regular CUDA textures. Texture indices are now identical across all CUDA architectures, where before Fermi used different indices, which lead to problems when rendering on multi-GPU setups mixing Fermi with newer hardware.
2017-07-18	Fix T52021: Shadow catcher renders wrong when catcher object is behind ↵	Sergey Sharybin
	transparent object Tweaked the path radiance summing and alpha to accommodate for possible contribution of light by transparent surface bounces happening prior to shadow catcher intersection. This commit will change the way how shadow catcher results looks when was behind semi transparent object, but the old result seemed to be fully wrong: there were big artifacts when alpha-overing the result on some actual footage.
2017-07-18	Cycles: Remove meaningless camera ray check	Sergey Sharybin
	In branched path tracing main loop is always a camera ray, with varying number of transparent bounces.
2017-07-12	Cycles: Enable SSS from Principled BSDF only when actually in use	Mai Lavelle
	This gives speed up for the split kernel in scenes using the principled BSDF but without subsurface scattering.
2017-07-12	Cycles: Fix comparison in principled BSDF	Mai Lavelle
	Could have lead to black pixels.
2017-07-11	Fix T52027: OSL getattribute() crash, when optimizer calls it before rendering.	Brecht Van Lommel

2017-07-11	Cycles: Disable OpenCL clFlush workarounds	Sergey Sharybin
	This is something which was reported to work fine by Mai, Benjamin and confirmed by myself. Disabling this workaround gains us some speedup: Before Now bmw27 04:28.42 04:07.79 classroom 09:26.48 08:54.53 fishy_cat 08:44.01 08:18.70 koro 09:17.98 08:57.18 pavillon_barcelone 12:26.64 11:52.81 Test environment is: - Ubuntu 16.04, with all updates installed - AMD RX 480 GPU - amdgpu pro driver version 17.10-450821
2017-07-08	Fix T51967: OSL crash after rendering finished (mainly on Windows).	Brecht Van Lommel

2017-07-07	Cycles: Fix ambiguity in call of min() function	Sergey Sharybin

2017-07-06	Cycles: Add artificial memory limit debug option for OpenCL	Mai Lavelle