git.blender.org/blender.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2017-03-09	Cycles: SSS and Volume rendering in split kernel	Hristo Gueorguiev
	Decoupled ray marching is not supported yet. Transparent shadows are always enabled for volume rendering. Changes in kernel/bvh and kernel/geom are from Sergey. This simiplifies code significantly, and prepares it for record-all transparent shadow function in split kernel.
2017-03-09	Remove (ifdef) draw_documentation from text_draw.c	Dalai Felinto
	This was no longer supported.
2017-03-09	3D View: wrap GPU_select cache calls	Campbell Barton
	Avoids including GPU_select and makes it more clear that the cache is needed for view3d_opengl_select calls. Also use typed enum for select mode.
2017-03-09	3D View: use cache for armature select	Campbell Barton

2017-03-09	Cycles: Fix CUDA build error for some compilers	Mai Lavelle
	Needed to include `util_types.h` before using `uint`.
2017-03-08	3D View: new nethod of opengl selection	Campbell Barton
	Intended to replace legacy GL_SELECT, without the limitations of sample queries which can't access depth information. This commit adds VIEW3D_SELECT_PICK_NEAREST and VIEW3D_SELECT_PICK_ALL which access the depth buffers to detect whats under the pointer, so initial selection is always the closest item. The performance of this method depends a lot on the OpenGL implementations glReadPixels. Since reading depth can be slow, buffers are cached for object picking so selecting re-uses depth data, performing 1 draw instead of 3 (for 24, 18, 10 px regions, picking with many items under the pointer). Occlusion queries draw twice when picking nearest, so worst case 6x draw calls per selection. Even with these improvements occlusion queries is faster on AMD hardware. Depth selection is disabled by default, toggle option under select method. May enable by default if this works well on different hardware. Reviewed as D2543
2017-03-08	Fix T50849: Transparent background produces artifacts in this compositing setup	Sergey Sharybin
	The issue was caused by sometimes negative color returned by the filter node. Seems to be caused by precision issues. Don't see any reason why we would want negative colors in output. Those only causing issues later on.
2017-03-08	Cycles: Make it more obvious message which initialization failed	Sergey Sharybin

2017-03-08	Fix T49603: Blender/Cycles 2.78 CUDA error on Jetson-TX1~	Sergey Sharybin
	Patch by Bruno d'Arcangeli (@arcangeli), thanks!
2017-03-08	OpenGL Select: integer rect for passing region	Campbell Barton

2017-03-08	Cleanup: replace short -> int for selection hits	Campbell Barton

2017-03-08	Rename BLI_rct*_init_pt_size -> radius	Campbell Barton

2017-03-08	Cycles: Use 1-based line number for #line directives	Sergey Sharybin
	AMD CPU platform was complaining about #line 0 directives in the code.
2017-03-08	Cycles: Log which device kernels are being loaded for	Sergey Sharybin

2017-03-08	Cycles: Make it possible to access KernelGlobals from split data ↵	Sergey Sharybin
	initialization function
2017-03-08	Cycles: Cleanup, remove residue of previous split kernel data	Sergey Sharybin
	This is all in split data state array.
2017-03-08	Fix T50886: Blender crashes on render	Sergey Sharybin
	Was a mistake in one of the previous TLS commits. See comment in the pool_create to see some details why it was crashing.
2017-03-08	update theme back to black re: T50869	meta-androcto

2017-03-08	Cycles: Fix indentation	Mai Lavelle

2017-03-08	Cycles: Fix strict warning about unused variable	Mai Lavelle

2017-03-08	Cycles: Calculate size of split state buffer kernel side	Mai Lavelle
	By calculating the size of the state buffer in the kernel rather than the host less code is needed and the size actually reflects the requested features. Will also be a little faster in some cases because of larger global work size.
2017-03-08	Cycles: Fix crash after failed kernel build	Mai Lavelle
	Pointers to kernels were uninitialized leading to freeing of random memory addresses. Another reason it would be good to use smart pointers.
2017-03-08	Cycles: Faster building of split kernel	Mai Lavelle
	Simple change to make it so that only kernels that have been modified are rebuilt. Might only be useful during development.
2017-03-08	Cycles: Initialize rng_state for split kernel	Mai Lavelle
	Because the split kernel can render multiple samples in parallel it is necessary to have everything initialized before rendering of any samples begins. The code that normally handles initialization of `rng_state` (`kernel_path_trace_setup()`) only does so for the first sample, which was causing artifacts in the split kernel due to uninitialized `rng_state` for some samples. Note that because the split kernel can render samples in parallel this means that the split kernel is incompatible with the LCG.
2017-03-08	Cycles: Remove sum_all_radiance kernel	Mai Lavelle
	This was only needed for the previous implementation of parallel samples. As we don't have that any more it can be removed. Real reason for removal tho is this: `per_sample_output_buffers` was being calculated too small and artifacts resulted. The tile buffer is already the correct size and calculating the size for `per_sample_output_buffers` is a bit difficult with the current layout of the code. As `per_sample_output_buffers` was only needed for `sum_all_radiance`, removing that kernel and writing output to the tile buffer directly fixes the artifacts.
2017-03-08	Cycles: Split path initialization into own kernel	Mai Lavelle
	This makes it easier to initialize things correctly in the data_init kernel before they are needed by path tracing.
2017-03-08	Cycles: Seperate kernel loading time from render time	Mai Lavelle

2017-03-08	Cycles: Add names to buffer allocations	Mai Lavelle
	This is to help debug and track memory usage for generic buffers. We have similar for textures already since those require a name, but for buffers the name is only for debugging proposes.
2017-03-08	Cycles: CUDA implementation of split kernel	Mai Lavelle

2017-03-08	Cycles: CPU implementation of split kernel	Mai Lavelle

2017-03-08	Cycles: Remove ccl_fetch and SOA	Mai Lavelle

2017-03-08	Cycles: Report device maximum allocation and detected global size	Sergey Sharybin

2017-03-08	Cycles: Workaround for driver hangs	Mai Lavelle
	Simple workaround for some issues we've been having with AMD drivers hanging and rendering systems unresponsive. Unfortunately this makes things a bit slower, but its better than having to do hard reboots. Will be removed when drivers have been fixed. Define CYCLES_DISABLE_DRIVER_WORKAROUNDS to disable for testing purposes.
2017-03-08	Cycles: OpenCL split kernel refactor	Mai Lavelle
	This does a few things at once: - Refactors host side split kernel logic into a new device agnostic class `DeviceSplitKernel`. - Removes tile splitting, a new work pool implementation takes its place and allows as many threads as will fit in memory regardless of tile size, which can give performance gains. - Refactors split state buffers into one buffer, as well as reduces the number of arguments passed to kernels. Means there's less code to deal with overall. - Moves kernel logic out of OpenCL kernel files so they can later be used by other device types. - Replaced OpenCL specific APIs with new generic versions - Tiles can now be seen updating during rendering
2017-03-08	Cycles: Add OpenCL kernel for zeroing memory buffers	Mai Lavelle
	Transferring memory to the device was very slow and there's really no need when only zeroing a buffer.
2017-03-08	Cycles: Add more atomic operations	Mai Lavelle

2017-03-08	Cycles: Expose passes size to device tasks	Mai Lavelle
	This is needed so devices can know the size of a tile buffer before any tiles are acquired.
2017-03-08	Cycles: Allow device_memory to be used directly	Mai Lavelle
	This is useful for when theres no host side memory attched to the buffer
2017-03-07	Task scheduler: Add concept of suspended pools	Sergey Sharybin
	Suspended pools allows to push huge amount of initial tasks without any threading synchronization and hence overhead. This gives ~50% speedup of cached rigid body with file from T50027 and seems to have no negative affect in other scenes here.
2017-03-07	Depsgraph: Remove workarounds from depsgraph for keeping threads alive	Sergey Sharybin
	This is something what should be done in the task scheduler instead with local thread queues so we handle this in a single place.
2017-03-07	Task scheduler: Initial implementation of local tasks queues	Sergey Sharybin
	The idea is to allow some amount of tasks to be pushed from working thread to it's local queue, so we can acquire some work without doing whole mutex lock. This should allow us to remove some hacks from depsgraph which was added there to keep threads alive.
2017-03-07	Task scheduler: Use real pthread's TLS to access active thread's data	Sergey Sharybin
	This allows us to avoid TLS stored in pool which gives us advantage of using pre-allocated tasks pool for the pools created from non-main thread. Even on systems with slow pthread TLS it should not be a problem because we access it once at a pool construction time. If we want to use this more often (for example, to get rid of push_from_thread) we'll have to do much more accurate benchmark.
2017-03-07	Task scheduler: Refactor the way we store thread-spedific data	Sergey Sharybin
	Basically move all thread-specific data (currently it's only task memory pool) from a dedicated array of taskScheduler to TaskThread. This way we can add more thread-specific data in the future with less of a hassle.
2017-03-07	Task scheduler: Remove per-pool threads limit	Sergey Sharybin
	This feature was adding extra complexity to task scheduling which required yet extra variables to be worried about to be modified in atomic manner, which resulted in following issues: - More complex code to maintain, which increases risks of something going wrong when we modify the code. - Extra barriers and/or locks during task scheduling, which causes extra threading overhead. - Unable to use some other implementation (such as TBB) even for the comparison tests. Notes about other changes. There are two places where we really had to use that limit. One of them is the single threaded dependency graph. This will now construct a single-threaded scheduler at evaluation time. This shouldn't be a problem because it only happens when using debugging command line arguments and the code simply don't run in regular Blender operation. The code seems a bit duplicated here across old and new depsgraph, but think it's OK since the old depsgraph is already gone in 2.8 branch and i don't see where else we might want to use such a single-threaded scheduler. When/if we'll want to do so, we can move it to a centralized single-threaded scheduler in threads.c. OpenGL render was a bit more tricky to port, but basically we are using conditional variables to wait background thread to do all the job.
2017-03-07	Fix typo in command line arg list	Aaron Carlisle

2017-03-07	Update keymap presets for recent transform manipulator changes	Julian Eisel
	Part of T50565.
2017-03-07	Once more T50565: Allow using planar constraints for scale manipulator	Julian Eisel

2017-03-06	Rigid body: fix viewport not updating on properties change.	Clément Foucault

2017-03-06	Fix width calculation for split layouts	raa

2017-03-06	Cycles: Fix strict -Wpedantic warnings with GCC	Sergey Sharybin
	Patch by Stefan Werner, thanks!