git.blender.org/blender.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2017-03-08	Cycles: Remove ccl_fetch and SOA	Mai Lavelle

2017-02-15	Cycles: Fix CUDA compilation error after recent changes	Sergey Sharybin

2016-11-03	Cycles: Fix T49901: OpenCL build error after recent light texture coordinate ↵	Lukas Stockner
	commit Basically, the problem here was that the transform that's used to bring texture coordinates to world space is either fetched while setting up the shader (with Object Motion is enabled) or fetched when needed (otherwise). That helps to save ShaderData memory on OpenCL when Object Motion isn't needed. Now, if OM is enabled, the Lamp transform can just be stored inside the ShaderData as well. The original commit just assumed it is. However, when it's not (on OpenCL by default, for example), there is no easy way to fetch it when needed, since the ShaderData doesn't store the Lamp index. So, for now the lamps just don't support local texture coordinates anymore when Object Motion is disabled. To fix and support this properly, one of the following could be done: - Just always pre-fetch the transform. Downside: Memory Usage increases when not using OM on OpenCL - Add a variable to ShaderData that stores the Lamp ID to allow fetching it when needed - Store the Lamp ID inside prim or object. Problem: Cycles currently checks these for whether an object was hit - these checks would need to be changed. - Enable OM whenever a Texture Coordinate's Normal output is used. Downside: Might not actually be needed.
2016-10-23	Cycles: OpenCL 3d textures support.	Hristo Gueorguiev
	Note that volume rendering is not supported yet, this is a step towards that. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2299
2016-09-04	Fix a few OpenCL compiler warnings.	Brecht Van Lommel

2016-08-14	Cycles: Add single channel texture support for OpenCL.	Thomas Dinges
	This way OpenCL devices can also benefit from a smaller memory footprint, when using e.g. bumpmaps (greyscale, 1 channel). Additional target for my GSoC 2016.
2016-08-11	Cycles: Enable half float support (4 channels and 1 channel) on CUDA.	Thomas Dinges
	Atm OpenEXR half files benefit from this and will use only 1/2 of the memory now. More space for HDRs! Part of my GSoC 2016.
2016-08-11	Cycles: Change code order for Image Data Types.	Thomas Dinges
	Now we have the 4 component ones first (float4, byte4, half4) followed by the 1 component ones (float, byte, half). Makes code a bit more consistent and also reduces code a bit when enabling half support on GPU in next commit. This also exposed a typo in half CPU images for 3D textures, which wasn't used yet, but good to have that one fixed anyway.
2016-07-29	Cycles microdisplacement: ngons and attributes for subdivision meshes	Mai Lavelle
	This adds support for ngons and attributes on subdivision meshes. Ngons are needed for proper attribute interpolation as well as correct Catmull-Clark subdivision. Several changes are made to achieve this: - new primitive `SubdFace` added to `Mesh` - 3 more textures are used to store info on patches from subd meshes - Blender export uses loop interface instead of tessface for subd meshes - `Attribute` class is updated with a simplified way to pass primitive counts around and to support ngons. - extra points for ngons are generated for O(1) attribute interpolation - curves are temporally disabled on subd meshes to avoid various bugs with implementation - old unneeded code is removed from `subd/` - various fixes and improvements Reviewed By: brecht Differential Revision: https://developer.blender.org/D2108
2016-07-11	Cycles: Fix Extend image extension mode on OpenCL	Sergey Sharybin

2016-07-02	Fix Cycles OpenCL not taking Extend and Clip extension types into account.	Thomas Dinges
	(See T48720).
2016-06-21	Fix T48691: Cycles - OpenCL - HDR Image mapping does not match CUDA rendering	Lukas Stockner
	The OpenCL texture code didn't offset the coordinates by half a pixel like the CPU code does.
2016-05-27	Cleanup: Shorten texture variables, tex and image was kinda redundant.	Thomas Dinges
	Also make prefix consistent, so it starts with either TEX_NUM or TEX_START, followed by texture type and architecture.
2016-05-19	Cycles: Add support for bindless textures.	Thomas Dinges
	This adds support for CUDA Texture objects (also known as Bindless textures) for Kepler GPUs (Geforce 6xx and above). This is used for all 2D/3D textures, data still uses arrays as before. User benefits: * No more limits of image textures on Kepler. We had 5 float4 and 145 byte4 slots there before, now we have 1024 float4 and 1024 byte4. This can be extended further if we need to (just change the define). * Single channel textures slots (byte and float) are now supported on Kepler as well (1024 slots for each type). ToDo / Issues: * 3D textures don't work yet, at least don't show up during render. I have no idea whats wrong yet. * Dynamically allocate bindless_mapping array? I hope Fermi still works fine, but that should be tested on a Fermi card before pushing to master. Part of my GSoC 2016. Reviewers: sergey, #cycles, brecht Subscribers: swerner, jtheninja, brecht, sergey Differential Revision: https://developer.blender.org/D1999
2016-05-10	Cycles: Add support for float4 textures on OpenCL.	Thomas Dinges
	Title says it all, this adds OpenCL float4 texture support. There is a bug in the code still, I get a "Out of ressources error" on nvidia hardware here, not sure whats wrong yet. Will investigate further, but maybe someone else has an idea. :) Reviewers: #cycles, brecht Subscribers: brecht, candreacchio Differential Revision: https://developer.blender.org/D1983
2016-05-09	Cleanup: More byte -> byte4 renaming for consistency.	Thomas Dinges

2016-05-06	Cleanup: Rename texture slots to float4 and byte, to distinguish from future ↵	Thomas Dinges
	float (single channel) and half_float slots. Should be no functional changes, tested CPU and CUDA.
2016-04-16	Cycles: Refactor Image Texture limits.	Thomas Dinges
	Instead of treating Fermi GPU limits as default, and overriding them for other devices, we now nicely set them for each platform. * Due to setting values for all platforms, we don't have to offset the slot id for OpenCL anymore, as the image manager wont add float images for OpenCL now. * Bugfix: TEX_NUM_FLOAT_IMAGES was always 5, even for CPU, so the code in svm_image.h clamped float textures with alpha on CPU after the 5th slot. Reviewers: #cycles, brecht Reviewed By: #cycles, brecht Subscribers: brecht Differential Revision: https://developer.blender.org/D1925
2016-03-25	Cycles: Cleanup, indent nested preprocessor directives	Sergey Sharybin
	Quite straightforward, main trick is happening in path_source_replace_includes(). Reviewers: brecht, dingto, lukasstockner97, juicyfruit Differential Revision: https://developer.blender.org/D1794
2016-02-15	Cycles: Initial support of 3D textures for CUDA rendering	Sergey Sharybin
	Supports both smoke/fire and point density textures now. Reduces number of textures available for sm_20 and sm_21, but you have to compromise somewhere on such a limited hardware. Currently limited to linear interpolation only, and decoupled ray marching is not supported yet. Think those could be considered just a further improvement. Some quick example: https://developer.blender.org/F282934 Code is minimal and we can fully consider it a fix for missing support of 3D textures with CUDA. Reviewers: lukasstockner97, brecht, juicyfruit, dingto Reviewed By: brecht, juicyfruit, dingto Subscribers: mib2berlin Differential Revision: https://developer.blender.org/D1806
2015-05-09	Cycles: OpenCL kernel split	George Kyriazis
	This commit contains all the work related on the AMD megakernel split work which was mainly done by Varun Sundar, George Kyriazis and Lenny Wang, plus some help from Sergey Sharybin, Martijn Berger, Thomas Dinges and likely someone else which we're forgetting to mention. Currently only AMD cards are enabled for the new split kernel, but it is possible to force split opencl kernel to be used by setting the following environment variable: CYCLES_OPENCL_SPLIT_KERNEL_TEST=1. Not all the features are supported yet, and that being said no motion blur, camera blur, SSS and volumetrics for now. Also transparent shadows are disabled on AMD device because of some compiler bug. This kernel is also only implements regular path tracing and supporting branched one will take a bit. Branched path tracing is exposed to the interface still, which is a bit misleading and will be hidden there soon. More feature will be enabled once they're ported to the split kernel and tested. Neither regular CPU nor CUDA has any difference, they're generating the same exact code, which means no regressions/improvements there. Based on the research paper: https://research.nvidia.com/sites/default/files/publications/laine2013hpg_paper.pdf Here's the documentation: https://docs.google.com/document/d/1LuXW-CV-sVJkQaEGZlMJ86jZ8FmoPfecaMdR-oiWbUY/edit Design discussion of the patch: https://developer.blender.org/T44197 Differential Revision: https://developer.blender.org/D1200
2015-04-27	Cycles: Use native saturate function for CUDA	Sergey Sharybin
	This more a workaround for CUDA optimizer which can't optimize clamp(x, 0, 1) into a single instruction and uses 4 instructions instead. Original patch by @lockal with own modification: Don't make changes outside of the kernel. They don't make any difference anyway and term saturate() has a bit different meaning outside of kernel. This gives around 2% of speedup in Barcelona file, but in more complex shader setups with lots of math nodes with clamping speedup could be much nicer. Subscribers: dingto Projects: #cycles Differential Revision: https://developer.blender.org/D1224
2015-04-20	Cycles: Split BVH nodes storage into inner and leaf nodes	Sergey Sharybin
	This way we can get rid of inefficient memory usage caused by BVH boundbox part being unused by leaf nodes but still being allocated for them. Doing such split allows to save 6 of float4 values for QBVH per leaf node and 3 of float4 values for regular BVH per leaf node. This translates into following memory save using 01.01.01.G rendered without hair: Device memory size Device memory peak Global memory peak Before the patch: 4957 5051 7668 With the patch: 4467 4562 7332 The measurements are done against current master. Still need to run speed tests and it's hard to predict if it's faster or not: on the one hand leaf nodes are now much more coherent in cache, on the other hand they're not so much coherent with regular nodes anymore. Reviewers: brecht, juicyfruit Subscribers: venomgfx, eyecandy Differential Revision: https://developer.blender.org/D1236
2015-03-27	Cycles: Code cleanup, spaces around keywords	Sergey Sharybin
	This inconsistency drove me totally crazy, it's really confusing when it's inconsistent especially when you work on both Cycles and Blender sides. Shouldn;t cause merge PITA, it's whitespace changes only, Git should be able to merge it nicely.
2015-02-19	Cycles: Make sphere and tube image mapping friendly with OpenCL	Sergey Sharybin
	OpenCL doesn't let you to get address of vector components, which is kinda annoying. On the other hand, maybe now compiler will have more chances to optimize something out.
2015-01-21	Cycles: Support tube projection for images	Sergey Sharybin
	This way Cycles finally becomes feature-full on image projections compared to Blender Internal and Gooseberry Project Team could finally finish the movie.
2015-01-21	Cycles: Support sphere mapping for the image texture	Sergey Sharybin

2014-12-25	Cleanup: Fix Cycles Apache header.	Thomas Dinges
	This was already mixed a bit, but the dot belongs there.
2014-10-07	Fix T42106: Box image mapping shows black triangles if they point to a ↵	Sergey Sharybin
	corner and blend is 0 After discussion with cambo here we decided it's better to choose arbitrary side of the box (in this case it's X-axis) and use image from it. That's better than doing a blackness. P.S. This is literally a corner case anyway.
2014-06-14	Cycles: Add support for uchar4 attributes.	Thomas Dinges
	* Added support for uchar4 attributes to Cycles' attribute system. * This is used for Vertex Colors now, which saves some memory (4 unsigned characters, instead of 4 floats). * GPU Texture Limit on sm_20 and sm_21 decreased from 95 to 94, because we need a new texture for the uchar4 attributes. This is no problem for sm_30 or newer. Part of my GSoC 2014.
2014-06-13	Cycles Refactor: Add SSE Utility code from Embree for cleaner SSE code.	Thomas Dinges
	This makes the code a bit easier to understand, and might come in handy if we want to reuse more Embree code. Differential Revision: https://developer.blender.org/D482 Code by Brecht, with fixes by Lockal, Sergey and myself.
2014-05-11	Quiet warnings with __CUDA_ARCH__ use	Campbell Barton

2014-05-11	Cycles / CUDA: Increase maximum image textures on GPU.	Thomas Dinges
	Instead of 95, we can use 145 images now. This only affects Kepler and above (sm30, sm_35 and sm_50). This can be increased further if needed, but let's first test if this does not come with a performance impact. Originally developed during my GSoC 2013.
2014-05-04	Style cleanup: indentation, braces	Campbell Barton

2014-03-29	Cycles code refactor: replace magic ~0 values in the code with defines.	Brecht Van Lommel

2014-03-13	Fix cycles texture interpolation mode closest constant offset on some devices	Martijn Berger

2014-03-08	Add support for multiple interpolation modes on cycles image textures	Martijn Berger
	All textures are sampled bi-linear currently with the exception of OSL there texture sampling is fixed and set to smart bi-cubic. This patch adds user control to this setting. Added: - bits to DNA / RNA in the form of an enum for supporting multiple interpolations types - changes to the image texture node drawing code ( add enum) - to ImageManager (this needs to know to allocate second texture when interpolation type is different) - to node compiler (pass on interpolation type) - to device tex_alloc this also needs to get the concept of multiple interpolation types - implementation for doing non interpolated lookup for cuda and cpu - implementation where we pass this along to osl ( this makes OSL also do linear untill I add smartcubic to the interface / DNA/ RNA) Reviewers: brecht, dingto Reviewed By: brecht CC: dingto, venomgfx Differential Revision: https://developer.blender.org/D317
2014-02-11	Better fix for T38501: blender crashes right after adding image texture to	Sv. Lockal
	material in cycles Buggy MSVC 2008 in 32-bit mode ignores stack align attribute for float3. Now it uses reference to __m128, which is always aligned.
2014-02-10	Fix T38501: blender crashes right after adding image texture to material	Sv. Lockal
	in cycles Also fix very similar problem in half-float SSE conversion.
2014-01-13	Fix cycles texture crash on win x86-64 + msvc 11	Sv. Lockal
	Use union for __m128 aliasing; while gcc supports no-strict-aliasing attribute, unions are the most common way to deal with __m128 in msvc.
2014-01-06	Cycles: Minor optimization (~1%) for texture access on CPU	Sv. Lockal

2014-01-06	Cycles: SSE optimization for sRGB conversion (gives 7% speedup on CPU for ↵	Sv. Lockal
	pavillon_barcelone scene) Thanks brecht/dingto/juicyfruit et al. for testing and reviewing this patch in T38034.
2013-11-18	Cycles: change __device and similar qualifiers to ccl_device in kernel code.	Brecht Van Lommel
	This to avoids build conflicts with libc++ on FreeBSD, these __ prefixed values are reserved for compilers. I apologize to anyone who has patches or branches and has to go through the pain of merging this change, it may be easiest to do these same replacements in your code and then apply/merge the patch. Ref T37477.
2013-08-18	Cycles: relicense GNU GPL source code to Apache version 2.0.	Brecht Van Lommel
	More information in this post: http://code.blender.org/ Thanks to all contributes for giving their permission!
2013-06-09	Cycles:	Thomas Dinges
	* Use float_to_int() functions in a few more places.
2013-04-17	Fix #35004: fireflies with .tif image in cycles, try to avoid extreme values ↵	Brecht Van Lommel
	when openimageio can't detect premul/straight alpha correct.
2013-04-05	Fix #34679: cycles image texture alpha fringes. New rule is now that color ↵	Brecht Van Lommel
	output will not give straight RGB values if the alpha output is used, so that mixing with a transparent BSDF gives the correct result.
2012-11-05	Cycles: fix crash rendering textured objects in OpenCL	Sergey Sharybin
	Issue was caused by changed order of texture slots -- float textures have got lower slots indices than byte textures. OpenCL was still assuming byte textures goes before float.
2012-10-20	Cycles OSL: light path, texture coordinate, bump and blended box mapping now up	Brecht Van Lommel
	to date and working.
2012-09-04	Cycles: merge of changes from tomato branch.	Brecht Van Lommel
	Regular rendering now works tiled, and supports save buffers to save memory during render and cache render results. Brick texture node by Thomas. http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Nodes/Textures#Brick_Texture Image texture Blended Box Mapping. http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Nodes/Textures#Image_Texture http://mango.blender.org/production/blended_box/ Various bug fixes by Sergey and Campbell. * Fix for reading freed memory in some node setups. * Fix incorrect memory read when synchronizing mesh motion. * Fix crash appearing when direct light usage is different on different layers. * Fix for vector pass gives wrong result in some circumstances. * Fix for wrong resolution used for rendering Render Layer node. * Option to cancel rendering when doing initial synchronization. * No more texture limit when using CPU render. * Many fixes for new tiled rendering.