git.blender.org/blender.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2022-06-18	Cleanup: Remove unused argument, unnecessary struct keyword	Hans Goudey

2022-06-05	Cleanup: Move remaining mesh draw code to C++	Hans Goudey
	After this commit, all mesh data extraction and drawing code is in C++, including headers, making it possible to use improved types for future performance improvements and simplifications. The only non-trivial changes are in `draw_cache_impl_mesh.cc`, where use of certain features and macros in C necessitated larger changes. Differential Revision: https://developer.blender.org/D15088
2022-02-11	File headers: SPDX License migration	Campbell Barton
	Use a shorter/simpler license convention, stops the header taking so much space. Follow the SPDX license specification: https://spdx.org/licenses - C/C++/objc/objc++ - Python - Shell Scripts - CMake, GNUmakefile While most of the source tree has been included - `./extern/` was left out. - `./intern/cycles` & `./intern/atomic` are also excluded because they use different header conventions. doc/license/SPDX-license-identifiers.txt has been added to list SPDX all used identifiers. See P2788 for the script that automated these edits. Reviewed By: brecht, mont29, sergey Ref D14069
2021-12-27	OpenSubDiv: add support for an OpenGL evaluator	Kévin Dietrich
	This evaluator is used in order to evaluate subdivision at render time, allowing for faster renders of meshes with a subdivision surface modifier placed at the last position in the modifier list. When evaluating the subsurf modifier, we detect whether we can delegate evaluation to the draw code. If so, the subdivision is first evaluated on the GPU using our own custom evaluator (only the coarse data needs to be initially sent to the GPU), then, buffers for the final `MeshBufferCache` are filled on the GPU using a set of compute shaders. However, some buffers are still filled on the CPU side, if doing so on the GPU is impractical (e.g. the line adjacency buffer used for x-ray, whose logic is hardly GPU compatible). This is done at the mesh buffer extraction level so that the result can be readily used in the various OpenGL engines, without having to write custom geometry or tesselation shaders. We use our own subdivision evaluation shaders, instead of OpenSubDiv's vanilla one, in order to control the data layout, and interpolation. For example, we store vertex colors as compressed 16-bit integers, while OpenSubDiv's default evaluator only work for float types. In order to still access the modified geometry on the CPU side, for use in modifiers or transform operators, a dedicated wrapper type is added `MESH_WRAPPER_TYPE_SUBD`. Subdivision will be lazily evaluated via `BKE_object_get_evaluated_mesh` which will create such a wrapper if possible. If the final subdivision surface is not needed on the CPU side, `BKE_object_get_evaluated_mesh_no_subsurf` should be used. Enabling or disabling GPU subdivision can be done through the user preferences (under Viewport -> Subdivision). See patch description for benchmarks. Reviewed By: campbellbarton, jbakker, fclem, brecht, #eevee_viewport Differential Revision: https://developer.blender.org/D12406
2021-08-23	Cleanup: rename 'MeshBufferCache' to 'MeshBufferList'	Germano Cavalcante
	`MeshBufferList` is more specific and can avoid confusion with `MeshBufferExtractionCache`.
2021-08-23	Cleanup: Move 'tris_per_mat' member out of 'MeshBufferCache'	Germano Cavalcante
	`MeshBufferCache` is a struct representing a list of buffers. As such, `GPUIndexBuf tris_per_mat` is out of place as it does not represent one of the buffers in the list. In fact this member should be close to `GPUBatch surface_per_mat` as they are related. The code for dependencies between buffer and batch had to be reworked as it relies on the member's position. Differential Revision: https://developer.blender.org/D12227
2021-07-26	Cleanup: Rearrange mesh extraction files	Germano Cavalcante
	In the draw module, it's not easy to identify what its header is, and where the shared functions are. So move `draw_cache_extract_mesh_extractors.c` and `draw_cache_extract_mesh_private.h` to the same folder as the extractors and rename these files to make them more identifiable. Reviewed By: jbakker Differential Revision: https://developer.blender.org/D11991
2021-07-21	Draw Cache: extract tris in parallel ranges	Germano Cavalcante
	The `ibo.tris` extraction in multithread is currently only done if the mesh has only 1 material. Now we cache a map indicating the index of each polygon after sort and thus allow the extraction of tris with materials in multithreaded. As caching is a heavy operation and was already being performed in multi-thread for triangle offsets, no significant improvements are expected. The benefit will be much greater when we can skip updating the cache while transforming a geometry. Profiling: \|\|master:\|PATCH: \|---\|---\|---\| \|large_mesh_editing_materials:\|Average: 13.855380 FPS\|Average: 15.525684 FPS \|\|rdata 9ms iter 36ms (frame 71ms)\|rdata 9ms iter 29ms (frame 64ms) \|subdiv_mesh_final_only_materials:\|Average: 28.113742 FPS\|Average: 28.633599 FPS \|\|rdata 0ms iter 1ms (frame 36ms)\|rdata 0ms iter 1ms (frame 35ms) 1.1x overall speedup Differential Revision: https://developer.blender.org/D11445
2021-06-18	Cleanup: clang-tidy	Campbell Barton

2021-06-15	DrawManager: Cache material offsets.	Jeroen Bakker
	When using multiple materials in a single mesh the most time is spend in counting the offsets of each material for the sorting. This patch moves the counting of the offsets to render mesh data and caches it as long as the geometry doesn't change. This patch doesn't include multithreading of this code. Reviewed By: mano-wii Differential Revision: https://developer.blender.org/D11612
2021-06-11	Refactor: use 'BLI_task_parallel_range' in Draw Cache	Germano Cavalcante
	One drawback to trying to predict the number of threads that will be used in the `task_graph` is that we are only sure of the number when the threads are running. Using `BLI_task_parallel_range` allows the driver to choose the best thread distribution through `parallel_reduce`. The benefit is most evident on hardware with fewer cores. This is the result on an 4-core laptop: \|\|before:\|after: \|---\|---\|---\| \|large_mesh_editing:\|Average: 5.203638 FPS\|Average: 5.398925 FPS \|\|rdata 15ms iter 43ms (frame 193ms)\|rdata 14ms iter 36ms (frame 187ms) Differential Revision: https://developer.blender.org/D11558
2021-06-11	Refactor: Draw Cache: use 'BLI_task_parallel_range'	Germano Cavalcante
	This is an adaptation of {D11488}. A disadvantage of manually setting the iter ranges per thread is that we don't know how many threads are running in the background and so we don't know how to best distribute the ranges. To solve this limitation we can use `parallel_reduce` and thus let the driver choose the best distribution of ranges among the threads. This proved to be especially beneficial for computers with few cores. Benchmarking: Here's the result on an 4-core laptop: \|\|master:\|PATCH: \|---\|---\|---\| \|large_mesh_editing:\|Average: 5.203638 FPS\|Average: 5.398925 FPS \|\|rdata 15ms iter 43ms (frame 193ms)\|rdata 14ms iter 36ms (frame 187ms) Here's the result on an 8-core PC: \|\|master:\|PATCH: \|---\|---\|---\| \|large_mesh_editing:\|Average: 15.267482 FPS\|Average: 15.906881 FPS \|\|rdata 9ms iter 28ms (frame 65ms)\|rdata 9ms iter 25ms (frame 63ms) \|large_mesh_editing_ledge: \|Average: 15.145966 FPS\|Average: 15.520474 FPS \|\|rdata 9ms iter 29ms (frame 65ms)\|rdata 9ms iter 25ms (frame 64ms) \|looptris_test:\|Average: 4.001917 FPS\|Average: 4.061105 FPS \|\|rdata 12ms iter 90ms (frame 236ms)\|rdata 12ms iter 87ms (frame 230ms) \|subdiv_mesh_cage_and_final:\|Average: 1.917769 FPS\|Average: 1.971790 FPS \|\|rdata 7ms iter 37ms (frame 261ms)\|rdata 7ms iter 31ms (frame 258ms) \|\|rdata 7ms iter 38ms (frame 252ms)\|rdata 7ms iter 33ms (frame 249ms) \|subdiv_mesh_final_only:\|Average: 6.387240 FPS\|Average: 6.591251 FPS \|\|rdata 3ms iter 25ms (frame 151ms)\|rdata 3ms iter 16ms (frame 145ms) \|subdiv_mesh_final_only_ledge:\|Average: 6.247393 FPS\|Average: 6.596024 FPS \|\|rdata 3ms iter 26ms (frame 158ms)\|rdata 3ms iter 16ms (frame 148ms) Notes: - The improvement can only be noticed if all extracts are multithreaded. - This patch touches different areas of the code, so it can be split into another patch if the idea is accepted. These screenshots show how threads behave in a quadcore: Master: {F10164664} Patch: {F10164666} Differential Revision: https://developer.blender.org/D11558
2021-06-09	T88352: Use threaded ibo.tris extraction for single material meshes.	Jeroen Bakker
	This patch adds a specific extraction method when the mesh has only one material. This method is multi-threaded. There is a trade-off in this patch as the ibo isn't compressed (it adds restart indexes for hidden faces). So it depends if threading is faster than the additional GPU buffer upload. # Subdivided cube I used a cube subdivided 7 times, modifiers applied. that gives around 400000 faces. The test is selecting some vertices and move them. During this test the next buffers are updated on each frame: * vbo.pos_nor * vbo.lnor * vbo.edit_data * ibo.tris * ibo.points System info: \|platform\| Linux-5.11.0-7614-generic-x86_64-with-glibc2.33\| \| renderer\| AMD SIENNA_CICHLID (DRM 3.40.0, 5.11.0-7614-generic, LLVM 11.0.1)\| \|vendor\| AMD\| \|version\| 4.6 (Core Profile) Mesa 21.0.1\| \|cpu\| Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz\| \|compiler\| gcc version 10.3.0\| Timing have been measured using DEBUG_TIME in `draw_cache_extract_mesh`. master: `rdata 8ms iter 45ms (frame 153ms)` this patch `rdata 6ms iter 36ms (frame 132ms)` Reviewed By: mano-wii Maniphest Tasks: T88352 Differential Revision: https://developer.blender.org/D11290
2021-06-08	Revert "Cleanup: use cpp new/delete."	Jeroen Bakker
	This reverts commit 43464c94f4def8689dd99a9e459f5ff77420d27b.
2021-06-08	Cleanup: replace NULL with nullptr.	Jeroen Bakker

2021-06-08	Cleanup: use cpp new/delete.	Jeroen Bakker

2021-06-07	Cleanup: Move extract tris in own compile unit.	Jeroen Bakker