LUMIERA.clone

Author	SHA1	Message	Date
Ichthyostega	c5d1a7d0df	Library: rearrange into standard allocation factory - code spelled out as intended, according to generic scheme - can now encode the »unmanaged« case directly as `null`-deleter, because in all other cases a deleter function is mandatory now - add default constructor to `ArrayBucket`, detailing the default spread	2024-06-07 01:53:38 +02:00
Ichthyostega	0e2ca6ee1c	Library: consider to align this with the »Factory« concept even while at first sight only a ''deleter instance'' is required, it seems prudent to rearrange the code in accordance to the prospective Allocator / Object Factory concept, and rather try to incorporate the specifics of the memory layout into this generic view, thereby abstracting the actual allocator away. This can be achieved by using a standard-allocator for `std::byte` as the base allocator and treat each individual element allocator as a specialised cross-allocator (assuming that this cross adaptation is actually trivial in almost all cases)	2024-06-07 01:14:55 +02:00
Ichthyostega	bf74ba6292	Library: sketch for a deleter trampoline for simple allocators this can be static	2024-06-06 18:41:07 +02:00
Ichthyostega	98d5b2962c	Library: analyse options for passing a deleter function The fundamental decision is that we want to have a single generic front-end, meaning that we must jump dynamically into a configured deleter function. And on top of that comes the additional requirement that ''some allocators'' are in fact tied to a specific instance, while other allocators are monostate. However, we can distinguish both by probing if the allocator can be default constructed, and if a default constructed allocator is equivalent to the currently used alloctor instance. If this test fails, we must indeed maintain a single allocator instance, and (to avoid overengineering for this rather special use case) we will place this allocator instance into heap memory then, with a self-cleanup mechanism On the other hand, all monostate allocators can be handled through static trapolines.	2024-06-06 02:46:05 +02:00
Ichthyostega	802fef9b7c	Library: work out Skeleton for memory-handling strategy - the basic decision is to implement ''realloc'' similar to `std::vector` - however the situation is complicated by the desire to allow arbitrary element types - ⟹ must build a strategy based on the properties of the target type - the completely dynamic growth is only possibly for trivially-movable types - can introduce a dedicated ''element type'' though, and store a trampolin handler	2024-06-05 02:24:39 +02:00
Ichthyostega	2abbae77d7	Library: draft memory rearrangements not clear yet how to handle the classical "realloc" situation	2024-05-29 01:01:16 +02:00
Ichthyostega	feeee4096d	Library: draft skeleton of builder operations - create by forwarding allocator arguments to policy - builder-Op to append from iterator - decide to collapse the ArrayBucket class, since access is going through unsafe pointer arithmetic anyway	2024-05-28 18:52:01 +02:00
Ichthyostega	27b36f0679	Library: implement access to storage and subscript ...not sure if this approach works out OK, since we can not make a safe downcast to a size known at runtime	2024-05-28 18:07:08 +02:00
Ichthyostega	f6e4358259	Library: data layout for the new `Several` container - favour dynamic polymorphism - use additional memory for management data alongside the element allocation - encode a flag and a deleter pointer to enable ownership of the allocation - inherit base container privately into builder, so the build ends with a slice	2024-05-28 17:20:34 +02:00
Ichthyostega	73dd24ecef	Library: start design draft to replace `RefArray` Some decisions - use a single template with policy base - population via separate builder class - implemented similar to vector (start/end) - but able to hold larger (subclass) objects	2024-05-28 04:03:51 +02:00
Ichthyostega	e4f91ecb4d	Library: document usage of AllocationCluster for STL containers - basically works out-of-the-box now - the hard wired fixed Extent size is a serious limitation - however, this is not the intended primary use, rather complementary	2024-05-28 00:36:32 +02:00
Ichthyostega	db4fbde201	Library: adapt further `AllocationCluster` tests ...especially demonstrate that destructors can optionally be invoked or not invoked...	2024-05-27 21:21:03 +02:00
Ichthyostega	178107e8b9	Library: enable empty-base optimisation for allocator ...this is an important detail: quite commonly, a custom allocator is actually implemented as monostate, to avoid bloating every client container with a backlink pointer; by inheriting the `StdFactory` adapter from the allocator, the empty-base optimisation can be exploited. In the standard case thus LinkedElements is the same size as a single pointer, which is already exploited at several places in the code base. Notably `AllocationCluster` uses a »virtual overlay« to dress-up the position pointer as `LinkedElements`, allowing to delegate most of the administration and memory management to existing and verified code. With this adjustments, `LinkedElements` pass the tests again and the rework of `AllocationCluster` is considered complete.	2024-05-27 19:02:31 +02:00
Ichthyostega	cf80a292c1	Library: adapt `LinkedElements` to use the new `StdFactory` adaptor This is the first validation of the new design: the policy to take ownership can be reimplemented simply by delegating to the adaptor for a C++ standard allocator	2024-05-27 02:06:06 +02:00
Ichthyostega	8ca3c61c6f	Library: Analysis and planning towards a generic Allocator Concept The following structure can be expected, after __switching to C++20__ * Concept Allocator deals with the bare memory allocation * Concept Factory handles object creation and disposal by delegation * Concept Handle is a ready-made functor for dependency-injection Right now, an implementation of the ''prospective Factory Concept'' can be provided, by delegating through `std::allocator_traits` to a given `std::allocator` or compatible object	2024-05-26 19:31:30 +02:00
Ichthyostega	5259000bc4	Library: how to use a standard allocator for `LinkedElements` By default, LinkedElements uses a policy OwningHeapAllocated; while retaining this interface, this policy should be recast to rely on a standard compliant allocator, with a default fallback to `std::allocator<T>` This way, a single policy would serve all the cases where objects are actually owned and managed by `LinkedElements`, and most special policies would be redundant. This turns out to be quite tedious and technical however, since the newer standard mandates to use std::allocator_traits as front-end, and moreover the standard allocators are always tied to one specific target type, while `LinkedElements` is deliberately used to maintain a polymorphic sequence.	2024-05-26 02:51:47 +02:00
Ichthyostega	3edf727c23	Library: simplify diagnostics using a transient list since the calculation to find the current block start has been recast as a private method, it is now possible to calculate the allocation statistics without mutating the pos pointer. To enable such usages, add a wrapper for `LinkedElements` to expose an element-pointer temporarily as a immutable `LinkedElements` list, allowing to iterate or use subscript and size information functions	2024-05-25 23:35:12 +02:00
Ichthyostega	08e0f52e61	Library: low-level implementation internals covered ...including overflow into new extents, alignment padding and chaining and invocation of destructors.	2024-05-25 20:01:23 +02:00
Ichthyostega	be398e950a	Library: better let C++ handle the destructors ...what I've implemented yesterday is effectively the same functionality as provided automatically by the C++ object system when using a virtual destructor. Thus a much cleaner solution is to turn `Destructor` into a interface and let C++ do all the hard work. Verified in test: works as intended	2024-05-25 19:27:17 +02:00
Ichthyostega	71d5851701	Library: implement optional invocation of destructors This is the first draft, implementing the invocation explicitly through a trampoline function. While it seems to work, the formulation can probably be simplified....	2024-05-25 05:14:36 +02:00
Ichthyostega	31f8664725	Library: verify overflow to second extent	2024-05-25 01:28:29 +02:00
Ichthyostega	841234684b	Library: verify further simple allocations and alignment	2024-05-24 19:06:33 +02:00
Ichthyostega	037a5f2dd0	Library: verify the simplest possible allocation ...by inspecting the raw memory locations -- looks good thus far...	2024-05-24 18:05:21 +02:00
Ichthyostega	e8d9dcd9bf	Library: implement accounting of storage size These diagnostics helpers must rely on low-level trickery, since the implementation strives at avoiding unnecessary storage overhead. Since `AllocationCluster` is move-only (for good reasons) and `StorageManager` can not be constructed independently, a »backdoor« is created by forced cast, relying on the known memory layout	2024-05-24 17:05:50 +02:00
Ichthyostega	13e22f315a	Library: clarify details of the low-level allocation - rather accept hard-wired limits than making the implementation excessively generic - by exploiting the layout, the administrative overhead can be reduced significantly - the trick with the "virtual managment overlay" allows to hand-off most of the clean-up work to C++ destructor invocation - it is important to verify these low-level arrangements explicitly by unit-test	2024-05-19 17:53:51 +02:00
Ichthyostega	72aea53ac3	Library: idea for an extent management mechanism * this is pure old-style low-level trickery * using a layout trick, the `AllocationCluster` can be operated with the bare minimum of overhead * this trick relies on the memory layout of `lib::LinkedElements`	2024-05-18 02:35:20 +02:00
Ichthyostega	c623298ac8	Library: draft standard case for actual memory allocation - attempt to use the minimal possible storage for the management itself - rely on the `std::align()` function to do the actual placement	2024-05-16 02:47:01 +02:00
Ichthyostega	eeda3aaa56	Library: remove elaborate allocation logic ...due to the decision to use a much simpler allocation scheme to increase probability for actual savings, after switching the API and removing all trading related aspects, a lot of further code is obsoleted	2024-05-16 01:46:41 +02:00
Ichthyostega	13f51c910c	Library: work out ramifications of the changed design Notably this raises the difficult question, whether to ensure invocation of destructors. Not invoking dtors ''breaks one of the most fundamental contracts'' of the C++ language — yet the infrastructure to invoke dtors in such a heterogeneous cluster of allocations creates a hugely significant overhead and is bound to poison the caches (objects to be deallocated typically sit in cold memory pages). What makes this decision especially daunting is the fact that the low-level-Model can be expected to be one of the largest systemic data structures (letting aside the media buffers). I am leaning towards a compromise: turn down this decision towards the user of the `AllocationCluster`	2024-05-15 19:59:05 +02:00
Ichthyostega	fd49c68096	Library: draft a simpler STL complient allocation interface After some analysis, it became clear that the existing code for `AllocationCluster` (while in itself valid) will likely miss the point for the expected usage in the low-level Model: most segments of the model will be rather small, and thus there is not enough potential for amortisation when using such a per-type and per-segment scheme; a rather simplistic linear allocator will be sufficient. On the other hand, with the current C++ standard it is easy to provide a complient allocator implementation for STL containers, and thus the interface should be retro-fitted accordingly.	2024-05-15 01:38:16 +02:00
Ichthyostega	db30da90ce	Invocation: consider storage and allocation of fan-in/fan-out At the time of the initial design attempts, I naively created a classic interface to describe an fixed container allocated ''elsewhere.'' Meanwhile the C++ language has evolved and this whole idea looks much more as if it could be a ''Concept'' (C++20). Moreover, having several implementations of such a container interface is deemed inadequate, since it would necessitate ''at least two indirections'' — while going the Concept + Template route would allow to work without any indirection, given our current understanding that the `ProcNode` itself is ''not an interface'' — rather a building block.	2024-05-13 18:34:42 +02:00
Ichthyostega	c0d5341b15	Invocation: capture idea for sharpened invocation structure - the starting point is the idea to build a dedicated ''turnout system'' - `StateAdapter`, `BuffTable` ⟶ `FeedManifold` and _Invocation_ will be fused - actually, the `TurnoutSystem` will be ''pulled'' and orchestrate the invocation - the structure is assumed to be recursive The essence of the Node-Invocation, as developed 2009 / 2011 remains intact, yet it will be organised along a clearer structure	2024-05-12 17:27:07 +02:00
Ichthyostega	bd9527716a	Invocation: segregate first and second buffer feed implementation Within the existing body of code, there are two unfinished attempts towards building a node invocation and management of data buffers. The first attempt was entirely driven from the angle of invoking a processing function, while the second one draws from a wider scope and can be considered the solution to build upon regarding data buffers in general. However, the results of the first approach are well suited for their specific purpose, so both solutions will be combined. Thus the arrangement of data feeds going in and out of the render node shall be renamed into `BuffTable` -> `FeedManifold`	2024-05-11 17:06:12 +02:00
Ichthyostega	9a435a667e	Invocation: start with some rename-refactorings ... to plot a clearer understanding of the intended usage	2024-05-11 16:39:58 +02:00
Ichthyostega	0d5f95704f	DOC: add short inline summary for Mar / Apr Developer Meetings ..some conclusions from Scheduler performance testing	2024-05-07 16:17:19 +02:00
Ichthyostega	bb3d565436	Invocation: Reassessment of existing code ...which seems to be basically fine thus far ...beyond some renaming and rearranging ''it turns out that the final, crucial links, necessary to tie all together, are yet to be developed''	2024-05-05 15:12:23 +02:00
Ichthyostega	a11ee34fc8	Invocation: forge a path for integration Facing quite some difficulties here, since there are (at least) two abandoned past efforts towards building a render node network in the code base; the structure and architecture decisions from these previous attempts seem largely valid still, yet on a technical level, the style of construction evolved considerably in the meantime. Moreover, these old fragments of code, written during the early stages of the project, were lacking clear goals and anchor points at places; the situation is quite different now in this respect. Sticking to well proven practice, the rework will be driven by a test setup, and will progress over three steps with increasing levels of integration.	2024-04-23 01:13:40 +02:00
Ichthyostega	47e26e2a65	Invocation: initial considerations... Looks like some code archaeology is required to sort apart the various effort to get this topic started....	2024-04-21 02:58:30 +02:00
Ichthyostega	d71eb37b52	Scheduler-test: complete and document stress testing effort (closes #1344 ) The initial effort of building a Scheduler can now be considered complete Reaching this milestone required considerable time and effort, including an extended series of tests to weld out obvious design and implementation flaws. While the assessment of the new Scheduler's limitation and traits is ''far from complete,'' some basic achievements could be confirmed through this extended testing effort: * the Scheduler is able to follow a given schedule effectively, until close up to the load limit * the ''stochastic load management'' causes some latency on isolated events, in the order of magnitude < 5ms * the Scheduler is susceptible to degradation through Contention * as mitigation, the Scheduler prefers to reduce capacity in such a situation * operating the Scheduler effectively thus requires a minimum job size of 2ms * the ability for sustained operation under full nominal load has been confirmed by performing test sequences with over 80 seconds * beyond the mentioned latency (<5ms) and a typical turnaround of 100µs per job (for debug builds), no further significant overhead was found. Design, Implementation and Testing were documented extensively in the [https://lumiera.org/wiki/renderengine.html#Scheduler%20SchedulerProcessing%20SchedulerTest%20SchedulerWorker%20SchedulerMemory%20RenderActivity%20JobPlanningPipeline%20PlayProcess%20Rendering »TiddlyWiki« #Scheduler]	2024-04-20 01:56:54 +02:00
Ichthyostega	a46449d5ac	Scheduler-test: stable-state run > 1sec This test completes the stress-testing effort and summarises the findings * Scheduler performs within relevant parameter range without significant overhead * Scheduler can operate with full load in stable state, with 100% correct result	2024-04-18 01:39:28 +02:00
Ichthyostega	177e241060	Scheduler-test: investigate extended loads with different patterns The behaviour seems consistent and the schedule breaks at the expected point. At first sight, concurrency seems slightly to low; detailed investigation however shows that this is due to the structure of the load graph, and in fact the run time comes close to optimal values.	2024-04-18 01:39:28 +02:00
Ichthyostega	c934e7f079	Scheduler-test: reduce impact of scale adjustments on breakpoint-search the `BreakingPoint` tool conducts a binary search to find the ''stress factor'' where a given schedule breaks. There are some known deviations related to the measurement setup, which unfortunately impact the interpretation of the ''stress factor'' scale. Earlier, an attempt was made, to watch those factors empirically and work a ''form factor'' into the ''effective stress factor'' used to guide this measurement method. Closer investigation with extended and elastic load patters now revealed a strong tendency of the Scheduler to scale down the work resources when not fully loaded. This may be mistaken by the above mentioned adjustments as a sign of a structural limiation of the possible concurrency. Thus, as a mitigation, those adjustments are now only performed at the beginning of the measurement series, and also only when the stress factor is high (implying that the scheduler is actually overloaded and thus has no incentive for scaling down). These observations indicate that the »Breaking Point« search must be taken with a grain of salt: Especially when the test load does ''not'' contain a high degree of inter dependencies, it will be ''stretched elastically'' rather than outright broken. And under such circumstances, this measurement actually gauges the Scheduler's ability to comply to an established load and computation goal.	2024-04-18 01:39:27 +02:00
Ichthyostega	55842a8d4f	Scheduler-test: identify and fix bug in allocator ...well — more of a logical contradiction, not so much a bug. The underlying problematic situation arises when meanwhile the Extent storage has been expanded, and especially the active slots are in »wrapped state«. In this case, the newly allocated extents must be rotated in, which invalidates existing index numbers. This problem was amended by exploting a chaching mechanism, allowing to re-attach and validate an index position still stored in an old iterator; especially this can happen when attempting to attach a follow-up dependency onto a job planned earlier, but not yet scheduled. The problem here was an assertion failure, which was triggered with a high probability; the fix for the problem detailed above used the yield() function, while it actually was only interested in retrieving the Extent's address to probe if the extent matches an known storage location. The solution is to provide a dedicated function for this check, which can then skip the sanity check (because in this case we do not want to use the Extent, and thus can touch obsoleted/inactive Extents without problem)	2024-04-18 01:39:27 +02:00
Ichthyostega	7c2b9a8ba5	Scheduler-test: investigate extended load patterns ...this seems to be the last topic for this investigation of Scheduler behaviour; the goal is to demonstrate readiness for stable-state operation over an extended period of time	2024-04-18 01:39:26 +02:00
Ichthyostega	1d4f6afd18	Scheduler-test: complete and document the Load-peak tests - use parameters known to produce a clean linear model - assert on properties of this linear model Add extended documentation into the !TiddlyWiki, with a textual account of the various findings, also including some of the images and diagrams, rendered as SVG	2024-04-12 02:23:31 +02:00
Ichthyostega	7798ef499c	Scheduler-test: adapt assertions to changes in load generation This amends test code, which was commented-out for some time, and was affected by the changes in load-graph generation: `a983a506b` These changes typically lead to a simplified topology at the end of the load graph, since open ends are no longer connected to a single exit node. In the case here, level 27 is no longer generate, and level 26 is now comprised of three nodes, two of them with load=2	2024-04-12 02:23:31 +02:00
Ichthyostega	5b62438eb4	Scheduler-test: investigate logic problem related to the »Tick« deadline In the end, I decided that it ''is to early to decide anything'' in this respect... The actual situation encountered is a Catch-22: * in its current form, the »Tick« handler detects compulsory jobs beyond deadline * since such a Job ''must not be touched anymore,'' there is no way scheduling can proceed * so this would constitute a ''Scheduler Emergency'' All fine — just the »Tick« handler ''itself is a compulsory job'' — and being a job, it can well be driven beyond its deadline. In fact this situation was encountered as part of stress testing. Several mitigations or real solutions are conceivable, but in the end, too little is known yet regarding the integration of the scheduler within the Engine Thus I'll marked the problematic location and opened #1362	2024-04-12 02:23:31 +02:00
Ichthyostega	1316ee2c7f	Scheduler-test: adjust contention mitigation as result of testing Investigate the behaviour over a wider range of job loads, job count and worker pool sizes. Seemingly the processing can not fully utilise the available worker pool capacity. By inspection of trace-dumps, one impeding mechanism could be identified: the »stickiness« of the contention mitigation. Whenever a worker encounters repeated contention, it steps up and adds more and more wait cycles to remove pressure from the schedule coordination. As such this is fine and prevents further degradation of performance by repeated atomic synchronisation. However, this throttling was kept up needlessly after further successful work-pulls. Since job times of several milliseconds can be expected on average in media processing, such a long retention would spread a performance degradation over a duration of several frames. Thus, the scheme for step-down was changed to decrease the throttling by a power series rather than just documenting the level.	2024-04-12 02:23:31 +02:00
Ichthyostega	a6a9155cd9	Scheduler-test: measurements documented	2024-04-09 17:10:21 +02:00
Ichthyostega	6e7f9edf43	Scheduler-test: calculate linear model as test result Use the statistic functions imported recently from Yoshimi-test to compute a linear regression model as immediate test result. Combining several measurement series, this allows to draw conclusions about some generic traits and limitations of the scheduler.	2024-04-09 17:10:21 +02:00

... 7 8 9 10 11 ...

7036 commits