LUMIERA.clone

Author	SHA1	Message	Date
Ichthyostega	c183045dfa	Library: switch Microbenchmark setup to C++17 threads Over time, a collection of microbenchmark helper functions was extracted from occasional use -- including a variant to perform parallelised microbenchmarks. While not used beyond sporadic experiments yet, this framework seems a perfect fit for measuring the SyncBarrier performance. There is only one catch: - it uses the old Threadpool + POSIX thread support - these require the Threadpool service to be started... - which in turn prohibits using them for libary tests And last but not least: this setup already requires a barrier. ==> switch the existing microbenchmark setup to c++17 threads preliminarily (until the thread-wrapper has been reworked). ==> also introduce the new SyncBarrier here immediately ==> use this as a validation test of the setup + SyncBarrier	2023-09-24 18:07:28 +02:00
Ichthyostega	963dc38088	Activity-Lang: introduce some shorthand notation ...regarding the kind of activity (the verb), and also for some special case access of payload data; deliberately asserting the correct verb, but no mandatory check, since this whole Activity-Language is conceived as cohesive and essentially sealed (not meant to be extended)	2023-09-01 17:41:40 +02:00
Ichthyostega	cda1cdd975	Activity-Lang: verify memory allocation and connectivity	2023-08-29 18:46:37 +02:00
Ichthyostega	26c2e835c3	Activity-Lang: setup skeleton of the activation function - complete spec of Activity processing - define the invocation structure - implement basic cases of activation	2023-07-30 22:06:06 +02:00
Ichthyostega	28b3900284	Block-Flow: final adjustments from performance test (closes: #1311 ) Further extensive testing with parameter variations, using the test setup in `BlockFlow_test::storageFlow()` - Tweaks to improve convergence under extreme overload; sudden load peaks are now accomodated typically < 5 sec - Make the test definition parametric, to simplify variations - Extract the generic microbenchmark helper function - Documentation	2023-07-22 06:07:35 +02:00
Ichthyostega	049ca833a0	Block-Flow: optimise parameters for performance There seems to be a ''sweet spot'' for somewhat larger Epoch sizes around 500 slots. At least in the test setup used here, which works with a load of 200 Frames / sec, which is significantly over the typical value of 50fps (video + audio) for simple playback. The optimisation of averaged allocation times can not be much improved below 30ns. Overall, this can be considered a good result, since this allocation scheme does way more than just allocate memory, it also provides a means to track dependencies and lifecycle. __For context__: - we should strive at processing one frame in ~ 10ms - for 10 Activity records per Frame, we currently use < 0.5 µs for memory and dependency management in the scheduler - this leaves enough room for the further administrative efforts (priority queue, job planning, buffer management)	2023-07-21 04:34:04 +02:00
Ichthyostega	2977076b7f	Block-Flow: switch to using the reworked config BUT -> +50% runtime in -O3 (+20ns) Investigation seems to indicate - that the increased (+1 Epochs, 10 -> 11) moving average caused the Algo to perform worse (strong effect) - that the Optimiser has problems with boost::rational, which however yields only a minute effect (+5ns), and only on the critical path The access via Meyers Singleton has no adverse effect, rather the new setup gives a tiny benefit (46ns -> 37ns). Surprisingly, the increased pre-allocation has no observable effect.	2023-07-20 21:47:18 +02:00
Ichthyostega	ca502aa826	Block-Flow: introduce config through a policy mix-in ...measured running time reproduced unaltered for -O3	2023-07-20 19:28:20 +02:00
Ichthyostega	5803fed544	Block-Flow: draft for re-arranged configuration On the long run, there will be a central Render Engine parametrisation; some parameters can even be expected to be dynamic; thus prepare the BlockFlow allocator to fit in with this expectation	2023-07-20 16:46:54 +02:00
Ichthyostega	14a5200cc0	Block-Flow: more runtime observation and fine-tuning For comparison: use individual managment by refcount. This supports the conclusion that BlockFlow is more than just a custom allocator; it also supports a non-trivial lifetime management, and this comes at a cost. Playing around with various load patterns uncovers further weak spots in the regulation mechanism. As a remedy, introduce a stronger feed-back and especially set the target load factor from 100% -> 90% to add some headroom to absorb intermittent load peaks Presumably ''much more observation and fine-tuning'' will be necessary under real-world load conditions (⟹ Ticket #1318 for later)	2023-07-19 03:29:09 +02:00
Ichthyostega	c008858d8f	Block-Flow: investigate, fix and fine-tune Epoch size control - BUG: must prevent the Epoch size to become excessive low - Problem: feedback signal should not be overly aggressive Fine-Tuning: - Dose for Overflow-compensation is delicate - Moving average and Overflow should be balanced - ideally the compensatory actions should be one order of magnitude slower than the characteristic regulation time Improvement: perform Moving-Average calculations in doubles	2023-07-18 21:23:00 +02:00
Ichthyostega	c7d6f3e24c	Block-Flow: Load-Test indicates problem in Epoch control ...leading to PATHETICALLY bad timing comparison ...it seems clear that the Epoch-Step went to zero (which was neither anticipated, nor protected against) However, even individual heap allocations fare surprisingly well under full optimisation; just they don't solve our problem with tracking dependencies; the most simplest solution that would also fulfil this requirement would be using shared_ptr	2023-07-18 01:59:17 +02:00
Ichthyostega	c1001064e3	Block-Flow: draft Load/Stress-Test - use a midrange load scenario - but play this at saturation level	2023-07-17 18:36:12 +02:00
Ichthyostega	a4365a24f8	Block-Flow: feed size regulation on clean-up Generate a signal based on actual Epoch length and observed fill ratio, assuming even distribution of load.	2023-07-17 04:32:10 +02:00
Ichthyostega	9d040dc49c	Block-Flow: compute exponential moving average ..as a heuristic to regulate optimal Epoch duration; when Epochs are discarded, the effective fill factor can be used to guess an Epoch duration time, which would (in hindsight) lead to perfect usage of storage space	2023-07-17 03:00:56 +02:00
Ichthyostega	bd353d768a	Block-Flow: detect and react on Epoch overflow ..using a simplistic implementation for now: scale down the Epoch-stepping by 0.9 to increase capacity accordingly. This is done on each separate overflow event, and will be counterbalanced by the observation of Epoch fill ratio performed later on clean-up of completed Epochs	2023-07-16 20:47:39 +02:00
Ichthyostega	6d75a82932	Block-Flow: introduce backlink into AllocationHandle further implementation makes clear that the AllocationHandle, which is the primary usage front-end, has to rely both on services of the underlying ExtentFamily allocator, as well as on the BlockFlow itself for managing the Epoch spacing.	2023-07-16 18:03:27 +02:00
Ichthyostega	e4b74f3ae1	Block-Flow: handle Epoch overflow ...draft of control logic, does not work correct in all cases	2023-07-16 03:06:02 +02:00
Ichthyostega	dce65104aa	Block-Flow: select suitable Epoch for new allocation	2023-07-15 21:37:58 +02:00
Ichthyostega	cb2ee9466b	Block-Flow: add diagnostics and define further expectations - fix a bug in IterExplorer: when iterating a »state core« directly, the helper CoreYield passed the detected type through ValueTypeBindings. This is logically wrong, because we never want to pick up some typedefs, rather we always want to use the type directly returned from CORE::yield() Here the iterator returns an Epoch&, which itself is again iterable (it inherits from std::array<Activity, N>). However, it is clear that we must not descent into such a "flatMap" style recursive expansion - draft a simple scheme how to regulate Epoch lengths dynamically - add diagnostics to pinpoint a given Activity and find out into which Epoch it has been allocated; used to cover the allocator behaviour	2023-07-15 18:54:59 +02:00
Ichthyostega	7167ad6d96	Block-Flow: define expected behaviour for Epoch association ...how new Activities are placed into Epochs, incl. overflow	2023-07-14 05:03:01 +02:00
Ichthyostega	d0fd7f32a9	Block-Flow: verify handling of Activity records within the Epoch	2023-07-14 01:51:00 +02:00
Ichthyostega	af8f84a72d	Block-Flow: complete simple use case (see #1311 ) - add preliminary deadline-check (directly instead of using the Activity) - with this shortcut, now able to implement discarding obsoleted Epochs - Iteration and use of the underlying `ExtentFamily` is also settled by now 💡 ''Implementation concept for the allocation scheme complete and validated''	2023-07-13 19:43:22 +02:00
Ichthyostega	946f7c17f7	Block-Flow: implement opening a new Epoch ..this is the most simple case, where no Epochs are opened yet ..add diagnostics to inspect alloc count and deadlines ..add accessors for the first/last underlying Extent	2023-07-13 04:41:58 +02:00
Ichthyostega	180c6b8d84	Block-Flow: define next steps to construct ...continue to proceed test-driven ...scheduler internals turn out to be intricate and cohesive, and thus the only hope is to adhere to strict testing discipline	2023-07-13 01:51:21 +02:00
Ichthyostega	ccf0710903	Block-Flow: maintain an »Epoch« within the raw allocation Extent - the idea is to use slot-0 in each extent for administrative metadata - to that end, a specialised GATE-Activity is placed into slot-0 - decision to use the next-pointer for managing the next free slot - thus we need the help of the underlying ExtentFamily for navigating Extents Decision to refrain from any attempt to "fix" excessive memory usage, caused by Epochs still blocked by pending IO operations. Rather, we assume the engine uses sane parametrisation (possibly with dynamic adjustment) Yet still there will be some safety limit, but when exceeding this limit, the allocator will just throw, thereby killing the playback/render process	2023-07-09 01:32:27 +02:00
Ichthyostega	4ac995548a	Block-Flow: identify required API operations - decision how to handle the Extent storage (by forced-cast) - decision to place the administrative record directly into the Extent TODO not clear yet how to handle the implicit limitation for future deadlines	2023-07-05 15:12:20 +02:00
Ichthyostega	23a6fbdf4f	Scheduler: investigate modes of operation - analysis of Activity usage - derive possible memory management schemes - research regarding asynchronous IO - decision regarding the memory management scheme	2023-07-03 18:40:37 +02:00

28 commits