LUMIERA.clone

Author	SHA1	Message	Date
Ichthyostega	da57e3dfcd	Scheduler-test: ''can demonstrate running a synthetic load'' (closes #1346 ) * added benchmark over synchronous execution as point of reference * verified running times and execution pattern * Scheduler behaves as expected for this example	2023-12-11 23:53:25 +01:00
Ichthyostega	347b9b24be	Scheduler-test: complete and integrate computational load This basically completes the Chain-Load framework; a simple Scheduler integration run with all relevant features can now be demonstrated.	2023-12-11 19:42:23 +01:00
Ichthyostega	db1ff7ded7	Scheduler-test: incremental calibration of both variants - Generally speaking, the calibration uses current baseline settings; - There are now two different load generation methods, thus both must be calibrated - Performance contains some socked and non-linear effects, thus calibration should be done close to the work point, which can be achieved by incremental calibration until the error is < 5% Interestingly, longer time-base values run slightly faster than predicted, which is consistent with the expectation (socket cost). And using a larger memory block increases time values, which is also plausible, since cache effects will be diminishing	2023-12-11 04:43:05 +01:00
Ichthyostega	9ef8e78459	Scheduler-test: implement memory-accessing load ...use an array of volatiles, and repeatedly add neighbouring cells ...bake the base allocation size configurable, and tie the alloc to the scale-step	2023-12-11 03:13:28 +01:00
Ichthyostega	df4ee5e9c1	Scheduler-test: implement pure computation load ..initial gauging is a tricky subject, since existing computer's performance spans a wide scale Allowing - pre calibration -98% .. +190% - single run ±20% - benchmark ±5%	2023-12-11 03:10:42 +01:00
Ichthyostega	beebf51ac7	Scheduler-test: draft a configurable CPU load component ...which can be deliberately attached (or not attached) to the individual node invocation functor, allowing to study the effect of actual load vs. zero-load and worker contention	2023-12-10 19:58:18 +01:00
Ichthyostega	fcfdf97853	Chain-Load: prepare infrastructure for computational load Within Chain-Load, the infrastructure to add this crucial feature is minimal: each node gets a `weight` parameter, which is assigned using another RandomDraw-Rule (by default `weight==0`). The actual computation load will be developed as a separate component and tied in from the node calculation job functor.	2023-12-09 03:13:48 +01:00
Ichthyostega	fe6f2af7bb	Chain-Load: combine all exit-hashes into a single global hash ...during development of the Chain-Load, it became clear that we'll often need a collection of small trees rather than one huge graph. Thus a rule for pruning nodes and finishing graphs was added. This has the consequence that there might now be several exit nodes scattered all over the graph; we still want one single global hash value to verify computations, thus those exit hashes must now be picked up from the nodes and combined into a single value. All existing hash values hard coded into tests must be updated	2023-12-09 02:36:14 +01:00
Ichthyostega	9e25283b72	Chain-Load: precise pre-roll for planning-job ...with this change, processing is ''ahead of schedule'' from the beginning, which has the nice side effect that the problematic contention situation with these very short computation jobs can not arise, and most of the schedule is processed by a single worker. Processing pattern is now pretty much as expected	2023-12-09 01:20:53 +01:00
Ichthyostega	1df328cfc1	Chain-Load: switch planning chunk-size from level to node This is a trick to get much better scheduling and timing guesses. Instead of targeting a specific level, rather a fixed number of nodes is processed in each chunk, yet still always processing complete levels. The final level number to expect can be retrieved from the chain-load graph. With this refactoring, we can now schedule a wake-up job precisely after the expected completion of the last level	2023-12-08 23:52:57 +01:00
Ichthyostega	34d6423660	Scheduler-test: first successful complete run Scheduling a wake-up job behind the end of the planned schedule did the trick. Sometimes there is ''strong contention'' immediately after full provision of the WorkForce, but this seems to be as expected, since the »Jobs« currently used have no actually relevant run time on their own. It is even more surprising that the Capacity-control logic is able to cope with this situation in a matter of just some milliseconds, bringing the average Lag at ~ 300µs	2023-12-08 04:22:12 +01:00
Ichthyostega	7eca3ffe42	Scheduler-test: a helper for one-time operations Invent a special JobFunctor... - can be created / bound from a λ - self-manages its storage on the heap - can be invoked once, then discards itself Intention is to pass such one-time actions to the Scheduler to cause some ad-hoc transitions tied to curren circumstances; a notable example will be the callback after load-test completion.	2023-12-08 03:16:57 +01:00
Ichthyostega	030e9aa8a2	Scheduler / Activity-Lang: simplify handling of blocked Gate In the first draft version, a blocked Gate was handled by »polling« the Gate regularly by scheduling a re-invocation repeatedly into the future (by a stepping defined through ExecutionCtx::getWaitDelay()). Yet the further development of the Activity-Language indicates that the ''Notification mechanism'' is sufficient to handle all foreseeable aspects of dependency management. Consequently this ''Gate poling is no longer necessary,'' since on Notification the Gate is automatically checked and the activation impulse is immediately passed on; thus the re-scheduled check would never get an opportunity actually to trigger the Gate; such an active polling would only be necessary if the count down latch in the Gate is changed by "external forces". Moreover, the first Scheduler integration tests with TestChainLoad indicate that the rescheduled polling can create a considerable additional load when longer dependency chains miss one early prerequisite, and this additional load (albeit processed comparatively fast by the Scheduler) will be shifted along needlessly for quite some time, until all of the activities from the failed chain have passed their deadline. And what is even more concerning, these useless checks have a tendency to miss-focus the capacity management, as it seems there is much work to do in a near horizon, which in fact may not be the case altogether. Thus the Gate implementation is now changed to just SKIP when blocked. This helped to drastically improve the behaviour of the Scheduler immediately after start-up -- further observation indicated another adjustment: the first Tick-duty-cycle is now shortened, because (after the additional "noise" from gate-rescheduling was removed), the newly scaled-up work capacity has the tendency to focus in the time horizon directly behind the first jobs added to the timeline, which typically is now the first »Tick«. ð¡ this leads to a recommendation, to arrange the first job-planning chunk in such a way that the first actual work jobs appear in the area between 5ms and 10ms after triggering the Scheduler start-up.Scheduler¡	2023-12-07 22:12:41 +01:00
Ichthyostega	fa86228057	Scheduler: rework load-regulation The first complete integration test with Chain-Load highlighted some difficulties with the overall load regulation: - it works well in the standard case (but is possibly to eager to scale up) - the scale-up sometimes needs several cycles to get "off the ground" - when the first job is dispatched immediately instead of going through the queue, the scheduler fails to boot up	2023-12-07 03:55:20 +01:00
Ichthyostega	21fbe09ee0	Chain-Load: fix planning and wait logic two rather obvious bugfixes (well, after watching the Scheduler in action...) - the first planning-chunk needs an offset - the future to block on must be setup before any dispatch happens	2023-12-07 02:39:40 +01:00
Ichthyostega	72f11549e6	Chain-Load: Scheduler instrumentation for observation - prime diagnostics with the first time invocation - print timings relative to this first invocation - DUMP output to watch the crucial scheduling operations	2023-12-06 23:54:33 +01:00
Ichthyostega	e761447a25	Chain-Load: setup simple integration test - use a chain-load with 64 steps - use a simple topology - trigger test run with default stepping TODO: Test hangs -> Timeout	2023-12-06 07:24:30 +01:00
Ichthyostega	481e35a597	Chain-Load: implement translation into Scheduler invocations ... so this (finally) is the missing cornerstone ... traverse the calculation graph and generate render jobs ... provide a chunk-wise pre-planning of the next batch ... use a future to block the (test) thread until completed	2023-12-06 01:54:35 +01:00
Ichthyostega	5e9b115283	Chain-Load: verify correct operation of planning logic - test setup without actual scheduler - wire the callbacks such to verify + all nodes are touched + levels are processed to completion + the planning chunk stops at the expected level + all node dependencies are properly reported through the callbacks	2023-12-05 01:31:54 +01:00
Ichthyostega	29ca3a485f	Chain-Load: implement planning JobFunctor - decided to abstract the scheduler invocations as λ - so this functor contains the bare loop logic Investigation regarding hash-framework: It turns out that boost::hash uses a different hash_combine, than what we have extracted/duplicated in lib/hash-value.hpp (either this was a mistake, or boost::hash did use this weaker function at that time and supplied a dedicated 64bit implementation later) Anyway, should use boost::hash for the time being maybe also fix the duplicated impl in lib/hash-value.hpp	2023-12-04 16:29:57 +01:00
Ichthyostega	2e6712e816	Chain-Load: implement invocation through JobFunctor - use a ''special encoding'' to marshal the specific coordinates for this test setup - use a fixed Frame-Grid to represent the ''time level'' - invoke hash calculation through a specialised JobFunctor subclass	2023-12-04 03:57:04 +01:00
Ichthyostega	7d5242f604	Chain-Load: remove excess template argument The number of nodes was just defined as template argument to get a cheap implementation through std::array... But actually this number of nodes is ''not a characteristics of the type;'' we'd end up with a distinct JobFunctor type for each different test size, which is plain nonsensical. Usage analysis reveals, now that the implementation is ''basically complete,'' that all of the topology generation and statistic calculation code does not integrate deeply with the node storage, but rather just iterates over all nodes and uses the ''first'' and ''last'' node. This can actually be achieved very easy with a heap-allocated plain array, relying on the magic of lib::IterExplorer for all iteration and transformation.	2023-12-04 04:16:16 +01:00
Ichthyostega	e0766f2262	Chain-Load: draft usage for Scheduler testing - use a dedicated context "dropped off" the TestChainLoad instance - encode the node-idx into the InvocationInstanceID - build an invocation- and a planning-job-functor - let planning progress over an lib::UninitialisedStorage array - plant the ActivityTerm instances into that array as Scheduling progresses	2023-12-04 00:34:06 +01:00
Ichthyostega	6707962bca	Chain-Load: work out a set of comprehensible example patterns Since Chain-Load shall be used for performance testing of the scheduler, we need a catalogue of realistic load patterns. This extended effort started with some parameter configurations and developed various graph shapes with different degree of connectivity and concurrency, ranging from a stable sequence of very short chains to large and excessively interconnected dependency networks.	2023-12-01 23:43:00 +01:00
Ichthyostega	bb69cf02e3	Chain-Load: demonstrate pruning and separated graph segments Through introduction of a ''pruning rule'', it is possible to create exit nodes in the middle of the graph. With increased intensity of pruning, it is possible to ''choke off'' the generation and terminate the graph; in such a case a new seed node is injected automatically. By combination with seed rules, an equilibrium of graph start and graph termination can be achieved. Following this path, it should be possible to produce a pattern, which is random but overall stable and well suited to simulate a realistic processing load. However, finding proper parameters turns out quite hard in practice, since the behaviour is essentially contingent and most combinations either lead to uninteresting trivial small graph chunks, or to large, interconnected and exponentially expanding networks	2023-12-01 04:50:11 +01:00
Ichthyostega	38f27f967f	Chain-Load: demonstrate seeding new chains ... seeding happens at random points in the middle of the chain ... when combined with reduction, the resulting processing pattern resembles the real processing pattern of media calcualtions	2023-11-30 21:06:10 +01:00
Ichthyostega	229541859d	Chain-Load: demonstrate use of reduction rule ... special rule to generate a fixed expansion on each seed ... consecutive reductions join everything back into one chain ... can counterbalance expansions and reductions	2023-11-30 03:20:23 +01:00
Ichthyostega	aafd277ebe	Chain-Load: rework the pattern for dynamic rules ...as it turns out, the solution embraced first was the cleanest way to handle dynamic configuration of parameters; just it did not work at that time, due to the reference binding problem in the Lambdas. Meanwhile, the latter has been resolved by relying on the LazyInit mechanism. Thus it is now possible to abandon the manipulation by side effect and rather require the dynamic rule to return a ''pristine instance''. With these adjustments, it is now possible to install a rule which expands only for some kinds of nodes; this is used here to crate a starting point for a reduction rule to kick in.	2023-11-30 02:13:39 +01:00
Ichthyostega	3d5fdce1c7	Chain-Load: demonstrate use of the expansion rule ...played a lot with the parameters ...behaviour and DOT graphs look plausible ...document three typical examples with statistics	2023-11-29 02:58:55 +01:00
Ichthyostega	dd6929ccc5	Chain-Load: validate and improve statistics - present the weight centres relative to overall level count - detect sub-graphs and add statistics per subgraph - include an evaluation for ''all nodes'' - include number of levels and subgraphs	2023-11-28 22:46:59 +01:00
Ichthyostega	852a86bbda	Chain-Load: generate statistics report ...test and fix the statistics computation...	2023-11-28 16:25:22 +01:00
Ichthyostega	c3bef6d344	Chain-Load: implement graph statistic computation - iterate over all nodes and classify them - group per level - book in per level statistics into the Indicator records - close global averages ...just coded, not yet tested...	2023-11-28 03:03:55 +01:00
Ichthyostega	d968da989e	Chain-Load: define data structure for graph statistics The graph will be used to generate a computational load for testing the Scheduler; thus we need to compute some statistical indicators to characterise this load. As starting point sum counts and averages will be aggregated, accounting for particular characterisation of nodes per level.	2023-11-28 02:18:38 +01:00
Ichthyostega	a780d696e5	Chain-Load: verify connectivity and recalculation It seams indicated to verify the generated connectivity and the hash calculation and recalculation explicitly at least for one example topology; choosing a topology comprised of several sub-graphs, to also verify the propagation of seed values to further start-nodes. In order to avoid addressing nodes directly by index number, those sub-graphs can be processed by ''grouping of nodes''; all parts are congruent because topology is determined by the node hashes and thus a regular pattern can be exploited. To allow for easy processing of groups, I have developed a simplistic grouping device within the IterExplorer framework.	2023-11-27 21:58:37 +01:00
Ichthyostega	619a5173b0	Chain-Load: handle node seed and recalculation - with the new pruning option, start-Nodes can now be anywhere - introduce predicates to detect start-Nodes and exit-Nodes - ensure each new seed node gets the global seed on graph construction - provide functionality to re-propagate a seed and clear hashes - provide functionality to recalculate the hashes over the graph	2023-11-26 22:28:12 +01:00
Ichthyostega	1ff9225086	Chain-Load: ability to prune chains ...using an additional pruneRule... ...allows to generate a wood instead of a single graph ...without shuffling, all part-graphs will be identical	2023-11-26 20:57:13 +01:00
Ichthyostega	5af2279271	Chain-Load: ability to inject further shuffling up to now, random values were completely determined by the Node's hash, leading to completely symmetrical topology. This is fine, but sometimes additional randomness is desirable, while still keeping everything deterministic; the obvious solution is to make the results optionally dependent on the invocation order, which is simply to achieve with an additional state field. After some tinkering, I decided to use the most simplistic solution, which is just a multiplication with the state.	2023-11-26 19:46:48 +01:00
Ichthyostega	ecbe5e5855	Chain-Load: generate new start node automatically this is only a minor rearrangement in the Algorithm, but allows to re-boot computation should node connectivity go to zero. With current capabilities, this could not happen, but I'm considering to add a »pruning« parameter to create the possibility to generate multiple shorter chains instead of one complete chain -- which more closely emulates reality for Scheduler load patterns.	2023-11-26 18:25:10 +01:00
Ichthyostega	dbe71029b7	Chain-Load: now able to define RandomDraw rules ...all existing tests reproduced ...yet notation is hopefully more readable Old: graph.expansionRule([](size_t h,double){ return Cap{8, h%16, 63}; }) New: graph.expansionRule(graph.rule().probability(0.5).maxVal(4))	2023-11-26 03:04:59 +01:00
Ichthyostega	f1c156b4cd	Chain-Load: lazy init of functional configuration now complete ...so this was yet another digression, caused by the desire somehow to salvage this problematic component design. Using a DSL token fluently, while internally maintaining a complex and totally open function based configuration is a bit of a stretch.	2023-11-25 23:47:20 +01:00
Ichthyostega	659441fa88	Chain-Load: verify (and bugfix)	2023-12-03 04:59:18 +01:00
Ichthyostega	04ca79fd65	Chain-Load: verify re-initialisation and copy ...this is a more realistic demo example, which mimics some of the patterns present in RandomDraw. The test also uses lambdas linking to the actual storage location, so that the invocation would crash on a copy; LazyInit was invented to safeguard against this, while still allowing leeway during the initialisation phase in a DSL.	2023-12-03 04:59:18 +01:00
Ichthyostega	e95f729ad0	Chain-Load: verify simple usage of LazyInit ...turns out I'd used the wrong Opaque buffer component; ...but other than that, the freaky mechanism seems to work	2023-12-03 04:59:18 +01:00
Ichthyostega	c658512d7b	Chain-Load: verify building blocks of lazy-init	2023-12-03 04:59:18 +01:00
Ichthyostega	8de3fe21bb	Chain-Load: detect small-object optimisation - Helper function to find out of two objects are located "close to each other" -- which can be used as heuristics to distinguish heap vs. stack storage - further investigation shows that libstdc++ applies the small-object optimisation for functor up to »two slots« in size -- but only if the copy-ctor is trivial. Thus a lambda capturing a shared_ptr by value will always be maintained in heap storage (and LazyInit must be redesigned accordingly)... - the verify_inlineStorage() unit test will now trigger if some implementation does not apply small-object optimisation under these minimal assumptions	2023-12-03 04:59:18 +01:00
Ichthyostega	98078b9bb6	Chain-Load: investigate std::function inline-storage ...which is crucial for the solution pursued at the moment; std::function is known to apply a small-object optimisation, yet unfortunately there are no guarantees by the C++ standard (it is only mandated that std::function handles a bare function pointer without overhead) Other people have investigated that behaviour already, indicating that at least one additional »slot« of data can be handled with embedded storage in all known implementations (while libstdc++ seemingly imposes the strongest limitations) https://stackoverflow.com/a/77202545/444796 This experiment in the unit-test shows that for my setup (libstdc++ and GCC-8) only a lambda capturing a single pointer is handled entirely embedded into the std::function; already a lambda capturing a shared-ptr leads to overflow into heap	2023-12-03 04:59:18 +01:00
Ichthyostega	3c713a4739	Chain-Load: invent the heart of the trap-mechanism ...the intention is to plant a »trojan lambda« into the target functor, to set off initialisation (and possibly relocation) on demand.	2023-12-03 04:59:18 +01:00
Ichthyostega	1892d1beb5	Chain-Load: safety problems with rule initialisation the RandomDraw rules developed last days are meant to be used with user-provided λ-adapters; employing these in a context of a DSL runs danger of producing dangling references. Attempting to resolve this fundamental problem through late-initialisation, and then locking the component into a fixed memory location prior to actual usage. Driven by the goal of a self-contained component, some advanced trickery is required -- which again indicates better to write a library component with adequate test coverage.	2023-12-03 04:59:18 +01:00
Ichthyostega	5033674b00	Chain-Load: define bindings to use the new RandomDraw component RandomDraw as a library component was extracted and (grossly) augmented to cut down the complexity exposed to the user of TestChainLoad. To control the generated topology, random-selected parameters must be configured, defining a probability profile; while this can be achieved with simple math, getting it correct turned out surprisingly difficult.	2023-12-03 04:59:18 +01:00
Ichthyostega	8b1326129a	Library: RandomDraw - implementation complete and tested.	2023-12-03 04:59:17 +01:00

1 2 3 4 5 ...

2724 commits