LUMIERA.clone

Author	SHA1	Message	Date
Ichthyostega	2a60f77bdf	Library: improve formulation of the parsing regexp - allow additional leading and trailing whitespace within token - more precise on the sequence of keywords - clearer build-up of the regexp syntax	2024-03-23 02:55:28 +01:00
Ichthyostega	10bda3a400	Library: develop a token-parsing regular expression oh my!	2024-03-23 02:55:28 +01:00
Ichthyostega	9790feb822	Library: remould MatchSeq into a _Lumiera Forward Iterator_ MatchSeq was imported recently from the Yoshimi-testsuite, as supporting helper for the CSV table component. Actually this is just a thin wrapper on top of std::regex_iterator, which in turn has properties and behaviour very similar to Lumiera's »Forward Iterator« concept (in fact, it was a source of inspiration to generalise such a pattern). So this is an obvious round out and cleanup, as it requires just some minor additions and adjustments to allow processing a sequence of matches through a for-loop or some elaborate pipelining setup.	2024-03-23 02:55:28 +01:00
Ichthyostega	a2749adbc9	Library: also cover the smart-ptr usage variant The way I've written this helper template, as a byproduct it is also possible to maintain the back-refrence to the container through a smart-ptr. In this case, the iterator-handle also manages the ownership automatically.	2024-03-21 19:57:34 +01:00
Ichthyostega	f716fb0bee	Library: build a helper to encapsulate container access by index ...mostly we want the usual convenient handling pattern for iterators, but with the proviso actually to perform an access by subscript, and the ability to re-set to another current index	2024-03-21 19:57:34 +01:00
Ichthyostega	76bd9ba6ce	Library: establish evaluation and iteration on application	2024-03-21 19:57:34 +01:00
Ichthyostega	7893cc99c3	Library: further work out the framework for TextTemplate instantiation	2024-03-21 19:57:34 +01:00
Ichthyostega	d2dcf6c163	Library: draft the interpretation of a compiled TextTemplate will use an iteration-pipeline, based on the »State Core« concept	2024-03-21 19:57:34 +01:00
Ichthyostega	5881b014fe	Library: work out a treatment for text template substitution (see: #1359 ) * establish the feature set to provide * choose scheme for runtime representation * break down analysis to individual parsing and execution steps * conclude which actions to conduct and the necessary data * derive the abstract binding API required	2024-03-21 19:57:34 +01:00
Ichthyostega	af1f549190	Library: Assessment and plan for a text templating engine Conducted an extended investigation regarding text templating and the library solutions available and still maintained today. The conclusion is * there are some mature and widely used solutions available for C++ * all of these are considered a mismatch for the task at hand, which is to generate Gnuplot scripts for test data visualisation Points of contention * all solutions offer a massive feature set, oriented towards web content generation * all solutions provide their own structured data type or custom property-tree framework Decision 🠲 better to write a minimalistic templating engine from scratch rather	2024-03-21 19:57:34 +01:00
Ichthyostega	d3fda114f8	Library: Research -- Gnuplot Read the documentation and find out how to generate the kind of diagram necessary for visualisation of Scheduler-Stress-Test observations. I used to have basic Gnuplot knowledge, and thus had to find out about - reading CSV - supported diagram types - layering and styling Conclusion: will use Gnuplot and generate a script from Test code	2024-03-21 19:57:34 +01:00
Ichthyostega	a90b9e5f16	Library: uniform definition scheme for error-IDs In the Lumiera code base, we use C-String constants as unique error-IDs. Basically this allows to create new unique error IDs anywhere in the code. However, definition of such IDs in arbitrary namespaces tends to create slight confusion and ambiguities, while maintaining the proper use statements requires some manual work. Thus I introduce a new standard scheme * Error-IDs for widespread use shall be defined _exclusively_ into `namespace lumiera::error` * The shorthand-Macro `LERR_()` can now be used to simplify inclusion and referral * (for local or single-usage errors, a local or even hidden definition is OK)	2024-03-21 19:57:34 +01:00
Ichthyostega	59390cd2f8	Library: reorder some pervasively used includes reduce footprint of lib/util.hpp (Note: it is not possible to forward-declare std::string here) define the shorthand "cStr()" in lib/symbol.hpp reorder relevant includes to ensure std::hash is "hijacked" first	2024-03-21 19:57:34 +01:00
Ichthyostega	aa93bf9285	Library: cover statistic functions and linear regression	2024-03-16 03:05:49 +01:00
Ichthyostega	1f5518e2c8	Library: introduce functions for floating-point comparisons - rough numeric comparisons (optionally given precision) - epsilon comparison based on numeric_limits	2024-03-16 03:03:29 +01:00
Ichthyostega	95198c5f2a	Library: need to exclude C++ stream sources from string conversion In the Lumiera code base, a convenient string conversion is used an many places, and is also ''magically'' integrated into the usual C++ style output with `<<` operators. However, there is a ''gotcha'' — in the ''rare cases'' when we actually want to use the C++ input/output framework to copy stream data from an input source into an output sink, obviously we do not want the input source to be »string converted«....	2024-03-15 02:47:46 +01:00
Ichthyostega	599407deea	Library: complete coverage of CSV data table including storage also encompasses some coverage for the simplistic CSV format implemented as storage backend for this data table	2024-03-15 02:45:45 +01:00
Ichthyostega	3b3600379a	Library: introduce formating variants for decimal10 showDecimal -> decimal10 (maximal precision to survive round-trip through decimal representation= showComplete -> max_decimal10 (enough decimal places to capture each possible distinct floating-point value) Use these new functions to rewrite the format4csv() helper	2024-03-14 17:32:22 +01:00
Ichthyostega	01a098db99	Library: cover row handling of data table ...this uncovered one inconsistency: when directly adding values into one of the embedded data vectors, the inconsistent size was allowed to persist even when adding / removing lines. This is in contradiction to the behavior for the CSV dump, which uses index positions from the front of all vectors uniformely. Thus changed the behaviour of adding a new row, so that it now caps all vectors to a common size also added function to clear the table	2024-03-14 17:29:16 +01:00
Ichthyostega	4a8364e422	Library: extend the DataFile to allow using it without storage ...seems obvious and does not compromise the simplistic design... ...we do check the file path anyway, just need to add saveAs()...	2024-03-13 18:57:48 +01:00
Ichthyostega	1c2cbd4d47	Library: coverage for some filesystem convenience helpers ...created as a byproduct of the TempDir feature, which in turn is required to do ''any'' meaningful unit-test of filesystem related functionality.	2024-03-13 03:52:35 +01:00
Ichthyostega	a6aad5261c	Library: complete and verify temp-dir helper verify also that clean-up happens in case of exceptions thrown; as an aside, add Macro to check for ''any'' exception and match on something in the message (as opposed to just a Lumiera Exception)	2024-03-13 02:50:13 +01:00
Ichthyostega	18b1d37a3d	Library: also create unique temporary files ...using the same method for sake of uniformity Also move the permissions helpers to the file.hpp support functions and setup a separate unit test for these	2024-03-12 22:54:05 +01:00
Ichthyostega	a1832b1cb9	Library: base implementation of temp-dir creation Inspired by https://stackoverflow.com/a/58454949 Verified behaviour of fs::create_directory --> it returns true only if it ''indeed could create'' a new directory --> it returns false if the directory exists already --> it throws when some other obstacle shows up As an aside: the Header include/limits.h could be cleaned up, and it is used solely from C++ code, thus could be typed, namespaced etc.	2024-03-12 20:14:29 +01:00
Ichthyostega	b426ea4921	Library: simple default implementation for random sequences Since this is a much more complicated topic, for now I decided to establish two instances through global variables: * a sequence seeded with a fixed starting value * another sequence seeded from a true entropy source What we actually need however is some kind of execution framework to define points of random-seeding and to capture seed values for reproducible tests.	2024-03-12 02:34:19 +01:00
Ichthyostega	7a3e4098c8	Library: some first thoughts regarding random number generation Relying on random numbers for verification and measurements is known to be problematic. At some point we are bound to control the seed values -- and in the actual application usage we want to record sequence seeding in the event log. Some initial thoughts regarding this intricate topic. * a low-ceremony drop-in replacement for rand() is required * we want the ability to pick-up and control each and every usage eventually * however, some usages explicitly require true randomness * the ability to use separate streams of random-number generation is desirable	2024-03-12 00:48:11 +01:00
Ichthyostega	6e8c07ccd6	Library: draft tests to document the new features Yesterday I decided to include some facilities I have written in 2022 for the Yoshimi-Testsuite. The intention is to use these as-is, and just to adapt them stylistically to the Lumiera code base. However — at least some basic documentation in the form of very basic unit-tests can be considered »acceptance criteria«	2024-03-11 17:44:19 +01:00
Ichthyostega	0e88dec28a	Library: integrate into the Lumiera code base - reformat in Lumieara-GNU style - use the Lumiera exceptions - use Lumiera format-string frontend - use lib/util NOTE: I am the original author of the code introduced here, and thus I can re-license it under GPL 2+	2024-03-11 17:38:30 +01:00
Ichthyostega	8c344b6a51	Library: bring in statistics helper from Yoshimi-test [http://yoshimi.sourceforge.net/ Yoshimi] is a software sound synthesizer, derived from `ZynAddSubFx` and developed by an OpenSource community. The Repository [https://github.com/Ichthyostega/yoshimi-test/ Yoshimi-test] is used by the Yoshimi developers to maintain a suite of automated acceptance tests for the Yoshimi application. This task involves watching execution times to detect long-term performance trends, which in turn requires to maintain time-series data in CSV files and to perfrom some simple statistic calculations, including linear regression. Requiring any external statistics package as dependency was not deemed adequate for such a simple task, and thus a set of self-contained helper functions was created as a byproduct. This task attaches an excerpt of the Yoshimi-test history with those helpers.	2024-03-10 23:20:58 +01:00
Ichthyostega	fda24faa73	add a rounding helper ...as part of: Breaking Change: adapt runtime.csv storage to improve readability	2024-03-10 23:36:38 +01:00
Ichthyostega	ef91088fb7	Documentation clean-up and fixes more clean-up and polishing after some further test regarding the topic of timing measurements Improved handling: filter test cases to be performed	2024-03-10 23:36:38 +01:00
Ichthyostega	48d74f261f	heuristics to establish a tolerance band for watching global trends After the individual tests, we calculate the averaged delta over the whole test suite, to detect changes to the overall timings. As it turned out, using the error propagation for the calculation of the averaged delta yields the right tolerance band to ignore random fluctuations but trigger alarm on real changes. Moreover, add several further timing test cases to verify the calibration via "platform model" works as intended	2024-03-10 23:36:38 +01:00
Ichthyostega	f2adfc6406	calculate statistics and trend for the complete testsuite	2024-03-10 23:36:38 +01:00
Ichthyostega	e9eae205f0	use regression to monitor short term and long term trends Since the platform calibration inevitably incurs some additional error band, a linear regresssion over the time series of measurements can additionally be used to spot ongoing systematic changes below this general error band, while leveling out local statistical fluctuations.	2024-03-10 23:36:38 +01:00
Ichthyostega	8f9e54b7c7	implement fitting the platform model by linear regression * triggered by --calibrate * normalise away any known expense factors, but use them as weight * calculate simple linear regression from statistic data	2024-03-10 23:36:38 +01:00
Ichthyostega	015a6ed6f2	add global storage and apply existing platform model ...this is largely just wiring of components built thus far ...TODO build platform model by linear regression	2024-03-10 23:36:38 +01:00
Ichthyostega	75767a3a97	capture and store individual timings as time series Note: work-in-progress... TODO: derive the expense factor and delta	2024-03-10 23:36:38 +01:00
Ichthyostega	7639ac4172	complete implementation of CSV backed data table ...mostly routine after solving the tricky design challenge - for usage, instantiate the template DataFile with a Storage record - object is created with filename, and immediately slurps in existing data - data storage is optimised for readability (not speed); newest value at top Note: some kind of testcase is "hidden" in this changeset only; next changeset will remove research-experiment.hpp	2024-03-10 23:36:38 +01:00
Ichthyostega	a42de3ee1b	integrate CSV handling into the new data table type	2024-03-10 23:36:38 +01:00
Ichthyostega	b6a2eec94c	parse a simplified variant of CSV rationale: the purpose is to read back our own values, yet it should be reasonably standard, to allow investigating and tweaking values with a spreadsheet - first line is a header line and used to verify the number of columns - one record per line, embedded line breaks prohibited - fields separated by comma, semicolon tolerated - fields are trimmed and may be empty - a field may be double quoted - only quoted fields may contain whitespace or comma - no escaping of quotes, i.e. no quotes within quotes	2024-03-10 23:36:38 +01:00
Ichthyostega	a523861428	completed foundations of the testrunner - build the parametrisation from several config sources - read and interpret the test specifications in a subdirectory structure - as a result build a complete test suite definition as a sequence of test steps - setup a stage, also based on the parametrisation - actually perform the test suite and capture results - generate a test report	2024-03-10 23:36:38 +01:00
Ichthyostega	a983a506b0	Scheduler-test: simplify graph generation yet more Initially the model was that of a single graph starting with one seed node and joining all chains into a single exit node. This however is not well suited to simulate realistic calculations, and thus the ability for injecting additional seeds and to randomly sever some chains was added -- which overthrows the assumption of a single exit node at the end, where the final hash can be retrieved. The topology generation used to pick up all open ends, in order to join them explicitly into a reserved last node; in the light of the above changes, this seems like an superfluous complexity, and adds a lot of redundant checks to the code, since the main body of the algorithm, in its current form, already does all the necessary bound checks. It suffices thus to just terminate the processing when the complete node space is visited and wired. Unfortunately this requires to fix basically all node hashes and a lot of the statistics values of the test; yet overall the generated graphs are much more logical; so this change is deemed worth the effort.	2024-03-10 02:47:32 +01:00
Ichthyostega	d8eb334b17	Scheduler-test: preconfigured graph with unconnected nodes Allow easily to generate a Chain-Load with all nodes unconnected, yet each node on a separate level. Fix a deficiency in the graph generation, which caused spurious connections to be added at the last node, since the prune rule was not checked	2024-03-09 18:06:08 +01:00
Ichthyostega	ab5900c82e	Scheduler-test: fix error with topology ...the previous setup produced a single linear chain instead of a set of unconnected nodes. With this, the behaviour is more like expected, but concurrency is still too low	2024-03-08 18:16:18 +01:00
Ichthyostega	605d747b8d	Scheduler-test: attempt to find a viable Scheduler setup for this measurement - better use a Test-Chain-Load without any dependencies - schedule all at once - employ instrumentation - use the inner »overall time« as dependent result variable The timing results now show an almost perfect linear dependency. Also the inner overall time seems to omit the setup and tear-down time. But other observed values (notably the avgConcurrency) do not line up	2024-03-08 01:30:12 +01:00
Ichthyostega	2556151304	Scheduler-test: simple implementation of range coverage - fill the range randomly with probe points - use the node count as independent parameter - measurement method works as intended - results indeed show a linear relationship Results are ''interesting'' however, since the (par,time) points seem to be arranged into two lines, implying that about half of the runs were somehow ''degraded'' and performed way slower.	2024-02-24 04:17:05 +01:00
Ichthyostega	a117e6e8c5	Scheduler-test: consider using a complementary measurement method With the latest improvements, the »breaking point search« works as expected and yields meaningful data; however — it seems to be well suited rather for specific setups, which involve an extended graph with massive dependencies, because only such a setup produces a clearly defined ''breaking point.'' Thus I'm considering to complement this research by another measurement setup to establish a linear regression model of the Scheduler expense. To allow integration of this different setup into the existing stress-test-rig, some rearrangements of the builder notation are necessary; especially we need to pass the type name of the actual tool, and it seems indicated to reorder the source code to provide the config base class `StressRig` at the top, followed by a long (and very technical) implementation namespace.	2024-02-23 17:29:50 +01:00
Ichthyostega	93729e5667	Scheduler-test: more precise accounting for expected concurrency It turns out to be not correct using all the divergence in concurrency as a form factor, since it is quite common that not all cores can be active at every level, given the structural constraints as dictated by the load graph. On the other hand, if the empirical work (non wait-time) concurrency systematically differs from the simple model used for establishing the schedule, then this should indeed be considered a form factor and deduced from the effective stress factor, since it is not a reserve available for speed-up The solution entertained here is to derive an effective compounded sum of weights from the calculation used to build the schedule. This compounded weight sum is typically lower than the plain sum of all node weights, which is precisely due to the theoretical amount of expense reduction assumed in the schedule generation. So this gives us a handle at the theoretically expected expense and through the plain weight sum, we may draw conclusion about the effective concurrency expected in this schedule. Taking only this part as base for the empirical deviations yields search results very close to stressFactor ~1 -- implying that the test setup now observes what was intended to observe...	2024-02-23 02:02:20 +01:00
Ichthyostega	2d1bd2b765	Scheduler-test: fix deficiencies in search control mechanism In binary search, in order to establish the invariant initially, a loop is necessary, since a single step might not be sufficient. Moreover, the ongoing adjustments jeopardise detection of the statistical breaking point condition, by causing a negative delta due to gradually approaching the point of convergence -- leading to an ongoing search in a region beyond the actual breaking point.	2024-02-19 17:38:04 +01:00
Ichthyostega	ff39aed7ea	Scheduler-test: fix feedback adjustments for breaking point search Various misconceptions identified in the feedback path of the test algorithm. - statistics are cumulative, which must be incorporated by norming on time base - average concurrency includes idle times, which is besides the point within this test setup, since additional wait-phases are injected when reducing stress	2024-02-19 17:38:04 +01:00

1 2 3 4 5 ...

6496 commits