Unit testing framework recommendation

mosteo · January 21, 2025, 2:58pm

Hi people,

I know of the existence of aunit, ahven, gnattest, but my experience is anecdotal and outdated. Before doing my own research, maybe some of you already have the answer ready.

I’m looking for something that generates the boilerplate (harness project? and test stubs) and is able to find new subprograms in subsequent generation runs without botching existing tests.

Ada for the tooling is not mandatory, as long as the generated test stubs are in Ada. Not sure if such a beast exists though…

TIA!

pyj · January 22, 2025, 1:48am

I’m not aware of any.

Trendy Test doesn’t use any generated code, but instead hijacks the exception system to handle registration and running, relying on GNAT unused procedure warnings to ensure that all tests get added to the test pools.

The most likely way to get auto-registration to work as-is would be to write a test framework that follows what Criterion does. It uses macros to write test functions to a special separate linker section and then read that out of the ELF or COFF at test startup time to effectively give it the ability to dynamically determine the list of all tests to run. This relies on having appropriate binary analysis to look at either of these file types at startup.

pyj · January 22, 2025, 1:59pm

Thinking about this some more, you wouldn’t even have to parse the elf/coff, if you made a custom pragma for unit test registration (similar to #[test] for Rust, iirc, C# and F# have something similar), the implementation should ignore it, so you could write a script to extract of these, and dump them into an entry procedure (with some wrapper for catching exceptions and reporting). I had thought of pitching an RFC a while ago for a Unit_Test attribute, but thought it might be too specialized.

pragma Unit_Test (Test_Function);
--  Maybe it should be an function instead, idk.
procedure Test_Function is begin ... end;

Something like:

find . name '*.ads' -o -name '*.adb' | egrep 'pragma\s+Unit_Test \((.*)\) | write_test_wrapper.sh > run_tests.adb'

I’ve wanted to update trendy test anyways to add property testing and some other stuff, I’ll try this tonight.

zertovitch · January 22, 2025, 9:59pm

This code generation would be a perfect job for HAC.

mosteo · January 23, 2025, 10:28am

Frankly, before I reach the point of registering custom tests, I’d like to have automatically registered tests for every visible subprogram that XFAIL until I fill them.

simonjwright · January 23, 2025, 5:01pm

I tentatively push my Scripted Testing crate - it’s intended for functional tests rather than unit tests.

ebriot · January 23, 2025, 6:07pm

Just curious: in what domains are people actually using extensive unit tests (as opposed to functional tests or integration tests that test whole packages or projects) ? I have started a few times with the intention of having unit tests for a library or something, but it generally becomes quite visible that this is a major test that’s not worth the trouble in general. Also it somewhat deter from refactoring the code and split it into more subprograms.

jere · January 23, 2025, 7:46pm

For me it depends on what is being developed. I work on a lot of custom message protocols, so when we have a brand new one, I do heavy unit testing on them to ensure there are no bugs in the implementation (and to some extent catch any conflicts in the design). If it’s a more vetted system, I’ll rely only on functional testing.

On my hobby projects: The NES emulator that I am putzing around with has a ton of unit tests for the CPU. I wanted to make darn sure my instruction execution was as sound as possible as I figured it was the hardest place to debug later on once I integrated the other parts. Now that I am past that part I’ll probably rely more on functional testing.

I find a good balance works well for me. I don’t know of any good tools for Mosteo though. I’ve always rolled my own, which is time consuming.

EDIT: I guess if someone was really feeling froggy they could make an application based on libadalang that just parsed files and looked for library level function / procedure declarations and generate the stubs. Might be a fun project for someone.

pyj · January 23, 2025, 9:56pm

I don’t know about libadalang, but I could probably do it in Raffle. I was planning on doing this to be able to automatically make Lua bindings for packages, and also to make better searchable docs based on spec files, like “I have a Foo, what subprograms return a Foo and which take a Foo as a parameter?”

Oh, I do unit tests differently where I focus tests on cases, not necessarily specific subprograms. I see what you mean now.

My experience has also been that full end-to-end tests are much more powerful and long-lived. BBT has been good for this. Unit tests are great for “did I write this thing correctly?” but I also very liberally use Pre/Pre and constraints and run with all checks on.

simonjwright · January 23, 2025, 10:02pm

My Scripted Testing crate uses libadalang (well, libadalang2xml) and generates code using xslt. See the demo/’s README.

Unfortunately I think it may be the opposite to what you want - it replaces the bodies of a called package with scripting support.

mosteo · January 23, 2025, 10:08pm

In my case is small focused libraries with relatively few public subprograms. I was thinking that it would be better to start with these tests rather than with an ad-hoc, haphazard collection of tests (which is what I usually end doing).

But it may well turn out that what I want to try is not really practical.

Fabien.C · January 24, 2025, 4:07pm

This is exactly what GNATtest is ^^ https://github.com/AdaCore/libadalang-tools

Fabien.C · January 24, 2025, 4:13pm

Cesar who joined my team a couple months ago will experiment with a simple test framework built-in Alire.

Along the lines of:

Find all the files matching a given pattern in a directory (e.g. tests/). Each file is an ada “Main” procedure .
Generate a GPR file to compile all these files as executables (one exec for each)
Run them one by one and gather results

Writing a test would be as simple a writing a single source file:

with My_Lib;
procedure Test is
begin
   pragma Assert (My_Lib.Double (1) = 2);
end Test;

LionelDraghi · January 24, 2025, 6:02pm

My three small open source app (acc, smk and bbt) are CLI oriented, and easy to end-to-end test.
I have very very rare needs for unit testing. Last example is a function to compute a relative shortest path from, lets say, /home/lionel/bin/x to /home/lionel/y/z : taking into account the Windows drive, Windows vs Unix separators, etc.
This easily lead to many test cases.
So there is one unit test, not even for a package, just for this procedure.

I noticed that in CLI, there is often commands or options that do not execute the full process the app is intended to run, and activate only a small part of the code. For example dry-run options, or command to list input files recursively found by the app.
Those command or options sometimes offers a way to have a stable end-to-end test on a limited scope, that would have been a integration or a unit test otherwise.

Going further, I don’t exclude if the need arises to cheat, and have a secret option exposing some stable internal data only for test purposes.
Using those internal data in (false) end-to-end tests, depending on the command line interface and not on the package/procedure profiles, seems to me less prone to change when the code evolves, and do not deter refactoring.

All in all, I have 98% end-to-end tests, 2% unit testing, no integration tests.
My small CLI utilities seems to me an easy case for testing, things would probably be not so simple on larger app, server, app with GUI, app with high coverage requirement, etc.

ebriot · January 25, 2025, 4:27pm

We have an asserts package (a generic of course), based on GNATCOLL.Asserts. Much nicer to use than a straight pragma Assert, because in case of errors you see both sides of the “=” (or any other operator). Also has support for retrying (for those times when you are dealing with asynchronous things), comparing arrays,… I am sure a lot of people have similar packages, or maybe use directly GNATCOLL.Asserts… @Fabien.C maybe Cesar could also gather ideas and later improve that package to make writing tests even nicer.

krischik · January 26, 2025, 6:09pm

I just use AUnit and just write my own. Yes, AUnit needs a lot of boilerplate but it’s mostly copy paste and I wrote my own extensions to make it more bareable.

That would indeed be nice. There is a reason all other unit test frameworks I know of do it that way.

It do. It’s called test driven development where you write the test first and the the method. But more importantly: It’s not either or. You can do both.

Which is what I suggest for Android. With the unit test designed to run without the actual device. Since testing on device is significantly slower.

I too have the contracts active when running unit tests and it did indeed find problems I would not have found otherwise.

I hope that won’t break my AUnit tests I already have in my Alire crates and already run by the Alire build server.

Indeed I have: Fluent AUnit Asserts · Ada Class Library

Martin.

csagaert · February 3, 2025, 2:00pm

I posted a first draft of what I had in mind for a built-in testing feature, on the Alire github repository. This first draft is intended to collect ideas and feedback, in order to build something that would be a good fit for actual crates in the wild so feel free to comment and add ideas!

csagaert · February 14, 2025, 5:26pm

github.com/alire-project/alire

first draft of alire test runner

master ← AldanTanneo:draft/test_runner

opened 01:30PM - 14 Feb 25 UTC

AldanTanneo

+325 -0

This is a demo implementation of what a minimal Alire test runner could look lik…e, following the proposed design in #1831. For now, it only takes two arguments: a `-j[obs=]N` arg that specifies how many tests should be run in parallel (defaults to 0, aka how many cores your machine has) and `--dir=<dir>`, specifying which sub-directory holds your tests (defaults to `tests`). ### To test it: Have a repository with a `tests` folder containing a binary crate. In the GPR file, replace the `Main` setting with ```ada for Main use [Crate_Name]_List_Config.Test_Files; ``` and at the top, add ```ada with "config/[crate name]_list_config.gpr"; ``` Each `.adb` file in `src/` should contain a _single_ main procedure. You can add extra source directories (like `common`...), they will not be considered test folders, and you can put extra test code in there, to be used from the actual test procedures. A test fails when its exit code is not zero (for instance, from an unhandled exception). Finally, simply run `alr test2` with the alr from this branch.

Following on my RFC, I implemented a demo of what I had in mind for a default test runner in alire.

The point is still to have a minimal runner in alire, with opinionated-ish defaults (but easy to replace with a different runner whenever those choices don’t align with a project’s specific needs), and I’m still very much looking for feedback

In this draft PR there’s just the runner, with no manifest integration yet. That will come soon, I hope

Fabien.C · February 17, 2025, 3:17pm

If you want to see what a potential testsuite looks like, I converted the tests for my usb_embedded library: Switch to Alire test driver · Fabien-Chouteau/usb_embedded@b2f8fc2 · GitHub

Trescott · February 19, 2025, 7:00am

@krischik I may be misreading this, but I think you might have missed something here. In case you are unaware, I’ll point this out, but maybe you do already know. (I mean no offense.)

Ada compilers are permitted to ignore pragma statements that are not understood by the compiler. (And most do, I think, including GNAT) See ARM 2.8, paragraph 15 So, with an existing compiler I can add made-up pragma statements to my code, and it will still compile. (But I will get warnings that the pragma was not understood and was ignored.)

Try adding your own made up pragma statement to some code and see for yourself. Such as:

pragma Quack_Like_A_Duck; --No compiler "supports" this pragma, but it will still compile.

So, @pyj is not suggesting adding support in the compiler, but rather taking advantage of the existing permission to ignore unknown pragmas. Semantically correct Ada files, that compile successfully, can be marked up using a made-up pragma. Then some tool can search for those made-up pragmas.