Fuego wiki

KernelCI Notes

Here is some miscellaneous information about KernelCI.

discussion about kernelci test schema [edit section]

See the message thread at:

https://lists.linuxfoundation.org/pipermail/fuego/2017-April/000448.html

Milo Casagrande wrote:

[KernelCI test schema] doesn't store raw outputs or logs directly in the schema.

The test schema includes an "attachment" sub-schema that can be used to define where those outputs/files are stored. We have a separate system (storage.kernelci.org) that is used to "archive" artifacts from the build/boot and potentially from the test phases.

We don't rely on the build/boot/test system (Jenkins in this case) to handle that: we extract what we need and store it where we need it. You could even store it somewhere else and point the attachment to the correct URL, then it's up to a visualization implementation to handle that.

...

"parameters" is used to store something like the environment variables set and their values, command line options passed, etc.

Daniel wrote about Fuego's schema and intentions:

> For kernelci.org, builds and boots are a special kind of "test",
> that's why we have always been keeping them separate from everything
> else. Builds and boots are what we started building kernelci.org on.
> After the build and boot phase, a "test" can be reduced to whatever
> else can be run - and gives an output - on a board after it booted
> successfully.

Ok, so my understanding now is that there are multiple schemas (batch, boot, boot_regressions,
build, build_logs, build_logs_summary, compare, job, lab, report, send, token), some of them
containing two sub-schemas (GET and POST) but for non kernel build/boot tests we would only
need to care about the 3+3+3 schemas at https://api.kernelci.org/schema-test.html.
Is that correct?

How many individual JSON files would be needed to be generated/POST'ed for a multi test case test suite like LTP.
# For example, suppose  1 testsuite made of 6 test sets with 100 test cases each
# Note: in Fuego we only generate 1 JSON file.

If we make Fuego and KernelCI interact together, Fuego would mainly POST results but
the reporting tool would also GET them.

By the way, where can I find more information about the "non special" tests?
# I can only see kernel build/boot test tabs at https://kernelci.org.
I have prepared a virtual machine with KernelCI and I want to start making experiments
by POST'ing Fuego results from different tests (not just build/boot tests) to KernelCI.
Is that supposed to work out of the box?

> > Actually, the current JSON output goes as follows:
> >
> > testsuite (e.g.: Functional.LTP)
> > --board (e.g. Beaglebone black)
> > ----kernel version (e.g.: CIP kernel 4.4.55 ...)
> > ------spec (e.g.: default or quick)
> > --------build number (like KernelCI build id)
> > ----------groupname <-- we do have groups! (e.g.: 2048b_sector_size)
> > ------------test1 (e.g.: reads)
> > -------------- measurement
> > -------------- reference value (e.g. a threshold of Mb/s)
> > ------------test2 (e.g. writes)
> > ------------test3 (e.g.: re-writes)
> >
> > [Note] We also have the concept of testplans where you can group testsuites
> > and their specs for a specific board. This is quite useful.
> >
> > Apart from this information we also store timestamps, the test raw output,
> > fuego variables (this needs improvements but it will be useful for "replaying" tests),
> > and a developers log (including syslog errors, cat /proc/cpuinfo, ps output etc..).
>
> We don't store raw outputs or logs directly in the schema, if that's
> what you meant.

Yeah, we don't either. We just package them (something like LAVA's bundles) and
(proof-of-concept work) send them to a central server.

> The test schema includes an "attachment" sub-schema that can be used
> to define where those outputs/files are stored. We have a separate
> system (storage.kernelci.org) that is used to "archive" artifacts from
> the build/boot and potentially from the test phases.

It's good to have that decoupling and possibly (?) default to local host when
the user doesn't have a separate storage server.

> We don't rely on the build/boot/test system (Jenkins in this case) to
> handle that: we extract what we need and store it where we need it.
> You could even store it somewhere else and point the attachment to the
> correct URL, then it's up to a visualization implementation to handle
> that.
>
> > I am checking Kernel CI's output schema(s) from the link you sent:
> >
> > 1) parameters: seems to be the equivalent to our specs
>
> I'm not sure what the "spec" is for Fuego, but the "parameters" for us
> is used to store something like the environment variables set and
> their values, command line options passed...

Yes, exactly the same. But we are not storing the spec in the results yet, just
the name of the spec. We will have to send the spec as well somehow
when we want to share the results with a centralized server.

> > 2) minimum, maximum, number of samples, samples_sum, samples_swr_sum: we don't store
> >      information that can be inferred from the data at the moment, just calculate it when making a report.
>
> I don't remember when we introduced those (keep in mind that they are
> not required fields), but the idea was to store some statistical
> analysis directly into the tests.
> I think the "samples_sqr_sum" description is a little bit off though.
>
> > 5) kvm_guest: this would be just another board name in Fuego, so we don't include such specific parameter.
>
> It's not required field, but needed for us since we might run tests on
> KVM and need to keep track where exactly they ran.

        "kvm_guest": {
            "type": "string",
            "description": "The name of the KVM guest this test case has been executed on"
        },

Do you think it could be changed to something more generic such as "the board" or "the node"?
By the way, is KernelCI a community project with for example a mailing list where I can send patches and there is a reviewer etc..?

> > 6) definition_uri: the URI is inferred from the data in our case right now. In other words, the folder where the
> >      data is stored is a combination of the board's name, testname, spec, build number etc..
>
> >From what I could see and understand, Fuego is tightly coupled with
> Jenkins: kernelci.org is not (or at least tries not to as much as it
> can).
> kernelci.org doesn't know where the builds are running, nor where the
> boots are happening and which systems are being used to do all that.
> The same can be extended to the test phase: they can be run anywhere
> on completely different systems.
>
> Potentially we can swap Jenkins out and use another build system,
> that's why we need to keep track of measurements like this one because
> we don't rely on the other systems.

From the architecture point of view, Fuego does not depend on Jenkins anymore.
There are some quirks that need to be fixed in the implementation, but basically
we are going to be decoupled like KernelCI. In fact we will be able to run and
report results from the command line without GUIs or web applications.

> > 8) attachments: we have something similar (success_links and fail_links in the spec) that are used to present a link on
> >      the jenkins interface. This way the user can download the results (e.g.: excel file, a tar.gz file, a log file, a png..).
>
> See above for the "attachment". I'm not sure it's the same as
> "[success|fail]_links", but I'm lacking some background info here.

It's kind of similar but at the moment we are assuming that the files are stored in the host so the links
are to local files. This is fine for most people, but probably we should support external links like KernelCI does with
the storage server in the future.

> > 9) metadata: we don't have this at the moment, but I think it's covered by the testlog, devlog, and links.
> > 10) kernel: we have this as fwver (we use the word firmware since it doesn't need to be the linux kernel)
> > 11) defconfig: we do not store this at the moment. In the kernel_build test the spec has a "config" parameter that
> >   has similar functionality though.
> > 12) arch: this is stored as part of the board parameters (the board files contain other variables
> >    such as the toolchain used, path used for tests, etc..)
>
> We extract all those values from either the build or the boot data,
> it's mostly needed for searching/indexing capabilities.
> The test schemas are probably a little bit tightly coupled with our
> concepts of build and boot.
>
> > 13) created_on: this information is probably stored inside jenkins.

Actually, this will be stored in the variable FUEGO_HOST.

> > 14) lab_name: this seems similar to the information that Tim wants to add for sharing tests.
> > 15) test_set: this looks similar to fuego's groupnames.

> > The reporting part in Fuego needs to be improved as well, I will be working on this soon.
> > I think that reports should be based on templates, so that the user can prepare his/her
> > own template (e.g.: in Japanese) and Fuego will create the document filling the gaps
> > with data.
> The email we send out are based on some custom templates (built with
> Jinja) that potentially could be translated into different languages:
> we are using gettext to implement plurals/singular, and most of the
> strings in the email template are marked for translation.
>
> We never had the use case for that (nor the time/resources do do
> that), but with some work - and some translations - it could be done.

Actually, what you have is fine I think. KernelCI has a GET interface, so the local reporting
tool would just download the necessary results and create the report according
to a template provided by the user. If we have the same GET interface, we can share
the reporting tool.

> >   + Option 2: Use the KernelCI web app
> >      -> KernelCI web app is a strong option but we may need to extend
> >           some parts. In that case, I would like to work with you and the KernelCI
> >           maintainers because it is too complex to maintain a fork.
> >           Fuego could have a command like "ftc push -f kernelci -s 172.34.5.2" where the
> >           internal custom format would be converted to KernelCI schema and POST'ed
> >      -> The web app must be portable and easy to deploy. We don't want only one
> >            single server on the whole Internet. This work at the CIP project is very
> >            valuable in this sense: https://github.com/cip-project/cip-kernelci
>
> We have plans to move everything to a container based approach, and
> that should be more portable than it is now.

That's great.

Actually, it's already virtualized here (the previous link was outdated).
https://gitlab.com/cip-project/board-at-desk-single-dev

I already got KernelCI working with that, but I had to make a small modification to
KernelCI because I work behind a proxy.

Milo responded with more information:

> Ok, so my understanding now is that there are multiple schemas (batch, boot, boot_regressions,
> build, build_logs, build_logs_summary, compare, job, lab, report, send, token), some of them
> containing two sub-schemas (GET and POST) but for non kernel build/boot tests we would only
> need to care about the 3+3+3 schemas at https://api.kernelci.org/schema-test.html.
> Is that correct?

Yes, you are correct.
For the general tests, you would only need those schemas.

The POST and GET schemas are slightly different because when you POST
something, we extract data from the database and include what we can
infer into the data you GET. That is true if the test data includes
references to build and/or boot reports.

> How many individual JSON files would be needed to be generated/POST'ed for a multi test case test suite like LTP.
> # For example, suppose  1 testsuite made of 6 test sets with 100 test cases each
> # Note: in Fuego we only generate 1 JSON file.

You can send 1, 2 or 3: it depends on how you want to send the data.
You should have a more general overview here of how it can work:
https://api.kernelci.org/collection-test.html

In case of just 1 JSON payload, everything else is "embedded": you
would include into the test_set array everything else (and the
test_set will itself include all the other tests in the test_case
array). Or you could split it up into 3 separate payloads.

> If we make Fuego and KernelCI interact together, Fuego would mainly POST results but
> the reporting tool would also GET them.

Right now though, to extract the same data, you would need at least 2
GET requests: once you upload everything, in "embedded" mode or even
with 3 separate POSTs, we only include the references to the other
data.

Due to lack of support on the database side, we couldn't do the over
way around and re-embed the data back. Now the database supports that,
but we haven't got around to actually implement it.

> By the way, where can I find more information about the "non special" tests?
> # I can only see kernel build/boot test tabs at https://kernelci.org.

That's the missing part on kernelci.org: we never had the time to
implement a visualization for the general tests.
I have a branch on github where I started to play around with that,
but it never saw the light of day.

> I have prepared a virtual machine with KernelCI and I want to start making experiments
> by POST'ing Fuego results from different tests (not just build/boot tests) to KernelCI.
> Is that supposed to work out of the box?

More or less yes: you would need to create a token for yourself first
in order to be able to POST the results into the database.

> It's good to have that decoupling and possibly (?) default to local host when
> the user doesn't have a separate storage server.

It is possible yes, but that is a detail on the visualization side, at
least from our POV.

If nothing is defined, we default to storage.kernelci.org, but if you
are running locally, you can tweak the frontend config files and make
it so that it will point to "localhost". It will probably need more
logic to build the correct path, but if using the kernelci API to
upload files it has a fixed location.

I think the CIP project with their VMs are already doing something similar.

>         "kvm_guest": {
>             "type": "string",
>             "description": "The name of the KVM guest this test case has been executed on"
>         },
>
> Do you think it could be changed to something more generic such as "the board" or "the node"?

There should already be a "board" field, at least at the "test_suite" level.

We were asked to include those as well in the "test_set" and
"test_case" schemas, but never got around doing it: we don't have that
much traffic on the test data.

We only included it at the "test_suite" level because for us, a suite
with all its sets and test cases will be run for (almost) each
successfully booted board.

> By the way, is KernelCI a community project with for example a mailing list where I can send patches and there is a reviewer etc..?

We don't have a mailing list, but the code is publicly available on
github [1] [2], and you can submit pull requests there.
We will look at them.

> That's great.
>
> Actually, it's already virtualized here (the previous link was outdated).
> https://gitlab.com/cip-project/board-at-desk-single-dev

Yeah, I know the project. :-)
They are doing a slightly different thing from what we have in mind
(plus they include LAVA in the VM). What we wanted to achieve is a
"docker pull" style of packaging.

We have a basic wireframe of how this should be done here:
https://github.com/kernelci/kernelci-docker
We never really tested it, and it might be a little bit more complex
than a simple "docker pull" command: there are many moving parts, but
with Docker compose that should be achievable.

> Let me summarize some action items
>   - I will try POST'ing Fuego's kernel_build results to KernelCI (I will use CIP's board-at-desk-single-dev VM)
>   - Is the generic test interface ready to use out of the box?

It should, yes, but it definitely is not bugs or quirks free.
The more we use it the more we will discover things to tweak and fix.

>     + If not, is the KernelCI project willing to (or have time/resources for) patching or reviewing patches?

We are willing to review patches, it might take a little bit of time
to do though.

>     + if yes, I will try POST'ing Fuego's Dhrystone and Bonnie results
>   - Will  the KernelCI project collaborate on the board-at-desk-single-dev VM or create a new container?
>     + If creating a new one, do you have enough resources or can you give us an approximate date?

We don't really collaborate with them on that, strictly speaking, but
we try to help them whenever we can and whenever they need help.

Our approach with the container is slightly different compared to the
VM one: we are looking at how to "containerize" the system in order to
deploy it, but more "on the cloud" side than "on the desk" side
(although, with Docker, it should be fairly portable).

[1] https://github.com/kernelci/kernelci-frontend [2] https://github.com/kernelci/kernelci-backend

KernelCI json examples [edit section]

kernelci test output [edit section]

This is for hackbench:

{
    "_id": ObjectId("584eed9259b5148514f6c3ad"),
    "name": "./common/scripts/min_max_avg_parse.py hackbench_res.txt 'Time:' 'Seconds'",
    "kvm_guest": null,
    "samples_sqr_sum": null,
    "version": "1.0",
    "parameters": {},
    "attachments": [],
    "samples_sum": null,
    "test_suite_id": ObjectId("584eed9259b5141b906315f8"),
    "vcs_commit": null,
    "maximum": null,
    "test_suite_name": "hackbench",
    "test_set_id": ObjectId("584eed9259b5148514f6c3a9"),
    "created_on": ISODate("2016-12-12T18:33:54.985Z"),
    "measurements": [{
        "name": "vbus_max",
        "measure": "5087.50",
        "units": "mV"
    }, {
        "name": "energy",
        "measure": "87816.52",
        "units": "mJ"
    }, {
        "name": "power_min",
        "measure": "1225.00",
        "units": "mW"
    }, {
        "name": "power_max",
        "measure": "3900.00",
        "units": "mW"
    }, {
        "name": "power_avg",
        "measure": "3447.00",
        "units": "mW"
    }, {
        "name": "current_min",
        "measure": "245.00",
        "units": "mA"
    }, {
        "name": "current_max",
        "measure": "776.00",
        "units": "mA"
    }, {
        "name": "hackbench_res_min",
        "measure": "0.533",
        "units": "Seconds"
    }, {
        "name": "hackbench_res_max",
        "measure": "0.722",
        "units": "Seconds"
    }, {
        "name": "hackbench_res_avg",
        "measure": "0.551936",
        "units": "Seconds"
    }, {
        "name": "hackbench_res_mdn",
        "measure": "0.55",
        "units": "Seconds"
    }, {
        "name": "hackbench_res_std",
        "measure": "0.0115152900094",
        "units": "Seconds"
    }],
    "samples": null,
    "definition_uri": null,
    "time": -1,
    "status": "PASS",
    "minimum": null,
    "metadata": {}
}

kernelci testsuite [edit section]

{
    "_id": ObjectId("584eed9259b5141b906315f8"),
    "lab_name": "lab-baylibre",
    "build_id": ObjectId(""),
    "name": "hackbench",
    "job_id": ObjectId(""),
    "definition_uri": null,
    "job": "",
    "board_instance": null,
    "defconfig": "defconfig",
    "test_set": [ObjectId("584eed9259b5148514f6c3a9")],
    "defconfig_full": "defconfig",
    "created_on": ISODate("2016-12-12T18:33:54.976Z"),
    "kernel": "",
    "metadata": {},
    "board": "",
    "arch": "arm64",
    "version": "1.0",
    "time": -1,
    "boot_id": null,
    "test_case": [],
    "vcs_commit": null
}

kernelci testset [edit section]

{
    "_id": ObjectId("584eed9259b5148514f6c3a9"),
    "name": "hackbench",
    "version": "1.0",
    "parameters": {},
    "test_job_id": null,
    "test_case": [
		ObjectId("584eed9259b5148514f6c3aa"),
		ObjectId("584eed9259b5148514f6c3ab"),
		ObjectId("584eed9259b5148514f6c3ac"),
		ObjectId("584eed9259b5148514f6c3ad")
	],
    "time": -1,
    "test_suite_name": "hackbench",
    "test_job_path": null,
    "created_on": ISODate("2016-12-12T18:33:54.983Z"),
    "defects": [],
    "metadata": {},
    "test_job_url": null,
    "vcs_commit": null,
    "definition_uri": null,
    "test_suite_id": ObjectId("584eed9259b5141b906315f8")
}