Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(cmd): Implemented a CLI for task management #4318

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

whisperity
Copy link
Member

@whisperity whisperity commented Aug 19, 2024

Important

⛔ Blocked by #4317.

This is patch 2 of the Asynchronous Store Protocol (#3672).

This patch extends CodeChecker cmd with a new sub-command, serverside-tasks, which lets users and administrators deal with querying the status of running server-side tasks.

By default, the CLI queries the information of the task(s) specified by their token(s) in the --token argument from the server using getTaskInfo(token), and shows this information in either verbose "plain text" (available if precisely one task was specified), "table" or JSON formats.

In addition to --token, it also supports 19 more parameters, each of which correspond to a filter option in the TaskFilter API type. If any filters in addition to --token is specified, it will exercise getTasks(filter) instead. This mode is only available to administrators. The resulting more detailed information structs are printed in "table" or JSON formats.

Apart from querying the current status, two additional flags are available, irrespective of which query method is used to obtain a list of "matching tasks" after filtering:

  • --kill will call cancelTask(token) for each task.
  • --await will block execution until the specified task(s) terminate (in one way or another).

--await is implemented by calling the new await_task_termination library function, which is implemented with the goal of being reusable by other clients later.

Example outputs

Query that results in using the filters by default shows table format

Tip

Notice the showing of Machine ID in the output, as this is a query run by an administrator.

❯ build/CodeChecker/bin/CodeChecker cmd serverside-tasks --enqueued-after 2024:08:19 --status cancelled
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Token                                                            | Machine            | Type                   | Summary                         | Status    | Product | User | Enqueued            | Started             | Last seen           | Completed           | Cancelled?
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
8b62497c7d1b7e3945445f5b9c3951d97ae07e58f97cad60a0187221e7d1e2ba | xxxxxxxxxxxxx:8001 | taskService::DummyTask | Dummy task for testing purposes | CANCELLED |         |      | 2024-08-19 15:55:34 | 2024-08-19 15:55:34 | 2024-08-19 15:55:35 | 2024-08-19 15:55:35 | Yes
6fa0097a9bd1799572c7ccd2afc0272684ed036c11145da7eaf40cc8a07c7241 | xxxxxxxxxxxxx:8001 | taskService::DummyTask | Dummy task for testing purposes | CANCELLED |         |      | 2024-08-19 15:55:53 | 2024-08-19 15:55:53 | 2024-08-19 15:55:53 | 2024-08-19 15:55:53 | Yes
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

User can modify the output format via the usual --output parameter

❯ build/CodeChecker/bin/CodeChecker cmd serverside-tasks --token 8b62497c7d1b7e3945445f5b9c3951d97ae07e58f97cad60a0187221e7d1e2ba --output json
[{"token": "8b62497c7d1b7e3945445f5b9c3951d97ae07e58f97cad60a0187221e7d1e2ba", "taskKind": "taskService::DummyTask", "actorUsername": null, "summary": "Dummy task for testing purposes", "comments": "<unknown> at 2024-08-19 15:55:34.887164+00:00:\nSUPERUSER requested cancellation.\n----------\nSYSTEM[AbstractTask::execute()] at 2024-08-19 15:55:35.881162+00:00:\nCANCEL!\nCancel request of admin honoured by task.", "cancelFlagSet": true, "enqueuedAt": "2024-08-19 15:55:34", "startedAt": "2024-08-19 15:55:34", "completedAt": "2024-08-19 15:55:35", "lastHeartbeat": "2024-08-19 15:55:35", "status": "CANCELLED", "productEndpoint": null}]

Default behaviour when searching for a specific task (identified by token)

Tip

Comments are formatted as if they were e-mails or chat messages for better visual separation.

Caution

This output is only meant to be consumed by humans!

❯ build/CodeChecker/bin/CodeChecker cmd serverside-tasks --token 8b62497c7d1b7e3945445f5b9c3951d97ae07e58f97cad60a0187221e7d1e2ba
Task '8b62497c7d1b7e3945445f5b9c3951d97ae07e58f97cad60a0187221e7d1e2ba':
    - Type:         taskService::DummyTask
    - Summary:      Dummy task for testing purposes
    - Status:       CANCELLED
    - Enqueued at:  2024-08-19 15:55:34
    - Started at:   2024-08-19 15:55:34
    - Last seen:    2024-08-19 15:55:35
    - Completed at: 2024-08-19 15:55:35
 
Comments on task '8b62497c7d1b7e3945445f5b9c3951d97ae07e58f97cad60a0187221e7d1e2ba'...
 
<unknown> at 2024-08-19 15:55:34.887164+00:00:
> SUPERUSER requested cancellation.
----------
SYSTEM[AbstractTask::execute()] at 2024-08-19 15:55:35.881162+00:00:
> CANCEL!
> Cancel request of admin honoured by task.

@whisperity whisperity added enhancement 🌟 RDY-OnHold 🛑 Patch reviewed and ready, but don't merge due to having to merge a dependent patch first. CLI 💻 Related to the command-line interface, such as the cmd, store, etc. commands web 🌍 Related to the web app labels Aug 19, 2024
@whisperity whisperity added this to the release 6.25.0 milestone Aug 19, 2024
@whisperity whisperity force-pushed the feat/server/asynchronous-store-protocol/patch/3-task-management-cli branch 2 times, most recently from f6c7f99 to 529a0dd Compare August 20, 2024 13:04
@whisperity whisperity removed the web 🌍 Related to the web app label Aug 25, 2024
This patch implements the whole support ecosystem for server-side
background tasks, in order to help lessen the load (and blocking) of API
handlers in the web-server for long-running operations.

A **Task** is represented by two things in strict co-existence: a
lightweight, `pickle`-able implementation in the server's code (a
subclass of `AbstractTask`) and a corresponding `BackgroundTask`
database entity, which resides in the "configuration" database (shared
across all products).
A Task is created by API request handlers and then the user is
instructed to retain the `TaskToken`: the task's unique identifier.
Following, the server will dispatch execution of the object into a
background worker process, and keep status synchronisation via the
database.
Even in a service cluster deployment, load balancing will not interfere
with users' ability to query a task's status.

While normal users can only query the status of a single task (which is
usually automatically done by client code, and not the user manually
executing something); product administrators, and especially server
administrators have the ability to query an arbitrary set of tasks using
the potential filters, with a dedicated API function (`getTasks()`) for
this purpose.

Tasks can be cancelled only by `SUPERUSER`s, at which point a special
binary flag is set in the status record.
However, to prevent complicating inter-process communication,
cancellation is supposed to be implemented by `AbstractTask` subclasses
in a co-operative way.
The execution of tasks in a process and a `Task`'s ability to
"communicate" with its execution environment is achieved through the new
`TaskManager` instance, which is created for every process of a server's
deployment.

Unfortunately, tasks can die gracelessly if the server is terminated
(either internally, or even externally).
For this reason, the `DROPPED` status will indicate that the server has
terminated prior to, or during a task's execution, and it was unable to
produce results.
The server was refactored significantly around the handling of subprocesses
in order to support various server shutdown scenarios.

Servers will start `background_worker_processes` number of task handling
subprocesses, which are distinct from the already existing "API
handling" subprocesses.
By default, if unconfigured, `background_worker_processes` is equal to
`worker_processes` (the number of API processes to spawn), which is
equal to `$(nproc)` (CPU count in the system).

This patch includes a `TestingDummyTask` demonstrative subclass of
`AbstractTask` which counts up to an input number of seconds, and each
second it gracefully checks whether it is being killed.
The corresponding testing API endpoint, `createDummyTask()` can specify
whether the task should simulate a failing status.
This endpoint can only be used from, but is used extensively, the unit
testing of the project.

This patch does not include "nice" or "ergonomic" facilities for admins
to manage the tasks, and so far, only the server-side of the
corresponding API calls are supported.
This patch extends `CodeChecker cmd` with a new sub-command,
`serverside-tasks`, which lets users and administrators deal with
querying the status of running server-side tasks.

By default, the CLI queries the information of the task(s) specified by
their token(s) in the `--token` argument from the server using
`getTaskInfo(token)`, and shows this information in either verbose
"plain text" (available if precisely **one** task was specified), "table"
or JSON formats.

In addition to `--token`, it also supports 19 more parameters, each of
which correspond to a filter option in the `TaskFilter` API type.
If any filters in addition to `--token` is specified, it will exercise
`getTasks(filter)` instead.
This mode is only available to administrators.
The resulting, more detailed information structs are printed in "table"
or JSON formats.

Apart from querying the current status, two additional flags are
available, irrespective of which query method is used to obtain a list
of "matching tasks":

  * `--kill` will call `cancelTask(token)` for each task.
  * `--await` will block execution until the specified task(s) terminate
    (in one way or another).

`--await` is implemented by calling the new **`await_task_termination`**
library function, which is implemented with the goal of being reusable
by other clients later.
@whisperity whisperity force-pushed the feat/server/asynchronous-store-protocol/patch/3-task-management-cli branch from 529a0dd to d73c1da Compare September 18, 2024 07:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLI 💻 Related to the command-line interface, such as the cmd, store, etc. commands enhancement 🌟 RDY-OnHold 🛑 Patch reviewed and ready, but don't merge due to having to merge a dependent patch first.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant