flamenco

Author	SHA1	Message	Date
Sybren A. Stüvel	e687c95e5d	Manager: add "last rendered image" processing pipeline Add a handler for the OpenAPI `taskOutputProduced` operation, and an image thumbnailing goroutine. The queue of images to process + the function to handle queued images is managed by `last_rendered.LastRenderedProcessor`. This queue currently simply allows 3 requests; this should be improved such that it keeps track of the job IDs as well, as with the current approach a spammy job can starve the updates from a more calm job.	2022-06-24 16:51:11 +02:00
Sybren A. Stüvel	b53cd67eb4	Cleanup: rename `assertResponseEmpty()` → `assertResponseNoContent()` The function tests the HTTP response is `204 No Content`, and now the name reflects that better. No functional changes.	2022-06-24 16:09:46 +02:00
Sybren A. Stüvel	27a6dde708	Manager: add `local_storage` package for managing storage locations Add a `local_storage` package that finds a suitable place to put files. Currently it just looks at the location of the currently running executable; it can later do other things. It can be queried for directory to put job-specific files. It is intended to be used by the under-development "last rendered output" processing system, to store an image file per job. Later we should also refactor the task log handling system to use this.	2022-06-23 16:45:38 +02:00
Sybren A. Stüvel	b441f3f3de	Manager: load job compiler scripts from disk as well If there is a `scripts` directory next to the current executable, load scripts from that directory as well. It is still required to restart the Manager in order to pick up changes to those scripts (including new/removed files), PLUS a refresh in the add-on.	2022-06-21 17:59:20 +02:00
Sybren A. Stüvel	87f1959e26	Manager: use blocklist to actually block workers Actually use the blocklist in the task scheduler to block workers from doing blocked job types.	2022-06-21 17:59:20 +02:00
Sybren A. Stüvel	a0e8eebcb3	Manager: make access to job compilers script thread-safe When on-disk job compiler scripts are supported, they will be reloaded often, and it becomes more important to have the access to the map of loaded job compilers thread-safe.	2022-06-20 18:09:33 +02:00
Sybren A. Stüvel	defa5b0431	Refactor: extract 'get the embedded filesystem' to a separate function The global `scriptFS` variable was too easy to access, which caused an issue where the mandatory `"scripts"` subdirectory was not passed. Accessing via a getter function that hides this requirement prevents this.	2022-06-20 17:43:08 +02:00
Sybren A. Stüvel	201236cf46	Refactor: take some functions out of `job_compilers.Service` Take some functions out of the `Service` struct, as they are more or less standalone anyway. This will also make it easier later to make things thread-safe, as that'll become important when files can get live-reloaded.	2022-06-20 17:26:17 +02:00
Sybren A. Stüvel	d5c527209f	Cleanup: rename local var from `compiler` to `service` The `Load()` function returns a `*Service`, and it was confusing that the local variable is named `compiler` instead. Now it's called `service`. No functional changes.	2022-06-20 17:21:19 +02:00
Sybren A. Stüvel	89fdc45b45	Manager: ignore small JS files Empty (or almost-empty) JS files are ignored by the job compiler.	2022-06-20 17:14:06 +02:00
Sybren A. Stüvel	7a89c07fc9	Manager, refactor access to JS script files Refactor the JS script file loading code so that it's tied to the `fs.FS` interface for longer, and less to the specifics of our `embed.FS` instance. This should make it possible to use other filesystems, like a real on-disk one, to load scripts.	2022-06-20 17:06:46 +02:00
Sybren A. Stüvel	2d05e1c773	Fix unit test for recent scheduler change Fix unit test for rF1586c37b.	2022-06-20 16:05:36 +02:00
Sybren A. Stüvel	380d55b4f0	Cleanup: rename `job_compilers/path.go` to `js_path.go` Rename the file by adding `js_` suffix, to indicate it's for exposing a "path" object to JavaScript. No functional changes.	2022-06-20 15:57:03 +02:00
Sybren A. Stüvel	a7fbbf3313	Cleanup: rename `job_compilers/process.go` to `js_process.go` Rename the file by adding `js_` suffix, to indicate it's for exposing a "process" object to JavaScript. No functional changes.	2022-06-20 15:56:09 +02:00
Sybren A. Stüvel	1586c37b32	Manager: mark task as active as soon as it is assigned to a worker Move the task to 'active' status so that it won't be assigned to another worker. This also enables the task timeout monitoring.	2022-06-20 13:00:49 +02:00
Sybren A. Stüvel	de5d12362d	Manager: add `sleep_repeats` parameter to `echo-sleep-test` job type This makes it convenient to create an arbitrary number of tasks.	2022-06-20 11:44:41 +02:00
Sybren A. Stüvel	a2b667c043	Manager: log blocklist threshold	2022-06-17 17:15:23 +02:00
Sybren A. Stüvel	13bdb0ed73	Manager: remove outdated TODO	2022-06-17 17:15:13 +02:00
Sybren A. Stüvel	a368230afa	Manager: fix race condition in logging of worker name/UUID Instead of updating the logger in the context, just store a new logger in a new sub-context.	2022-06-17 17:13:32 +02:00
Sybren A. Stüvel	64c8fa851d	Show assigned worker in task details Show the worker assigned to the task in the task details view, as link to the worker itself.	2022-06-17 16:36:55 +02:00
Sybren A. Stüvel	cdb7789f08	Refactor: Manager, move test code Move code that covers `worker_task_updates.go` into `worker_task_updates_test.go`. No functional changes.	2022-06-17 15:51:15 +02:00
Sybren A. Stüvel	046853932d	Manager: re-queue previously failed tasks of worker when blocklisting When a Worker is blocked from a job, re-queue its previously failed tasks so that other workers can give them a try.	2022-06-17 15:49:16 +02:00
Sybren A. Stüvel	b95bed1f96	Refactor: rename `RequeueTasksOfWorker` to `RequeueActiveTasksOfWorker` Soon there will be another function to requeue tasks of workers by other criteria, so being clear in the name helps. No functional changes.	2022-06-17 15:49:16 +02:00
Sybren A. Stüvel	fd31a85bcd	Manager: add blocking of workers when they fail certain tasks too much When a worker fails too many tasks, of the same task type, on the same job, it'll get blocked from doing those.	2022-06-17 15:49:16 +02:00
Sybren A. Stüvel	56abc825a6	Refactor: Manager, refactor handling of task failures Split the handling of soft and hard failures into separate functions. No functional changes intended.	2022-06-17 15:01:52 +02:00
Sybren A. Stüvel	6feee74c54	Cleanup: Manager, move worker task update handling code into its own file Move the code related to task updates from workers to `worker_task_updates.go`. It's going to get more complex with the blocklisting in there; this prepares for that. No functional changes.	2022-06-17 11:46:07 +02:00
Sybren A. Stüvel	81f81d0e0a	Show task failure list in the web frontend Show the task failure list in the web frontend's `TaskDetails` component.	2022-06-17 11:37:56 +02:00
Sybren A. Stüvel	0b5140fc5f	Manager: clear task failure list on requeueing of jobs & tasks When a job or task gets requeued from the web interface, its task failure lists (i.e. the list of workers that previously failed this task) will be cleared. This clearing doesn't happen in other situations, e.g. when a worker signs off and its task gets requeued, the task's failure list will remain as-is.	2022-06-17 11:37:28 +02:00
Sybren A. Stüvel	e9fca8d993	Cleanup: typo fix in comment	2022-06-17 11:03:43 +02:00
Sybren A. Stüvel	b991e5f446	Cleanup: Manager, clarify some function names of the task state machine Rename functions `onTaskStatusX` to `updateJobOnTaskStatusX` to clarify their responsibility is to update the job in reaction to a task status change. No functional changes.	2022-06-17 11:01:41 +02:00
Sybren A. Stüvel	8764f8f7c1	Manager: task scheduler, don't schedule tasks the worker failed before When a worker asks for a task to perform, don't give it a task that it failed before.	2022-06-16 16:02:28 +02:00
Sybren A. Stüvel	9ab41984ac	Adjust Go code for Nickname -> Name change This fixes a bug where 'Worker undefined changed status' was logged in the web interface, as that was (back then incorrectly) `workerupdate.name`. Now that code is correct.	2022-06-16 11:03:18 +02:00
Sybren A. Stüvel	12f0a605a4	Manager: log configured worker timeout at startup	2022-06-16 10:51:17 +02:00
Sybren A. Stüvel	5f2712980e	Manager: task scheduler, check for requested worker status change first Before checking whether the Worker is allowed to do work (i.e. is in `awake` state), check any queued-up status changes. Those should be communicated, before saying "no work for you", so that the Worker can actually respond to it.	2022-06-16 10:48:38 +02:00
Sybren A. Stüvel	ee53373878	Cleanup: compare worker state to constant instead of hard-coded state Use the `requiredStatusToGetTask` constant to compare the worker status, and not just for logging. No functional changes, just better code.	2022-06-16 10:46:50 +02:00
Sybren A. Stüvel	40f711bf69	Fix two unit tests for the previous commit I pushed too soon :'(	2022-06-16 10:42:04 +02:00
Sybren A. Stüvel	be0b10400f	Manager: count workers as 'seen' even when there is no task Fix a bug where a worker would only be counted as 'seen' by the task scheduler if it actually got a task assigned.	2022-06-16 10:39:42 +02:00
Sybren A. Stüvel	7d7c2b1bd6	Cleanup: blacklist → blocklist Change "blacklist" to "blocklist", because that makes people happier. No functional changes.	2022-06-16 10:36:36 +02:00
Sybren A. Stüvel	6e12a2fb25	Manager: keep track of which worker failed which task When a Worker indicates a task failed, mark it as `soft-failed` until enough workers have tried & failed at the same task. This is the first step in a blocklisting system, where tasks of an often-failing worker will be requeued to be retried by others. NOTE: currently the failure list of a task is NOT reset whenever it is requeued! This will be implemented in a future commit, and is tracked in `FEATURES.md`.	2022-06-13 18:41:38 +02:00
Sybren A. Stüvel	c5debdeb70	Manager: add 'task failure list' to record workers failing tasks The persistence layer can now store which worker failed which task, as preparation for a blocklisting system. Such a system should be able to determine whether there are still any workers left to do the work.	2022-06-13 18:41:30 +02:00
Sybren A. Stüvel	e35911d106	Manager: add ability to delete jobs This is needed for a future unit test, and exposed the fact that SQLite didn't enforce foreign key constraints (and thus also didn't handle on-delete-cascade attributes). This has been fixed in the previous commit.	2022-06-13 18:41:19 +02:00
Sybren A. Stüvel	e5d0e987e1	Manager: enforce DB foreign key checks at startup SQLite disables foreign key checks by default, so Flamenco has to enable them explicitly.	2022-06-13 18:41:19 +02:00
Sybren A. Stüvel	6ec493d944	Manager, more efficiently create tasks When creating tasks the inter-task dependencies are saved as a 2nd pass,by updating the tasks in the database. This now only saves those dependencies, and no longer saves the entire task again.	2022-06-13 18:40:42 +02:00
Sybren A. Stüvel	02bc03ae2b	Manager: replace `gorm.Model` with our own `persistence.Model` struct `persistence.Model` contains the common database fields for most model structs. It is a copy of `gorm.Model`, but without the `DeletedAt` field (which triggers Gorm's soft deletion). Soft deletion is not used by Flamenco. If it ever becomes necessary to support soft-deletion, see https://gorm.io/docs/delete.html#Soft-Delete	2022-06-13 18:40:42 +02:00
Sybren A. Stüvel	ec5b3aac52	Manager: on getting task update from Worker, write log before status change When receiving a `TaskUpdate` from a Worker, write to the task log, before handling any task status change. If both log and task status change are sent, the log will likely contain the cause of the task state change. Any subsequent task logs, for example generated by the Manager in response to the status change, should be logged after that.	2022-06-13 18:40:42 +02:00
Sybren A. Stüvel	25d5b01b3c	Cleanup: test errors with `assert.NoError()` instead of `assert.Nil()` No functional changes, just nicer way to test.	2022-06-13 18:40:42 +02:00
Sybren A. Stüvel	6fc936d0a6	Revert accidental debug code Revert change in rF01c45afc20854918d1f18e6859b4154499d500b6 that made unit tests use an on-disk database.	2022-06-13 18:40:25 +02:00
Sybren A. Stüvel	b922722614	Manager: broadcast worker timeouts over SocketIO This way the web interface will also show timed-out workers.	2022-06-13 13:05:20 +02:00
Sybren A. Stüvel	75ca0e652e	Cleanup: timeout checker, improve readability of failed tests No functional changes	2022-06-13 12:50:27 +02:00
Sybren A. Stüvel	1de1e3a9a5	Manager: add 'canary' test to all timeout checker tests The canary test asserts that certain constants still have the expected value. Lowering those constants is good for testing the timeout stuff with the actual Flamenco Manager + Worker (without having to wait 5 minutes for it to kick in), but it's too easy to accidentally run the unit tests and get cryptic errors about everything failing horribly and miserably when you leave those constants low.	2022-06-13 12:50:02 +02:00

... 2 3 4 5 6 ...

473 Commits