flamenco

Author	SHA1	Message	Date
Sybren A. Stüvel	79076be91b	Manager: Convert task failure persistence to SQLC No functional changes.	2024-05-28 16:07:22 +02:00
Sybren A. Stüvel	a99e68ec99	Manager: Convert TaskTouchedByWorker to sqlc No functional changes.	2024-05-28 16:07:21 +02:00
Sybren A. Stüvel	7175bb469b	Manager: Convert UpdateJobsTaskStatuses(Conditional) to sqlc No functional changes.	2024-05-28 16:07:21 +02:00
Sybren A. Stüvel	4435633756	Manager: Convert FetchTasksOfJob() and FetchTasksOfJobInStatus() to sqlc No functional changes.	2024-05-28 16:07:21 +02:00
Sybren A. Stüvel	4ab853da40	Manager: Convert JobHasTasksInStatus and CountTasksOfJobInStatus to sqlc No functional changes.	2024-05-28 16:07:21 +02:00
Sybren A. Stüvel	b66490831c	Manager: fetch jobs of tasks in FetchTasksOfWorkerInStatus() The task state machine expects that `task.Job` is set correctly. Since SQLC does not automatically fill this field (and rightfully so), I've added a bit of Go code that fetches the job in a separate query. A TODO is added as a reminder that it would be better for the task state machine itself to fetch the job when needed.	2024-05-28 16:07:17 +02:00
Sybren A. Stüvel	1e327c510e	Manager: Convert FetchTasksOfWorkerInStatusOfJob to sqlc No functional changes.	2024-05-28 14:46:43 +02:00
Sybren A. Stüvel	950d661377	Manager: convert TaskAssignToWorker and FetchTasksOfWorkerInStatus to sqlc No functional changes.	2024-05-28 14:46:43 +02:00
Sybren A. Stüvel	dcca9aef03	Manager: convert db.SaveTaskActivity() to SQLC No functional changes.	2024-05-28 14:46:42 +02:00
Sybren A. Stüvel	a9be729e59	Manager: Convert db.SaveTaskStatus() to SQLC No functional changes.	2024-05-28 14:46:42 +02:00
Sybren A. Stüvel	a54972ddd0	Manager: Convert db.SaveTask() to SQLC No functional changes.	2024-05-28 14:46:42 +02:00
Sybren A. Stüvel	c1cdff567e	Manager: Convert FetchTask to sqlc This is a bit more work than other queries, as it also breaks apart the fetching of the job and the worker into separate ones. In other words, internally the persistence layer API changes.	2024-05-28 14:46:42 +02:00
Sybren A. Stüvel	b102b73a1f	Refactor: convert more job functions to sqlc No functional changes.	2024-03-03 23:23:51 +01:00
Sybren A. Stüvel	3fbb3cde34	Manager: SQLC rename `Uuid` to `UUID` No functional changes.	2024-03-03 20:54:43 +01:00
Sybren A. Stüvel	c046094880	Manager: start replacing GORM with SQLC GORM has certain downsides: - Code-first approach, where queries have to be translated to the Go code required to execute them. - GORM comes with its own SQLite implementation, which doesn't provide an on-connect callback. This means that new connections cannot correctly enable foreign key constraints, causing database consistency issues. [SQLC](https://sqlc.dev/) solves these issues for us. This commit doesn't fully replace GORM with SQLC, but introduces it for a few queries. Once all queries have been converted, GORM can be removed completely.	2024-03-03 20:15:39 +01:00
Sybren A. Stüvel	6777e89589	Manager: refuse to delete job when foreign keys are disabled Just as a safety measure, before deleting a job, check that foreign key constraints are enabled. These are optional in SQLite, and the deletion function assumes that they are on.	2024-01-11 17:17:56 +01:00
Sybren A. Stüvel	246916475f	Manager: Implement mass mark-for-deletion of jobs Implement the API function to mass-mark jobs for deletion, based on their 'updated_at' timestamp. Note that the `last_updated_max` parameter is rounded up to entire seconds. This may mark more jobs for deletion than you expect, if their `updated_at` timestamps differ by less than a second.	2023-12-16 23:05:52 +01:00
Sybren A. Stüvel	02fac6a4df	Change Go package name from git.blender.org to projects.blender.org Change the package base name of the Go code, from `git.blender.org/flamenco` to `projects.blender.org/studio/flamenco`. The old location, `git.blender.org`, has no longer been use since the [migration to Gitea][1]. The new package names now reflect the actual location where Flamenco is hosted. [1]: https://code.blender.org/2023/02/new-blender-development-infrastructure/	2023-08-01 12:42:31 +02:00
Sybren A. Stüvel	b58f1e15f1	Add CLI utility to recreate tasks of jobs Due to an issue (which has been fixed in the previous commit), all tasks in the database were deleted when starting Flamenco. This tool attempts to recompile the job and recreate its tasks. The statuses of the tasks are set based on the job status. Basically: - job active → tasks queued - job completed → tasks completed - job cancelled / failed → tasks cancelled - otherwise → tasks queued To ensure that the tool is only used to create tasks from scratch, it refuses to work on a job that still has tasks in the database.	2023-07-10 14:10:15 +02:00
Eveline Anderson	830c3fe794	Rename worker 'clusters' to 'tags' As it was decided that the name "tags" would be better for the clarity of the feature, all files and code named "cluster" or "worker cluster" have been removed and replaced with "tag" and "worker tag". This is only a name change, no other features were touched. This addresses part of #104204. Reviewed-on: https://projects.blender.org/studio/flamenco/pulls/104223 As a note to anyone who already ran a pre-release version of Flamenco and configured some worker clusters, with the help of an SQLite client you can migrate the clusters to tags. First build Flamenco Manager and start it, to create the new database schema. Then run these SQL queries via an sqlite commandline client: ```sql insert into worker_tags (id, created_at, updated_at, uuid, name, description) select id, created_at, updated_at, uuid, name, description from worker_clusters; insert into worker_tag_membership (worker_tag_id, worker_id) select worker_cluster_id, worker_id from worker_cluster_membership; ```	2023-07-10 11:11:03 +02:00
Sybren A. Stüvel	afde952c10	Fix incompatibility with 32-bit platforms	2023-05-24 21:23:05 +02:00
Anish Bharadwaj (he)	0502498dfa	Fix #104201 : Task Limit error in Flamenco Manager Insert tasks in batches so that the required SQL query stays within the limits of SQLite. No changes to the API, only to the persistence layer. Reviewed-on: https://projects.blender.org/studio/flamenco/pulls/104205	2023-04-24 15:10:59 +02:00
Sybren A. Stüvel	8408d28a6b	Manager: add support for worker clusters	2023-04-04 12:18:35 +02:00
Sybren A. Stüvel	fe0899fd55	shaman-checkout-id-setter: Don't update job's "updated at" timestamp The Shaman Checkout ID setter shouldn't update a job's "updated at" timestamp. Its goal is to fake that the job was submitted with a new enough Flamenco version, and thus should not touch the timestamps.	2023-02-07 16:24:23 +01:00
Sybren A. Stüvel	01a85d86cb	Add "Shaman Checkout ID setter" command This is a command that can be run to retroactively set the Shaman Checkout ID of jobs, allowing the job deletion to also remove the job's Shaman checkout directory. This is highly experimental, and not built by default or shipped with Flamenco releases. It's only been used once at Blender Animation Studio to help cleaning up. Run at your own risk. Make backups first.	2023-02-07 15:07:41 +01:00
Sybren A. Stüvel	ef3cab9745	Webapp: handle job deletions properly - Add a little confirmation overlay before deleting a job. This overlay also shows information about whether the Shaman checkout directory will be deleted or not. - Send job updates to the web frontend when jobs are marked for deletion, and when they are actually deleted. - Respond to those updates, and handle some corner cases where job info is missing (because it just got deleted). This closes T99401.	2023-02-03 16:59:15 +01:00
Sybren A. Stüvel	bf0906eb95	Manager: avoid logging an error when requesting a non-existent job This is expected to happen every once in a while, especially now that Flamenco supports job deletion. It's not something to log at error level.	2023-02-03 16:37:55 +01:00
Sybren A. Stüvel	791d877ff1	Manager: implement API endpoint for deleting jobs Implement the `deleteJob` API endpoint. Calling this endpoint will mark the job as "deletion requested", after which it's queued for actual deletion. This makes the API response fast, even when there is a lot of work to do in the background. A new background service "job deleter" keeps track of the queue of such jobs, and performs the actual deletion. It removes: - Shaman checkout for the job (but see below) - Manager-local files of the job (task logs, last-rendered images) - The job itself The removal is done in the above order, so the job is only removed from the database if the rest of the removal was succesful. Shaman checkouts are only removed if the job was submitted with Flamenco version 3.2. Earlier versions did not record enough information to reliably do this.	2023-01-04 01:18:21 +01:00
Sybren A. Stüvel	f413a40f4e	Store Shaman checkout ID when submitting a job If Shaman is used to submit the job files, store the job's checkout ID (i.e. the path relative to the checkout root) in the database. This will make it possible in the future to remove the Shaman checkout along with the job itself.	2023-01-04 01:18:21 +01:00
Sybren A. Stüvel	85d53de1f9	Manager: implement API endpoint for changing job priority The priority of an existing can now be changed. It will be taken into account when assigning tasks to workers, but it will not reassign tasks that are already active.	2022-09-30 16:30:03 +02:00
Sybren A. Stüvel	2a345a3d2c	API for deleting workers Workers can now be soft-deleted. Tasks assigned to the worker will remain associated with that Worker. Active tasks will be re-queued so other workers can pick them up.	2022-08-11 16:59:53 -07:00
Sybren A. Stüvel	859a261b05	Manager: on deletion of a worker, do not cascade to deletion of its tasks Fix an issue where deleting a Worker would also delete the tasks it was assigned to.	2022-07-15 17:00:25 +02:00
Sybren A. Stüvel	1fceae3604	Manager: more efficient database queries Be more selective in what's saved to the database to speed some things up. Most importantly, this avoids saving the entire job when a task status is updated or a task is assigned.	2022-07-15 15:08:00 +02:00
Sybren A. Stüvel	1055aabee2	Manager: optimise db.SaveActivity() query Use an explicit `Select()` GORM call to avoid saving related objects.	2022-07-15 15:08:00 +02:00
Sybren A. Stüvel	6e28271c93	Manager: prevent saving related job & worker when "touching" task	2022-07-15 15:08:00 +02:00
Sybren A. Stüvel	64c8fa851d	Show assigned worker in task details Show the worker assigned to the task in the task details view, as link to the worker itself.	2022-06-17 16:36:55 +02:00
Sybren A. Stüvel	046853932d	Manager: re-queue previously failed tasks of worker when blocklisting When a Worker is blocked from a job, re-queue its previously failed tasks so that other workers can give them a try.	2022-06-17 15:49:16 +02:00
Sybren A. Stüvel	81f81d0e0a	Show task failure list in the web frontend Show the task failure list in the web frontend's `TaskDetails` component.	2022-06-17 11:37:56 +02:00
Sybren A. Stüvel	0b5140fc5f	Manager: clear task failure list on requeueing of jobs & tasks When a job or task gets requeued from the web interface, its task failure lists (i.e. the list of workers that previously failed this task) will be cleared. This clearing doesn't happen in other situations, e.g. when a worker signs off and its task gets requeued, the task's failure list will remain as-is.	2022-06-17 11:37:28 +02:00
Sybren A. Stüvel	c5debdeb70	Manager: add 'task failure list' to record workers failing tasks The persistence layer can now store which worker failed which task, as preparation for a blocklisting system. Such a system should be able to determine whether there are still any workers left to do the work.	2022-06-13 18:41:30 +02:00
Sybren A. Stüvel	e35911d106	Manager: add ability to delete jobs This is needed for a future unit test, and exposed the fact that SQLite didn't enforce foreign key constraints (and thus also didn't handle on-delete-cascade attributes). This has been fixed in the previous commit.	2022-06-13 18:41:19 +02:00
Sybren A. Stüvel	6ec493d944	Manager, more efficiently create tasks When creating tasks the inter-task dependencies are saved as a 2nd pass,by updating the tasks in the database. This now only saves those dependencies, and no longer saves the entire task again.	2022-06-13 18:40:42 +02:00
Sybren A. Stüvel	02bc03ae2b	Manager: replace `gorm.Model` with our own `persistence.Model` struct `persistence.Model` contains the common database fields for most model structs. It is a copy of `gorm.Model`, but without the `DeletedAt` field (which triggers Gorm's soft deletion). Soft deletion is not used by Flamenco. If it ever becomes necessary to support soft-deletion, see https://gorm.io/docs/delete.html#Soft-Delete	2022-06-13 18:40:42 +02:00
Sybren A. Stüvel	67562856d3	Manager: let Gorm create an index on `Task.LastTouchedAt` It's used in timeout queries, and there could be tens or hundreds of thousands of tasks in the database.	2022-06-13 12:33:05 +02:00
Sybren A. Stüvel	d90a8b987d	Manager: Task Timeout Checker Tasks that are in state `active` but haven't been 'touched' by a Worker for 10 minutes or longer will transition to state `failed`. In the future, it might be better to move the decision about which state is suitable to the Task State Machine service, so that it can be smarter and take the history of the task into account. Going to `soft-failed` first might be a nice touch.	2022-06-10 14:32:02 +02:00
Sybren A. Stüvel	b4d2fc4231	Manager: keep track of when a Worker last worked on a task This will be used for keeping track of stuck tasks.	2022-06-03 16:33:50 +02:00
Sybren A. Stüvel	530520b1c7	Implement mass updating of tasks when `JobUpdate.refresh_tasks = true` Send & handle `JobUpdate.refresh_tasks = true` when many tasks are updated simultaneously. This applies to things like cancelling & requeueing an entire job. This partially rolls back 67bf77de13d99b1bc5d7344951068822c4fadd88, as it was too slow when 1000+ tasks were being updated all at once.	2022-05-17 14:48:50 +02:00
Sybren A. Stüvel	d673da7a0c	Manager: check for stuck jobs at startup Check for jobs in 'cancel-requested' or 'requeued' statuses, and ensure they transition to the right status. This happens at startup, before even starting the web interface, so that a consistent state is presented.	2022-05-06 16:07:27 +02:00
Sybren A. Stüvel	ba34652cd1	Implement task status changes from web interface This also reworks some of the logic due to the recently-removed `cancel-requested` task status.	2022-05-05 16:44:09 +02:00
Sybren A. Stüvel	67bf77de13	Manager: rework mass updates to task statuses When the job status changes, it impacts the task statuses as well. These status changes are now no longer done with a single database query, but instead each affected task is fetched, changed, and saved. This unifies the regular & mass updates to the tasks, and causes the resulting task changes to be broadcast to SocketIO clients.	2022-05-03 16:13:44 +02:00

1 2

75 Commits