Add support for recurring tasks (cron style jobs) #155

rosa · 2024-02-20T09:19:39Z

This PR introduces support for recurring (aka. cron-style) tasks. They can be included in the dispatcher's configuration as:

  dispatchers:
    - polling_interval: 1
      batch_size: 500
      recurring_tasks:
        my_periodic_job:
          class: MyJob
          args: [ 42, { status: "custom_status" } ]
          schedule: every second

recurring_tasks is a hash/dictionary, and the key will be the task key internally. Each task needs to have a class, which will be the job class to enqueue, and a schedule. The schedule is parsed using Fugit, so it accepts anything that Fugit accepts as a cron. You can also provide arguments to be passed to the job, as a single argument, a hash, or an array of arguments that can also include kwargs as the last element in the array.

The job in the example configuration above will be enqueued every second as:

MyJob.perform_later(42, status: "custom_status")

Tasks are enqueued at their corresponding times by the dispatcher that owns them, and each task schedules the next one. This is pretty much inspired by what GoodJob does.

It's possible to run multiple dispatchers with the same recurring_tasks configuration. To avoid enqueuing duplicate tasks at the same time, an entry in a new solid_queue_recurring_executions table is created in the same transaction as the job is enqueued. This table has a unique index on task_key and run_at, ensuring only one entry per task per time will be created. This only works if you have preserve_finished_jobs set to true (the default), and the guarantee applies as long as you keep the jobs around.

Finally, it's possible to configure jobs that aren't handled by Solid Queue. That's it, you can a have a job like this in your app:

class MyResqueJob < ApplicationJob
  self.queue_adapter = :resque

  def perform(arg)
    # ..
  end
end

You can still configure this in Solid Queue:

  dispatchers:
    - recurring_tasks:
        my_periodic_resque_job:
          class: MyResqueJob
          args: 22
          schedule: "*/5 * * * *"

and the job will be enqueued via perform_later so it'll run in Resque. However, in this case we won't track any solid_queue_recurring_execution record for it and there won't be any guarantees that the job is enqueued only once each time.

This pull request also introduces a new configuration option for the dispatcher, to opt-out of concurrency maintenance, via concurrency_maintenance: false (it's true by default). You can have multiple dispatchers and choose that some of them do concurrency maintenance but not all of them, as well as one/some of them being in charge of dispatching recurring tasks but not all of them.

Closes #104.

Pending:

Update README with this new feature.

fxn · 2024-03-02T18:27:10Z

lib/solid_queue.rb

+require "active_job"
+require "active_job/queue_adapters"
+
+require "zeitwerk"


klenis · 2024-03-04T15:54:25Z

I'm testing this branch with a frequent cart expiry job that runs every minute. My concern is the amount of noise generated in the jobs table. What are your thoughts on either having the option to add a condition to the task definition such as:

recurring_tasks:
  expire_carts_job:
    class: ExpireCartsJob
    schedule: every minute
    if: -> { Cart.candidates_for_expiry.any? }

or a task specific version of the clear_finished_jobs_after setting for automatic cleanup

rosa · 2024-03-05T17:44:50Z

@klenis, this would depend on your job volume, but a job every minute would be 1,440 jobs per day and 10,080 jobs after one week. How would this compare to your current job volume?

As a comparison, in HEY, the noise corresponding to recurring jobs is about ~2,000 per day, but that's negligible compared to regular jobs (over 10M / day). I think this might be the case for most users because the lowest time interval you can schedule jobs to run recurringly is 1 second.

In case it helps, this is what we use on HEY to delete jobs that finished over 3 days ago:

# config/application.rb

# Keep finished Solid Queue jobs for 3 days
config.solid_queue.clear_finished_jobs_after = 3.days

And then as part of our recurring tasks:

clear_solid_queue_finished_jobs:
  class: "CronJob"
  schedule: "42 * * * *"
  args: "SolidQueue::Job.clear_finished_in_batches(batch_size: 1000)"

Would something like this work for you?

klenis · 2024-03-05T20:23:57Z

I guess the scheduled cleaner could be a viable solution. It would be nice to be able to pass class_name: to clear_finished_in_batches to simplify custom cleaning logic but I understand if you want to keep the public interface for simplicity.

Thank you for taking the time to respond and great job with Solid Queue 👏

rosa · 2024-03-06T11:23:29Z

It would be nice to be able to pass class_name: to clear_finished_in_batches to simplify custom cleaning logic

Ohh, interesting idea! I hadn't thought about that as we didn't need that granularity when clearing jobs, but it's something I can certainly add 😊 Thank you!

weilandia · 2024-03-09T02:12:00Z

@klenis, this would depend on your job volume, but a job every minute would be 1,440 jobs per day and 10,080 jobs after one week. How would this compare to your current job volume?

As a comparison, in HEY, the noise corresponding to recurring jobs is about ~2,000 per day, but that's negligible compared to regular jobs (over 10M / day). I think this might be the case for most users because the lowest time interval you can schedule jobs to run recurringly is 1 second.

In case it helps, this is what we use on HEY to delete jobs that finished over 3 days ago:
# config/application.rb



# Keep finished Solid Queue jobs for 3 days

config.solid_queue.clear_finished_jobs_after = 3.days
And then as part of our recurring tasks:
clear_solid_queue_finished_jobs:

  class: "CronJob"

  schedule: "42 * * * *"

  args: "SolidQueue::Job.clear_finished_in_batches(batch_size: 1000)"
Would something like this work for you?

Hey @rosa 👋

Does BC run SolidQueue on a dedicated db?

rosa · 2024-03-09T10:04:39Z

Does BC run SolidQueue on a dedicated db?

Yes! We use it for HEY only (for now), and it has its own DB that shares the hardware with the app's main DB.

…g out We can have dispatcher processes that don't do concurrency maintenance.

It was always zero for the default polling interval, so it was doing nothing and we didn't even realise ^_^U

With the other boolean options.

In the dispatcher and the configuration.

Using concurrent-ruby's scheduled tasks. Each task schedules the next one, like GoodJob does. Add a simple test and allow dispatcher to be initialized without having to pass instantiated recurring tasks.

To avoid any confusion with Active Record's id.

To keep track of the jobs associated with each recurring task and to avoid creating duplicate ones.

…an once Only when the recurring job being enqueued is using Solid Queue as the adapter. This supports other adapters as well, but in that case we can't guarantee unique runs of the same task at the same time.

If we don't explicitly add a ruby2_keywords flag, Active Job will any hash included in the arguments array with keys as `_aj_symbol_keys`, and when deserialized, it'd be treated always as a hash argument instead of keyword arguments. Depending on the job, this might work fine, but if the job uses keyword arguments, trying to execute the job with deserialized arguments will fail. However, the opposite is not true: if the job accepts a hash argument and we pass a hash with the ruby2_keywords flag, it'll work just fine as Active Job will serialize that with keys as `_aj_ruby2_keywords`, so we take advantage of that to simplify the task definition and not having to distinguish between args and kwargs.

It'll be handy in Mission Control when we want to show the configured tasks because we need to aggregate them across dispatchers that might have different configurations.

For example, if we don't keep finished jobs around.

This is useful for those who decide not to have FKs that ensure recurring executions are deleted when jobs are cleared up, so they can just call this method periodically to clear orphaned executions.

Somehow I hadn't noticed that until now ^_^U

Make the loop be part of Poller. Allow to have other other Runnable processes that don't need an infinite loop. I'm still not super happy with these concerns. This needs more work that will come when I properly implement async mode. Right now this is all interleaved in the modules and it shouldn't be.

weilandia · 2024-03-16T16:14:32Z

Yes! We use it for HEY only (for now), and it has its own DB that shares the hardware with the app's main DB.

Thanks @rosa!

We have similar job amounts/day--How big is your dedicated queue db? Did y'all consider sharing your main db?

rosa · 2024-03-20T14:03:11Z

We have similar job amounts/day--How big is your dedicated queue db? Did y'all consider sharing your main db?

We did consider it, and in the beginning, when we started using Solid Queue in production, we were running it there (about ~1M jobs per day). We looked into how the write load would look like when moving all the jobs, compared it to the load from the application, and realised it'd be a little less than multiplying the existing write load by 2, leaving less margin for peaks. In the end, we decided to be cautious and moved it to its own DB, which shares the hardware with the main app's DB and other DBs, just a separate database, as we still had a lot of margin in terms of IOPS supported by our disks there, CPU and memory.

brunoprietog · 2024-03-20T16:58:01Z

README.md

@@ -265,3 +267,48 @@ Solid Queue has been inspired by [resque](https://github.com/resque/resque) and

 ## License


Shouldn't this be at the end?

Oops, @brunoprietog thanks for spotting this! It should totally be at the end 😆

n-at-han-k · 2024-03-29T16:59:54Z

Does BC run SolidQueue on a dedicated db?

Yes! We use it for HEY only (for now), and it has its own DB that shares the hardware with the app's main DB.

Am I missing something in the solid_queue readme? Not quite sure how you'd set solid_queue up to use a separate database to the one that stores rails model data.

rosa · 2024-03-29T17:02:37Z

@n-at-han-k you can use the connects_to config option described in this section:

# Use a separate DB for Solid Queue
config.solid_queue.connects_to = { database: { writing: :solid_queue_primary, reading: :solid_queue_replica } }

rosa force-pushed the cron-jobs-take-2 branch 10 times, most recently from f0089cd to 3ba1861 Compare February 20, 2024 18:49

rosa mentioned this pull request Feb 20, 2024

Implement cron-style, recurring tasks #104

Closed

rosa force-pushed the cron-jobs-take-2 branch 5 times, most recently from f2f10f5 to 974a112 Compare February 27, 2024 17:20

rosa mentioned this pull request Feb 27, 2024

Expose recurring jobs rails/mission_control-jobs#88

Merged

fxn reviewed Mar 2, 2024

View reviewed changes

klenis mentioned this pull request Mar 6, 2024

Add option to limit clearing of jobs to specific class name(s) #166

Merged

rosa force-pushed the cron-jobs-take-2 branch from 974a112 to cf6efc9 Compare March 14, 2024 16:17

rosa added 4 commits March 14, 2024 17:18

Pass default options for each dispatcher

4ea040f

Extract concurrency maintenance work to its own class and allow optin…

f0c1134

…g out We can have dispatcher processes that don't do concurrency maintenance.

Add Fugit to parse cron-style schedules

16966b6

Refactor and simplify process callbacks setup

e554e7e

rosa added 16 commits March 14, 2024 17:21

Remove initial jitter for dispatcher

d453743

It was always zero for the default polling interval, so it was doing nothing and we didn't even realise ^_^U

Use Zeitwerk as autoloader instead of explicit requires

c383d15

Move #preserve_finished_jobs? to SolidQueue module for consistency

d871c27

With the other boolean options.

Move ConcurrencyClerk under Dispatcher namespace

5e82dbf

Refactor a bit the concurrency maintenance and stub recurring tasks

f5e63a1

In the dispatcher and the configuration.

Implement basic recurring task parsing and loading in a schedule

676383f

Using concurrent-ruby's scheduled tasks. Each task schedules the next one, like GoodJob does. Add a simple test and allow dispatcher to be initialized without having to pass instantiated recurring tasks.

Rename RecurringTask#id to RecurringTask#key

efadb87

To avoid any confusion with Active Record's id.

Create recurring_executions table

e7dd9d3

To keep track of the jobs associated with each recurring task and to avoid creating duplicate ones.

Fix configuration of recurring tasks when parsed via Configuration

8726b39

Track recurring executions to prevent enqueuing the same task more th…

3e4c0f0

…an once Only when the recurring job being enqueued is using Solid Queue as the adapter. This supports other adapters as well, but in that case we can't guarantee unique runs of the same task at the same time.

Log thread errors in development, just like in test

b2f74a3

Remove or change some logging from info to debug level

ab5e504

Store full recurring task configuration in process metadata

a28ccda

It'll be handy in Mission Control when we want to show the configured tasks because we need to aggregate them across dispatchers that might have different configurations.

Destroy recurring execution when deleting a job

6d510a5

For example, if we don't keep finished jobs around.

Add a clear_in_batches method to delete recurring executions

1043984

This is useful for those who decide not to have FKs that ensure recurring executions are deleted when jobs are cleared up, so they can just call this method periodically to clear orphaned executions.

rosa force-pushed the cron-jobs-take-2 branch from cf6efc9 to 1043984 Compare March 14, 2024 16:25

rosa added 2 commits March 14, 2024 19:31

Fix indentation in Process::Runnable

e104a5f

Somehow I hadn't noticed that until now ^_^U

rosa added 2 commits March 20, 2024 10:30

Group all development dependencies together in solid_queue.gemspec

38db619

Add documentation about recurring tasks in README

4ea0e2a

rosa force-pushed the cron-jobs-take-2 branch from 690ec75 to 4ea0e2a Compare March 20, 2024 13:58

rosa merged commit 7fca542 into main Mar 20, 2024
6 checks passed

rosa deleted the cron-jobs-take-2 branch March 20, 2024 16:53

brunoprietog reviewed Mar 20, 2024

View reviewed changes

miharekar mentioned this pull request Mar 20, 2024

Migrate from Sidekiq to Solid Queue miharekar/visualizer#103

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for recurring tasks (cron style jobs) #155

Add support for recurring tasks (cron style jobs) #155

rosa commented Feb 20, 2024 •

edited

fxn Mar 2, 2024

klenis commented Mar 4, 2024

rosa commented Mar 5, 2024

klenis commented Mar 5, 2024 •

edited

rosa commented Mar 6, 2024

weilandia commented Mar 9, 2024

rosa commented Mar 9, 2024

weilandia commented Mar 16, 2024

rosa commented Mar 20, 2024

brunoprietog Mar 20, 2024

rosa Mar 29, 2024

n-at-han-k commented Mar 29, 2024

rosa commented Mar 29, 2024

		@@ -265,3 +267,48 @@ Solid Queue has been inspired by [resque](https://github.com/resque/resque) and

		## License

Add support for recurring tasks (cron style jobs) #155

Add support for recurring tasks (cron style jobs) #155

Conversation

rosa commented Feb 20, 2024 • edited

fxn Mar 2, 2024

Choose a reason for hiding this comment

klenis commented Mar 4, 2024

rosa commented Mar 5, 2024

klenis commented Mar 5, 2024 • edited

rosa commented Mar 6, 2024

weilandia commented Mar 9, 2024

rosa commented Mar 9, 2024

weilandia commented Mar 16, 2024

rosa commented Mar 20, 2024

brunoprietog Mar 20, 2024

Choose a reason for hiding this comment

rosa Mar 29, 2024

Choose a reason for hiding this comment

n-at-han-k commented Mar 29, 2024

rosa commented Mar 29, 2024

rosa commented Feb 20, 2024 •

edited

klenis commented Mar 5, 2024 •

edited