celery-pool-asyncio

Celery pool to run coroutine tasks


Keywords
celery, asyncio, pool, python3, background-jobs, concurrency
License
Apache-2.0
Install
pip install celery-pool-asyncio==0.1.12

Documentation

Celery Pool AsyncIO

Logo

  • Free software: Apache Software License 2.0

Features

import asyncio
from celery import Celery

# celery_pool_asyncio importing is optional
# It imports when you run worker or beat if you define pool or scheduler
# but it does not imports when you open REPL or when you run web application.
# If you want to apply monkey patches anyway to make identical environment
# when you use REPL or run web application app it's good idea to import
# celery_pool_asyncio module
import celery_pool_asyncio  # noqa
# Sometimes noqa does not disable linter (Spyder IDE)
celery_pool_asyncio.__package__


app = Celery()


@app.task(
    bind=True,
    soft_time_limit=42,  # raises celery.exceptions.SoftTimeLimitExceeded inside the coroutine
    time_limit=300,  # breaks coroutine execution with futures.TimeoutError
)
async def my_task(self, *args, **kwargs):
    await asyncio.sleep(5)


@app.task
async def my_simple_task(*args, **kwargs):
    await asyncio.sleep(5)

Then run celery:

$ celery worker -A hello_async_celery.app -P celery_pool_asyncio:TaskPool

Monkey patching: wtf and why

There are many monkey patches should be applied to make application working, and some of them should be applied as early as possible. You are able to switch off any of them by setting environment variable CPA_MONKEY_DENY. Remember you should have a great reason to do it.

Except critical for work features it allows:

# await data sending to broker
async_result = await my_simple_task.delay()

# await wainting for AsyncResult
result = await async_result.get()

You can manually disable any of them by enumerating it comma separated:

$ env CPA_MONKEY_DENY=CELERY.SEND_TASK,ALL_BACKENDS celery worker -A hello_async_celery.app -P celery_pool_asyncio:TaskPool

Disabling is available for:

  • CELERY.SEND_TASK
  • WORKCONTROLLER.USE_EVENTLOOP
  • BASERESULTCONSUMER.WAIT_FOR_PENDING
  • BASERESULTCONSUMER.DRAIN_EVENTS_UNTIL
  • ASYNCBACKENDMIXIN.WAIT_FOR_PENDING
  • ALL_BACKENDS
  • BEAT.SERVICE.START
  • BEAT.SERVICE.STOP
  • BUILD_TRACER
  • KOMBU.UTILS.COMPAT
  • RPC.RESULTCONSUMER.DRAIN_EVENTS
  • AMQPBACKEND.DRAIN_EVENTS
  • AMQPBACKEND.GET_MANY
  • AMQP_BACKEND
  • RPC_BACKEND

Scheduling

Default scheduler doesn't work. PersistentScheduler is subclass of default celery scheduler.

Running celery with scheduler:

$ celery worker -A hello_async_celery.app -P celery_pool_asyncio:TaskPool --scheduler celery_pool_asyncio:PersistentScheduler
$ celery beat -A hello_async_celery.app --scheduler celery_pool_asyncio:PersistentScheduler

Embeding also supported:

$ celery worker -A hello_async_celery.app -P celery_pool_asyncio:TaskPool --scheduler celery_pool_asyncio:PersistentScheduler -B

WARNING: embeded scheduler startup is not stable. It starts correctly in ~50% of cases. It looks like race condition. But after correct startup it works well. That's why it's good idea to run scheduler in separated process.

More examples

There is an example project uses celery-pool-asyncio: https://github.com/kai3341/celery-decorator-taskcls-example