常见问题¶

Frequently Asked Questions

常见¶

General

What kinds of things should I use Celery for?¶

What kinds of things should I use Celery for?

中文

答： Queue everything and delight everyone 是一篇很好的文章，描述了为什么在 Web 环境中要使用队列。

以下是一些常见的使用场景：

在后台运行任务。例如，为了尽快完成 Web 请求，然后逐步更新用户页面。这样即使真正的处理需要一些时间，也会给用户带来性能良好、响应迅速的印象。
在 Web 请求完成后再运行某些任务。
通过异步执行并使用重试机制，确保某件事最终会完成。
定期调度任务。

在某种程度上还包括：

分布式计算。
并行执行。

英文

Answer: Queue everything and delight everyone is a good article describing why you'd use a queue in a web context.

These are some common use cases:

Running something in the background. For example, to finish the web request as soon as possible, then update the users page incrementally. This gives the user the impression of good performance and "snappiness", even though the real work might actually take some time.
Running something after the web request has finished.
Making sure something is done, by executing it asynchronously and using

retries.

Scheduling periodic work.

And to some degree:

Distributed computing.
Parallel execution.

结果¶

Results

如果我有指向任务结果的 ID，该如何获取结果？¶

How do I get the result of a task if I have the ID that points there?

中文

答：使用 task.AsyncResult：

>>> result = my_task.AsyncResult(task_id)
>>> result.get()

这将返回一个 AsyncResult 实例，使用当前任务的结果后端。

如果你需要指定自定义结果后端，或者希望使用当前应用默认的后端，可以使用 app.AsyncResult：

>>> result = app.AsyncResult(task_id)
>>> result.get()

英文-

Answer: Use task.AsyncResult:

>>> result = my_task.AsyncResult(task_id)
>>> result.get()

This will give you a AsyncResult instance using the tasks current result backend.

If you need to specify a custom result backend, or you want to use the current application's default backend you can use app.AsyncResult:

>>> result = app.AsyncResult(task_id)
>>> result.get()

安全性¶

Security

使用 pickle 是否存在安全隐患？¶

Isn't using pickle a security concern?

中文

答：的确，自 Celery 4.0 起，默认的序列化格式为 JSON，这是为了让用户有意识地选择合适的序列化方式并关注安全问题。

保护 broker、数据库和其他传输 pickled 数据的服务不被未授权访问是至关重要的。

这不仅仅是 Celery 的问题，例如 Django 的缓存客户端也使用了 pickle。

对于任务消息，可以通过设置 task_serializer 为 "json" 或 "yaml" 来替代 pickle。

同样地，任务结果的序列化方式可以通过 result_serializer 来配置。

关于使用格式的详细说明以及在判断使用何种格式时的查找顺序，请参见序列化器。

英文

Answer: Indeed, since Celery 4.0 the default serializer is now JSON to make sure people are choosing serializers consciously and aware of this concern.

It's essential that you protect against unauthorized access to your broker, databases and other services transmitting pickled data.

Note that this isn't just something you should be aware of with Celery, for example also Django uses pickle for its cache client.

For the task messages you can set the task_serializer setting to "json" or "yaml" instead of pickle.

Similarly for task results you can set result_serializer.

For more details of the formats used and the lookup order when checking what format to use for a task see 序列化器

消息可以加密吗？¶

Can messages be encrypted?

中文

答：某些 AMQP broker（包括 RabbitMQ）支持 SSL。你可以通过设置 broker_use_ssl 来启用 SSL。

如果你还需要为消息添加额外的加密或安全机制，请联系 mailing-list 以获取更多信息。

英文

Answer: Some AMQP brokers supports using SSL (including RabbitMQ). You can enable this using the broker_use_ssl setting.

It's also possible to add additional encryption and security to messages, if you have a need for this then you should contact the mailing-list.

以 root 身份运行 :program:`celery worker 是否安全？¶

Is it safe to run celery worker as root?

中文

答：不可以！

虽然我们目前尚未发现安全漏洞，但假设不存在安全问题是非常天真的，因此强烈建议以非特权用户身份运行 Celery 服务（celery worker、celery beat、celeryev 等）。

英文

Answer: No!

We're not currently aware of any security issues, but it would be incredibly naive to assume that they don't exist, so running the Celery services (celery worker, celery beat, celeryev, etc) as an unprivileged user is recommended.

代理¶

Brokers

为什么 RabbitMQ 会崩溃？¶

Why is RabbitMQ crashing?

中文

答：如果 RabbitMQ 内存耗尽，会导致崩溃。这个问题将在 RabbitMQ 的未来版本中修复。请参考 RabbitMQ FAQ： https://www.rabbitmq.com/faq.html#node-runs-out-of-memory

备注

该问题在 RabbitMQ 2.0 及以上版本中已被修复。新版本引入了一个新的持久化机制，能够容忍内存溢出错误。推荐 Celery 用户使用 RabbitMQ 2.1 或更高版本。

如果你仍在运行旧版本 RabbitMQ，并遇到崩溃问题，请尽快升级！

在旧版 RabbitMQ 中，如果 Celery 配置不当，可能导致崩溃。即便不会崩溃，也可能占用大量系统资源，因此必须注意一些常见陷阱。

事件（Events）

使用 -E 启动 worker 时，会发送反映 worker 内部事件的消息。

只有在有事件监控器实时消费这些事件，或者定期清空事件队列的情况下，才应启用事件。

AMQP 结果后端

当使用 AMQP 结果后端时，每个任务的结果都会以消息的形式发送。如果你没有收集这些结果，它们将持续堆积，最终可能导致 RabbitMQ 内存耗尽。

该结果后端已被弃用，因此不应再使用。建议使用 RPC 后端用于 rpc 样式的调用，或者使用持久化后端以支持多消费者访问结果。

结果默认在 1 天后过期。你可以通过设置 result_expires 配置该值。

如果你不需要任务结果，务必设置 ignore_result 选项：

@app.task(ignore_result=True)
def mytask():
    pass

class MyTask(Task):
    ignore_result = True

英文

Answer: RabbitMQ will crash if it runs out of memory. This will be fixed in a future release of RabbitMQ. please refer to the RabbitMQ FAQ: https://www.rabbitmq.com/faq.html#node-runs-out-of-memory

备注

This is no longer the case, RabbitMQ versions 2.0 and above includes a new persister, that's tolerant to out of memory errors. RabbitMQ 2.1 or higher is recommended for Celery.

If you're still running an older version of RabbitMQ and experience crashes, then please upgrade!

Misconfiguration of Celery can eventually lead to a crash on older version of RabbitMQ. Even if it doesn't crash, this can still consume a lot of resources, so it's important that you're aware of the common pitfalls.

Events.

Running worker with the -E option will send messages for events happening inside of the worker.

Events should only be enabled if you have an active monitor consuming them, or if you purge the event queue periodically.

AMQP backend results.

When running with the AMQP result backend, every task result will be sent as a message. If you don't collect these results, they will build up and RabbitMQ will eventually run out of memory.

This result backend is now deprecated so you shouldn't be using it. Use either the RPC backend for rpc-style calls, or a persistent backend if you need multi-consumer access to results.

Results expire after 1 day by default. It may be a good idea to lower this value by configuring the result_expires setting.

If you don't use the results for a task, make sure you set the ignore_result option:

@app.task(ignore_result=True)
def mytask():
    pass

class MyTask(Task):
    ignore_result = True

我可以将 Celery 与 ActiveMQ/STOMP 一起使用吗？¶

Can I use Celery with ActiveMQ/STOMP?

中文

答复：不支持。此前在 https://pypi.org/project/Carrot`（我们早期使用的消息库）中曾提供该支持，但目前的消息库 :pypi:`Kombu/ 并不支持该功能。

英文

Answer: No. It used to be supported by https://pypi.org/project/Carrot/ (our old messaging library) but isn't currently supported in https://pypi.org/project/Kombu/ (our new messaging library).

不使用 AMQP 代理时，哪些功能不受支持？¶

What features aren't supported when not using an AMQP broker?

中文

以下是使用虚拟传输（virtual transports）时不可用功能的非完整列表：

远程控制命令（仅 Redis 支持）。

事件监控在部分虚拟传输中可能无法正常工作。

header 和 fanout 类型的交换机
（其中 fanout 类型由 Redis 支持）。

英文

This is an incomplete list of features not available when using the virtual transports:

Remote control commands (supported only by Redis).

Monitoring with events may not work in all virtual transports.

The header and fanout exchange types
(fanout is supported by Redis).

Django¶

“django-celery-beat”创建的数据库表有什么用途？¶

What purpose does the database tables created by django-celery-beat have?

中文

当使用数据库支持的调度器时，周期性任务的调度信息来自 PeriodicTask 模型，此外还会涉及一些辅助表（如 IntervalSchedule、 CrontabSchedule、 PeriodicTasks）。

英文

When the database-backed schedule is used the periodic task schedule is taken from the PeriodicTask model, there are also several other helper tables (IntervalSchedule, CrontabSchedule, PeriodicTasks).

“django-celery-results”创建的数据库表有什么用途？¶

What purpose does the database tables created by django-celery-results have?

中文

Django 的数据库结果后端扩展需要额外的两个模型： TaskResult 和 GroupResult。

英文

The Django database result backend extension requires two extra models: TaskResult and GroupResult.

Windows¶

Celery 支持 Windows 吗？¶

Does Celery support Windows?

中文

答复：不支持。

自 Celery 4.x 起，因资源限制已不再支持 Windows 平台。

不过它可能仍可工作，并且我们欢迎相关补丁提交。

英文

Answer: No.

Since Celery 4.x, Windows is no longer supported due to lack of resources.

But it may still work and we are happy to accept patches.

常见问题¶

常见¶

What kinds of things should I use Celery for?¶

误解¶

Celery 真的由 50,000 行代码组成吗？¶

Celery 依赖项多吗？¶

celery¶

kombu¶

Celery 是重量级的吗？¶

Celery 依赖于 pickle 吗？¶

Celery 只适用于 Django 吗？¶

我必须使用 AMQP/RabbitMQ 吗？¶

Celery 支持多种语言吗？¶

故障排除¶

MySQL 抛出死锁错误，我该怎么办？¶

工作线程什么都没做，只是挂了¶

任务结果无法可靠返回¶

为什么 Task.delay/apply*/ 工作线程挂了？¶

它在 FreeBSD 上能正常工作吗？¶

我遇到了“IntegrityError: Duplicate Key”错误。为什么？¶

为什么我的任务没有被处理？¶

为什么我的任务无法运行？¶

为什么我的周期性任务无法运行？¶

如何清除所有等待中的任务？¶

我已经清除了消息，但队列中仍然有消息？¶

结果¶

如果我有指向任务结果的 ID，该如何获取结果？¶

安全性¶

使用 pickle 是否存在安全隐患？¶

消息可以加密吗？¶

以 root 身份运行 :program:`celery worker 是否安全？¶

代理¶

为什么 RabbitMQ 会崩溃？¶

我可以将 Celery 与 ActiveMQ/STOMP 一起使用吗？¶

不使用 AMQP 代理时，哪些功能不受支持？¶

任务¶

如何在调用任务时重用相同的连接？¶

subprocess 中的 :command:`sudo 返回 :const:`None¶

为什么工作进程无法处理队列中的任务时会将其删除？¶

我可以通过名称调用任务吗？¶

我可以获取当前任务的任务 ID 吗？¶

我可以指定自定义的 task_id 吗？¶

我可以对任务使用装饰器吗？¶

我可以使用自然的任务 ID 吗？¶

我可以在某个任务完成后再运行另一个任务吗？¶

我可以取消正在执行的任务吗？¶

为什么我的远程控制命令没有被所有工作进程接收？¶

我可以只将某些任务发送到某些服务器吗？¶

我可以禁用任务预取吗？¶

我可以运行时更改周期性任务的执行间隔吗？¶

Celery 支持任务优先级吗？¶

我应该使用 retry 还是 acks_late？¶

我可以安排任务在特定时间执行吗？¶

我可以安全地关闭工作进程吗？¶

我可以在 [platform] 上在后台运行工作进程吗？¶

Django¶

“django-celery-beat”创建的数据库表有什么用途？¶

“django-celery-results”创建的数据库表有什么用途？¶

Windows¶

Celery 支持 Windows 吗？¶

`subprocess` 中的 :command:`sudo 返回 :const:`None¶