system-design-primer/resources/noat.cards/Asynchronism.md

49 lines
3.2 KiB
Markdown
Raw Normal View History

2021-03-14 13:08:05 +03:00
+++
noatcards = True
isdraft = False
2021-03-26 18:15:35 +03:00
weight = 100
2021-03-14 13:08:05 +03:00
+++
# Asynchronism
2021-03-21 13:45:39 +03:00
![](https://camo.githubusercontent.com/c01ec137453216bbc188e3a8f16da39ec9131234/687474703a2f2f692e696d6775722e636f6d2f353447597353782e706e67)
[Source: Intro to architecting systems for scale](http://lethain.com/introduction-to-architecting-systems-for-scale/#platform_layer)
2021-03-14 13:08:05 +03:00
Asynchronous workflows help reduce request times for expensive operations that would otherwise be performed in-line. They can also help by doing time-consuming work in advance, such as periodic aggregation of data.
2021-03-21 13:45:39 +03:00
## Message queues
2021-03-14 13:08:05 +03:00
Message queues receive, hold, and deliver messages. If an operation is too slow to perform inline, you can use a message queue with the following workflow:
2021-03-14 18:13:28 +03:00
- An application publishes a job to the queue, then notifies the user of job status
- A worker picks up the job from the queue, processes it, then signals the job is complete
2021-03-14 13:08:05 +03:00
The user is not blocked and the job is processed in the background. During this time, the client might optionally do a small amount of processing to make it seem like the task has completed. For example, if posting a tweet, the tweet could be instantly posted to your timeline, but it could take some time before your tweet is actually delivered to all of your followers.
2021-03-21 13:12:09 +03:00
Redis is useful as a simple message broker but messages can be lost.
2021-03-14 13:08:05 +03:00
2021-03-21 13:12:09 +03:00
RabbitMQ is popular but requires you to adapt to the 'AMQP' protocol and manage your own nodes.
2021-03-14 13:08:05 +03:00
Amazon SQS, is hosted but can have high latency and has the possibility of messages being delivered twice.
2021-03-21 13:45:39 +03:00
## Task queues
2021-03-14 13:08:05 +03:00
Tasks queues receive tasks and their related data, runs them, then delivers their results. They can support scheduling and can be used to run computationally-intensive jobs in the background.
2021-03-21 13:12:09 +03:00
Celery has support for scheduling and primarily has python support.
2021-03-14 13:08:05 +03:00
2021-03-21 13:45:39 +03:00
## Back pressure
2021-03-14 13:08:05 +03:00
2021-03-21 13:12:09 +03:00
If queues start to grow significantly, the queue size can become larger than memory, resulting in cache misses, disk reads, and even slower performance. [Back pressure](http://mechanical-sympathy.blogspot.com/2012/05/apply-back-pressure-when-overloaded.html) can help by limiting the queue size, thereby maintaining a high throughput rate and good response times for jobs already in the queue. Once the queue fills up, clients get a server busy or HTTP 503 status code to try again later. Clients can retry the request at a later time, perhaps with [exponential backoff](https://en.wikipedia.org/wiki/Exponential_backoff) .
2021-03-14 13:08:05 +03:00
2021-03-21 13:45:39 +03:00
## Disadvantage(s) : asynchronism
2021-03-14 13:08:05 +03:00
2021-03-14 18:13:28 +03:00
- Use cases such as inexpensive calculations and realtime workflows might be better suited for synchronous operations, as introducing queues can add delays and complexity.
2021-03-14 13:08:05 +03:00
2021-03-21 13:45:39 +03:00
## Source(s) and further reading
2021-03-14 13:08:05 +03:00
2021-03-14 18:13:28 +03:00
- [It's all a numbers game](https://www.youtube.com/watch?v=1KRYH75wgy4)
- [Applying back pressure when overloaded](http://mechanical-sympathy.blogspot.com/2012/05/apply-back-pressure-when-overloaded.html)
- [Little's law](https://en.wikipedia.org/wiki/Little%27s_law)
- [What is the difference between a message queue and a task queue?](https://www.quora.com/What-is-the-difference-between-a-message-queue-and-a-task-queue-Why-would-a-task-queue-require-a-message-broker-like-RabbitMQ-Redis-Celery-or-IronMQ-to-function)