Archive

The Dulin Report

Browsable archive from the WordPress export.

Results (56)

On Amazon Prime Video’s move to a monolith May 14, 2023 One size does not fit all: neither cloud nor on-prem Apr 10, 2023 Comparing AWS SQS, SNS, and Kinesis: A Technical Breakdown for Enterprise Developers Feb 11, 2023 Stop Shakespearizing Sep 16, 2022 Using GNU Make with JavaScript and Node.js to build AWS Lambda functions Sep 4, 2022 Monolithic repository vs a monolith Aug 23, 2022 Keep your caching simple and inexpensive Jun 12, 2022 Java is no longer relevant May 29, 2022 There is no such thing as one grand unified full-stack programming language May 27, 2022 Best practices for building a microservice architecture Apr 25, 2022 TypeScript is a productivity problem in and of itself Apr 20, 2022 In most cases, there is no need for NoSQL Apr 18, 2022 Node.js and Lambda deployment size restrictions Mar 1, 2021 Should we abolish Section 230 ? Feb 1, 2021 TDWI 2019: Architecting Modern Big Data API Ecosystems May 30, 2019 Microsoft acquires Citus Data Jan 26, 2019 Which AWS messaging and queuing service to use? Jan 25, 2019 Using Markov Chain Generator to create Donald Trump's state of union speech Jan 20, 2019 Let’s talk cloud neutrality Sep 17, 2018 A conservative version of Facebook? Aug 30, 2018 TypeScript starts where JavaScript leaves off Aug 2, 2017 Design patterns in TypeScript: Chain of Responsibility Jul 22, 2017 I built an ultimate development environment for iPad Pro. Here is how. Jul 21, 2017 Rather than innovating Walmart bullies their tech vendors to leave AWS Jun 27, 2017 Emails, politics, and common sense Jan 14, 2017 Don't trust your cloud service until you've read the terms Sep 27, 2016 I am addicted to Medium, and I am tempted to move my entire blog to it Sep 9, 2016 What I learned from using Amazon Alexa for a month Sep 7, 2016 Amazon Alexa is eating the retailers alive Jun 22, 2016 In search for the mythical neutrality among top-tier public cloud providers Jun 18, 2016 What can we learn from the last week's salesforce.com outage ? May 15, 2016 Why it makes perfect sense for Dropbox to leave AWS May 7, 2016 Managed IT is not the future of the cloud Apr 9, 2016 JavaScript as the language of the cloud Feb 20, 2016 Our civilization has a single point of failure Dec 16, 2015 Operations costs are the Achille's heel of NoSQL Nov 23, 2015 IT departments must transform in the face of the cloud revolution Nov 9, 2015 Setting Up Cross-Region Replication of AWS RDS for PostgreSQL Sep 12, 2015 Top Ten Differences Between ActiveMQ and Amazon SQS Sep 5, 2015 Ten Questions to Consider Before Choosing Cassandra Aug 8, 2015 The Three Myths About JavaScript Simplicity Jul 10, 2015 Big Data is not all about Hadoop May 30, 2015 Smart IT Departments Own Their Business API and Take Ownership of Data Governance May 13, 2015 Guaranteeing Delivery of Messages with AWS SQS May 9, 2015 We Need a Cloud Version of Cassandra May 7, 2015 Building a Supercomputer in AWS: Is it even worth it ? Apr 13, 2015 Ordered Sets and Logs in Cassandra vs SQL Apr 8, 2015 Exploration of the Software Engineering as a Profession Apr 8, 2015 Finding Unused Elastic Load Balancers Mar 24, 2015 Where AWS Elastic BeanStalk Could be Better Mar 3, 2015 Trying to Replace Cassandra with DynamoDB ? Not so fast Feb 2, 2015 Why I am Tempted to Replace Cassandra With DynamoDB Nov 13, 2014 How We Overcomplicated Web Design Oct 8, 2014 Infrastructure in the cloud vs on-premise Aug 25, 2014 Cassandra: a key puzzle piece in a design for failure Aug 18, 2014 Cassandra: Lessons Learned Jun 6, 2014

Which AWS messaging and queuing service to use?

January 25, 2019

AWS offers a number of messaging and queuing services, each one with its own pros and cons. This post describes each service and when to use it.

AWS SQS: managed durable queues

img_0528

AWS SQS stands for Simple Queue Service. It is as simple as it gets -- you can send a message to a queue, you can receive it from a queue, and you can acknowledge the message by deleting it from the queue. Only one consumer can process the same message at a time. It is a native AWS service that requires the use of AWS REST API or an AWS SDK for your programming language.

It is a genuinely server-less managed service. The developers don't need to worry about scalability, storage, or performance. SQS queues scale as required. You only pay for what you use which turns out to be pennies per million messages.

SQS integrates well with other AWS services such as AWS Lambda. For example, SQS will trigger a lambda function for each message in the queue. While SQS is very simple in that it only supports "send," "receive," and "delete" operations, I recommend abstracting these routines at a bit of a higher level.

SQS scales horizontally. Roughly speaking, if you have a single consumer able to process, say, 1000 messages per second, two instances of the consumer will process 2000 messages per second. If you use Lambda triggers, AWS will scale both the queues and the consumers for you.

SQS is durable and supports Dead Letter Queues and configurable re-delivery policy. If for some reason your consumer took a message off the queue but failed to correctly process it, SQS will re-attempt delivery a few times (configurable) before eventually delivering the failed message to the Dead Letter Queue.

You can find SQS documentation here.

Also see: Top ten differences between ActiveMQ and AWS SQS

When to use SQS?



  • Your architecture requires asynchronous processing with guarantees that all tasks get processed

  • You need a durable and reliable queuing solution that will scale at low cost and require no management on your part

  • You are building a brand new AWS-native application

  • You are porting an existing application to server-less architecture

  • You are adding queueing functionality to your architecture in AWS


When not to use SQS?



  • Your application is sensitive to the latency associated with using AWS APIs via HTTP protocol.


AWS SNS

img_0530

AWS SNS is a Simple Notification Service. It delivers messages published to topics to one or more destinations. Destinations can be SQS queues, Lambda functions, HTTP POST endpoints, SMS text messages, mobile device push notifications, and more.

SNS offers some degree of durability. Messages are saved before producers get a confirmation. There is a configurable re-delivery policy depending on the destination. Applications requiring assurances that messages will eventually get delivered to their destinations should use SNS to route to SQS and then consume from SQS.

When to use SNS?



  • Your architecture requires delivering messages asynchronously to multiple consumers at a time but does not require journaling or strong delivery guarantees (if you need delivery guarantees you should use SQS queues)

  • You need to trigger asynchronous events across different parts of your application

  • You need to send mobile push notifications in a cross-platform manner

  • You need to send SMS text messages

  • You are building a brand new AWS-native application

  • You are porting an existing application to server-less architecture


When not to use SNS?



  • Your application sensitive to the latency associated with using AWS APIs via HTTP protocol and you have no need for specialized functions like SMS or mobile push


AWS Kinesis

img_0532

AWS Kinesis is a managed data streaming service. Although it does support video and multi-media streams, it is beyond the scope of this article.

Producers put data on a stream using Kinesis client library. Multiple different Kinesis data stream consumers can then process data from the stream concurrently. This is an important distinction from queues where only one kind of a consumer can take messages off the same queue.

For example, one consumer can archive product order data while another analyzes data for fraud in real-time, and yet another one uses the data to dynamically update pricing and inventory data.

It is possible to also arrange streams into direct acyclic graphs (DAG) such that messages are forwarded to other streams to create complex stream processing. This is another distinction from queues.

Kinesis integrates well with other AWS services, including AWS Lambda. Kinesis can trigger Lambda functions in response to messages. Additionally, AWS services such as S3 or Dynamo can publish data to Kinesis streams.

When to use Kinesis?



  • Your architecture requires complex real-time processing of data streams

  • You have complex real-time analytics requirements

  • You need to trigger asynchronous events across different parts of your application

  • You are building a brand new AWS-native application

  • You are adding stream-processing functionality to your AWS-native application

  • You are porting an existing application to AWS-native architecture


When not to use Kinesis?



  • You only need simple queues

  • You just need simple topics

  • Your architecture is sensitive to latencies. If that is the case consider AWS MSK (see below).


AWS MSK (managed Kafka)

img_0532

AWS MSK stands for "AWS Managed Streaming for Kafka." Conceptually, Kafka is similar to Kinesis: producers publish messages on Kafka topics (streams), while multiple different consumers can process messages concurrently.

Kafka is famous but can be "Kafkaesque" to maintain in production. MSK takes a lot of the operational difficulties out of running a Kafka cluster. MSK takes care of various maintenance tasks such as backups and routine upgrades.

MSK works similarly to AWS RDS (managed database service). MSK provisions and manages the underlying compute infrastructure on your behalf. You pay for this compute capacity regardless of the workload on your Kafka cluster. In other words, MSK is a managed service, but it is not server-less.

When to use AWS MSK?



  • You are migrating an existing Kafka-based application to AWS

  • You already use Kafka and have a significant investment in Kafka-dependent codebase

  • You want to reduce operational costs and difficulties out of running a Kafka cluster in AWS

  • Your application is sensitive to the latency associated with using AWS APIs via HTTP protocol.


When not to use AWS MSK?

There is little reason to introduce Kafka to an application that is not already using it and does not have requirements to use Kafka specifically (for example due to latency and performance considerations). Consider using Kinesis instead.

  • You are building a brand new AWS-native application

  • You are adding streaming functionality to an existing AWS-native application


AWS MQ (managed Apache ActiveMQ)

AWS MQ is a managed ActiveMQ service. Similar to MSK for Kafka, it takes operational complexity out of running an ActiveMQ cluster. It supports JMS, NMS, AMQP, STOMP, MQTT and other industry standard messaging protocols. Most legacy applications do not require significant changes to work in AWS.

AWS MQ supports various broker topologies, including mesh networks. By being ActiveMQ-based, it supports the full range of services including queues, topics, and multiple configurations of durability, re-delivery, and replication.

Also see: Top ten differences between ActiveMQ and AWS SQS (note: I wrote that post in 2015, long before Amazon had a managed ActiveMQ service)

When to use AWS MQ (Apache ActiveMQ)?



  • You are migrating an existing JMS or NMS-based application to AWS

  • You already use ActiveMQ or a compatible product and have a significant investment in the codebase

  • You want to reduce operational costs and difficulties out of running an ActiveMQ cluster in AWS

  • Your application is sensitive to the latency associated with using AWS APIs via HTTP protocol.


When not to use AWS MQ?



  • You are adding message queues to your AWS-native architecture and need something simple. Consider using SQS instead.

  • Your application is not sensitive to the latency associated with AWS SQS API.






Liked this post? In September 2020 I will ride my bicycle for 275 miles from Boston to NYC to help end AIDS. Please consider contributing to my fundraising goals.