Archive

The Dulin Report

Browsable archive from the WordPress export.

Results (56)

On Amazon Prime Video’s move to a monolith May 14, 2023 One size does not fit all: neither cloud nor on-prem Apr 10, 2023 Comparing AWS SQS, SNS, and Kinesis: A Technical Breakdown for Enterprise Developers Feb 11, 2023 Stop Shakespearizing Sep 16, 2022 Using GNU Make with JavaScript and Node.js to build AWS Lambda functions Sep 4, 2022 Monolithic repository vs a monolith Aug 23, 2022 Keep your caching simple and inexpensive Jun 12, 2022 Java is no longer relevant May 29, 2022 There is no such thing as one grand unified full-stack programming language May 27, 2022 Best practices for building a microservice architecture Apr 25, 2022 TypeScript is a productivity problem in and of itself Apr 20, 2022 In most cases, there is no need for NoSQL Apr 18, 2022 Node.js and Lambda deployment size restrictions Mar 1, 2021 Should we abolish Section 230 ? Feb 1, 2021 TDWI 2019: Architecting Modern Big Data API Ecosystems May 30, 2019 Microsoft acquires Citus Data Jan 26, 2019 Which AWS messaging and queuing service to use? Jan 25, 2019 Using Markov Chain Generator to create Donald Trump's state of union speech Jan 20, 2019 Let’s talk cloud neutrality Sep 17, 2018 A conservative version of Facebook? Aug 30, 2018 TypeScript starts where JavaScript leaves off Aug 2, 2017 Design patterns in TypeScript: Chain of Responsibility Jul 22, 2017 I built an ultimate development environment for iPad Pro. Here is how. Jul 21, 2017 Rather than innovating Walmart bullies their tech vendors to leave AWS Jun 27, 2017 Emails, politics, and common sense Jan 14, 2017 Don't trust your cloud service until you've read the terms Sep 27, 2016 I am addicted to Medium, and I am tempted to move my entire blog to it Sep 9, 2016 What I learned from using Amazon Alexa for a month Sep 7, 2016 Amazon Alexa is eating the retailers alive Jun 22, 2016 In search for the mythical neutrality among top-tier public cloud providers Jun 18, 2016 What can we learn from the last week's salesforce.com outage ? May 15, 2016 Why it makes perfect sense for Dropbox to leave AWS May 7, 2016 Managed IT is not the future of the cloud Apr 9, 2016 JavaScript as the language of the cloud Feb 20, 2016 Our civilization has a single point of failure Dec 16, 2015 Operations costs are the Achille's heel of NoSQL Nov 23, 2015 IT departments must transform in the face of the cloud revolution Nov 9, 2015 Setting Up Cross-Region Replication of AWS RDS for PostgreSQL Sep 12, 2015 Top Ten Differences Between ActiveMQ and Amazon SQS Sep 5, 2015 Ten Questions to Consider Before Choosing Cassandra Aug 8, 2015 The Three Myths About JavaScript Simplicity Jul 10, 2015 Big Data is not all about Hadoop May 30, 2015 Smart IT Departments Own Their Business API and Take Ownership of Data Governance May 13, 2015 Guaranteeing Delivery of Messages with AWS SQS May 9, 2015 We Need a Cloud Version of Cassandra May 7, 2015 Building a Supercomputer in AWS: Is it even worth it ? Apr 13, 2015 Ordered Sets and Logs in Cassandra vs SQL Apr 8, 2015 Exploration of the Software Engineering as a Profession Apr 8, 2015 Finding Unused Elastic Load Balancers Mar 24, 2015 Where AWS Elastic BeanStalk Could be Better Mar 3, 2015 Trying to Replace Cassandra with DynamoDB ? Not so fast Feb 2, 2015 Why I am Tempted to Replace Cassandra With DynamoDB Nov 13, 2014 How We Overcomplicated Web Design Oct 8, 2014 Infrastructure in the cloud vs on-premise Aug 25, 2014 Cassandra: a key puzzle piece in a design for failure Aug 18, 2014 Cassandra: Lessons Learned Jun 6, 2014

Best practices for building a microservice architecture

April 25, 2022

I have been building micro-service enterprise applications my entire career - 25 years as of this writing. Over the years, I learned that there is a balance between pure adherence to design patterns and practice.



Most micro-service architecture articles, such as this one about what they do at Netflix appear to consider enterprise-scale architectures. Enterprise architects must consider enterprise as a whole, but it is independent, self-contained applications that make up an enterprise architecture. Each such application contributes APIs to the enterprise, but microservices drive it's internal workings.




Loose coupling with stable shared contracts




A change in one micro-service should not require changing others. By declaring and adhering to API contracts, you balance continuous evolution and backward compatibility.



The API contracts are not merely human-readable documentation, though human readability is essential. The contracts must be machine-readable and usable for runtime and static validation of API requests and responses. 



If you standardize your application ecosystem on a specific programming language and platform, then use that language to declare and reuse all interfaces. On the other hand, if you have a multi-lingual architecture in which components are written in different languages, you can utilize a cross-platform mechanism for declaring data structures and generating code, such as Apache Thriftprotobufs or OpenAPI / swagger.




Datastore abstraction




While a separate data store per micro-service may seem like a good idea from the micro-service perspective, it inevitably turns out to be a horrible idea from data integrity, transactions and reporting. 



I do not subscribe to the philosophy that each microservice should have its datastore. Instead, I prefer an architecture in which the datastore is abstracted from all microservices. 



I highly recommend using GraphQL for queries and mutations as an abstraction layer. The abstraction layer can be a set of micro-services hidden behind a GraphQL URL endpoint. The underlying data store itself can be flexible and adapted as the project evolves without having to rebuild any of the business logic in the micro-services.



Moreover, GraphQL imposes a degree of discipline on managing the backward compatibility of the logical data model by providing tools for the continuous evolution of the schema:




While nothing prevents a GraphQL service from being versioned just like any other REST API, GraphQL takes a strong opinion on avoiding versioning by providing the tools for the continuous evolution of a GraphQL schema.





Monorepo with dedicated micro-service build and deployment lifecycle




I recommend placing the entire application ecosystem into a single Git monorepo. I will discuss structuring such a monorepo in another post.



The reason for a monorepo is that it facilitates code reuse and signifies a microservice ecosystem known to work together. That does not mean that all microservices are always built and deployed together.



Though all of your microservices will live in the same monorepo, they each need to have their lifecycle. Following the loose coupling principle described above, changing one microservice should not require changes to others under most circumstances. Only modified microservices get deployed together.




Microservice granularity




A single microservice may perform more than one task. I think it's an overkill to limit microservices to one individual function. The tasks should be related and have the following common characteristics:




  • The tasks are related and tightly coupled. Usually, such tasks are modified together. If you frequently find yourself changing multiple microservices at the same time, it's a good indication that they should either be a single microservice or you need to rethink their coupling;
  • The tasks have similar performance and scaling characteristics. Suppose your microservice serves, say, 5 APIs, of which three always complete in 500 milliseconds and must serve thousands of requests per second. One requires 20 seconds to run but only runs once an hour, and another one is a long-running asynchronous task that runs overnight. In this example you have 3 tasks that share code and have similar performance characteristics, and the other two don't. That is 3 independent microservices;
  • The tasks have similar development lifecycles. Suppose your microservice serves 5 seemingly related APIs. Four of these rarely change. But one changes with every release. As a result, due to changes to one API, you have to rebuild and redeploy the other four. It is time to refactor;
  • Periodic reviews of performance, scalability, and development lifecycle. You should periodically review the data from your cloud service and code commit history to see whether you need to refactor or combine microservices. You do not need to stick to some permanent architecture. Microservice architecture should be fluid and easily movable, depending on performance, scalability, and development lifecycle characteristics.




Each microservice is its own deployable asset




Though the entire ecosystem lives in the same monorepo, each microservice is its own deployable asset. It can be a container or a AWS Lambda function. Only microservices that are modified should be rebuilt and redeployed.



A choice between a container or a Lambda is something I'd like to explore in another post.




Strive for stateless micro-services




Micro-services should be stateless. There could be a state associated with interacting with a micro-service, but the micro-service itself should not be the one to maintain that state.



My approach is to pass the state around between interactions in the form of a session object. You could also use something like Redis, but I don't like using Redis for things that cannot be restored without enabling persistence — I exclusively use Redis as an LRU cache. Using Redis for durable storage is another topic we should explore in another post.




Final thoughts




What I described in this post is my philosophy for building microservice architectures. I do not consider myself a purist, and my views are very pragmatic. I do not like team silos, and I like architectures that are natural to create and evolve in practice and do not impose contrived constraints. The best practices I described above are based on years of practical hands-on experience.