Archive

The Dulin Report

Browsable archive from the WordPress export.

Results (57)

Strategic activity mapping for software architects May 25, 2025 On the role of Distinguished Engineer and CTO Mindset Apr 27, 2025 The future is bright Mar 30, 2025 Software Engineering is here to stay Mar 3, 2024 Some thoughts on recent RTO announcements Jun 22, 2023 Comparing AWS SQS, SNS, and Kinesis: A Technical Breakdown for Enterprise Developers Feb 11, 2023 Should today’s developers worry about AI code generators taking their jobs? Dec 11, 2022 Things to be Thankful for Nov 24, 2022 Book review: Clojure for the Brave and True Oct 2, 2022 Monolithic repository vs a monolith Aug 23, 2022 Scripting languages are tools for tying APIs together, not building complex systems Jun 8, 2022 There is no such thing as one grand unified full-stack programming language May 27, 2022 Most terrifying professional artifact May 14, 2022 Best practices for building a microservice architecture Apr 25, 2022 True identity verification should require a human Mar 16, 2020 On elephant graveyards Feb 15, 2020 TDWI 2019: Architecting Modern Big Data API Ecosystems May 30, 2019 Returning security back to the user Feb 2, 2019 Which AWS messaging and queuing service to use? Jan 25, 2019 Using Markov Chain Generator to create Donald Trump's state of union speech Jan 20, 2019 The religion of JavaScript Nov 26, 2018 Leaving Facebook and Twitter: here are the alternatives Mar 25, 2018 When politics and technology intersect Mar 24, 2018 TypeScript starts where JavaScript leaves off Aug 2, 2017 Node.js is a perfect enterprise application platform Jul 30, 2017 Rather than innovating Walmart bullies their tech vendors to leave AWS Jun 27, 2017 Architecting API ecosystems: my interview with Anthony Brovchenko of R. Culturi Jun 5, 2017 TDWI 2017, Chicago, IL: Architecting Modern Big Data API Ecosystems May 30, 2017 Apple’s recent announcements have been underwhelming Oct 29, 2016 Why I switched to Android and Google Project Fi and why should you Aug 28, 2016 Amazon Alexa is eating the retailers alive Jun 22, 2016 What can we learn from the last week's salesforce.com outage ? May 15, 2016 Why it makes perfect sense for Dropbox to leave AWS May 7, 2016 JEE in the cloud era: building application servers Apr 22, 2016 Managed IT is not the future of the cloud Apr 9, 2016 JavaScript as the language of the cloud Feb 20, 2016 OAuth 2.0: the protocol at the center of the universe Jan 1, 2016 Operations costs are the Achille's heel of NoSQL Nov 23, 2015 IT departments must transform in the face of the cloud revolution Nov 9, 2015 Banking Technology is in Dire Need of Standartization and Openness Sep 28, 2015 Top Ten Differences Between ActiveMQ and Amazon SQS Sep 5, 2015 We Live in a Mobile Device Notification Hell Aug 22, 2015 What Every College Computer Science Freshman Should Know Aug 14, 2015 The Three Myths About JavaScript Simplicity Jul 10, 2015 Book Review: "Shop Class As Soulcraft" By Matthew B. Crawford Jul 5, 2015 Your IT Department's Kodak Moment Jun 17, 2015 The longer the chain of responsibility the less likely there is anyone in the hierarchy who can actually accept it Jun 7, 2015 Smart IT Departments Own Their Business API and Take Ownership of Data Governance May 13, 2015 We Need a Cloud Version of Cassandra May 7, 2015 Building a Supercomputer in AWS: Is it even worth it ? Apr 13, 2015 Ordered Sets and Logs in Cassandra vs SQL Apr 8, 2015 Exploration of the Software Engineering as a Profession Apr 8, 2015 What can Evernote Teach Us About Enterprise App Architecture Apr 2, 2015 Why I am Tempted to Replace Cassandra With DynamoDB Nov 13, 2014 Infrastructure in the cloud vs on-premise Aug 25, 2014 Wall St. wakes up to underinvestment in OMS Aug 21, 2014 Cassandra: Lessons Learned Jun 6, 2014

Operations costs are the Achille's heel of NoSQL

November 23, 2015

This article was originally published on my Cloud Power blog at Computerworld on October 27th, 2015

NoSQL databases scale by adding more commodity servers. With more commodity servers come increased costs and complexities. Some NoSQL systems are better at this than others and need less.

Consider the size of the Apple Cassandra installation that is reported at 75000 nodes and over 10 petabytes of data. The complexity of the operations, monitoring, upgrades and other maintenance tasks must be overwhelming. Apple bought FoundationDB to cut their own costs while improving performance. Julie Bort writes:
While both Cassandra/DataStax and FoundationDB are noSQL databases, FoundationDB had some unique technology. It works super-fast but needs far less hardware than Cassandra, making it even cheaper to use, even as it scales. (In geek speak, it’s an “in-memory” database that runs on flash storage.)

Goldmacher says it needs somewhere between 5% to 10% less hardware than Cassandra.

At Apple’s scale 10% of 75000 is 7500 nodes and it is not something to ignore. The most popular post on my blog is my article on how I’d like to replace Cassandra with DynamoDB in the AWS environment. The long term costs of operating Cassandra are on the minds of Cassandra adopters.

MongoDB is under pressure from customers to reduce operations costs as well. Viber migrated their MongoDB cluster to Couchbase cutting the number of AWS EC2 instances in half. At Viber’s scale that is not a small number.

Companies interested in adopting NoSQL should consider their options carefully. The vast majority of database use cases do not need massive horizontal scalability. Most applications could be better off with traditional SQL databases. In the cloud, there are NoSQL alternatives that cost less and are easier to maintain. Let’s review just a few examples.

AWS RDS for PostgreSQL


PostgreSQL has been offering NoSQL capabilities like MongoDB since version 9.3. That includes ACID, hierarchical document data and ability to index JSON documents. AWS RDS service of PostgreSQL offers high availability, redundancy, and fail-over. Being a managed service it requires very little attention. Many tasks such as backups and fail-over are fully automated. Rich management API and monitoring tools provide for customization of scaling behavior.

Redis


As John Martin of Computerworld wrote, “When it comes to storage, cache is king”. Azure, AWS and Google offer managed cache services. AWS Elasticache in particular offers a choice of Memcached and Redis. Redis is an interesting alternative to NoSQL since its low level data model is similar to that of Cassandra for some of the use cases. Redis database has to fit entirely in-memory but it can be persisted to disk and recovered upon reboot. Redis can be configured in clusters for high availability and performance. On master failure one of the slaves becomes the new master.

AWS DynamoDB and Google BigTable


AWS DynamoDB and Google BigTable offer a similar data model to Cassandra as well as infinite scalability. Neither service requires any administration or devops. One has to be on the look-out for burst performance, however. Burst capacity is one area where a custom configured NoSQL database can shine.

Object storage


An object storage tool like AWS S3 is a long term infinitely large key/value store. As a corner stone of AWS, S3 can integrate with CloudFront, RedShift and many other AWS services. It scales horizontally without any questions asked and can store JSON and binary documents as well as logs. S3 is also ridiculously cheap and can be used to store terabytes of data.

Final thoughts


Companies should keep in mind the costs associated with NoSQL technology. It is important to consider not only the technical merits but also the costs. Development teams that choose the right tool for the right job will always win.