Archive

The Dulin Report

Browsable archive from the WordPress export.

Results (45)

The future is bright Mar 30, 2025 On Amazon Prime Video’s move to a monolith May 14, 2023 One size does not fit all: neither cloud nor on-prem Apr 10, 2023 Some thoughts on the latest LastPass fiasco Mar 5, 2023 Comparing AWS SQS, SNS, and Kinesis: A Technical Breakdown for Enterprise Developers Feb 11, 2023 There is no such thing as one grand unified full-stack programming language May 27, 2022 Which AWS messaging and queuing service to use? Jan 25, 2019 Using Markov Chain Generator to create Donald Trump's state of union speech Jan 20, 2019 Adobe Creative Cloud is an example of iPad replacing a laptop Jan 3, 2019 Facebook is the new Microsoft Apr 14, 2018 Leaving Facebook and Twitter: here are the alternatives Mar 25, 2018 Rather than innovating Walmart bullies their tech vendors to leave AWS Jun 27, 2017 Architecting API ecosystems: my interview with Anthony Brovchenko of R. Culturi Jun 5, 2017 TDWI 2017, Chicago, IL: Architecting Modern Big Data API Ecosystems May 30, 2017 Online grocers have an additional burden to be reliable Jan 5, 2017 Windows 10: a confession from an iOS traitor Jan 4, 2017 What I learned from using Amazon Alexa for a month Sep 7, 2016 Why I switched to Android and Google Project Fi and why should you Aug 28, 2016 Amazon Alexa is eating the retailers alive Jun 22, 2016 In search for the mythical neutrality among top-tier public cloud providers Jun 18, 2016 What can we learn from the last week's salesforce.com outage ? May 15, 2016 Why it makes perfect sense for Dropbox to leave AWS May 7, 2016 Our civilization has a single point of failure Dec 16, 2015 IT departments must transform in the face of the cloud revolution Nov 9, 2015 Setting Up Cross-Region Replication of AWS RDS for PostgreSQL Sep 12, 2015 Top Ten Differences Between ActiveMQ and Amazon SQS Sep 5, 2015 What Every College Computer Science Freshman Should Know Aug 14, 2015 Ten Questions to Consider Before Choosing Cassandra Aug 8, 2015 Big Data Should Be Used To Make Ads More Relevant Jul 29, 2015 Book Review: "Shop Class As Soulcraft" By Matthew B. Crawford Jul 5, 2015 Attracting STEM Graduates to Traditional Enterprise IT Jul 4, 2015 Smart IT Departments Own Their Business API and Take Ownership of Data Governance May 13, 2015 Guaranteeing Delivery of Messages with AWS SQS May 9, 2015 We Need a Cloud Version of Cassandra May 7, 2015 The Clarkson School Class of 2015 Commencement speech May 5, 2015 Building a Supercomputer in AWS: Is it even worth it ? Apr 13, 2015 Ordered Sets and Logs in Cassandra vs SQL Apr 8, 2015 Microsoft and Apple Have Everything to Lose if Chromebooks Succeed Mar 31, 2015 Where AWS Elastic BeanStalk Could be Better Mar 3, 2015 Trying to Replace Cassandra with DynamoDB ? Not so fast Feb 2, 2015 Why I am Tempted to Replace Cassandra With DynamoDB Nov 13, 2014 Infrastructure in the cloud vs on-premise Aug 25, 2014 Cassandra: a key puzzle piece in a design for failure Aug 18, 2014 Cassandra: Lessons Learned Jun 6, 2014 Things I wish Apache Cassandra was better at Feb 12, 2014

Things I wish Apache Cassandra was better at

February 12, 2014

At first I wanted to name this article “My personal list of grievances against Apache Cassandra”, but I decided to take a more positive approach.

Only Java developers need apply

Surely you can use Cassandra with non-Java clients but the reality is that administering and maintaining Cassandra itself requires very significant knowledge of Java and there is no way around it. The engineer must know Java garbage collection in great detail, and this requires significant experience. As for myself, with twenty+ years of experience with Java I still don’t know how to prevent long GC pauses in Cassandra completely, and in order for me to do that I have no choice but run Cassandra in a JVM profiler and familiarize myself with its source code.

Point is, Cassandra needs to be self-tuning and it should dynamically adjust itself to the workload. It should not require an above-average Java expertise to maintain and to tune. Ideally it shouldn’t require any Java expertise at all.

I don’t want to become an ops engineer

Cassandra is far from zero-maintenance. It requires a regular “repair” process to ensure consistency and that deleted items don’t come back. Compactions can temporarily double disk space utilization. If you have disk space issues and deleting data doesn’t mean you reclaim the space. Again, this is just like what I mentioned above, Cassandra needs to be more self-tuning, dynamically adjust itself to the workload and not require babysitting.

I want to see a proper cloud version of Cassandra

Today’s cloud environments like Google AppEngine, Heroku and even Amazon give you standard JDBC access to SQL database without requiring a developer to become an ops engineer or a DBA. To put it bluntly, I couldn’t care less about the nitty gritty details of cassandra.yaml . I want to use cassandra to build applications, not spend eternity tuning yaml files. I want Thrift and CQL access to a cloud Cassandra cluster maintained by someone else. That’s it.