Archive

The Dulin Report

Browsable archive from the WordPress export.

Results (46)

Should today’s developers worry about AI code generators taking their jobs? Dec 11, 2022 Book review: Clojure for the Brave and True Oct 2, 2022 Stop Shakespearizing Sep 16, 2022 Using GNU Make with JavaScript and Node.js to build AWS Lambda functions Sep 4, 2022 Monolithic repository vs a monolith Aug 23, 2022 Scripting languages are tools for tying APIs together, not building complex systems Jun 8, 2022 Good developers can pick up new programming languages Jun 3, 2022 Java is no longer relevant May 29, 2022 Automation and coding tools for pet projects on the Apple hardware May 28, 2022 There is no such thing as one grand unified full-stack programming language May 27, 2022 Most terrifying professional artifact May 14, 2022 TypeScript is a productivity problem in and of itself Apr 20, 2022 Tools of the craft Dec 18, 2021 Node.js and Lambda deployment size restrictions Mar 1, 2021 What programming language to use for a brand new project? Feb 18, 2020 Using Markov Chain Generator to create Donald Trump's state of union speech Jan 20, 2019 The religion of JavaScript Nov 26, 2018 Let’s talk cloud neutrality Sep 17, 2018 TypeScript starts where JavaScript leaves off Aug 2, 2017 Node.js is a perfect enterprise application platform Jul 30, 2017 Singletons in TypeScript Jul 16, 2017 Copyright in the 21st century or how "IT Gurus of Atlanta" plagiarized my and other's articles Mar 21, 2017 Collaborative work in the cloud: what I learned teaching my daughter how to code Dec 10, 2016 Amazon Alexa is eating the retailers alive Jun 22, 2016 What can we learn from the last week's salesforce.com outage ? May 15, 2016 JEE in the cloud era: building application servers Apr 22, 2016 JavaScript as the language of the cloud Feb 20, 2016 In memory of Ed Yourdon Jan 23, 2016 Top Ten Differences Between ActiveMQ and Amazon SQS Sep 5, 2015 We Live in a Mobile Device Notification Hell Aug 22, 2015 What Every College Computer Science Freshman Should Know Aug 14, 2015 Ten Questions to Consider Before Choosing Cassandra Aug 8, 2015 The Three Myths About JavaScript Simplicity Jul 10, 2015 Book Review: "Shop Class As Soulcraft" By Matthew B. Crawford Jul 5, 2015 Big Data is not all about Hadoop May 30, 2015 Smart IT Departments Own Their Business API and Take Ownership of Data Governance May 13, 2015 Guaranteeing Delivery of Messages with AWS SQS May 9, 2015 Where AWS Elastic BeanStalk Could be Better Mar 3, 2015 Why I am Tempted to Replace Cassandra With DynamoDB Nov 13, 2014 How We Overcomplicated Web Design Oct 8, 2014 Docker can fundamentally change how you think of server deployments Aug 26, 2014 Cassandra: Lessons Learned Jun 6, 2014 Things I wish Apache Cassandra was better at Feb 12, 2014 "Hello, World!" Using Apache Thrift Feb 24, 2013 Have computers become too complicated for teaching ? Jan 1, 2013 Java, Linux and UNIX: How much things have progressed Dec 7, 2010

Things I wish Apache Cassandra was better at

February 12, 2014

At first I wanted to name this article “My personal list of grievances against Apache Cassandra”, but I decided to take a more positive approach.

Only Java developers need apply

Surely you can use Cassandra with non-Java clients but the reality is that administering and maintaining Cassandra itself requires very significant knowledge of Java and there is no way around it. The engineer must know Java garbage collection in great detail, and this requires significant experience. As for myself, with twenty+ years of experience with Java I still don’t know how to prevent long GC pauses in Cassandra completely, and in order for me to do that I have no choice but run Cassandra in a JVM profiler and familiarize myself with its source code.

Point is, Cassandra needs to be self-tuning and it should dynamically adjust itself to the workload. It should not require an above-average Java expertise to maintain and to tune. Ideally it shouldn’t require any Java expertise at all.

I don’t want to become an ops engineer

Cassandra is far from zero-maintenance. It requires a regular “repair” process to ensure consistency and that deleted items don’t come back. Compactions can temporarily double disk space utilization. If you have disk space issues and deleting data doesn’t mean you reclaim the space. Again, this is just like what I mentioned above, Cassandra needs to be more self-tuning, dynamically adjust itself to the workload and not require babysitting.

I want to see a proper cloud version of Cassandra

Today’s cloud environments like Google AppEngine, Heroku and even Amazon give you standard JDBC access to SQL database without requiring a developer to become an ops engineer or a DBA. To put it bluntly, I couldn’t care less about the nitty gritty details of cassandra.yaml . I want to use cassandra to build applications, not spend eternity tuning yaml files. I want Thrift and CQL access to a cloud Cassandra cluster maintained by someone else. That’s it.