Jepsen: Strangeloop Hangout

2013-09-28

Since the Strangeloop talks won’t be available for a few months, I recorded a new version of the talk as a Google Hangout.

Continue reading (29 words)

Jepsen: RICON East talk

Software Riak Distributed Systems Jepsen Redis MongoDB Postgres

2013-09-28

Continue reading (6 words)

Jepsen: Cassandra

Software Network Distributed Systems Jepsen Cassandra

2013-09-24

Previously on Jepsen, we learned about Kafka’s proposed replication design.

Cassandra is a Dynamo system; like Riak, it divides a hash ring into a several chunks, and keeps N replicas of each chunk on different nodes. It uses tunable quorums, hinted handoff, and active anti-entropy to keep replicas up to date. Unlike the Dynamo paper and some of its peers, Cassandra eschews vector clocks in favor of a pure last-write-wins approach.

Continue reading (3189 words)

Jepsen: Kafka

Software Network Distributed Systems Jepsen Kafka

2013-09-24

In the last Jepsen post, we learned about NuoDB. Now it’s time to switch gears and discuss Kafka. Up next: Cassandra.

Kafka is a messaging system which provides an immutable, linearizable, sharded log of messages. Throughput and storage capacity scale linearly with nodes, and thanks to some impressive engineering tricks, Kafka can push astonishingly high volume through each node; often saturating disk, network, or both. Consumers use Zookeeper to coordinate their reads over the message log, providing efficient at-least-once delivery–and some other nice properties, like replayability.

Continue reading (1881 words)

Jepsen: NuoDB

Software Network Distributed Systems Jepsen NuoDB

2013-09-23

Previously on Jepsen, we explored Zookeeper. Next up: Kafka.

NuoDB came to my attention through an amazing mailing list thread by the famous database engineer Jim Starkey, in which he argues that he has disproved the CAP theorem:

Continue reading (1497 words)

Jepsen: Zookeeper

Software Network Distributed Systems Jepsen Zookeeper

2013-09-23

In this Jepsen post, we’ll explore Zookeeper. Up next: NuoDB.

Update 2019-07-23: @insumity explains that ZooKeeper sync+read is not, in fact, linearizable–there are conditions under which it might return stale reads.

Continue reading (772 words)

A letter on NSA surveillance

Politics Security

2013-07-04

I wish I could make more concrete policy recommendations, but in this case all I can say is “this looks troubling.” Here’s the letter I sent to my representatives today:

Dear Senator Feinstein,

Continue reading (637 words)

Automating Jepsen

Databases Jepsen

2013-06-28

If you, as a database vendor, implement a few features in your API, I can probably offer repeatable automated tests of your DB’s partition tolerance through Jepsen.

The outcome of these tests would be a set of normalized metrics for each DB like “supports linearizability”, “available for writes when a majority partition exists”, “available for writes when no majority available”, “fraction of writes successful”, “fraction of writes denied”, “fraction of writes acked then lost”, “95th latency during condition X”, and so forth. I’m thinking this would be a single-page web site–a spreadsheet, really–making it easy to compare and contrast DBs and find one that fits your safety needs.

Continue reading (335 words)

The network is reliable

Network Jepsen

2013-06-02

I’ve been discussing Jepsen and partition tolerance with Peter Bailis over the past few weeks, and I’m honored to present this post as a collaboration between the two of us. We’d also like to extend our sincere appreciation to everyone who contributed their research and experience to this piece.

Continue reading (5635 words)

Asynchronous replication with failover

Software Network Distributed Systems Jepsen Redis

2013-05-21

Continue reading (2437 words)

Jepsen: final thoughts

Software Network Distributed Systems Jepsen

2013-05-20

Previously in Jepsen, we discussed Riak. Now we’ll review and integrate our findings.

This was a capstone post for the first four Jepsen posts; it is not the last post in the series. I’ve continued this work in the years since and produced several more posts.

Continue reading (1548 words)

Jepsen: Riak

Software Network Riak Distributed Systems Jepsen

2013-05-19

Previously in Jepsen, we discussed MongoDB. Today, we’ll see how last-write-wins in Riak can lead to unbounded data loss.

Continue reading (2742 words)