Blog
Notes from working on data platforms, infrastructure, and side projects.
·6 min read· AI-assisted
What does grep stand for, and the seventy-five-year history of the regular expression
grep is short for g/re/p — the ed command syntax for global regular expression print. Regular expressions themselves go back to a 1951 RAND memo by Stephen Kleene. The thirty-year flavour war is a footnote to the original math.
#regex
#history
#unix
·5 min read· AI-assisted
Where the name Docker comes from
A docker is a longshoreman — the worker at a port who loads and unloads shipping containers. The software took its name from the work; the work took its metaphor from the 1956 standardisation of physical shipping containers. The container ecosystem inherited the maritime vocabulary.
#docker
#history
#containers
#naming
·5 min read· AI-assisted
SSH: the 1995 protocol nothing has displaced
Tatu Ylönen wrote SSH at Helsinki University of Technology in July 1995 after a password-sniffing incident. Thirty years later it is the default remote-access protocol on every Unix-like operating system, including Windows. Almost nothing else in security tooling has lasted as long unchanged.
#ssh
#security
#history
#networking
·5 min read· AI-assisted
POSIX, the standard nobody reads
POSIX stands for Portable Operating System Interface, plus an X added at the IEEE's request. Richard Stallman coined the name in 1988. The standard codifies the Unix interface every modern operating system claims to implement and almost none follow strictly.
#posix
#unix
#history
#standards
·6 min read· AI-assisted
The editor wars were never really about the editor
vim came from ed (1969) through ex and vi. Emacs came from a different lineage starting at MIT in 1976. The thirty-year fight between them was a disagreement about whether modal editing is worth the learning curve. Nano is the third option that quietly took the casual market.
#vim
#emacs
#nano
#history
·6 min read· AI-assisted
What we mean when we say 'the cloud'
The cloud was a network-diagram icon for two decades before it was a product. Compaq coined cloud computing in 1996, AWS made it real in 2006, and the surrounding vocabulary — VPS, hyperscaler, colocation — each carries its own history.
#cloud
#infrastructure
#history
#opinion
·5 min read· AI-assisted
Why Kubernetes is called K8s, and what the Greek means
K8s is a numeronym — K, eight letters, s. Kubernetes itself is the ancient Greek κυβερνήτης, the helmsman of a ship. The seven-spoked logo is a Star Trek easter egg. The thing it names is the third generation of Google's internal scheduler.
#kubernetes
#k8s
#history
#naming
·5 min read· AI-assisted
JSON: discovered, not invented
Douglas Crockford has said for twenty years that he did not invent JSON, he discovered it. The format was sitting inside JavaScript the whole time, waiting for someone to extract it. The story of how a 2001 footnote in a browser scripting language ate XML's lunch is shorter than most people think.
#json
#history
#data
·6 min read· AI-assisted
YAML vs YML, and what 'markup language' actually means
The .yml extension is a 1990s DOS artifact. The 'YAML Ain't Markup Language' acronym is a 2002 self-correction. Both questions resolve cleanly once you know markup languages and data serialisation formats are different categories with different ancestors.
#yaml
#markup
#history
#data
#opinion
·5 min read· AI-assisted
The cold-start tax: serverless warehouses vs an always-on box
A serverless warehouse that auto-provisions three nodes for a SELECT * spends most of the bill on the time you waited for it. A self-hosted ClickHouse on EC2 trades elasticity for sub-second latency and a fixed monthly line item — favourable any time queries are even mildly steady.
#data
#warehouse
#databricks
#clickhouse
#cost