-
Instagram sharding: https://instagram-engineering.com/sharding-ids-at-instagram-1cf5a71e5a5c
-
Dynamo: https://www.allthingsdistributed.com/files/amazon-dynamo-sosp2007.pdf
-
Dynamo DB: https://www.usenix.org/system/files/atc22-elhemali.pdf
-
Anti-entropy without merkle trees: https://repositorium.sdum.uminho.pt/bitstream/1822/51489/1/DottedDB-SRDS2017.pdf
-
zab protocol
-
Paxos, multi-paxos, paxo group (google spanner)
-
Sparse indexes in database
-
pack files ?
-
Conflict free replicated data types: https://arxiv.org/pdf/1805.06358
-
A commutative replicated data type for cooperative editing: https://inria.hal.science/file/index/docid/445975/filename/icdcs09-treedoc.pdf
-
Database engineering knowledge base: https://arpitbhayani.me/knowledge-base/database-engineering
-
Preventing Postgres SQL Race Conditions with SELECT FOR UPDATE: https://on-systems.tech/blog/128-preventing-read-committed-sql-concurrency-errors/
-
Postgres anti patterns: https://www.enterprisedb.com/blog/postgresql-anti-patterns-read-modify-write-cycles
-
https://www.uber.com/en-IN/blog/ringpop-open-source-nodejs-library/
-
cassandra consistency levels: https://docs.datastax.com/en/cassandra-oss/3.0/cassandra/dml/dmlConfigConsistency.html
-
notion sharding: https://www.notion.com/blog/the-great-re-shard
-
Advisory locking in postgres
-
Distributed locks during transactions
-
https://grafana.com/blog/2024/02/09/how-i-write-http-services-in-go-after-13-years/
-
https://www.dragonflydb.io/guides/message-broker-vs-message-queue
-
https://benjamincongdon.me/blog/2021/08/17/B-Trees-More-Than-I-Thought-Id-Want-to-Know/
-
Go scheduler https://www.youtube.com/watch?v=YHRO5WQGh0k
-
Lock free data structures/programming
-
RAG
-
Distributed storage/systems
-
https://www.smashingmagazine.com/2017/04/guide-http2-server-push/
-
https://www.confluent.io/events/kafka-summit-london-2023/an-introduction-to-kafka-cruise-control/
-
Error wrapping in Go: https://bitfieldconsulting.com/posts/wrapping-errors
-
DDD + SOLID:
-
Google spanner: https://dl.acm.org/doi/pdf/10.1145/3035918.3056103
-
Swiss hash tables
-
Valkey hash table
-
Papers
- Dynamo - Amazon’s Highly Available Key Value Store ✅ 2025-05-21
- Google File System: Insights into a highly scalable file system
- Scaling Memcached at Facebook: A look at the complexities of Caching
- BigTable: The design principles behind a distributed storage system
- Borg - Large Scale Cluster Management at Google
- Cassandra: A look at the design and architecture of a distributed NoSQL database
- Attention Is All You Need: Into a new deep learning architecture known as the transformer
- Kafka: Internals of the distributed messaging platform
- FoundationDB: A look at how a distributed database
- Amazon Aurora: To learn how Amazon provides high-availability and performance
- Spanner: Design and architecture of Google’s globally distributed databas
- MapReduce: A detailed look at how MapReduce enables parallel processing of massive volumes of data
- Shard Manager: Understanding the generic shard management framework
- Dapper: Insights into Google’s distributed systems tracing infrastructure
- Flink: A detailed look at the unified architecture of stream and batch processing
- A Comprehensive Survey on Vector Databases
- Zanzibar: A look at the design, implementation and deployment of a global system for managing access control lists at Google
- Monarch: Architecture of Google’s in-memory time series database
- Thrift: Explore the design choices behind Facebook’s code-generation tool
- Bitcoin: The ground-breaking introduction to the peer-to-peer electronic cash system
- WTF - Who to Follow Service at Twitter: Twitter’s (now X) user recommendation system
- MyRocks: LSM-Tree Database Storage Engine
- GoTo Considered Harmful
- Raft Consensus Algorithm: To learn about the more understandable consensus algorithm
- Time Clocks and Ordering of Events: The extremely important paper that explains the concept of time and event ordering in a distributed system
-
Debezium
-
CDC messages (change data capture)
-
https://research.google/pubs/f1-the-fault-tolerant-distributed-rdbms-supporting-googles-ad-business/
-
https://research.google/pubs/spanner-googles-globally-distributed-database-2/
-
Demystifying distributed SQL and why companies choose CockroachDB? https://www.youtube.com/watch?v=OrZszgOt5Fs
- Write behind logging: https://db.cs.cmu.edu/papers/2016/p337-arulraj.pdf
-
What Every Programmer Should Know about How CPUs Work: https://www.youtube.com/watch?v=-HNpim5x-IE ✅ 2025-05-19
-
IPFS, IPFN, Orbit DB, Web3 storage, Stable coins, smart contracts
-
DBOS
-
SingleStore
-
Zero copy I/O
-
SpacetimeDB
-
Multi tenent databases
-
Serializability and Linearizability https://aerospike.com/blog/concurrency-control-in-dbms/
-
ZeroMQ
-
Terragrunt
-
Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics : https://www.cidrdb.org/cidr2021/papers/cidr2021_paper17.pdf
-
CXL for disaggregated memory in cloud native databases vs RDMA (Remote Direct Memory Access)
-
Golden master approach