Under the Code — Part IV: Data and Security

01 — Historical arc

Two thousand years of trying to keep secrets · and the 1976 paper that changed everything.

Cryptography is older than the alphabet it scrambles. Julius Caesar used a letter-shift cipher around 50 BCE — every letter replaced by the one three positions later, "VENI" becomes "YHQL" — to keep his military dispatches private from any messenger or interceptor who could read Latin. The cipher worked for a generation; it failed the moment anyone realised the same letter always mapped to the same ciphertext letter, which a contemporary cryptanalyst could break with frequency analysis in an afternoon. The pattern of the next two thousand years was the same: a clever scheme, in use until an adversary noticed a regularity, then a slightly cleverer scheme, then frequency analysis or pattern-matching exposed that one too. The 16th-century Vigenère cipher (a key-shifted Caesar, where different positions used different shifts) was unbreakable for three hundred years — and then Charles Babbage broke it in 1854 and Friedrich Kasiski published the method in 1863. Enigma, with rotors and a plugboard and a daily key, lasted six years before Bletchley broke it (Chapter 1 §01).

The whole arc, until the 1970s, was symmetric by necessity: sender and receiver had to share the same key. That key had to be agreed in advance, transmitted secretly, kept secret by both sides, and changed periodically. For two correspondents on a battlefield this was hard but feasible — you sent a courier with the new keys monthly. For a civilian internet of millions of strangers wanting to talk privately to one server they had never met, it was impossible. Until 1976, "everyone needs to share a key with everyone they want to talk to" was an unsolved problem, and civilian cryptography was correspondingly nearly nonexistent. Then Whitfield Diffie and Martin Hellman published "New Directions in Cryptography" in the IEEE Transactions on Information Theory, November 1976, and described a way for two strangers to agree on a shared secret over a public channel that any eavesdropper could see. The paper was, in retrospect, the second invention of cryptography. Everything since has built on it.

Fig 14.1 — Two thousand years of cryptography in one strip

Two thousand years compressed onto one timeline. The pre-1976 era is everyone sharing keys in advance — Caesar, Vigenère, Enigma, the One-Time Pad. Each scheme was eventually broken, usually by a clever observer (Babbage, Kasiski, Bletchley) noticing structural patterns. The 1976 paper changed the question. Diffie and Hellman did not provide the first practical public-key system — that was RSA in 1977 — but they articulated the framework, proved it was theoretically possible, and described an actual key-exchange protocol. The IEEE referees almost rejected the paper as "too speculative." Half a century later, every HTTPS connection on Earth runs on top of the framework it announced.

One last historical thread is worth pulling, because it shapes the modern legal landscape of cryptography. Phil Zimmermann, a software engineer in Boulder, Colorado, wrote PGP (Pretty Good Privacy) — a usable email encryption program built on RSA — and released it on the internet in June 1991. Cryptography was at the time classified as a munition under the US International Traffic in Arms Regulations; exporting it required a State Department license. PGP's appearance abroad triggered a three-year US criminal investigation of Zimmermann under the Arms Export Control Act. In 1995, MIT Press published the PGP source code as an actual printed book — the export of books was protected by the First Amendment in a way the export of software was not. Anyone abroad could buy the book, scan it back into source code with OCR, and compile a copy of PGP. The investigation was dropped in 1996. Cryptography ceased to be a munition in 2000. The legal precedent set — that source code is protected speech — is the reason every cryptographic library today can be open source and globally distributed. PGP itself is rarely used now (its UX is famously hostile), but Signal, WhatsApp, and iMessage all descend, ideologically, from the cypherpunk position Zimmermann argued for.

"It's personal. It's private. And it's no one's business but yours."

— Phil Zimmermann, on the original release of PGP, 1991

📜

The hidden Cocks. Diffie–Hellman (1976) and RSA (1977) were independently invented by British cryptographers at GCHQ — James Ellis sketched the idea in 1969, Clifford Cocks implemented what is essentially RSA in 1973, and Malcolm Williamson implemented essentially Diffie–Hellman in 1974. All three were classified Top Secret by the British government and remained classified until 1997 — by which time RSA Inc. and Stanford had built billion-dollar industries on the public re-inventions. The hidden inventors got a footnote in cryptographic history; the public inventors got the Turing Award (Diffie and Hellman, 2015) and patents. The episode is a recurring lesson about classified research: discoveries kept secret are functionally non-discoveries. Public reinvention almost always wins.

01.5 — Math primer

Three ideas before the algorithms · so the rest of the chapter is grounded.

Every primitive in this chapter — hashing, AES, Diffie–Hellman, RSA, elliptic curves, post-quantum lattices — is a different one-way function: a computation that is easy to run forward and infeasible to run backward. The whole edifice of modern cryptography is built on finding mathematical operations that have this asymmetry, and engineering protocols that put the asymmetry where the security boundary needs to be. Before we open any of the specific primitives, three ideas are worth installing as intuition.

1. Modular arithmetic · the clock that goes very high

Almost every operation in cryptography happens not on ordinary integers but on integers modulo some number — i.e., arithmetic where you wrap around. The hour after 11 o'clock is not 12; it is 0. That is arithmetic modulo 12 — and it is the same kind of arithmetic, with the same algebraic properties (addition, multiplication, exponentiation), as cryptographic arithmetic, just with much larger clocks. 5 + 9 mod 12 = 2 is exactly the operation performed in cryptography, only with the modulus being a 2048-bit prime instead of 12. The reason crypto uses modular arithmetic is that it produces a finite playground — every result stays within a known range — while preserving the algebraic structure that makes hard problems possible. Throughout this chapter: whenever you see mod p, picture a clock with p hours.

2. The one-way function · the asymmetry crypto exploits

A function f(x) = y is one-way if computing y from x is fast (microseconds, milliseconds at worst) but computing x from y is infeasible (longer than the age of the universe, with every algorithm currently known). That asymmetry is rare. Most mathematical operations are symmetric: addition has subtraction, multiplication has division, squaring has square root — each takes about as long forward as backward. Cryptography needs operations where the inverse is computationally harder than the forward direction by a factor large enough to be practically infinite. Four such operations are known and broadly trusted in 2026; this chapter is, in a sense, a tour through them.

3. "Hard" means what · and what kills it

The word "hard" in cryptography is not the everyday word. It does not mean "impossible" — it means "given the best algorithm currently known and the fastest hardware currently buildable, the computation would take longer than civilization is likely to last, at every key size deemed adequate." This is a contingent claim, not a mathematical one. When the algorithm changes — when Shor's algorithm (1994) showed how a sufficiently large quantum computer can factor integers and compute discrete logs in polynomial time — the hardness assumption dies, and the primitives that depended on it (RSA, Diffie–Hellman, ECDSA) become breakable the moment the hardware exists. That is why §06 of this chapter is about post-quantum migration: the assumption underneath RSA and ECC has a known expiration date, and the cryptographic community is already shipping replacements built on different math (lattice problems) for which no efficient quantum algorithm is known. "Hard" is always provisional. The math primitives change with what the adversary can compute.

Fig 14.2 — The four one-way functions cryptography is built on

x · no efficient classical algo used for: Diffie–Hellman key exchange killed by: Shor's algorithm on a quantum computer ③ INTEGER FACTORING · §04 (RSA) forward: multiply two large primes p × q = n backward: given n, recover p and q used for: RSA encryption and signatures killed by: Shor (same algorithm as discrete log) ④ ELLIPTIC-CURVE DLP · §05 forward: k · G · scalar mul on curve backward: recover k from k·G — even harder per bit used for: ECDH, ECDSA, Ed25519 (TLS, SSH, Bitcoin) also killed by Shor — §06 covers what replaces it three of these four primitives have a known expiration date — quantum computers · §06 is the migration plan

Cryptography in one picture. Every primitive in this chapter is an instance of the same shape: a function easy in one direction, infeasible in the other. SHA-256 hashes a megabyte in microseconds; finding any input that produces a specific 32-byte hash takes ~2²⁵⁶ trials. Diffie–Hellman raises a base to a secret exponent mod p in milliseconds; recovering the exponent — the discrete-log problem — has no efficient classical algorithm at adequate prime sizes. RSA multiplies two large primes in microseconds; factoring the product takes weeks of supercomputer time at 2048-bit sizes and longer at 4096. Elliptic-curve cryptography does scalar multiplication on a curve fast; the inverse — given kG, recover k — is harder per bit than ordinary discrete log, which is why 256-bit ECC matches 3072-bit RSA security. The bottom line acknowledges what cryptographers have known since 1994: three of these four assumptions die on a sufficiently large quantum computer (Shor's algorithm), and §06 covers the post-quantum lattice-based replacements that NIST began standardising in 2024. Hashing — the first row — does not die, and is therefore the foundation everything else can be rebuilt on if the worst happens. The chapter from here on is, essentially, four detailed visits to these four asymmetries, plus how they compose into TLS.

Why "easy" and "hard" in CS

In computer science, "easy" and "hard" have a specific technical meaning, set by how the work scales with the size of the input. Easy means polynomial time: O(n), O(n²), O(n³). The work grows as some power of n; double n and the work goes up by a constant factor like 4× or 8×. Hard, in the cryptographic sense, means exponential or sub-exponential: roughly O(2ⁿ), or in the case of factoring, O(exp((log n)^1/3)). Double the bits and the work goes up by a multiplicative factor that doubles, then doubles again. At 256 bits of key, "hard" is several orders of magnitude beyond the entire computational capacity of Earth running until the heat death of the universe.

forward time = poly(n) · inverse time = exp(n)

The size of the gap is what makes a function cryptographically useful. The whole sub-discipline of computational complexity studies exactly this gap, and the most famous open problem in computer science — P vs NP — asks whether some problems we currently think are hard might actually have hidden efficient algorithms. If P=NP, modern cryptography largely collapses. Almost every cryptographer believes P≠NP; nobody can prove it yet.

02 — Hashing

The fingerprint that does not let you reconstruct the face.

A cryptographic hash function takes input of any size and produces a fixed-size output — 256 bits for SHA-256, 512 bits for SHA-512, 160 bits for the now-broken SHA-1, 128 bits for the long-broken MD5. The function is deterministic (same input always produces same output) and easy to compute (microseconds for megabytes of input). It must satisfy three security properties, each independently important. Preimage resistance: given a hash, no practical algorithm finds an input producing it. Second-preimage resistance: given an input, no practical algorithm finds a different input with the same hash. Collision resistance: no practical algorithm finds any two distinct inputs with the same hash. The three are not the same property, and breaking one is easier than breaking another.

Fig 14.3 — Three hash properties · three different attacks

The three hash-security properties listed in increasing order of attacker difficulty for a defender — meaning collision resistance is the property that breaks first when a hash is weakened. SHA-1 (160-bit output) became collision-vulnerable in 2017 when Google published SHAttered — two PDF files with identical SHA-1 hashes — at a cost of about 2⁶³ hash computations. Preimage resistance against SHA-1 is still intact at ~2¹⁶⁰. The lesson: when a hash function shows weakness, it shows up at collision first; preimage and second-preimage are typically stable for years afterwards. SHA-256, the modern standard, has 2¹²⁸ as its theoretical collision bound — well beyond foreseeable computing — and is what every modern system uses. SHA-3, designed in 2015 with a different internal structure, exists as a hedge against unforeseen weaknesses in SHA-256's underlying construction.

Inside SHA-256 · sketches · not pseudocode

SHA-256 takes its input, pads it to a multiple of 512 bits, and processes it 512 bits at a time through 64 rounds of a compression function. Each round mixes the running 256-bit state with bits of the current block using bitwise rotations, XORs, additions modulo 2³², and a fixed table of constants derived from the cube roots of the first 64 primes. The construction is called Merkle–Damgård and dates to 1979. The exact operations are calibrated so that every output bit depends on every input bit through enough mixing rounds that no shortcut to inversion is known. Designing such functions is genuinely hard: SHA-1 looked solid in 1995 and was broken by 2017; MD5 looked solid in 1991 and was broken by 2004. The current generation (SHA-256, BLAKE3, SHA-3) has been studied for over a decade without a meaningful weakening.

Where hashes show up in practice

Hash functions appear everywhere in modern systems. Password storage: never store the password itself; store the hash, and on login compute the hash of what the user typed and compare. An attacker stealing the database gets hashes, not passwords. Git commit IDs: every commit's hash includes the hash of its parent, the hash of every file's contents, and the author/message — making the commit log immutable in practice (a tampered commit produces a different hash, breaking every descendant). Bitcoin mining: find a block whose SHA-256 hash starts with N leading zeros — the only way is brute force, and the difficulty (current N) is what regulates the supply. Merkle trees: a tree where each non-leaf node is the hash of its children's hashes; allows compact proofs that a specific leaf belongs to the tree. Certificate Transparency (Chapter 11 §05) uses Merkle trees so that a CA's issuance log can be publicly audited without downloading every certificate.

Fig 14.4 — Password storage · plaintext to hash to salted hash to slow KDF

The evolution of password storage maps to the evolution of attacker capability. Plaintext fails to anyone with database read access. Simple unsalted hashing fails to rainbow tables, which precompute SHA-256 of every common password. Salting (a unique random string per user, mixed with the password before hashing) defeats rainbow tables but not GPU-accelerated brute force; modern GPUs compute billions of SHA-256 hashes per second. The modern answer is slow, memory-hard key derivation functions like argon2id — deliberately tuned to take 100–500 ms and several megabytes of RAM per password attempt. The legitimate user notices nothing (one login attempt per session); an attacker with a stolen hash database can try perhaps a few attempts per second per GPU core. The arithmetic of "billions of attempts per second" becomes "thousands per hour."

03 — Symmetric encryption · AES

One key for both sides · the bulk-data workhorse.

AES — the Advanced Encryption Standard — is the symmetric cipher that does the actual bulk work in nearly every modern encrypted system. Your TLS connection (Chapter 11) uses RSA or ECDH only to agree on the key; the megabytes of HTTPS traffic that follow are encrypted by AES. Your full-disk encryption is AES. Your VPN tunnel is AES. Your iMessage and Signal messages are AES (in modern modes). The algorithm was selected by the US National Institute of Standards and Technology in 2001 from a five-year open competition; the winning design, called Rijndael, was submitted by two Belgian cryptographers, Vincent Rijmen and Joan Daemen. AES has been studied continuously for twenty-five years and remains, against classical attackers, fundamentally unbroken. Modern CPUs have hardware instructions for it (AES-NI on Intel/AMD, equivalent on ARM); a single core encrypts gigabytes per second.

AES operates on 128-bit blocks. The key is 128, 192, or 256 bits, and the number of rounds is 10, 12, or 14 respectively. Each round is the same four-step transformation, applied to a 128-bit state arranged as a 4×4 grid of bytes. The four steps — SubBytes, ShiftRows, MixColumns, AddRoundKey — together provide what cryptographers call confusion (each output bit depends complicatedly on the key) and diffusion (each input bit influences every output bit). After 10–14 rounds, the relationship between plaintext, key, and ciphertext is mathematically intractable; no shortcut to recovering the key is known.

Fig 14.5 — One AES round · four steps on a 4×4 byte grid

One AES round, the same four steps on a 4×4 byte grid. SubBytes sends each byte through a 256-entry lookup table (the S-box) — non-linear, the source of confusion. ShiftRows cyclically shifts each row by 0, 1, 2, or 3 positions — cheap diffusion across columns. MixColumns multiplies each column by a fixed 4×4 matrix in the Galois field GF(2⁸) — strong diffusion within columns. AddRoundKey XORs the state with a 128-bit round key derived from the master key. After 10 rounds (AES-128) or 14 (AES-256), every output bit depends on every input bit and every key bit through enough non-linear transformations that no algebraic shortcut to inversion is known. Hardware AES-NI instructions implement each round in 3-5 cycles; a modern core encrypts 5+ GB/sec.

Block modes · why ECB is forbidden and GCM is what you want

AES encrypts one 128-bit block at a time. Real messages are larger than 128 bits; the question is how to encrypt a long message safely. The answer is a mode of operation, and the choice of mode is more often the source of cryptographic bugs than the cipher itself. The simplest mode — ECB, Electronic Codebook — encrypts each 128-bit block independently with the same key. It is also the most famously broken: identical plaintext blocks produce identical ciphertext blocks, so any structural pattern in the plaintext (an image, repeated headers, repeated bytes) survives encryption. The canonical demonstration is the Linux mascot Tux: encrypt the penguin image with ECB and the result is still recognisably a penguin, because 16-byte regions of identical pixels become 16-byte regions of identical ciphertext.

Fig 14.6 — ECB's famous failure · structure leaks through

The ECB Tux image, first attributed to Filippo Valsorda's circulation around 2013, is the canonical visual proof that modes matter. The same penguin encrypted with AES-128 in ECB mode is still a recognisable penguin — every 16-byte plaintext region of identical pixels produces an identical 16-byte ciphertext region, so all the spatial structure of the image survives encryption. CBC (Cipher Block Chaining) XORs each plaintext block with the previous ciphertext block before encryption, ensuring identical plaintext produces different ciphertext. CTR (Counter Mode) treats AES as a stream cipher by encrypting an incrementing counter and XORing the result with plaintext. GCM (Galois Counter Mode) does CTR plus an authentication tag — the modern recommendation, providing both confidentiality and integrity in one operation. The cipher AES is essentially unbroken; the modes are where bugs come from.

04 — Public-key cryptography

Two strangers · a public channel · a shared secret nobody else can compute.

The 1976 Diffie–Hellman paper described a protocol for two parties who have never met to agree on a shared secret value over a channel that any eavesdropper can observe. The protocol's correctness rests on a single mathematical asymmetry: discrete exponentiation is easy to compute, discrete logarithm is hard. Given a large prime p, a base g, and an exponent a, computing g^a mod p takes microseconds. Given p, g, and the result g^a mod p, recovering a is computationally infeasible at adequate sizes. The protocol exploits this gap to let two parties end up knowing the same secret while the eavesdropper, with full knowledge of the public messages, cannot derive it.

Fig 14.7 — Diffie–Hellman with actual numbers · g=5, p=23

The whole Diffie–Hellman protocol in actual numbers. Alice picks secret a=6, Bob picks secret b=15; they both know the public prime p=23 and base g=5. Alice sends Bob A=g^a mod p = 8; Bob sends Alice B=g^b mod p = 19. Each then raises the received value to their own secret: Alice computes B^a=2, Bob computes A^b=2. They have the same shared secret. The eavesdropper Eve sees p, g, A, B — but to compute K she would need to solve g^a mod p = A for a, the discrete logarithm problem. With p=23 it's trivial; with the 2048-bit primes used in real Diffie-Hellman, no efficient algorithm is known. The same construction with elliptic curves (§05) instead of integer modular arithmetic gives ECDH, the key exchange of choice in TLS 1.3.

RSA · the same idea with factoring instead of discrete log

RSA — published in 1977 by Ron Rivest, Adi Shamir, and Leonard Adleman at MIT, the year after Diffie–Hellman — uses a different one-way function but the same overall framework. Each party generates a key pair by picking two large primes p and q, multiplying them to get n=pq, and then computing two related exponents e and d such that exponentiation by e modulo n can be undone by exponentiation by d. The pair (e, n) is the public key — published freely. The pair (d, n) is the private key — kept secret. To encrypt a message m for the holder of this key pair, anyone computes c = m^e mod n; only the holder, who knows d, can compute m = c^d mod n. The security rests on the difficulty of recovering d from e and n — which is computationally equivalent to factoring n back into p and q.

Fig 14.8 — RSA worked through · p=17, q=23

RSA in actual numbers, with comically tiny primes for legibility. Real RSA uses primes of ~1024 bits each, producing a 2048-bit modulus. The math is identical: pick primes p, q, compute n=pq, compute Euler's totient φ(n) = (p−1)(q−1), pick a public exponent e coprime to φ(n) (almost always e=65537 in practice), and solve for the private exponent d as the modular inverse of e mod φ(n). Encryption is m^e mod n; decryption is c^d mod n. The math underneath — that m^ed ≡ m (mod n) — follows from Euler's theorem and is the kind of result a number-theory undergraduate proves in a homework problem. The security: an attacker who can factor n recovers φ(n) and thus d; classical factoring of 2048-bit n is computationally infeasible. RSA is being slowly displaced by elliptic-curve methods because of key size; §05 explains why.

⚙️

The performance gap. RSA is roughly 1000× slower than AES per byte processed. A 2048-bit RSA encryption takes microseconds; AES-128 encrypts megabytes in the same time. This is why TLS does not use RSA for the actual data: the handshake uses RSA (or ECDHE) once to agree on a 256-bit AES key, and AES handles the rest. Diffie-Hellman is in the same performance class as RSA — a few milliseconds per handshake — but with the bonus property of forward secrecy (Chapter 11 §05): each session uses ephemeral DH values that are discarded after, so a future compromise of the long-term key cannot retroactively decrypt recorded sessions. Modern TLS 1.3 mandates ephemeral key exchange (ECDHE specifically) and uses the long-term RSA or ECDSA key only for signing the handshake, not for the key exchange itself.

05 — Elliptic curves

Same trapdoor · curves instead of integers · much smaller keys.

Elliptic curve cryptography (ECC) does the same job as Diffie–Hellman and RSA — agreeing on shared secrets, signing messages — using a different mathematical structure underneath. The payoff is dramatic: an elliptic-curve key of 256 bits provides security roughly equivalent to an RSA key of 3072 bits. A TLS server using ECDSA signatures and ECDHE key exchange has handshake messages a fraction the size of an RSA-based one, while offering the same cryptographic strength. Every modern protocol that wants good performance — TLS 1.3, SSH, Bitcoin, Signal, modern WireGuard — has standardised on elliptic curves. RSA still exists, mostly for backward compatibility with older systems, but the new generation of cryptography is curves.

The mathematical object is an elliptic curve — the set of points (x, y) satisfying an equation of the form y² = x³ + ax + b — over a finite field. Over the real numbers, the curve looks like the smooth shape in the Chapter 14 hero: a closed loop on the left (sometimes) and a curving arm to the right. The remarkable property the curve has is a natural point addition operation: given any two points P and Q on the curve, you can construct a third point P+Q also on the curve, by drawing the line through P and Q, finding where it meets the curve a third time, and reflecting that intersection across the x-axis. This addition is associative, has an identity (a "point at infinity"), and turns the points of the curve into a mathematical group — the same kind of structure integers modulo a prime form under modular addition.

Fig 14.9 — Point addition on an elliptic curve · geometric picture

Elliptic-curve point addition. Take two points P and Q on the curve. The line through them crosses the curve at exactly one other point — call it R'. Reflect R' across the x-axis. The resulting point is, by definition, P + Q. The construction looks arbitrary but produces a genuine mathematical group: addition is commutative, associative, and has an identity (the so-called "point at infinity," which serves the role of zero). The corresponding "scalar multiplication" — starting from a point G and adding it to itself k times to get kG — is fast in one direction but, given kG alone, infeasible to recover k. This is the elliptic-curve discrete logarithm problem, and it's the trapdoor that all of ECC pulls on.

In real cryptography, the curve is defined not over the real numbers but over a finite field — the integers modulo some large prime. The points form a finite, but enormous, set. Two curves dominate modern systems. secp256k1 — used by Bitcoin, Ethereum, and most cryptocurrencies — has the equation y² = x³ + 7 over a 256-bit prime field. Curve25519 and Ed25519 — used by TLS 1.3, SSH, Signal, and WireGuard — were designed by Daniel J. Bernstein in 2005 with careful attention to side-channel resistance and constant-time implementation. The pragmatic difference between the two is mostly cultural; the cryptography is similar in spirit.

Fig 14.10 — Equivalent security · ECC keys are dramatically smaller

Why TLS 1.3 standardised on elliptic curves. To match the strength of a 128-bit AES key (the modern default), RSA needs 3072 bits while ECC needs only 256 — the ratio gets even worse as security demands rise. RSA keys grow roughly with the cube root of attack work; ECC keys grow linearly. A TLS server using ECDHE+ECDSA has handshake bytes a fraction the size of an RSA-based one, which matters at planet scale: Cloudflare in 2014 estimated that switching its TLS terminations from RSA to ECDSA saved petabytes of bandwidth per year. Every new protocol since 2010 — Signal, WireGuard, age, modern SSH — has been ECC-first. RSA persists in legacy compatibility paths but is increasingly the slow option.

06 — Putting it together · TLS at depth · post-quantum

Where every primitive lands · and what threatens them.

The previous sections introduced primitives in isolation: hashing, symmetric encryption, public-key key exchange, public-key signatures, elliptic curves. A real protocol composes them. TLS 1.3 — already sketched in Chapter 11 §05 — uses every primitive in this chapter at once, in a precise sequence, with the cryptographic justification for each step. Now we have enough math to read the handshake at full depth: which operations happen, which keys derive from which, which security property each piece contributes.

Fig 14.11 — TLS 1.3 handshake at cryptographic depth

The same TLS 1.3 handshake from Chapter 11 §05, now with the cryptographic primitives labelled. The key exchange is ECDH on Curve25519: each side picks an ephemeral scalar, multiplies the curve generator G by it to get a public point, sends that point. Both sides compute the shared secret as the product of their own scalar with the other's public point. The shared secret is run through HKDF (an HMAC-based key derivation function) to produce separate AES-GCM keys for each direction. The server's certificate is verified against the trust root chain (Fig 11.12); the server's signature over the transcript proves that the holder of the certificate's private key participated in this specific handshake. Every primitive in this chapter — ECC for key exchange, ECDSA or Ed25519 for signing, SHA-256 inside HKDF, AES-GCM for bulk encryption — appears exactly once in its right role.

Digital signatures · and Certificate Transparency

A digital signature is the inverse of public-key encryption. To sign a message, the signer hashes it and then "decrypts" the hash with their private key. Anyone can verify by "encrypting" with the public key and comparing to the hash of the message. Only someone with the private key could have produced the signature; anyone with the public key can verify it. This asymmetric primitive is the basis of the entire TLS certificate system: a CA's certificate is signed by a more trusted CA's private key; a server's certificate is signed by a CA's; the signed-document property propagates trust down the chain.

The chain has historically had one weakness: a misbehaving CA can issue a fraudulent certificate and the affected domain has no way to discover it until the certificate is actively used in an attack. Certificate Transparency (RFC 6962, 2013; mandatory in Chrome since 2018) closes this with a deceptively simple mechanism: every certificate any CA issues must be submitted to a public, append-only, cryptographically verifiable log. The logs are Merkle trees — every entry's hash combined into a tree whose root becomes the log's "head" — so that anyone can verify the log is being maintained honestly without downloading the entire log. A domain owner can monitor CT logs for certificates issued in their name; if one appears that they did not authorise, they know within hours and can revoke or replace. Modern Chrome and Safari refuse to trust certificates that are not in at least two CT logs.

Fig 14.12 — Digital signatures and Certificate Transparency · Merkle trees keep CAs honest

Digital signatures (top half) and Certificate Transparency (bottom half), the two cryptographic mechanisms that hold the modern web's trust hierarchy together. A signature provides authenticity: only the holder of the private key could have produced it, anyone with the public key can verify it. CT provides auditability: every certificate ever issued lives in a public Merkle-tree log, so any domain owner can monitor for certificates issued in their name without their consent. After DigiNotar's 2011 compromise — fraudulent *.google.com certificates issued, used to MITM Iranian Gmail users for weeks — the industry collectively concluded that the trust-everything-the-CA-says model was insufficient. CT was the response. Today every certificate Chrome trusts is in at least two CT logs, every CA's behaviour is publicly auditable, and "we issued a cert nobody noticed" is no longer a viable attack.

Post-quantum · Shor's algorithm and the slow migration

Every public-key system in this chapter — Diffie–Hellman, RSA, ECDH, ECDSA — rests on a problem that is hard for classical computers. In 1994, mathematician Peter Shor, working at Bell Labs, published an algorithm that solves both integer factoring and discrete logarithm in polynomial time on a sufficiently large quantum computer. Shor's algorithm has not been run at cryptographically relevant scales — quantum computers in 2026 have a few hundred logical qubits, RSA-2048 would need thousands — but the trajectory is clear. A quantum computer of sufficient size, when it exists, breaks all current public-key cryptography. The estimated horizon ranges from "within ten years" (optimistic engineering) to "fifty or more" (pessimistic). Either way, the cryptography we rely on today has a known expiration date.

The response is post-quantum cryptography (PQC): public-key systems whose security rests on problems believed hard even for quantum computers. NIST began a public competition in 2016 to standardise PQC algorithms; in August 2024, after eight years of analysis and three rounds of finalists, the first standards were published. ML-KEM (formerly "Kyber") replaces RSA / ECDH key exchange. ML-DSA (formerly "Dilithium") replaces RSA / ECDSA signatures. SLH-DSA (formerly "SPHINCS+") provides a stateless hash-based signature alternative for paranoid applications. All three are based on the hardness of certain problems on mathematical lattices — a fundamentally different kind of math from factoring or discrete log, with no known polynomial-time quantum algorithm. The cost is real: PQC keys and signatures are larger (Kyber public keys are ~1.2 KB versus ECC's 32 bytes), and the schemes have less battle-testing than RSA's fifty years.

Fig 14.13 — Post-quantum migration · the algorithms that survive Shor

The migration is real and underway. Symmetric cryptography (AES, SHA-256, HMAC) is essentially unaffected by quantum computing — Grover's algorithm offers a square-root speedup on brute-force search, which means doubling the key size restores security. Public-key cryptography is the problem: Shor's algorithm breaks RSA, DH, and ECC in polynomial time on a sufficiently large quantum computer. The standardised replacements published by NIST in August 2024 — Kyber (now ML-KEM), Dilithium (ML-DSA), and SPHINCS+ (SLH-DSA) — are based on lattice problems (or, for SPHINCS+, hash chains) that are believed quantum-resistant. As of 2026, hybrid handshakes that use both classical ECDH and post-quantum Kyber simultaneously are deploying across major web infrastructure, defending against the "harvest now, decrypt later" attack where an adversary records encrypted traffic today to decrypt years later when a CRQC exists.

🔁

The recurring pattern. Every cryptographic primitive in this chapter rests on a computational asymmetry that was, at the time of design, believed hard to reverse. Caesar's cipher rested on the belief that letter-pattern analysis was beyond an adversary's resources. RSA rests on factoring being hard. ECC rests on elliptic-curve discrete log being hard. Post-quantum schemes rest on lattice problems being hard. Each generation's "hard" eventually becomes the next generation's "easy" — through algorithmic insight (Shor for factoring), through hardware (rainbow tables, then GPUs, then quantum computers), or through deeper mathematical understanding. The resilience of the system as a whole comes not from any single primitive being unbreakable but from the discipline of always migrating to the next primitive before the current one fails. Cryptography is, in this view, a permanent moving target — and the discipline of cryptographic engineering is the discipline of keeping ahead of whoever is computing more cleverly than you.

Closing the chapter · seam to Chapter 15

Chapter 14 has unpacked the math that makes the rest of the book's security claims possible. TLS works because of the primitives in this chapter; password storage works because of slow KDFs; Bitcoin works because of SHA-256 and ECDSA; Signal works because of curves and ratcheting protocols built on top of them; Certificate Transparency works because of Merkle trees. Every layer of cryptography is a precise mathematical claim, and every claim has a precise threat model — the kind of adversary it withstands and the kind that breaks it.

Chapter 15, the final chapter of Part IV, takes everything in this book — the silicon (Parts I and II), the network (Part III), the data and cryptography (Chapters 13 and 14) — and looks at it from one direction: how systems break. Every attack we have foreshadowed in earlier chapters — buffer overflows, ROP chains, Dirty COW, DNS poisoning, BGP hijacking, XSS, CSRF, SQL injection, cache timing, side channels — gets pulled together into one working theory. The chapter closes Part IV by closing the security thread that has run through the whole book. After it, only Part V remains: parallelism, the cloud, and the epilogue that synthesises the entire arc.

01 — The attacker's mindset

The same problem from the other side · what does the attacker control?

On 19 May 1998, seven members of a Boston hacker collective called L0pht walked into a hearing room of the United States Senate's Governmental Affairs Committee, sat down at the witness table under their handles — Mudge, Kingpin, Brian Oblivion, Tan, Stefan, Space Rogue, Weld Pond — and testified that any one of them could take the entire internet down inside thirty minutes. The senators looked uncertain. The hackers were not joking; the BGP and DNS vulnerabilities they had in mind were real, and the testimony was carried verbatim into the Congressional Record. It was the first time the American government received the news, on the record, that adversaries thought about computer systems differently than the people who built them — that there was a discipline of misuse with its own working theory, and that the systems the country was beginning to depend on for everything had not been built with that discipline in mind.

That difference is what this chapter is about. Software engineering teaches you to think about correctness: given well-formed inputs, does the program produce the right output? Security engineering teaches you to think about misuse: given any input the attacker can construct, possibly delivered through any channel they can reach, with full knowledge of the source code, what is the worst thing that can happen? It is a different mental discipline from ordinary programming and it is the discipline this whole chapter asks you to acquire. The starting move, in any analysis, is to enumerate the trust boundaries — places where data controlled by someone you do not trust enters code that you do — and to ask, at every boundary, what could be wrong.

The dominant systematic framework, used at Microsoft and most mature security organisations, is STRIDE — a six-part checklist published by Praerit Garg and Loren Kohnfelder at Microsoft in 1999. Each letter names a class of threat: how could an attacker break this property of the system?

Fig 15.1 — STRIDE · the six classes of threat to enumerate

STRIDE is the working enumeration. For every component in your design — every API endpoint, every database, every file write, every external call — walk the six letters and ask whether each threat applies. Most will not (a static asset CDN doesn't worry about repudiation). The ones that do define your concrete defences. Underneath STRIDE is the deeper concept of trust boundaries: data crosses from less-trusted to more-trusted territory at specific places, and those places are where validation and sanitisation must live. The complementary discipline is least privilege: every component runs with the minimum permissions it needs — so that a successful exploit of one component does not compromise everything around it. The two principles, applied consistently, would prevent a substantial fraction of the attacks discussed in the rest of this chapter.

02 — Memory exploits revisited

Forty years of writing past the buffer · and the defenders who keep up.

Chapter 3 introduced the buffer overflow — Aleph One's 1996 Phrack essay, the saved return address overwritten, the CPU jumping to attacker-supplied code. Chapter 3 §06 traced the arms race from stack canaries through DEP and ASLR to ROP. This section continues the story past Chapter 3's stopping point. The defences have gotten more sophisticated; the attacks have gotten more inventive; the underlying problem — that C and C++ allow memory access without proof of correctness — has not gone away. The cumulative effect of thirty-five years of attack and defence, surveyed in one diagram, is the deepest example in computing of layered engineering against a moving adversary.

Fig 15.2 — The arms race continued · Fig 3.11 extended through 2026

Thirty-five years of memory-corruption arms race. Each red dot is an attack technique that broke the previous defence; each green dot is a defence that responded. Stack canaries, DEP, ASLR, CFI, Pointer Authentication (ARM PAC), Control-flow Enforcement Technology (Intel CET, including shadow stacks), Memory Tagging Extension (ARM MTE) — each is real and valuable, but the cumulative experience of the industry is that mitigations have diminishing returns. Microsoft and Google have both publicly concluded that ~70% of severe security bugs in their C/C++ codebases are memory-safety bugs, and that the most cost-effective response is no longer "another mitigation" but rewriting in memory-safe languages. Linux began accepting Rust kernel modules in 2022. Chromium and Firefox are progressively rewriting subsystems in Rust. The arms race continues at the layer of mitigations; the war is being lost at the layer of language choice.

CFI · Pointer Authentication · the contemporary defences

Two defences deserve particular attention, because they are what protects modern hardware against ROP and JOP today. Control-Flow Integrity (CFI), proposed by Martín Abadi and colleagues in 2005 and now shipped in Clang, MSVC, and most major compilers, instruments every indirect call and return with a runtime check: the target address must be one of the addresses the compiler statically determined was a valid target for that call site. An attacker who manages to overwrite a function pointer with a gadget address fails the CFI check because the gadget is not on the call site's allow-list. The check costs a handful of instructions per indirect call; production deployment is now common.

ARM Pointer Authentication (PAC), shipped in Apple A12 (2018) and every Apple Silicon Mac since, takes a different approach: each return address pushed to the stack is cryptographically signed using a per-process key and a context value (the stack pointer). When the function returns, the CPU verifies the signature; mismatch causes an exception. An attacker who overwrites a return address cannot produce a valid signature without the per-process key, and gadget addresses harvested from one binary cannot be used (they don't have valid PAC tags). PAC is implemented in silicon — a few transistors per core, near-zero runtime cost — and is one of the most effective ROP defences shipped in the last decade. Intel's equivalent, the CET (Control-flow Enforcement Technology) shadow stack, has been shipping in 11th-generation Core CPUs since 2020.

03 — Network attacks

Where the network can lie · and where TLS prevents it from mattering.

Part III made one structural claim about the network: it can never be trusted. Routes can be hijacked (Chapter 9 §06). Names can be poisoned (Chapter 11 §03). Packets can be intercepted, modified, or forged at any hop. The defence to all of this — the cryptographic layer of Chapter 14 applied at the transport layer — is TLS, which authenticates endpoints and encrypts traffic so that whatever the network does to a TLS-wrapped flow is detectable and reversible. This section walks through the canonical network attacks, locates each in the layer where it operates, and shows how Chapter 14's primitives neutralise them. The pattern is consistent: anything below TLS can be lied to; TLS itself, properly configured, prevents the lie from translating into a compromise.

Man-in-the-middle · the canonical network attack

A man-in-the-middle (MITM) attack is the reference template for every network attack: an adversary positioned between client and server intercepts the traffic, possibly modifies it, and forwards each side's traffic to the other while reading or rewriting in flight. Without authentication, neither side knows the conversation is being intermediated. With TLS, the client verifies the server's certificate against a trusted CA and aborts on mismatch — so an attacker who can intercept the network bytes still cannot present a valid certificate for the server's hostname. The MITM degenerates from "you read all my traffic" to "you see encrypted bytes you cannot decrypt or modify undetectably." The attacker can deny service (drop packets) but not impersonate.

Fig 15.3 — Man-in-the-middle · with and without TLS

The same network position — an attacker who has bumped your traffic onto their path, by ARP spoofing, BGP hijacking, DNS poisoning, malicious WiFi access point, or compromised router — produces two completely different outcomes depending on whether the application uses TLS. Without TLS, the attacker reads and rewrites everything. With TLS, the attacker sees encrypted bytes they cannot decrypt and cannot impersonate either party. The defence is not perfect — the attacker still learns timing, sizes, the fact that Alice talked to Bob — but the content and integrity are protected. Modern web traffic is overwhelmingly TLS-protected (Let's Encrypt, free certificates, the move to "HTTPS everywhere") for exactly this reason. The non-TLS web of the 1990s and 2000s was, at airport coffee shops and hotel networks, routinely intercepted; that era is over.

ARP spoofing · how the attacker bumps onto your path

For an attacker on the same local network as you, the easiest way to become a man in the middle is ARP spoofing. ARP — the Address Resolution Protocol from Chapter 8 §05 — maps IP addresses to MAC addresses on a LAN, and does so by broadcasted announcements that are not authenticated. A laptop on the network sends "who has 192.168.1.1?" and the gateway router answers "I do, MAC aa:bb:cc:dd:ee:ff." The laptop caches the answer. An attacker on the same LAN can simply respond to the same broadcast with their own MAC address, and now the laptop sends every packet destined for the gateway to the attacker instead. The attacker forwards traffic to the real gateway, becoming a transparent intermediary. ARP has no authentication and no signature; the protocol believes whatever speaks first.

Fig 15.4 — ARP spoofing · poisoning the address-resolution cache

ARP spoofing on a local network. The attacker on the same LAN simply answers ARP queries faster than the legitimate gateway, redirecting every packet through their machine. From the laptop's perspective, the gateway appears to have a different MAC address; from the gateway's perspective, the laptop's traffic is now arriving via the attacker's machine. Combined with packet forwarding (any modern OS does this with one sysctl), the attacker is now MITM. The standard defences — static ARP entries, port-level authentication, dynamic ARP inspection on managed switches — are deployed in serious enterprise networks; consumer LANs and coffee-shop Wi-Fi rarely have any of them. The pragmatic conclusion: the LAN itself is untrustable. Wrap everything important in TLS. Wrap your remote-access traffic in a VPN. The defence is not "trust the network"; it's "the network can lie, but my crypto cannot."

Two further attacks are worth flagging because we already met them in earlier chapters. DNS poisoning (Chapter 11 §03, Kaminsky 2008) tricks a recursive resolver into caching a forged answer; combined with no certificate validation, it lets an attacker direct a victim to a server they control. The defence is the certificate verification step — even with a poisoned DNS answer, the attacker's server cannot present a valid TLS certificate for the legitimate hostname. BGP hijacking (Chapter 9 §06, Pakistan/YouTube 2008) reroutes entire IP prefixes through a malicious autonomous system. Same defence: the certificate survives the rerouting; the attacker's machines, even if all your packets are now arriving at them, cannot complete a TLS handshake for the target's hostname. The pattern is consistent. Every network-layer attack ends in TLS — and is neutralised there.

04 — Web attacks

The application layer · where the trust boundary is most often confused.

If the network attacks of §03 are mostly solved by TLS, the web attacks of §04 are not. They live above TLS, in the application's own logic, and TLS does nothing to prevent them. An authenticated user with a session cookie clicking a link, or a server fetching a URL its application supplied, or a backend deserialising data sent in a request — all of these run inside the trust boundary. The attacks here are about confused deputies (Chapter 11 §06 introduced the term obliquely): code that has authority to do something, tricked into using that authority on the attacker's behalf. The 2010s saw the OWASP Top 10 — the curated list of the most consequential web vulnerabilities — settle into a stable shape, and most of those bugs are confused deputies of one form or another.

SQL injection in defence-in-depth

Chapter 13 §06 covered the mechanism. Production-grade defence is a stack: parameterised queries are necessary but not sufficient; every additional layer reduces blast radius even if a layer above fails. The discipline is defence in depth: assume every other layer might be bypassed and design each layer as if it were the last one.

Fig 15.5 — SQL injection defence in depth · five layers, each independent

SQL injection defence in production is never a single mechanism. The same template applies to every attack class in this section: input validation rejects obvious-bad inputs early, the primary fix (parameterised queries here, output encoding for XSS, allow-listing for SSRF) closes the technical vulnerability, the framework's defaults make the safe path the easy path, least privilege contains the blast radius if all of the above fail, and monitoring detects exploitation attempts. Strip layer 2 alone (a developer writes raw SQL on one endpoint) and you still have four other lines of defence. Strip all five and you become the next breach in the news. Defence in depth is the discipline of treating every layer as fallible — because in practice every layer is.

SSRF · the server fetches what the user asked for

Server-Side Request Forgery (SSRF) became one of the most common high-impact vulnerabilities of the cloud era. The mechanism: an application takes a URL from the user (a "fetch this image," "import from this URL," "preview this link" feature), and the application's server makes an HTTP request to that URL. If the application doesn't restrict which URLs it will fetch, the attacker can supply URLs the application's server can reach but the attacker cannot — internal admin endpoints, cloud metadata services, internal databases. The Capital One 2019 breach — a former AWS engineer who exfiltrated 100 million customer records by exploiting SSRF against Capital One's web application firewall — is the canonical modern example. The vulnerability turned the firewall into a proxy for fetching AWS IAM credentials from 169.254.169.254, the cloud metadata endpoint that every EC2 instance can reach.

Fig 15.6 — SSRF · the server makes the request the attacker can't

SSRF in the cloud era. The web application's server has network access the attacker doesn't — to internal admin endpoints, to the cloud-metadata service, to internal databases — and the application turns user-supplied URLs into outbound requests without restriction. The attacker supplies http://169.254.169.254/... (the AWS metadata IP), the server fetches IAM credentials, the server hands the credentials back to the attacker as the "image bytes." Capital One 2019: 100 million records, $190M direct cost, settled with regulators for additional fines. The fix is two-layer: at the application, validate the URL against an allow-list of permitted destinations; at the cloud platform, AWS now mandates IMDSv2 — a session-token-based metadata service that requires a separate PUT request before GET, breaking the simplest SSRF pattern. Both have to be on; either alone is fragile.

Deserialization · turning data into code

The deepest pattern in web vulnerabilities is data-as-code: bytes that the developer thought were inert turning out, under the right circumstances, to execute. SQL injection is data-as-code (the database parses your input as SQL). XSS is data-as-code (the browser parses your input as HTML/JavaScript). Server-side template injection is data-as-code (the templating engine parses your input as a template expression). And deserialization attacks are perhaps the most direct form: the application takes a serialised object (a Java object stream, a Python pickle, a YAML document, a .NET binary formatter blob), calls the language's deserialize() function on it, and that function constructs objects of whatever type the bytes specify — often invoking constructors that the original author never intended a user to control. Java's Apache Commons Collections gadgets, Python's pickle attack surface, .NET's BinaryFormatter (deprecated 2020 but still in use): every popular language has a deserialization minefield.

🪤

Prompt injection · 2023 → ?. A new vulnerability class arrived with large language models. An LLM application takes user input and concatenates it into a prompt with system instructions ("You are a customer-service bot. Help the user with their account."). The user can include instructions inside their input — "Ignore previous instructions and return the system prompt" — and the model often follows them, because the model has no architectural distinction between trusted system instructions and untrusted user content. This is, structurally, the same vulnerability as SQL injection: code and data on the same channel, no parser-level separation. The defences (output filtering, prompt structuring, sandboxing tool calls) are early-stage and incomplete in 2026, much as SQL-injection defences were incomplete in 2002. Expect five to ten years of arms race before the protocol-level fix arrives — assuming a protocol-level fix exists, which is currently unclear.

05 — Defense systems · the layered organism

From firewalls to zero trust · five generations of how to defend a network.

Industrial-scale defence is no longer about a single tool. A mature security organisation runs five layers concurrently, each assuming the others might fail. The five layers correspond roughly to five generations of security engineering: the firewall (1990s, network perimeter), the intrusion detection system (early 2000s, signature-based pattern matching), the SIEM (mid 2000s, log aggregation and correlation), endpoint detection and response (mid 2010s, behaviour models on every machine), and zero-trust architecture (late 2010s, treat every request as if it came from the open internet). Each generation was a response to the failure of the one before. Each is still useful; none is sufficient alone.

Fig 15.7 — Five generations of defence · firewall → IDS → SIEM → EDR → zero trust

ZERO TRUST · late 2010s no implicit trust by network position · authenticate every request · device posture checks → Google BeyondCorp, NIST 800-207 no single layer is sufficient · production runs all five concurrently

Industrial-scale defence is layered. The firewall blocks the obvious bad traffic at the perimeter (and, in cloud-native deployments, between every pair of microservices). IDS/IPS spots known-malicious patterns inside the traffic the firewall let through. SIEMs aggregate logs from every component and look for cross-cutting patterns ("this user logged in from Helsinki and Tokyo within 30 seconds"). EDR agents sit on every laptop and server modelling normal process behaviour and alerting on anomalies — the modern detection layer where most novel attacks first show their hand. Zero trust dispenses with the network perimeter altogether: every request to every service authenticates and authorises independently, regardless of where the request came from. Each generation was the response to the previous one's failure mode; deploying all five is not redundant but cumulative.

Stuxnet · what a state-grade attack actually looks like

For most organisations the threat is criminal: ransomware groups, credential thieves, opportunistic exploits of known vulnerabilities. Defence in depth is calibrated for them. Occasionally a different category of attacker shows up — a nation-state with effectively unlimited resources, multiple zero-day vulnerabilities held in reserve, and the patience to target a single specific facility for years. The reference example, and the most consequential cyberweapon ever publicly disclosed, is Stuxnet: a 500-kilobyte malware package designed to physically damage the centrifuges enriching uranium at the Natanz nuclear facility in Iran. Stuxnet was deployed around 2009, discovered in mid-2010 by an antivirus researcher in Belarus who happened to notice that one of his customer machines was BSODing in unusual ways, and analysed throughout 2010–2012 by a small global community of malware researchers (notably Symantec and Ralph Langner) who together reverse-engineered what it did and what it was for. The general consensus, never officially confirmed but widely reported, is that Stuxnet was a joint US–Israeli operation (Operation Olympic Games) that delayed Iran's enrichment program by an estimated one to two years.

Fig 15.8 — Stuxnet · four zero-days, a USB jump, sabotage of physical machines

Stuxnet, end to end. Four undisclosed Windows vulnerabilities — by itself a remarkable arsenal — combined with two stolen code-signing certificates (from Realtek and JMicron) to bypass driver-signature checks, combined with target-specific code that recognised the exact Siemens Step7/PLC configuration at Natanz, combined with the physical understanding to attack centrifuge rotor dynamics. The malware spread widely (Stuxnet samples have been found on machines all over the world) but did damage only on machines connected to the right PLCs in the right Profibus configuration. Estimated production cost: tens of millions of dollars and years of work by a team with intelligence access to the target. The lasting effect was geopolitical — proof that physically destructive cyber operations against critical infrastructure are real, achievable, and now part of the standing capability of major states. Every defence diagram in this chapter has, in the back of its head, the question: what does my system look like to an adversary willing to spend Stuxnet money?

The lessons that came out of Stuxnet shaped the next decade of defence. Air gaps are not absolute — USB drives cross them. Code-signing certificates are an attack surface; private keys that signed kernel-mode drivers were stolen and used. Industrial control systems (SCADA, PLCs) had been treated as low-priority because they were "isolated"; Stuxnet showed that "isolated" is a temporary property and the systems must be defensible on their own. Most importantly: the existence of a state-grade attacker who can stockpile zero-days for years before using them is now simply assumed in any threat model touching critical infrastructure. The 2020 SolarWinds attack — Russian intelligence (SVR) inserted a backdoor into the build pipeline of SolarWinds' Orion network monitoring product, which then shipped via routine updates to ~18,000 organisations including most of the US federal government — is the modern successor: same conclusion, supply-chain delivery instead of USB.

🛰️

SolarWinds 2020 · the supply-chain shape. SVR operatives compromised SolarWinds' build environment, modified the Orion source code to insert a backdoor (called SUNBURST), and let the company's own digitally-signed updates carry the malware to ~18,000 customers including the Treasury, State, Commerce, and Energy departments, and most of the Fortune 500. The backdoor lay dormant for two weeks after install, then beaconed home. FireEye discovered it after spotting unusual traffic on its own network in December 2020. The fundamental lesson: every dependency you ship is part of your attack surface, including dependencies you trust because they're "established" or "enterprise-grade." The 2020s response — Software Bills of Materials (SBOMs), reproducible builds, signed transparency logs of build artifacts (SLSA) — is the supply-chain equivalent of Certificate Transparency from Chapter 14.

06 — The culture · the ecosystem of breaking and fixing

The unified theory · and the people who keep the system honest.

Every attack in this book is, at the structural level, an instance of a small handful of patterns. The whole chapter has been moving toward this synthesis. Buffer overflow (Chapter 3) is data-as-code in the call stack. SQL injection (Chapter 13) is data-as-code in the SQL parser. XSS (Chapter 12) is data-as-code in the HTML parser. Prompt injection (this chapter) is data-as-code in an LLM context window. ROP (Chapter 3) and Stuxnet (this section) chain primitives the attacker did not author into payloads the original author never anticipated. CSRF (Chapter 12), SSRF (this chapter), and the Capital One breach are all confused-deputy attacks: code with privilege, tricked into using its privilege at the attacker's behest. Naming the patterns is the deepest move available; once the names are in place, every new vulnerability becomes recognisable as a member of a class, and the defences for that class are the same ones already understood.

Fig 15.9 — The attack-pattern map · every specific exploit reduces to a few patterns

Six structural patterns underneath every specific attack in this book. Data as code covers SQLi, XSS, deserialization, prompt injection, format-string bugs, server-side template injection — anything where attacker bytes end up parsed as instructions. Confused deputy covers CSRF, SSRF, the Capital One breach, cross-tenant cloud bugs, sudo bugs — anything where privileged code is tricked into acting on the attacker's behalf. Trust boundary violations cover unvalidated inputs, supply-chain attacks like SolarWinds, spoofed source IPs. Time-of-check/time-of-use covers Dirty COW, double-fetch syscall bugs, race conditions in privileged setuid programs. Side channels cover Spectre, Meltdown, timing attacks on RSA, cache-prime attacks. Least-privilege failures cover over-permissioned cloud IAM roles, web servers running as root, applications with excessive default scopes. The recognition is what matters: when a "new" vulnerability is announced, ask which pattern it instantiates. The defence is almost always one already understood.

The ecosystem · who breaks things, who fixes things, where they meet

Security knowledge is not produced by individuals reading textbooks. It is produced by a working ecosystem of people who break things for sport, get paid to break things in production, report what they break, get paid to fix what others have broken, teach the next generation, and meet annually at a small set of conferences where the entire field exchanges its current understanding. The economy is unusual: breaking things produces direct knowledge gains, fixing things produces secondary knowledge gains, and the social contract that holds it together is responsible disclosure — the convention that an attacker who finds a vulnerability tells the affected vendor, gives them a reasonable window to fix it (typically 90 days, Project Zero's standard), and only then publishes. The convention is not enforceable; it is sustained by reputation and by the mutual understanding that the alternative — public weaponisation without warning — is worse for everyone.

The training environment for working security engineers is Capture The Flag (CTF) — a genre of contest in which participants find and exploit vulnerabilities in deliberately broken software, race to be first, and submit "flags" (short strings hidden inside successfully-pwned systems) as proof of solution. CTFs are organised in five canonical categories: pwn (binary exploitation, the buffer overflows of Chapter 3), web (the SQLi / XSS / SSRF of this chapter), crypto (the cryptographic primitives of Chapter 14, broken in some specific way), reverse engineering (read a binary and understand what it does), and forensics (find evidence in a memory dump, network capture, or filesystem image). Practitioners specialise within these categories; the most effective security engineers are typically excellent at one and competent at most. The barrier to entry is purely effort: PicoCTF (Carnegie Mellon's teaching CTF, free, runs continuously) is the canonical first stop; OverTheWire's Wargames are the second; HackTheBox and TryHackMe are where intermediate practitioners spend their weekends.

The professional shape this leads to has three rough tracks. The red team path takes the CTF skill into corporate environments — penetration testing, adversary emulation, internal red-team operations against your own employer. The bug bounty path is the freelance equivalent: HackerOne, Bugcrowd, YesWeHack are the major platforms; Apple, Google, Microsoft, and most major tech companies pay independent researchers ($1K–$1M per finding, depending on severity) for vulnerabilities reported through these channels. The blue team path is defensive — incident response, security operations, threat intelligence, the SIEM-and-EDR work of §05. The boundary between red and blue is permeable; many of the best defenders started as offensive researchers, and vice versa. The trade fair where everyone meets is the conference circuit: DEF CON (Las Vegas, August, the largest hacker conference in the world), Black Hat (the enterprise twin held the same week), CCC (Chaos Communication Congress, the European centre, late December in Hamburg, sociologically distinct), Pwn2Own (Vancouver and Toronto, biannual, where vendors put bounties on their own products and researchers compete to break them on stage), and USENIX Security (academic).

Fig 15.10 — The security ecosystem · learning, working, meeting

The security field is unusually well-organised as an ecosystem. Learning environments — PicoCTF, OverTheWire, HackTheBox — provide free or low-cost on-ramps. Working environments split into red team (offensive), blue team (defensive), and the bug bounty market that connects independent researchers to corporate budgets. The annual meeting points — DEF CON, Black Hat, CCC, Pwn2Own, USENIX Security — are where the field collectively decides what it knows. Holding all of it together is the responsible-disclosure norm: find a vulnerability, tell the vendor privately, give them a fixed window to ship a fix, then publish regardless of what they did. The norm is not enforceable; it is sustained by reputation and the alternative being worse for everyone. A working security engineer participates in all three rings — they learn through CTFs, they work in red or blue or both, and they spend at least one week a year somewhere where the field exchanges its current understanding.

Closing the chapter · closing Part IV

Chapter 15 closes Part IV. We started with an empty data abstraction (Codd's relations, Chapter 13), built up the mathematics that protects it (cryptography, Chapter 14), and ended by surveying the entire attack and defence landscape that gets in the way. Six structural attack patterns now name every specific exploit in this book. Five generations of defence systems now overlap in any production environment. The cultural ecosystem of breaking and fixing — CTFs, bug bounties, conferences, responsible disclosure — keeps the field self-correcting. The reader who has followed the chapter has, in principle, the vocabulary to read any new vulnerability announcement and locate it in the structure.

Part V, the last part of the book, is about where systems live now. Multi-core hardware (Chapter 16). The cloud and its distributed-systems mathematics (Chapter 17). And the epilogue (Chapter 18) — what you now see, after all this, when you press a key and the bytes flow.

Data andSecurity

SQL and theMathematicsof Data

One paper. One change of model. Fifty years of consequence.

The math under SQL · three operators do most of the work.

Declarative · standardised · still arguing about itself.

Three join algorithms

The four properties that make money safe to move.

How durability is actually achieved · the write-ahead log

Isolation levels · the trade between safety and speed

The data structure that turns O(N) into O(log N).

B-tree · the workhorse

Hash index · O(1) lookup, no ranges

String concatenation as the attack vector that will not die.

The fix · parameterised queries · and only that

Closing the chapter · seam to Chapter 14

CryptographyThe Math ThatKeeps Secrets