Under the Code — Part III: The Network

01 — Berners-Lee 1989

A proposal that nobody asked for.

In March 1989, a 33-year-old British physicist working at CERN — the European nuclear research lab outside Geneva — submitted a 17-page document to his manager titled Information Management: A Proposal. His name was Tim Berners-Lee. The document described a system for linking documents across computers using any network — extending an idea already implemented in CERN's internal database. His manager, Mike Sendall, wrote in the margin of the cover page: "Vague but exciting." He approved the project, partly to give Berners-Lee something to do. Eighteen months later the world's first web server was running on a black NeXT workstation in Building 31 at CERN.

The setting matters. CERN in 1989 was the world's largest particle physics laboratory: thousands of scientists from dozens of countries collaborating on experiments that took years to plan and produced terabytes of data. The work was inherently distributed — a detector built in Italy ran software written in France against data analysed in Sweden, all reporting to a paper in Physical Review with co-authors in twelve countries. Documents lived everywhere: research papers on FTP servers, calibration tables in flat files on shared disks, equipment manuals on a secretary's PC, design notes in a homegrown CERN database called ENQUIRE. Finding anything required knowing where to look. Sharing anything new required emailing it as an attachment to people you guessed might want it. The information existed; following it from one document to another required a human acting as the link.

Berners-Lee's proposal was, in essence, that the link itself should be part of the document. A reference in one document should point at another document on another computer — and a click on that reference should fetch and display the target. This is the idea of hypertext, and it was not new: Vannevar Bush had described something like it in 1945 ("As We May Think," The Atlantic); Ted Nelson had named it in 1965 and spent three decades trying to build a perfect version (Project Xanadu, which never shipped). What Berners-Lee added was the recognition that hypertext's bottleneck was no longer the idea — it was the plumbing. TCP/IP was widespread by 1989. UNIX workstations were common. The hard problem was just defining a few simple conventions — how documents are addressed, how they are requested, how they are formatted — and then implementing them.

He defined three:

URL — Uniform Resource Locator. A way to write down where any document lives, in any system. http://info.cern.ch/hypertext/WWW/TheProject.html — protocol, host, path. The address-on-an-envelope of the web.
HTTP — HyperText Transfer Protocol. The procedure for one machine to ask another for a document at a URL and receive it back. Originally five lines of text on the wire.
HTML — HyperText Markup Language. A simple format for documents that includes links to other documents as inline elements rather than appendices.

Each by itself was uninteresting. Together they made the web. By December 1990 he had implemented all three on a NeXT workstation in his office at CERN. The server was a program called httpd; the client was a program he called WorldWideWeb — both browser and editor in one window. The first page he served was a description of the project, at the address info.cern.ch. (The URL still resolves; the original page was reconstructed in 2013 and is still online.) On 6 August 1991 he posted to the Usenet newsgroup alt.hypertext a short message announcing the existence of the World Wide Web project and inviting other implementations. That message is the public birth of the web.

Fig 11.1 — March 1989 to August 1991 · a quiet two and a half years

The whole web in 1991 was a NeXT cube in Berners-Lee's office, a server program he had written, a browser-editor he had also written, and a few HTML files describing the project. The machine had a sticky note on its case: "This machine is a server. DO NOT POWER IT DOWN!!" The first publicly announced URL was info.cern.ch/hypertext/WWW/TheProject.html. By the time Berners-Lee posted to alt.hypertext on 6 August 1991, the protocols and the content had existed for nine months and one user had been using them. Two years later there were five hundred web servers. Five years later it was unstoppable.

"The Web is more a social creation than a technical one. I designed it for a social effect — to help people work together — and not as a technical toy."

— Tim Berners-Lee, Weaving the Web (1999)

Crucially, none of the three protocols required anyone else's permission. HTTP runs on top of TCP, which runs on top of IP, which runs on the existing physical internet. URLs needed no central registry — any owner of any host could just start serving documents under their own domain. HTML was just text; anyone could write it. There was no Web Consortium yet. There was no licence. There was no fee. The web spread because the cost of joining was zero and the value of having joined was that you could now find things that other people had decided to publish. Within five years of that Usenet post the number of web servers was doubling every four months. CERN had built a worldwide library by accident, while looking for a better way to organise lab notes.

The architectural choice that mattered

One decision separates the web from the dozen earlier hypertext systems that never caught on. Berners-Lee allowed dangling links. In Project Xanadu, every link required the target document to formally register itself with the link system; if the target moved or disappeared, the link broke and the publisher was supposed to update it. In the web, a link is just a URL; if the URL no longer resolves, the browser returns a 404 and the publisher is none the wiser. This sounds like a flaw — and famously it produces "link rot" as the web ages — but it was the price of decentralisation. A system that demands a working link to every published target is a system with a central authority. A system that tolerates 404s is a system anyone can join without asking.

The trade was correct. The web exists; Xanadu does not.

📜

The 1993 release. On 30 April 1993 CERN released the entire web — code, protocols, specifications — into the public domain, with a one-page document declaring no royalties were owed and no patents would be filed. That document is now in the CERN archive. It is, by some measures, the most economically significant piece of paper of the late twentieth century. Releasing the technology rather than licensing it foreclosed the possibility that any single company could capture it — and is the reason every later attempt to build a "private web" (AOL, MSN, Compuserve in their walled-garden phases) eventually surrendered.

02 — HTTP

Five lines of text. Forever.

HTTP is the simplest protocol in this book. It is so simple you can speak it by hand, with a keyboard, against a real web server, and watch the bytes come back. Open a TCP connection on port 80 to any web server, type GET / HTTP/1.0, press Enter twice, and the server will reply with a status line, a few headers, and the document. That transcript is HTTP. Forty years of evolution have added optional headers, content negotiation, persistent connections, multiplexing, compression, and finally a binary wire format — but the conversational shape "client asks, server answers" has not changed. HTTP/1.0 from 1996 still works against modern servers; modern HTTP/1.1 still works against any plain socket.

A request has a structure. The first line is the request line: a verb (called a method), a path, and a protocol version. After it come headers — one per line, each a name-colon-value pair, terminated by a blank line. After the blank line, optionally, comes a body (for methods like POST and PUT that send data). A response has the same shape: a status line with the protocol version, a numeric status code (200, 404, 500, …) and a short reason phrase, followed by headers, then a blank line, then the response body.

Fig 11.2 — A complete HTTP exchange · request and response, byte for byte

The entire HTTP wire format. Three things make a request: a request line (method, path, version), zero-or-more name-value headers, and an optional body — separated from the headers by a single blank line. The response has the same shape with a status line replacing the request line. The 200 above means "the request succeeded and the body is the requested document"; a 404 would mean "the document does not exist"; a 500 would mean "the server crashed trying." Every web request you have ever made has this exact shape underneath, even if a browser is hiding it from you.

The big idea: stateless

One design decision in HTTP echoes through the rest of the web's architecture. HTTP is stateless: the server treats every request as independent. There is no concept of a "session" at the protocol level. The server is not required to remember that you asked for /index.html ten seconds ago when you ask for /style.css now. Each request stands alone and contains everything the server needs to answer it.

This sounds inconvenient — and it is, for any application that needs to remember a logged-in user. Every workaround we use to fake state on top (cookies, session IDs, JWT tokens, OAuth flows) exists to paper over this fundamental statelessness. But the property is what made the web scale. A stateless server can handle requests from a million different clients in any order, on any thread, on any of a thousand servers behind a load balancer, without any of them needing to share memory. It is the reason the web survived Facebook becoming popular and Wikipedia becoming popular and YouTube becoming popular without requiring fundamental redesigns of the protocol. Statefulness scales with engineering effort; statelessness scales with money.

What actually happens when you press Enter

Type https://example.com/ into a browser and press Enter. Watching with a packet sniffer, the sequence of events is precise and ordered. The browser does not "open the page." It performs about twenty separate operations across four layers, and the page only appears at the end.

Fig 11.3 — One curl, fully traced

Eighty milliseconds of work between the moment you press Enter and the moment the page begins to paint. Most of it is round-trips: one for DNS, one for TCP, one for TLS 1.3, one for HTTP. The browser and server are exchanging little structured messages at every layer. From the wire, the HTTP request inside step ④ is indistinguishable from random bytes — TLS is doing its job. From the user's chair, all of this looks like "the page loaded." The whole rest of this chapter is the inside view of stages ①, ③, and ④ above.

The methods, and why some of them matter

A method is a verb. It tells the server what kind of operation the request represents. HTTP defines about a dozen, but four are dominant: GET (fetch a resource), POST (submit data, possibly creating a new resource), PUT (replace a resource with the supplied data), and DELETE (remove a resource). The choice of method is not arbitrary; it is part of the contract with caches, proxies, and CDNs. Two properties matter especially: safety (the request does not change the server's state — GET and HEAD are safe) and idempotency (repeating the request has the same effect as a single request — GET, PUT, and DELETE are idempotent; POST typically is not). Caches assume safety; retry logic assumes idempotency. A POST that should have been a PUT will, on a flaky network, occasionally charge a credit card twice.

Fig 11.4 — HTTP methods · what they promise, what caches and clients assume

REST APIs lean on this table heavily: each HTTP method maps to one CRUD operation (GET=read, POST=create, PUT=update, DELETE=delete) and the method's properties are the contract with the rest of the network. A reverse proxy like nginx or Cloudflare can cache GETs aggressively because they are safe and idempotent; it must never cache POSTs because the same POST body sent twice may create two records. A retry library should resend GETs on failure but should be cautious with POSTs — the server may have processed the first one but failed before sending its 200. Idempotency keys (a header applications add to make POSTs effectively idempotent) exist to paper over that gap.

⚠️

The double-charge problem. Stripe's idempotency-key documentation (and every payment processor's equivalent) exists because of one specific failure mode: client sends POST /charge, server processes the charge, server crashes before sending the 200, client times out, client retries — and now the customer is charged twice. The fix is for the client to attach a unique Idempotency-Key header; the server records the key alongside the result of the first execution and, on a duplicate request with the same key, returns the stored result instead of re-executing. The mechanism turns a non-idempotent operation into an effectively idempotent one. Every serious payment, signup, or stateful POST in production runs through some version of this dance — because HTTP itself does not guarantee what its method properties promise; the network does that, and the network is not entirely reliable.

03 — DNS

The phone book of the internet.

HTTP needs a destination. The destination, on the wire, is an IP address — a 32-bit number on the IPv4 internet, 128 bits on IPv6. Humans do not remember 93.184.216.34. Humans remember example.com. Something has to translate the second into the first, and it has to do this billions of times per second across the planet, with sub-second latency, and almost never lie. That something is the Domain Name System — a hierarchical, replicated, cached, mostly decentralised distributed database, designed by Paul Mockapetris in 1983 (RFC 882, RFC 883, then refined into RFC 1034 and RFC 1035 in 1987). DNS is older than the web, older than HTTP, and older than most readers of this book. It is also, structurally, the most fragile critical piece of the modern internet — and the one most often abused.

Before DNS, the ARPANET kept a single text file called HOSTS.TXT at the Stanford Research Institute. Every machine on the network downloaded it periodically; it listed every other machine's name and IP. By the early 1980s the file had grown to thousands of entries, was being edited by hand, and was distributed by FTP. A typo at SRI could break name resolution for the entire network. Mockapetris's design replaced this with a tree: the responsibility for each piece of the name space is delegated to a different organisation, and each organisation runs the servers authoritative for its own piece. The translation of www.example.com is not done by one machine; it is done by a chain of them, each pointing the asker one step closer to the answer.

The hierarchy

Names in DNS are read right to left. The rightmost label is closer to the root of the tree; the leftmost is the leaf. A trailing dot — almost always omitted in writing — represents the root itself. So www.example.com. means: the root, then the com top-level domain, then example registered inside com, then a host called www inside example.com. Each level of the tree is run by different operators. The root is run jointly by twelve organisations (VeriSign, ICANN, university and government bodies) operating thirteen logically named servers (a.root-servers.net through m.root-servers.net) replicated across hundreds of physical locations worldwide via anycast. The com servers are run by VeriSign on contract with ICANN. The example.com servers are run by whoever owns example.com — a hosting company, a corporate IT department, a CDN.

Fig 11.5 — The DNS namespace · a tree, read right to left

The whole DNS namespace is one tree, ~360 million names deep at its widest. The root is run by twelve organisations cooperatively. Each top-level domain is run by an operator chosen by ICANN — VeriSign for .com and .net, Public Interest Registry for .org, a national authority for each country code. Each registered domain is run by whoever bought it. Each subdomain is run by them too, or by a CDN they delegated to. There is no single point of authority below the root. There is also no consensus mechanism — each operator is trusted to answer correctly for their own subtree. The system works only because each operator wants their own subtree to keep working.

How a name becomes an address

The mechanics are straightforward, and almost always cached. When your laptop wants to know example.com's IP, it asks its configured resolver — usually your home router, your ISP, or a public service like Cloudflare's 1.1.1.1 or Google's 8.8.8.8. The resolver may already have the answer in its cache; if so, it returns it in under a millisecond. If not, the resolver does the actual work, called recursion. It asks one of the root servers for example.com; the root replies, "I don't know, but the com servers do — here are their IP addresses." The resolver asks one of the com servers; com replies, "I don't know, but the example.com servers do — here are theirs." The resolver asks one of those; that server is authoritative for example.com and returns the actual IP. The resolver caches the answer for the duration specified by the authoritative server's TTL (typically 5 minutes to 24 hours), then returns it to the laptop. Total time: usually under 50 milliseconds the first time; under a millisecond on every subsequent lookup until the TTL expires.

Fig 11.6 — Recursive resolution · four servers, four questions, one answer

The resolver does the recursion; the laptop just asks once and gets the answer. The four-stage walk happens only on a cache miss; with reasonable TTLs and a busy resolver, hit rates are typically 80–99%. The whole protocol fits in tiny UDP datagrams — query and reply are usually under 100 bytes — which is why DNS is fast enough to feel free, and also why it has security problems: UDP makes it cheap to spoof.

Where the trust breaks

DNS was designed in 1983, before adversarial thinking became standard in network protocol design. The resolver believes whatever the authoritative server told it, signed by nothing. The query goes out as a UDP packet with a 16-bit transaction ID; whichever server replies first with the matching ID and a plausible answer is believed. If an attacker can guess the ID and beat the real reply to the resolver, the resolver caches the attacker's lie — and serves it to every user behind the resolver until the TTL expires. This is cache poisoning, and it has been theoretically known since the 1990s. In 2008, security researcher Dan Kaminsky discovered a practical, fast variant that worked against essentially every DNS resolver in deployment. He disclosed it privately to vendors first; Microsoft, Cisco, Bind, and dozens of others released coordinated patches on a single day in July 2008. The patch did not fix the underlying weakness — UDP queries are still spoofable in principle — but added source port randomisation, multiplying the attacker's guessing space from 2¹⁶ to 2³² and pushing the attack from "a few minutes" to "thousands of years."

Fig 11.7 — Cache poisoning · the attacker races the real answer

DNS cache poisoning in one picture. The resolver sends a query out; the attacker, who has guessed (or been told) a transaction ID, fires forged replies at the resolver from a spoofed source IP. If a forged reply with a matching ID arrives before the real one, the resolver caches the lie and serves it to every user behind it for the TTL duration — typically hours. Kaminsky's 2008 disclosure showed how to make this practical against unpatched resolvers in minutes. The fix — randomising the source port as well as the transaction ID — multiplied the attacker's search space ~65,000-fold and pushed the practical attack from minutes to centuries. DNSSEC is the deeper fix: cryptographically sign every record so resolvers can verify authenticity, not just guess less. DNSSEC has been deployed for two decades and still covers under half the global namespace, because the deployment cost is real and the perceived risk has fallen.

🛡️

The 2016 Dyn attack. On 21 October 2016, the Mirai botnet (~100,000 compromised IoT devices) directed sustained UDP floods at Dyn, a major DNS provider that hosted authoritative servers for Twitter, Netflix, Reddit, GitHub, Spotify, and dozens of others. For most of the morning on the US East Coast, those sites were unreachable — not because their own servers were down, but because nobody could resolve their names. DNS is a centralised dependency for half the internet's user-visible names. Take down a major DNS operator and a thousand sites go dark together. The attack ended when Dyn engineers manually reconfigured anycast routing to absorb the load; aftershocks ran for a week. The lesson — that even a "decentralised" tree has heavy concentration points — drove the modern push toward redundant DNS providers (multiple authoritative NS records pointing at independent operators) and the long, slow rollout of DNS-over-HTTPS, which puts the hop between your laptop and resolver out of an attacker's reach.

04 — Cryptography intro

The mathematics that makes the rest of the chapter possible.

Up to here, every protocol we have built — IP, TCP, HTTP, DNS — operates in cleartext. Anyone with access to any wire between you and the server can read every byte you send and every byte you receive, and can forge bytes pretending to be either side. This was acceptable when the internet was a few thousand academics. It is not acceptable when the internet is your bank account and your medical records. Solving this requires cryptography — and the next section, on TLS, assumes a working understanding of three primitives: hashing, symmetric encryption, and public-key cryptography. This section is a fast briefing. Chapter 14 takes them apart in mathematical depth. For now we want enough to read the TLS handshake.

One-way functions: the mathematical asymmetry that makes everything possible

Modern cryptography rests on operations that are easy to do in one direction and computationally infeasible to undo. The canonical example: multiplication. Take two large prime numbers, say 200-digit ones, and multiply them. A laptop does this in microseconds. Now hand someone the product alone — without telling them the primes — and ask them to find the factors. There is no fast algorithm. The best known methods take longer than the age of the universe for primes of that size. The asymmetry between "easy forward" and "hard backward" is the single mathematical lever that all of public-key cryptography pulls on.

Fig 11.8 — One-way function · easy forward, infeasible backward

The whole of RSA encryption hangs from one observation: if you choose two large primes and multiply them, anyone can use the product (the public key) to encrypt a message — but only someone who knows the original primes (the private key) can decrypt it. The privacy is not protected by secrecy of the algorithm; the algorithm is published. It is protected by the computational gap between forward and reverse. Quantum computers, in principle, can close this specific gap (Shor's algorithm, 1994); this is why the post-quantum cryptography effort matters. Chapter 14 unpacks the actual modular arithmetic; here, the picture is enough.

Hashing: the digital fingerprint

A cryptographic hash function is the simplest one-way operation. Feed in any data — a paragraph, a movie, a database — and the function returns a fixed-length string of bits (256 bits for SHA-256). Three properties matter: determinism (the same input always produces the same hash), preimage resistance (given the hash, you cannot practically find any input that would produce it), and the avalanche effect (a single-bit change in the input produces a completely different output, with no predictable relationship). The hash is, in effect, a fingerprint of the data — uniquely identifying it without revealing it. SHA-256 in particular is the workhorse of modern systems: it is what Bitcoin uses, what certificate authorities use, what every TLS handshake uses, what every Git commit uses to identify itself.

Fig 11.9 — SHA-256 avalanche · one bit changed, all 256 bits scrambled

The avalanche property is what makes SHA-256 useful for tamper detection. Two inputs that differ by a single bit produce hashes with no recognisable relationship — there is no algebraic shortcut, no "hash patch" you could add to one to get the other. If you publish the hash of a file alongside the file, anyone who downloads it can re-hash and compare; any modification, even a single bit corrupted in transit or substituted by an attacker, produces a completely different hash. The same property powers Bitcoin's mining (find an input whose hash starts with N zeros), Git's commit IDs (the hash of the tree plus the parent commit plus the commit message), and the Merkle trees TLS certificates use to bind one signed root to many leaves. Cryptographic hashes are the universal "this is exactly what I sent" check.

Symmetric vs asymmetric: who has the key

Encryption comes in two flavours, and TLS uses both. Symmetric encryption uses a single shared secret key: both parties have it, both can encrypt and decrypt with it, anyone who gets a copy can read everything. The flagship algorithm is AES (Advanced Encryption Standard, NIST 2001), which encrypts 128-bit blocks using a 128, 192, or 256-bit key in roughly fifteen rounds of byte-level mixing. AES is fast — your CPU has hardware instructions for it (AES-NI on Intel/AMD, equivalent on ARM); a modern laptop encrypts gigabytes per second per core. The hard part is not the encryption itself; it is getting the shared key into both parties' hands without anyone watching.

Asymmetric (public-key) encryption solves exactly that bootstrap problem. Each party has a key pair: a public key they share with the world, and a private key they keep secret. Data encrypted with the public key can only be decrypted with the matching private key. Two strangers who have never met can establish a shared secret over a public channel: send each other their public keys, do a small mathematical dance (Diffie-Hellman, 1976), and both end up knowing a shared value that no eavesdropper can derive even with full transcripts. Public-key operations are slow — RSA is a thousand times slower than AES for the same data volume, ECDH is faster but still slower — so in practice we use public-key cryptography only to agree on a symmetric key, and then encrypt the actual conversation with AES. This is the core insight that makes TLS practical.

Fig 11.10 — Symmetric vs asymmetric · the same secret · or two halves of one

The two flavours have complementary strengths. Symmetric encryption is fast; everyone needs the same key. Asymmetric encryption solves the key-distribution problem; it is much too slow to encrypt a video call. The trick that powers TLS — and SSH, and Signal, and PGP, and essentially every encrypted protocol shipped since the 1990s — is to use the slow asymmetric primitive only to agree on a shared symmetric key, and then run the bulk of the conversation through fast symmetric encryption with that key. The next section traces this exact dance through a real TLS 1.3 handshake.

05 — TLS

One round trip · two strangers · a perfect channel.

TLS — Transport Layer Security — is the protocol that turns the cleartext byte stream of TCP into an encrypted, authenticated one. It is what the lock icon in your browser refers to. It is the protocol that runs underneath HTTPS, IMAPS, SMTPS, and roughly every "S"-suffixed protocol since the late 1990s. Its lineage starts with SSL 1.0 (Netscape, 1994, never released because it was discovered to be broken before launch), through SSL 2.0 (1995, broken), SSL 3.0 (1996, eventually broken via POODLE in 2014), then renamed TLS 1.0 (1999), TLS 1.1, TLS 1.2 (2008, dominant for a decade), and finally TLS 1.3 (RFC 8446, August 2018) — the version your browser uses today, the one that is actually clean.

The protocol's job is simple to state: two parties who have never met should end up sharing a secret key, mutually authenticated, with full confidence that no third party listening to or manipulating the conversation can read or alter the result. The mechanism is breathtakingly compact in TLS 1.3 — one round trip, not the two of TLS 1.2 — and combines every primitive from §04: a key exchange to bootstrap a shared secret, a digital signature over a certificate chain to authenticate the server, and symmetric encryption to actually carry the data once the handshake is done. The whole thing finishes in about 20 milliseconds on a continental link.

The TLS 1.3 handshake

Fig 11.11 — TLS 1.3 · one round trip from "hello" to encrypted data

The whole TLS 1.3 handshake: one round trip. The client sends a hello with half of an ephemeral Diffie-Hellman key exchange (an X25519 public key) plus the list of cipher suites it supports. The server replies with the other half plus its X.509 certificate plus a signature, with most of that already encrypted under the freshly derived shared key — possible because both sides can compute the shared key as soon as they see each other's public halves. The client verifies the certificate against its trusted root store (Fig 11.12), checks the signature, and replies Finished. From this moment forward, every byte is encrypted with AES-GCM (or ChaCha20 on phones without AES hardware) and authenticated against the agreed key. TLS 1.2 needed two round trips to do the same job; TLS 1.3 saved one round trip by being smarter about message ordering.

The certificate chain

The handshake includes a step the diagram glossed over: the client "verifies the certificate." What does that actually mean? The server's certificate is a small file containing the server's public key, the hostnames it claims to serve, validity dates, and — critically — a digital signature by some certificate authority the client already trusts. The certificate authority's own certificate is signed by another, more trusted authority. That one is signed by another. The chain ends at a root certificate — a self-signed certificate that the client trusts because it was shipped with the operating system or browser. There are roughly 150 such roots in the major stores; they are run by companies like DigiCert, Let's Encrypt, GlobalSign, Sectigo, and Google. The whole system rests on the trust placed in those ~150 organisations.

Fig 11.12 — The certificate chain · why your browser trusts a stranger

A web server presents not one certificate but a chain. The leaf certificate is for the actual hostname (example.com); it is signed by an intermediate CA's private key. The intermediate's certificate is signed by a root CA's private key. The root is self-signed and shipped in your browser's trust store. The client walks the chain from leaf to root, verifying every signature; if any link fails, the connection is aborted with a scary browser warning. The whole system rests on ~150 root operators behaving themselves — and on the Certificate Transparency infrastructure (mandatory since 2018) that publicly logs every certificate issued, so misbehaviour becomes detectable. The 2011 DigiNotar compromise — a Dutch CA whose private keys were stolen and used to issue fraudulent *.google.com certificates — destroyed the company within weeks and was the event that drove modern transparency requirements.

Forward secrecy: protecting yesterday's data

One subtlety in the TLS 1.3 design deserves explicit attention. The key exchange uses ephemeral Diffie-Hellman keys — fresh random values generated at the start of each connection, never reused, discarded the moment the connection ends. This property is called forward secrecy, and it has a profound consequence: even if an attacker records every encrypted byte of every TLS connection your server has ever served, and then years later steals your server's private key, they still cannot decrypt any of those past connections. The private key authenticated the server during the handshake, but it did not derive the session key. The session key was derived from ephemeral material that no longer exists anywhere.

Fig 11.13 — Forward secrecy · stealing the long-term key tomorrow does not unlock yesterday

Forward secrecy is the protection that recorded encrypted traffic remains encrypted even if the long-term server key is later compromised. The server's private key signs the handshake to authenticate; it does not decrypt the data. Decryption depends on the ephemeral keys, which are generated fresh per session and discarded after. An attacker recording all your bank's TLS traffic for ten years and then stealing the bank's private key gets nothing — every session's keys are gone. This property is now mandatory in TLS 1.3 (and was optional but common in TLS 1.2). It is the technical reason "store now, decrypt later" attacks (which are part of why post-quantum cryptography matters) only threaten current and future traffic, not historical recordings — and why the existence of governments archiving global TLS traffic for future quantum decryption is real but limited in what it can recover.

06 — HTTP/2 & HTTP/3

The two upgrades that broke compatibility on purpose.

HTTP/1.1 from 1999 was good enough for nearly two decades. Then mobile networks happened, JavaScript-heavy sites happened, the average web page grew from 50 KB to 3 MB, and the protocol's serial-request model started costing real time. Two new versions followed — HTTP/2 in 2015 and HTTP/3 in 2022 — each addressing a specific bottleneck the previous one couldn't. They are not "new HTTPs"; they are new wire formats for the same request-response semantics. The headers, methods, and status codes you saw in §02 are unchanged. What changes underneath is how those messages are serialised onto the network, multiplexed, and transported.

HTTP/2: binary, multiplexed, on the same TCP connection

The headline problem with HTTP/1.1 was head-of-line blocking at the application layer. A browser opening a typical page needed to fetch dozens of resources — HTML, then 10 stylesheets, then 30 JavaScript files, then 50 images. HTTP/1.1 sent these one at a time down a single TCP connection (with at most six parallel connections per host as a workaround). A slow response held up everything queued behind it. HTTP/2 (RFC 7540, 2015 — derived from Google's SPDY) replaced the text-based wire format with a binary one and introduced streams: many independent request/response pairs interleaved on the same TCP connection, each chopped into binary frames tagged with a stream ID. The server can deliver frame 4 of stream A, then frame 1 of stream B, then frame 5 of stream A, in any order — and the client reassembles each stream from its frames. Sixty parallel requests now share one connection, with no head-of-line blocking at the HTTP layer.

Fig 11.14 — HTTP/1.1 vs HTTP/2 · serial vs multiplexed on the same TCP connection

The same six resources, fetched two different ways. HTTP/1.1 sends one request, waits for the full response, then sends the next; a slow resource holds up everything queued behind it. HTTP/2 chops every request and response into binary frames tagged with a stream ID and interleaves them on a single TCP connection — small responses can fly past slow ones, and the browser reassembles each stream as its frames arrive. Same TCP, same TLS, same HTTP semantics — different wire format. Real-world page-load improvements range from 10% to 50% depending on resource shape. Server push (the protocol's other headline feature) was added then quietly removed from the spec — it never delivered the predicted benefits.

HTTP/3: still TCP's fault

HTTP/2 solved head-of-line blocking at the HTTP layer. It did not solve it at the TCP layer. TCP delivers a single ordered byte stream; if one packet is dropped, every byte after it is held back until the missing packet is retransmitted — even if those later bytes belong to entirely different HTTP/2 streams that have no logical dependency on the missing data. On a clean network the cost is invisible. On a flaky mobile connection — 5% packet loss, intermittent coverage — the TCP-layer head-of-line blocking dominates page load time. The fix had to happen below HTTP, at the transport layer itself.

QUIC (Quick UDP Internet Connections, RFC 9000, 2021) replaces TCP entirely for HTTP traffic. It runs on top of UDP — that "fire and forget" protocol from §01 of Chapter 10 — and reimplements everything TCP did (reliable ordered delivery, flow control, congestion control, retransmission) plus everything TLS did (encryption, authentication) plus the multiplexing of HTTP/2 — but with one crucial change: each HTTP/2-style stream is independently ordered, so a lost packet on stream A no longer holds back stream B. QUIC was developed by Google starting around 2012, deployed inside Chrome and to Google's servers around 2016, and standardised by the IETF in 2021. HTTP/3 (RFC 9114, 2022) is just HTTP semantics over QUIC. As of 2026 it handles a third of all web traffic — almost everything to and from Google, Cloudflare, and Meta — and is rapidly catching up to HTTP/2 everywhere else.

Fig 11.15 — HTTP/3 over QUIC · escaping TCP at the protocol layer

HTTP/3 is what you get when you take HTTP/2's good ideas and refuse to inherit TCP's bad ones. The three independent layers of HTTP/2 (HTTP, TLS, TCP) collapse into one (HTTP/3 over QUIC), which itself runs on UDP because UDP, in §01 of Chapter 10, was the protocol that didn't get in the way. The handshake is one round trip in the cold case, zero in the resumed case. Each stream has its own ordering, so packet loss on one image's bytes doesn't stall a different image's bytes. The catch — the reason this transition has taken a decade — is that QUIC is implemented in user space rather than the kernel, every server and CDN had to rewrite its transport layer, and corporate firewalls had to learn to allow UDP/443. The transition is happening, slowly, the way the IPv6 migration in Chapter 9 happens: not with a flag day, but with a steady year-on-year shift of the largest operators dragging the rest of the network behind them.

🔁

The recurring pattern. Every protocol in this chapter follows the same arc. Berners-Lee's HTTP — minimal, text-based, perfect for 1991 — got progressively pushed past its limits and replaced with denser, harder, more cryptographic versions. DNS — designed in 1983 with no adversarial model — got progressively patched with port randomisation, DNSSEC, DoH, DoT. TLS — born as Netscape's SSL 1.0, never released because broken — went through six numbered versions before reaching the clean 1.3 most browsers now use. None of these protocols was correctly designed at first issue; all of them earned their current shape from twenty-plus years of attacks and patches. This is unusually true of network protocols specifically, because their designers cannot iterate quickly: every change has to be deployed across millions of independently operated machines without breaking the existing ones. The slow grace of protocol evolution is one of the more remarkable forms of engineering humans have ever managed at planetary scale.

The seam to Chapter 12

Chapter 11 has built the web's transport. We can now send a typed URL through a verified, encrypted tunnel; we can negotiate a fresh shared key with a stranger every time; we can name and find any machine on Earth by composing a few labels. What we have not yet done is run code inside the document we just fetched. HTML describes a structure of text and images. The page is alive only because something else — a programming language baked into every browser — animates it. That language is JavaScript. It was invented in ten days. It runs everywhere. It powers the most popular development platform in computing. And it never should have worked. Chapter 12 is its story.

01 — The browser problem

HTML can describe; it cannot do.

By 1994 the web that Berners-Lee had launched in Chapter 11 was growing exponentially — but it was growing as a system of static documents. A web page in 1994 was an HTML file: text, images, links, perhaps a form that POSTed somewhere. Click a link, the browser fetched a new page; submit a form, the server replied with another page. Every interaction was a full round trip and a full re-render. For a research-paper repository this was fine. For anything closer to an application — a calculator, a stock quote that updates, validation of a form before submission, a clock — it was hopeless. The browser needed to be able to run code.

The first attempt was Sun Microsystems' Java applet. In late 1994 Sun announced HotJava, a browser written in Java that could embed small Java programs (applets) inside web pages. Each applet ran inside a Java Virtual Machine, isolated from the host, with a defined API for drawing and user input. Conceptually it was the right answer — a real, type-safe, sandboxed language with a proper runtime. In practice it was hopeless for the early web. The JVM took five to ten seconds to start. Applets were heavyweight, hard to write in small pieces, and required the server to host compiled .class files. Every applet was a separate island; you couldn't easily reach into the surrounding HTML page from inside Java, or vice versa. Applets shipped, were used for games and animation, and slowly died: by 2000 they were a niche, by 2010 obsolete, by 2017 removed from browsers entirely.

Netscape — the company whose browser had become the market-share leader through 1994 and 1995 — wanted something different. Marc Andreessen, Netscape's twenty-three-year-old co-founder, believed the web's killer feature would be the ability to write quick scripts that lived inside the page itself — small, untyped, easily embedded snippets that any HTML author could add without compilers, without classpaths, without ten seconds of JVM warm-up. The language should look enough like Java that the buzzword "Java" could be used in marketing, but should be far simpler. It should run instantly. It should fail loudly but not crash the browser. It should be easy enough that a designer who had learned HTML over a weekend could pick up. Netscape needed it shipped in the upcoming Netscape 2.0 release. The release window was eleven weeks away.

Fig 12.1 — The web in 1994 · static documents and a tab-out to Java

By 1994 the two paths to interactive web pages had failed. Pure HTML could describe a form but every interaction round-tripped to the server. Java applets could run real code but they took ten seconds to warm up, were hard to embed, and lived in a sealed island that couldn't easily talk to the surrounding page. Netscape's bet was that the right answer was something a designer could write in five lines, embedded directly between <script> tags, that ran the moment the page loaded. They needed the language designed and shipped in eleven weeks. They hired Brendan Eich.

02 — Eich 1995

Ten days. Three names. One language.

Brendan Eich joined Netscape in April 1995. He was 33, a programming language enthusiast, and had been promised — when the recruiters pitched him — that he would get to work on bringing Scheme (a Lisp dialect) to the Netscape browser. What he got instead, on his first week, was a different brief: Marc Andreessen and Bill Joy (Sun co-founder, brought into the Netscape-Sun partnership) wanted a language with C-like syntax — to look familiar to working programmers — but with the dynamic, interpreted, untyped flexibility of Scheme or Self underneath. They wanted it to embed inside HTML. They wanted it to ship with Netscape 2.0, currently scheduled for September. Eich had ten working days to produce a prototype.

He delivered. The first version, internally called Mocha, was a working interpreter by mid-May 1995. The language Eich actually designed was — under the C-like syntax — a deeply Scheme-influenced lexically-scoped language with first-class functions, closures, and prototypal inheritance taken from David Ungar and Randall Smith's Self. It had no classes (those came twenty years later, as sugar). It had no integers (only floating-point numbers, IEEE 754 doubles all the way). It had implicit type coercion that anyone who has used JavaScript has cursed. It was a pragmatic compromise of genuine elegance underneath surprising syntactic awkwardness — the shape of any language born in ten days under the wrong brief.

Then came the naming. Mocha was the internal name. In September 1995, shortly before launch, Netscape renamed it LiveScript. Then, in December 1995, with great fanfare, Netscape announced a partnership with Sun Microsystems and renamed it again, to JavaScript. The name change was pure marketing: Netscape wanted to ride Sun's Java publicity. The two languages were and remain unrelated — JavaScript shares almost nothing with Java besides the C-derived syntax that hundreds of languages share — but the name stuck, and three decades later it is the dominant programming language by raw deployment count and the source of endless confusion for first-time learners. Eich himself has said publicly that the name was the worst part of the whole project.

Fig 12.2 — May 1995 to ECMAScript 2026 · the slow legitimisation

JavaScript's first decade was a slow walk from "thrown together for a release deadline" to "actually standardised." ECMA International became the neutral standards body in 1997 because Netscape did not own the trademark "Java" (Sun did) and the language needed a non-Netscape home. Microsoft shipped a near-clone called JScript in Internet Explorer; the standardisation kept the two compatible enough that web pages worked in both. The really transformative version was ES6 in 2015 — twenty years after the original — which finally added classes, modules, native Promises, and arrow functions. Since 2015 the language has shipped a yearly small release; the days of waiting six years for the next ECMAScript are over.

"I had to be done in ten days or something worse than JS would have happened."

— Brendan Eich, on the early Netscape design

⏱️

The ten-day legacy. Several JavaScript quirks trace directly to the original ten-day window. typeof null === "object" is a bug that became compatibility-locked: Eich described it as a "leftover" he never got to fix. 0.1 + 0.2 !== 0.3 is just IEEE 754 (Chapter 2.6), but the choice to have no integer type at all — every number is a 64-bit float — meant JavaScript could never expose the difference. Implicit coercion ([] + {} === "[object Object]") is a pragmatic shortcut that did not get a sober second look. None of these is unfixable in principle; all of them are unfixable in practice, because billions of pages on the live web rely on the exact existing behaviour. The shape JavaScript will have in fifty years is the shape it has today, plus careful additions; it cannot remove anything.

03 — The event loop

One thread. Never blocking. Forever in motion.

JavaScript runs on a single thread. There is one call stack, one execution context. There is no fork(), no pthread_create(), no parallel Java-style worker threads sharing variables. This sounds like a fatal weakness — and would be, if JavaScript ever blocked. It does not. Every operation that might take time — fetching a URL, reading a file, waiting for a timer — returns immediately and arranges to be told when the work is done. The thread, freed from waiting, picks up the next ready callback and runs it. The mechanism that orchestrates this is the event loop. It is the architectural choice that makes JavaScript work for browsers, and it is the choice that Node.js exported back to the server world.

The model has four moving parts. The call stack holds the function frames currently executing — the same stack Chapter 3 dissected, in JavaScript form. The macrotask queue (also called the callback queue) holds callbacks ready to run from completed I/O, expired timers, and DOM events. The microtask queue holds callbacks from settled Promises and from queueMicrotask(); it is drained completely after every macrotask, before any rendering. And the render step is when the browser, between macrotasks, recomputes layout and paints any visual changes. The event loop's job is to coordinate them: pull one task off the macrotask queue, run it to completion, drain all microtasks it produces, optionally render, repeat. Forever.

Fig 12.3 — The event loop · stack, microtasks, macrotasks, render

The whole algorithm. Pull one macrotask off the queue (a click handler, a timer, a network response). Run it on the call stack. When it returns, drain the microtask queue completely — every .then, every awaited promise resolution. Then let the browser render if it's been ~16 ms since the last frame. Repeat. The single thread never blocks because nothing on the queues blocks; long-running computation (a heavy JSON.parse, a tight loop) does block the loop, which is why frozen UI in browsers and event-loop lag in Node.js are the same bug — the thread is stuck on the call stack instead of returning to the loop.

Three eras of asynchrony

The event loop has been the same since 1995. What has changed three times is the way JavaScript code spells "do this when the I/O is done." The first era was callbacks: pass a function to whoever does the work, and it will call your function back when finished. Simple, and the source of the famous callback pyramid of doom — three or four callbacks nested inside each other, indentation marching across the screen, error handling impossible. The second era was Promises (standardised in ES6, 2015): an object representing a future value, with .then() and .catch() chained horizontally. The third era is async/await (ES2017): syntactic sugar that lets you write asynchronous code as if it were synchronous — const x = await fetch(url); — while the compiler invisibly transforms it into a Promise chain underneath. All three eras coexist in any modern codebase, because legacy never dies; understanding all three is one of the shibboleth skills of working in the language.

Fig 12.4 — The same fetch · three eras of async syntax

Three ways to spell the same thing. All three compile, eventually, to operations on the macrotask and microtask queues from Fig 12.3 — async/await is sugar over Promises, Promises are sugar over callbacks, and callbacks were the bare metal. The underlying mechanism never changed; the language did. Most JavaScript a working developer touches today uses async/await for control flow, Promises for combinators (Promise.all, Promise.race), and old-style callbacks only at the lowest layers (DOM events, Node.js APIs that predate Promises). Reading older code or an Express middleware function still requires fluency in all three.

04 — V8 and Node.js

An interpreter that pretended to be a compiler · then a runtime that escaped the browser.

JavaScript was originally interpreted line by line by Netscape's SpiderMonkey engine — fast enough for the kind of small validation script and DOM manipulation people wrote in 1996, hopelessly slow for anything resembling an actual application. By 2008, when Google released V8 alongside the first Chrome browser, JavaScript engines had become aggressive optimising compilers masquerading as interpreters. V8's design — and the design of every major engine since — is a four-stage pipeline that takes source code to fast native machine code, with a feedback loop that re-optimises whichever functions turn out to be hot.

Fig 12.5 — V8 · from source to optimised native code, with a feedback loop

V8's two-tier architecture mirrors Java's HotSpot, Lua's LuaJIT, and most modern dynamic-language engines. The bytecode interpreter (Ignition) is fast to start up — code begins running immediately, no compilation pause — but slow per operation. The JIT compiler (TurboFan) is slow to compile but produces machine code competitive with C. The whole pipeline runs concurrently with the program: hot functions get optimised in a background thread, and the running program is patched mid-execution to point at the new optimised version. Speculative type assumptions ("this counter has always been an integer; assume integer") deliver most of the speedup; broken assumptions trigger deoptimisation, which throws the optimised code away and falls back to the bytecode interpreter. The whole machinery is invisible to user code — until you write a function that intermittently violates V8's expectations and notice it suddenly running at 1/10 speed.

Node.js: V8 outside the browser

In 2009 a developer named Ryan Dahl watched the progress bar on a file upload in a browser and realised the standard server-side approaches — Apache forking a process per request, blocking on disk I/O — were exactly the wrong shape for systems with many slow connections. The browsers had already solved this problem: single-threaded, event-loop, never-blocking. Why not move that model to the server? He took V8 — Google's then-new JavaScript engine — stripped away the browser, added a C library called libuv that provided a cross-platform asynchronous event loop and file/network I/O, and bound them together with a small JavaScript standard library. He called it Node.js and showed it at JSConf EU in Berlin that November. The community reaction was immediate.

Within five years Node had become the default backend for Silicon Valley startups. Within ten years it had eaten enormous amounts of the server-side language landscape. The reasons are practical, not aesthetic: a single-threaded async-I/O model handles thousands of slow connections (the WebSocket connections of a chat app, the open HTTP requests of a real-time feed) on a single CPU core where a thread-per-request model would have collapsed; and sharing one language between browser and server cut the cognitive cost of building web applications roughly in half. Node is not the best server for everything (CPU-bound workloads are still better served by Go, Rust, or threaded Java), but for the sweet spot of "many connections, mostly waiting on I/O" it is hard to beat.

Fig 12.6 — Node.js · V8 + libuv + a JavaScript stdlib

Node is small in concept and large in consequence. V8 (originally written for Chrome) executes the JavaScript. libuv (built for Node specifically, to add Windows support on top of the libev/libeio Unix codebase Dahl had been using) handles the OS-specific async I/O — epoll on Linux, kqueue on macOS/BSD, IOCP on Windows — wrapped in one common interface. Node's standard library glues the two together and exposes them to user code as JavaScript APIs. Everything else — Express, Next.js, npm's million packages — is just user code on top. The architecture is the reason JavaScript could become a server-side language in the first place: V8 was already fast and free, and libuv abstracted away the per-OS event-loop differences that would have otherwise made Node a per-platform port.

05 — The DOM and rendering

From bytes to pixels in five steps.

Loading a web page does not "show" the HTML. The browser, after fetching the bytes, runs them through a five-stage pipeline called the critical rendering path. The HTML is parsed into a tree (the DOM). The CSS is parsed into another tree (the CSSOM). The two are combined into a render tree describing what actually gets drawn. Layout computes where each box goes. Paint draws the pixels. Composite assembles layers. JavaScript can intervene at any stage — modifying the DOM, modifying the CSSOM, forcing a re-layout, triggering a repaint. Understanding this pipeline is the difference between a page that renders in 60 milliseconds and a page that stutters; understanding it is also necessary to read any modern frontend performance literature.

Fig 12.7 — Critical rendering path · five stages, ~16 ms each

Every frame the browser renders, it walks some prefix of this pipeline. Adding a new DOM element forces all five stages — parse, style, layout, paint, composite. Changing a CSS color forces paint and composite but skips layout. Animating transform: translateX() only touches composite — the GPU shifts an existing layer without re-laying-out anything. The performance gospel of the modern web ("animate transform and opacity, never width or height") falls out directly: cheap properties skip the expensive stages. Tools like the Chrome DevTools Performance panel let you watch this pipeline run frame by frame and see exactly which property change forced layout when.

The DOM itself is, mechanically, just a tree of objects exposed to JavaScript through methods like document.querySelector() and element.appendChild(). Modifying a DOM node triggers the relevant prefix of the pipeline; that is why naive "loop-and-append" code in JavaScript can be hundreds of times slower than building up a string and assigning to innerHTML once. React, Vue, Svelte, and the rest of the modern frontend framework family exist primarily to batch DOM changes — the programmer writes declarative rules ("the page should look like X"), the framework computes the minimum set of DOM mutations needed, and the browser pipeline runs once instead of fifty times. The DOM is slow only when you ask it to be.

06 — Web security

Four boundaries that keep the web from being one big shared computer.

The browser executes JavaScript from arbitrary websites with no pre-arranged trust. You visit news.example.com and it runs a hundred kilobytes of code on your machine. The same browser tab might be logged into your bank in another window. The same machine might have access to your filesystem, your camera, your microphone. The fact that visiting a webpage does not automatically compromise everything else you care about is not magic; it is four specific, layered defences invented over thirty years in response to attacks that worked. This section walks through them in the order they were added.

The same-origin policy: the foundational wall

The oldest and most important browser security boundary is the same-origin policy, introduced in Netscape 2.0 (1996) — the same release that shipped JavaScript itself. The rule is conceptually simple: code loaded from one origin can read and modify only resources from that same origin. An origin is the triple (scheme, host, port). Two URLs share an origin if and only if all three match exactly. https://a.example.com and https://b.example.com are different origins. http://example.com and https://example.com are different origins. http://example.com:80 and http://example.com:8080 are different origins. Without this rule, JavaScript on any tab could read the cookies and DOM of any other tab — including your bank's.

Fig 12.8 — Same-origin policy · scheme + host + port must all match

The same-origin policy is the boundary every other browser security mechanism builds on top of. JavaScript on https://news.com cannot make an XHR/fetch to https://bank.com and read the response. It cannot read the DOM of an iframe pointing at https://bank.com. It cannot read cookies set by https://bank.com. There are explicit, opt-in escape hatches — CORS for cross-origin XHRs (the server can send Access-Control-Allow-Origin headers), postMessage for cross-origin iframe messaging, JSONP (deprecated) for cross-origin script loading. Everything else is firmly fenced off. This is the rule that makes it safe to have multiple tabs open on different sites at the same time.

Cross-site scripting (XSS): when the page itself is the attack

Same-origin policy says: code from origin A cannot touch origin B. But what if the attacker can inject their own code into origin B's page? Then their code is, by definition, same-origin with B — and can do everything B's legitimate code can do. This is cross-site scripting (XSS), the most common web vulnerability for two decades running. It comes in three flavours. Reflected XSS: the attacker crafts a URL with malicious script in the query string; the server echoes the query string back into the page; the script executes on whoever visits the URL. Stored XSS: the attacker submits malicious script as a comment or profile field; the server stores it in the database; every subsequent visitor's browser executes it. DOM-based XSS: a similar attack carried out entirely client-side via JavaScript that interpolates URL parameters into the DOM unsafely.

Fig 12.9 — XSS · three ways for attacker code to run with your origin's privileges

All three XSS variants share the same shape — attacker-controlled string ends up where the browser parses it as code — and all three are exploitable as long as user input flows into HTML, JavaScript, or attribute contexts without proper output encoding. The 2010s saw an industry-wide push toward template engines that escape by default (React's JSX, Vue's mustaches, Svelte's curly braces all encode by default), and toward tainted-string static analysis at build time. Even so, XSS still appears every year on the OWASP Top 10. The deeper fix — the one this section is heading toward — is Content Security Policy, which lets the server tell the browser "do not run inline scripts at all, and only run scripts from these specific origins." Even if an attacker injects a <script>, CSP prevents the browser from executing it.

CSRF: making your browser the attacker

Same-origin policy stops origin A's JavaScript from reading origin B's responses. It does not stop origin A from making requests to origin B. A page on evil.com can make a POST request to bank.com/transfer, and the browser will cheerfully attach bank.com's session cookie to the request. If the bank trusts cookies as the only authentication, the request succeeds — the attacker has just made the user transfer money from their authenticated session, even though the user only visited evil.com. This is cross-site request forgery (CSRF), and the standard defence is the CSRF token: the server includes a random unguessable token in the form HTML; the form submission must echo that token in a hidden field; the server verifies the token before processing. Since evil.com cannot read bank.com's pages (same-origin policy), evil.com cannot read the token, cannot forge a valid submission, and the attack fails.

Fig 12.10 — CSRF · the cookie attaches to the cross-site request unless we add a token

CSRF tokens are an old defence (~2002); the modern complement is the SameSite cookie attribute, introduced in 2016 and now defaulted to Lax by every major browser. SameSite=Lax means: do not attach this cookie to cross-site POST requests; only to top-level navigations. SameSite=Strict goes further: do not attach the cookie even on cross-site GET. Together with CSRF tokens, they make the classical CSRF attack mostly obsolete on a properly configured site. The deeper lesson is that browser security has, over twenty years, moved from "defend at the application layer" (CSRF tokens, anti-XSS escaping) toward "defend at the platform layer" (SameSite, CSP, Trusted Types). The web's own primitives are getting safer; the application code matters less than it used to.

CSP: telling the browser which scripts you trust

The strongest mitigation in the modern web security stack is Content Security Policy (CSP), introduced 2010, standardised 2014, deployed everywhere by the late 2010s. CSP is a response header (Content-Security-Policy: …) in which the server declares, in a small declarative language, exactly which origins the browser is permitted to load resources from — separately for scripts, styles, images, frames, fonts, fetch destinations, and every other resource type. A typical strict policy: script-src 'self' https://cdn.example.com; style-src 'self' means: "execute scripts only from the same origin and from cdn.example.com; load styles only from the same origin; reject everything else." If an attacker manages to inject a <script src="evil.com/x.js">, the browser sees that evil.com is not in the policy and refuses to load the script. The XSS attack from Fig 12.9 is neutralised even though the injection succeeded.

Fig 12.11 — CSP · the server tells the browser which scripts to trust, the browser refuses the rest

CSP is unique among the defences in this section: it makes the browser an active enforcer of an application-level policy. Even if attacker code is injected into the page (a server bug let it through; a third-party script was compromised; a stored-XSS payload made it past sanitisation), the browser refuses to execute anything not on the allow-list. Strict CSP — script-src 'self' with no 'unsafe-inline', all scripts in external files — eliminates entire classes of XSS. The cost is real: legacy code that uses inline event handlers (onclick="…") breaks; fast-loading inline scripts have to be moved out; third-party widgets need their origins enumerated. But the security gain has been enough to push every serious site toward strict CSP over the past decade. As of 2026, sites without CSP are the exception.

Closing the chapter, closing the part

JavaScript was designed in ten days by one person under the wrong brief, given an unrelated marketing name to ride a competitor's hype, shipped before specification, and exported from the browser to the server fourteen years after birth. The event loop it inherited from the early browser is now the dominant concurrency model on the modern web. The engines that run it are masterpieces of compiler engineering that exist only because the language is everywhere. The security mechanisms that surround it — same-origin policy, CSRF tokens, Content Security Policy, Trusted Types, Subresource Integrity — were retrofitted over thirty years in response to attacks that worked. None of this was inevitable. All of it became, by accumulation, the shape of the modern web.

Part III is now complete. We started with voltage on a wire (Chapter 8), promoted it into routed packets (Chapter 9), built reliability on top of unreliability (Chapter 10), turned the network into a worldwide library of trustworthy documents (Chapter 11), and made those documents alive (Chapter 12). The reader who has followed all five chapters now has, in principle, the ability to read every layer of an HTTPS request from voltage on copper to a piece of JavaScript modifying the DOM and rendering at 60 frames per second. Part IV picks up where Part III left off: where information lives at rest, and how it stays trustworthy when nobody is watching the connection. The relational database. The cryptographic primitives in mathematical depth. And the unified security chapter that ties together every attack we have foreshadowed in Parts I, II and III into a single working theory of why systems break and how we keep them standing.

TheNetwork

TheWire

The voltage on a transistor extends to the voltage on a continent.

Electrons in copper, photons in glass, waves through air.

1948. Bell Labs again. The number behind every wire.

How a one-or-zero rides on a continuous wave.

Metcalfe, 1973: many computers, one cable.

Why every textbook draws the same cake.

The seam to Chapter 9

Packets — HowThe InternetDecided to Work

A century of dedicated wires.

Four nodes. A Cold War. The internet's first heartbeat.

The packet, byte by byte.

The graph that nobody owns.

The thirty-year migration that still isn't done.

Trust the source field at your peril.

The seam to Chapter 10

TCP — TheProblem ofReliability

The simplest thing that could possibly work.

Four engineering goals, written 1974, still in force.

Three segments. Two synchronised counters. One conversation.

How fast is too fast? Ask the network.

Three answers to TCP's mid-life crisis.

The price of state.

The seam to Chapter 11

HTTP · DNS · TLSThe Web