Turnaround Time, TTFB, and Latency — Three Terms, Three Problems

easy

Learn with your AI

Open this lesson in your favourite AI. It'll walk you through the why, explain the demo, and quiz you on the try-it list.

Open in Claude Open in ChatGPT

Why this matters

These three terms appear on every APM dashboard and SRE runbook, yet engineers use them interchangeably while meaning completely different things. TTFB (Time to First Byte) measures the gap from when the client sends a request to when it receives the first byte of the response — it captures server think-time plus one round-trip of network travel, but excludes how long the body takes to transfer. Turnaround time is the full wall-clock duration from first byte sent to last byte received; it includes TTFB plus body-transfer time. Latency, strictly, is one-way propagation delay; in practice it is used loosely to mean round-trip time or TTFB. Getting these wrong costs you: a 50 KB JSON payload with a 5 ms TTFB and 280 ms transfer time has a 285 ms turnaround — optimising server logic (which affects TTFB) will not help you because the body transfer is the bottleneck.

Demo

HTTP responses consist of two distinct durations: TTFB (the server think-time plus one network round-trip) and body transfer (proportional to payload size). Treating them as a single "latency" number masks which half is the bottleneck — a 5 ms TTFB with 280 ms transfer means server-side optimization is wasted effort. Splitting the measurement first tells you exactly where to spend engineering time.

Try it yourself

Run the demo. Compare /posts (100-item array) vs /posts/1 (single object) — which number changes: TTFB, transfer, or both? Explain why.

Swap in https://httpbin.org/delay/0.3 (300 ms artificial server delay) and observe which metric grows — does transfer time change too? Why or why not?

Try https://httpbin.org/bytes/500000 (500 KB) — watch transfer time climb while TTFB holds steady. Calculate roughly: at what body size does transfer time exceed your observed TTFB?

Prompt your AI

Use these three in order. Each builds on the one before.

1. Basics & terminology

In one paragraph, explain the difference between TTFB, turnaround time, and latency as if I'm new to API performance.

2. Why it works (the mechanism)

Walk me through exactly what happens between the moment a client sends an HTTP GET and the moment it receives the first response byte — every network and server step.

3. Advanced — application & what's next

My API's p99 TTFB is 12 ms but p99 turnaround time is 1,400 ms. What does that tell me about the bottleneck, and what should I investigate first?