Concurrency model per language

medium

Learn with your AI

Open this lesson in your favourite AI. It'll walk you through the why, explain the demo, and quiz you on the try-it list.

Open in Claude Open in ChatGPT

Why this matters

The reason your server scales (or doesn't) is almost always about how it handles concurrency. Go uses lightweight goroutines scheduled onto OS threads. Python historically fights the GIL — modern Python uses asyncio and workers. Rust uses native OS threads or an async runtime (Tokio). Node runs a single-threaded event loop with worker threads for CPU work. Each model has a sweet spot and a failure mode. Knowing which one you're working in is the first step to knowing how many connections your laptop can actually handle.

Demo

A handler that sleeps for 100ms, under 50 concurrent requests, exposes the difference between threading, async, and the event loop — not theoretically but numerically. Go schedules goroutines onto threads and handles them all; Python's asyncio suspends coroutines cooperatively; Node's event loop queues them; Rust's Tokio does the same with zero-cost futures. The throughput numbers you see are why these models exist.

Try it yourself

Install wrk (brew install wrk on mac). Run your server, then: wrk -c100 -t4 -d10s http://localhost:8080/slow. Note the req/sec.

Replace the async sleep with a blocking sleep (time.sleep in Python, a busy loop in Node). Rerun wrk. Why does throughput crash?

In Go, try -c10000 connections. The numbers hold. In Python/Node with a blocking handler, try the same — the server likely hangs.

In each language, look up how many OS threads the runtime is actually using. ps -M <pid> on mac, top -H -p <pid> on linux. Surprising?

Think about your own apps. When you've hit 'the server is slow,' was it I/O (most common) or CPU? The right fix is different for each.

Prompt your AI

Use these three in order. Each builds on the one before.

1. Basics & terminology

Explain Go's goroutines, Python's asyncio, Rust's Tokio, and Node's event loop as concurrency models. Where does each put its concurrency — threads, tasks, callbacks?

2. Why it works (the mechanism)

A single Go process on a 4-core machine can run tens of thousands of goroutines. How? Walk me through the Go runtime's M:N scheduler — how goroutines map to OS threads, how the scheduler preempts, and why the cost of a goroutine is measured in bytes, not kilobytes.

3. Advanced — application & what's next

When your service spends 95% of its time on I/O (DB, downstream HTTP), any async model works. When it spends 95% on CPU, the model matters a lot. Pick a realistic service (image resizer, data transformer, ML inference) and explain which concurrency model is wrong for it, and why.