How do I confirm it's mDNSResponder and not my code or API key on macOS?

Run dscacheutil -q host -a name . It uses the same daemon path as curl and Python, so an empty response while nslookup returns a valid IP confirms the daemon is the culprit — not your code or credentials. A second sanity check: curl --resolve :443: https:// — if that succeeds, routing and TLS are fine and mDNSResponder is the sole problem.

How do I fix the macOS DNS resolver when curl and pip can't resolve hosts?

Restart the mDNSResponder daemon, escalating weakest-first: (1) sudo dscacheutil -flushcache && sudo killall -HUP mDNSResponder; (2) sudo launchctl kickstart -k system/com.apple.mDNSResponder; (3) if that prints 'Could not find service' (the service path differs across macOS versions), the universal fix is sudo killall -9 mDNSResponder mDNSResponderHelper — launchd's KeepAlive restarts it with a fresh PID. Verify with pgrep -l mDNSResponder plus a repeat dscacheutil or curl. Do not touch your API keys, SDK versions, or code.

Why does my AI agent, MCP server, or long-running dev box keep hitting this DNS failure?

Connection-pool exhaustion. Many concurrent outbound long-poll connections — several MCP servers, a local inference/LLM server, and background jobs all holding sockets open — push mDNSResponder to high CPU until it stops answering. In one measured case, 30+ concurrent long-poll connections drove the daemon to 77% CPU and into the zombie state. Restarting the daemon recovers it, but recurrence only stops when you reduce the concurrent-connection source. Watch lsof -i -P -n | wc -l (80+ is a warning sign) and mDNSResponder CPU above ~30%.

Should I edit /etc/resolv.conf or reinstall my SDK to fix macOS DNS resolution?

No. macOS does not use /etc/resolv.conf for its system resolver (the file even says so in its own comments), so editing it won't help. Reinstalling an SDK or changing library versions doesn't touch the mDNSResponder daemon either. The actual fix is restarting mDNSResponder and, for recurrence, capping the number of concurrent long-poll connections. Switching to a public resolver like 8.8.8.8 is only a diagnostic step to rule out a bad upstream resolver, not a default.

macOS: nslookup works but curl and Python "Could not resolve host" — the mDNSResponder zombie

Q: Why does nslookup work but curl says 'Could not resolve host' on the same macOS machine?

Because they use different DNS paths. nslookup and dig query DNS servers directly and bypass the mDNSResponder daemon, while curl, Python (requests/httpx), and pip use getaddrinfo(), which routes through mDNSResponder. When the daemon becomes non-responsive — its process ID stays alive but it silently stops answering — the direct path keeps resolving while the system-resolver path fails on the exact same host. The failure is the daemon, not your network, API keys, or code.

If nslookup resolves a host fine but curl, pip, and Python (requests/httpx) fail with "Could not resolve host," your mDNSResponder daemon has almost certainly entered a non-responsive "zombie" state — and the fix is to restart it (sudo killall -9 mDNSResponder), not to touch your API keys, SDK versions, or code.

← hexisteme · notes · June 23, 2026 · 7 min read

This failure is maddening because every signal points the wrong way. nslookup example.com returns a clean IP, so DNS "works." Your network is up. Your code didn't change. Yet curl, pip install, and every Python HTTP call die with "Could not resolve host." People burn hours rotating API keys, downgrading SDKs, and editing config files. None of that is the problem.

The two-DNS-path asymmetry (your fastest diagnosis)

macOS resolves DNS over two independent paths, and the asymmetry is the diagnosis:

Path	Who uses it	Goes through mDNSResponder?
Direct	`nslookup`, `dig`	No — queries DNS servers directly
System resolver	`curl`, Python, `pip`, most apps (`getaddrinfo()`)	Yes — routes to the daemon

The daemon can keep its PID alive while silently refusing to answer. So the direct path (nslookup) succeeds and the system-resolver path (curl) fails on the exact same host. When one path works and the other fails, you are not looking at a network, key, or code problem — you are looking at a sick daemon.

Confirm it in 10 seconds

Use the daemon path directly so you're testing the same route curl uses:

# Uses the mDNSResponder path (same as curl/Python). Empty result = zombie.
dscacheutil -q host -a name example.com

# ...while the direct path still returns a valid IP:
nslookup example.com

If dscacheutil comes back empty but nslookup returns an IP, the daemon is confirmed as the culprit. One more sanity check proves routing and TLS are fine and isolates the daemon as the sole cause:

# Bypass resolution entirely with the IP nslookup gave you:
curl --resolve example.com:443:<IP-from-nslookup> https://example.com
# Succeeds? Then DNS resolution is the only broken thing.

Fix it weakest-tool-first

Start gentle and escalate only as needed:

# 1) flush + reload (least disruptive)
sudo dscacheutil -flushcache && sudo killall -HUP mDNSResponder

# 2) kickstart the service
sudo launchctl kickstart -k system/com.apple.mDNSResponder

# 3) universal hammer (if kickstart prints "Could not find service" —
#    the service path differs across macOS versions)
sudo killall -9 mDNSResponder mDNSResponderHelper
#    launchd's KeepAlive immediately restarts it with a fresh PID.

# verify
pgrep -l mDNSResponder && dscacheutil -q host -a name example.com

Why agents and long-lived dev boxes hit this repeatedly

The zombie is usually triggered by connection-pool exhaustion. Many concurrent outbound long-poll connections — say several MCP servers, a local inference/LLM server, and an ingest or daily-report job all holding sockets open at once — push mDNSResponder into high CPU until it stops answering.

Measured case: 30+ concurrent long-poll connections drove mDNSResponder to 77% CPU and into the non-responsive state. The recovery (restarting the daemon) alone doesn't stop recurrence — you have to find and cut the connection source. Watch for the active connection count climbing (lsof -i -P -n | wc -l in the 80+ range) and mDNSResponder CPU above ~30%.

"Disabled" in config does not mean the process is dead

A subtle trap when hunting the connection source: a server can be flagged off at the code or config level (an ENABLED=False switch) while its OS process keeps running and keeps holding its long-poll connections. The flag stopped new work from being dispatched but never killed the process, so it kept feeding the pile-up. When auditing, check ps/pgrep for the actual process and its elapsed time (etime) — not just the config flag.

# the flag says off; the process says otherwise
pgrep -lf <server-name>        # still there?
ps -o pid,etime,%cpu,command -p <pid>

Prevent recurrence — and the honest limitation

Two durable fixes: reduce concurrent long-poll load (trim always-on servers/MCP endpoints, kill stale inference servers, and on a box that runs for many days, restart mDNSResponder on a schedule), or install a watchdog LaunchDaemon that kickstarts the daemon when its CPU crosses a threshold.

Honest caveat: this exact two-path asymmetry is macOS-specific — it's mDNSResponder/getaddrinfo behavior, not Linux's nsswitch/resolv.conf. And a watchdog that calls launchctl kickstart needs root: implement it as a LaunchDaemon running as root with a narrowly scoped script, not a broad sudo NOPASSWD rule. The watchdog is a band-aid — the real fix is capping concurrent connections.

FAQ

Q. Why does nslookup work but curl says "Could not resolve host" on the same Mac?
Different DNS paths. nslookup/dig query DNS servers directly; curl/Python/pip use getaddrinfo() → the mDNSResponder daemon. A zombie daemon (PID alive, not answering) breaks the second path only.

Q. How do I confirm it's mDNSResponder, not my code or key?
dscacheutil -q host -a name <host> uses the daemon path — empty result while nslookup returns an IP confirms the daemon. curl --resolve with that IP succeeding proves routing/TLS are fine.

Q. How do I fix it?
Weakest-first: flushcache + killall -HUP → launchctl kickstart -k system/com.apple.mDNSResponder → sudo killall -9 mDNSResponder mDNSResponderHelper (launchd restarts it).

Q. Why does my agent/MCP box keep hitting this?
Connection-pool exhaustion from many concurrent long-poll connections (measured: 30+ → 77% CPU → zombie). Cut the connection source; restarting alone doesn't prevent recurrence.

Q. Edit /etc/resolv.conf or reinstall the SDK?
No — macOS ignores /etc/resolv.conf for its system resolver, and the SDK isn't involved. Restart the daemon and cap concurrent connections.

← hexisteme · notes · CC-BY 4.0