Your AI services are listening on 0.0.0.0

March 21, 2026 6 min read

#rigscore
#security
#ai
#network
#ollama
#mcp

Most local AI tools split between two defaults: loopback-only (safe but breaks remote access) and all-interfaces (works everywhere, reachable from anywhere). The split matters less than this: most developers don’t choose. They accept whatever the tool shipped with, or they copy a Docker snippet from a tutorial that maps 11434:11434 with no bind prefix, and the result is the same either way. An LLM inference server, or an MCP SSE endpoint, ends up reachable on whatever interface the host happens to expose.

rigscore’s network-exposure check exists because that exposure pattern is common enough to warrant a dedicated detector. It’s also advisory rather than scored, and the reason for that is part of the story.

The problem: 0.0.0.0 vs 127.0.0.1

Binding to 0.0.0.0 means the service accepts connections from any network interface the host has. Binding to 127.0.0.1 means localhost only — not reachable from the network at all. On a laptop with Wi-Fi, 0.0.0.0 means the coffee shop next to you. On a cloud VM without firewall rules, it means the internet. Inside a container orchestrator, it usually means the host’s bridge interface — which depending on your network config may or may not be what you wanted.

None of this is new. What’s new is that AI services are now something worth reaching.

Which AI tools default where

Verified against rigscore’s AI_SERVICE_PORTS map in src/constants.js:

Tool	Default port	Default bind	Notes
Ollama	11434	`127.0.0.1`	`OLLAMA_HOST=0.0.0.0` is commonly set for Docker or remote access and is the single most common finding.
LM Studio	1234 (1235 alt)	Server toggle	Off by default; once enabled, bind address depends on the build.
Open WebUI	8080	Container bind	Almost always run in Docker; the compose file’s port mapping is the real bind.
MCP SSE servers	3001 (heuristic 3000–3999)	Varies	Most ship loopback; a non-loopback bind is rarely intentional.
LiteLLM	4000	Varies	Proxy deployments legitimately bind to all interfaces; flag for review.
LocalAI	5001	`0.0.0.0`	Drop-in OpenAI-compatible server; ships broad by default.
vLLM	9090	`0.0.0.0`	Inference server intended for deployment; expects a firewall.
FastChat	8000	`0.0.0.0`	Research-oriented; binds broadly.

The takeaway from the table is not “these defaults are wrong.” It’s that the defaults vary, they are under-documented, and nobody audits them twice.

Four detection surfaces

rigscore reads four sources and reconciles them:

MCP client config URL parsing. .mcp.json, .vscode/mcp.json, and every AI-client MCP config rigscore knows about — repo-level and under $HOME, from the roster in src/clients.js — are scanned for SSE or streamable-HTTP URLs that target non-loopback hosts. This is the one surface that raises CRITICAL — a non-loopback MCP endpoint is almost never intentional.
Docker compose port mapping. docker-compose.yml and related files are parsed for AI-service ports declared without a 127.0.0.1: prefix. "11434:11434" becomes a finding; "127.0.0.1:11434:11434" does not.
Ollama config files. Systemd drop-ins and .env-style config that set OLLAMA_HOST=0.0.0.0.
Live listener scan. ss on Linux or lsof on macOS, filtered to known AI ports plus the MCP SSE heuristic range 3000–3999. This is the one surface that catches “whatever the running process is actually doing” regardless of what the config files claim.

The four-layer approach exists because any single surface misses. Config-only misses runtime overrides; runtime-only misses services that are down when you scan.

Attack scenarios

LAN exposure. Another device on the same network queries your local LLM. No auth, full inference access.
Public Wi-Fi. Coffee shop attacker scans the subnet and finds your Ollama instance on 11434.
Cloud VM. AI service on a VPS without firewall rules is internet-exposed. This one generates abuse reports fast.
Lateral movement. A compromised container reaches AI services on the host bridge that were assumed private.

The MCP SSE case is the sharpest. An SSE endpoint on 0.0.0.0 lets any process on any reachable interface impersonate a legitimate agent client — that maps to OWASP Agentic Top 10 ASI07 (Insecure Inter-Agent Communication). If the agent is authorized to run tools, the attacker is too.

Sample output

1npx github:Back-Road-Creative/rigscore --check network-exposure

 1ⓘ network-exposure — advisory
 2  CRITICAL MCP server "local-rag" SSE endpoint on non-loopback host
 3    Server "local-rag" in .mcp.json targets 192.168.1.20:3001.
 4    Non-loopback MCP endpoints are reachable from the network.
 5  WARNING Docker port 11434 (Ollama) exposed without loopback bind
 6    Container "ollama" in docker-compose.yml maps port 11434 without
 7    explicit 127.0.0.1 bind. It will listen on all interfaces.
 8    Fix: change "11434:11434" to "127.0.0.1:11434:11434".
 9  WARNING Live listener on 0.0.0.0:11434 (Ollama)
10    Process is currently bound to all interfaces.

Per-tool fix instructions

Ollama. OLLAMA_HOST=127.0.0.1, or remove the override entirely.
Docker ports. Change "11434:11434" to "127.0.0.1:11434:11434". Same pattern for every AI port.
MCP SSE servers. Bind the server itself to 127.0.0.1 in its own config. Fix the URL in the client config to match.
General. If remote access is actually intentional, put a reverse proxy with auth in front. The problem isn’t the bind — it’s the bind without a gate.

Why advisory, not scored

The check is weight 0. It does not affect your rigscore number. That is a deliberate design decision, not a TODO.

Local development legitimately binds 0.0.0.0 all the time. Docker bridge networking requires it. WSL2 VM networking often requires it. Codespaces and devcontainer setups default to it. Team members on a shared LAN who want to hit each other’s Ollama instance for ad-hoc testing configure it. The false-positive rate on “is this bind intentional?” is too high to score without penalizing the majority of legitimate workflows.

Scoring it would do one of two bad things: penalize normal setups (eroding trust in the overall score) or force every user to maintain an allowlist (governance overhead without clear upside). Advisory is the honest middle: the check runs, the findings surface, the fix instructions are there, and you decide whether the exposure is intentional. Documentation reflects the actual security posture. When intent matters more than the signal, the right move is to report clearly and refuse to grade.

The one exception is the MCP SSE CRITICAL row. That specific finding is rarely intentional — it emits at CRITICAL level inside the advisory check so it’s visible, but it still doesn’t feed the score. If you want a single line that tells you whether your AI services are exposed, this is the check. If you want it to change your number, it won’t. That’s the point.

The canonical reference for this check lives in the rigscore repo: docs/checks/network-exposure.md. More on the rigscore docs.

Configuration details reflect a production environment at time of writing. Implementation specifics vary based on tooling versions, platform updates, and organizational requirements. Validate approaches against current documentation before deployment.

← Back to Journal