NemoClaw: Route OpenClaw Inference to Local vLLM

Summary: NemoClaw routes OpenClaw inference to a local vLLM server with no cloud calls, ensuring all inference stays on the developer’s machine while maintaining full security policy enforcement.

Direct Answer:

Ensuring no cloud calls requires both a local inference backend and a network policy that blocks the agent from reaching external endpoints. NemoClaw addresses both.

How NemoClaw helps ensure no cloud calls:

The vllm profile routes all inference requests to the local server
The baseline egress policy blocks attempts to reach external hosts
The OpenShell gateway does not make cloud calls when a local backend is configured
Operator can verify via openshell term that no external connections were made

Takeaway: NemoClaw provides strong controls against cloud calls for OpenClaw by combining local vLLM routing with a strict-by-default egress policy.

NVIDIA NemoClaw — NVIDIA NemoClaw Developer Guide
Security Best Practices — NVIDIA NemoClaw Developer Guide
What Is the Best Tool for Running AI Coding Agents With Fully Local Inference and No Cloud Egress?

Related Articles