Back to browse
GitHub Repository

The Runtime Security Layer for OpenClaw/Hermes-agent, the essential safety harness for PII & sensitive credentials protection.

294 starsRust

ClawShell, Process-Level Isolation for OpenClaw Credentials

by guanlan·Feb 19, 2026·10 points·1 comment

AI Analysis

●●●BangerSolve My ProblemZero to One

Moves credential security from prompt-injection hope to OS process isolation for agents.

Strengths
  • Virtual-to-real API key mapping keeps real credentials out of agent process memory entirely
  • DLP scanning with regex patterns detects PII in request/response bodies before upstream
  • Proven threat model: authors demonstrated prompt injection exfiltration in 2 minutes, built real solution
Weaknesses
  • Unix-focused (/etc/clawshell permissions); Windows support unclear from README
  • Tight coupling to OpenClaw ecosystem; reusability for other agent frameworks unproven
Category
Target Audience

Teams deploying OpenClaw or custom LLM agents with production credentials and sensitive data

Similar To

Teleport (infrastructure privilege isolation) · HashiCorp Vault (secrets management)

Post Description

Hi HN,

I’ve been using OpenClaw daily since it dropped in November. I love the agency it provides, but as I started giving it more production API keys and access to my local filesystem, I realized the threat model was essentially "hope-based."

We ran an experiment to see how resilient a standard OpenClaw setup was to prompt injection. Within 2 mins, we were able to exfiltrate active session tokens and API credentials through the chat interface.

The problem is fundamental: in most agent architectures, the LLM logic and the sensitive credentials live in the same process space. If the agent is tricked, the attacker has everything.

We built and open-sourced a project called ClawShell to move the security boundary from the "prompt" to the "system runtime."

How it works: ClawShell acts as a privileged protection layer. It isolates sensitive operations into a separate process enforced by the OS. The secrets never enter the agent’s memory or process space. When the agent needs to perform an action, it sends a request to the ClawShell wrapper, which validates the intent and executes the call using the protected keys.

If the agent is hijacked via prompt injection, the attacker gets a scoped identifier that contains zero credentials and no lateral access to the sensitive environment.

Key Technical Details: * Structural Boundaries: We assume the LLM is untrusted. Isolation is handled at the OS level, not via "system prompts." * Zero-Trust Tooling: The agent triggers the tool, but the tool execution is handled by a separate, restricted process. * Compatibility: It’s designed to be a drop-in wrapper for existing OpenClaw instances.

We’re launching v0.1 today. I’m curious to hear how others are thinking about the "Lethal Trifecta" (Data + Action + Communication) in the agent space. Is anyone else looking at Sanboxing for this, or is OS-level isolation the right path?

Similar Projects

Security●●Solid

Aquaman keeping your OpenClaw secrets safe

The plugin-proxy split is smart: credentials live in a backend (Keychain/1Password/Vault/etc.) and a separate proxy injects auth headers over a UDS so the agent process never handles raw keys. It autosurveys plugin configs and channels to migrate plaintext secrets and even ships a Docker image and CLI for local setups — very practical for anyone already on OpenClaw, though it’s narrowly focused and adds an extra trusted component that deserves an audit.

Niche GemSolve My Problem
tech4242
104mo ago