Back to browse
GitHub Repository

External execution gateway allowing a mobile LLM app to safely operate a desktop computer

2 stars

Using a mobile LLM app to safely operate a desktop computer

by Ruikhu·Feb 28, 2026·1 point·2 comments

AI Analysis

●●SolidBig BrainBold Bet

Separates LLM cognition from OS execution via discrete actions, not continuous control.

Strengths
  • Architecture genuinely inverts the threat model—phone LLM never touches system credentials or gets OS privileges.
  • Discrete decision problem (skill selection) is more predictable and auditable than continuous UI control.
  • Works with official mobile apps (ChatGPT/Claude), no custom model training needed.
Weaknesses
  • Very early stage—GitHub shows zero stars/forks, minimal documentation, no working demo or usage examples.
  • Practical constraints unclear: how many 'whitelisted primitives' cover real-world tasks? Still may need heavy skill engineering.
Category
Target Audience

Security-conscious automation engineers, AI safety researchers

Similar To

LangChain Agents · Anthropic Computer Use API · Browser automation (Playwright)

Post Description

Hi HN, I’ve been experimenting with a different approach to computer-using AI agents. Most current AI agents control computers using: • cloud APIs with stored credentials • browser automation • screenshot + vision + mouse control I tried something else. Instead of embedding the AI inside the computer, I use the official mobile LLM apps (ChatGPT / Claude) as the intelligence source, and built an external execution gateway that translates model intent into deterministic OS actions. The model never gets system privileges, and the computer never exposes credentials to the model. Architecture: phone LLM app → data link → action gateway → predefined action skills → desktop OS The gateway only executes whitelisted primitives: keyboard sequences window operations command calls The key idea is separating cognition and execution. The model outputs decisions, not motor control. The gateway performs verified actions. This turns computer control from a continuous UI manipulation problem into a discrete decision problem, which makes it more predictable and auditable. Early prototype — I’d really appreciate feedback, especially from people working on agent safety or permission models.

Similar Projects

AI/ML●●Solid

Routiium – self-hosted LLM gateway with a tool-result guard

Guards tool outputs against injection attacks, unlike LiteLLM or Helicone.

Solve My ProblemBig Brain
deadpixel
201mo ago
AI/ML●●Solid

An AI Operating System

VirtualBox sandbox isolates AI agents so mistakes never touch your host machine.

Big BrainBold Bet
juniormpakou
442mo ago