Show HN: OculOS – Give AI agents control of your desktop via MCP

github.com

1 points by stif1337 5 hours ago

Hi HN,

I built OculOS because giving AI agents (like Claude Code or Cursor) control over desktop apps is still surprisingly difficult. Most current solutions rely on slow OCR/Vision or fragile pixel coordinates.

OculOS is a lightweight daemon written in Rust that reads the OS accessibility tree and exposes every button, text field, and menu item as a structured JSON API and MCP server.

Why this is different:

Semantic Control: No screenshots or coordinates. The agent interacts with actual UI elements (e.g., "Click the 'Play' button").

Rust-powered: Single binary, zero dependencies, and extremely low latency.

Universal: Supports Windows (UIA), macOS (AXUIElement), and Linux (AT-SPI2).

Local & Private: Everything runs on your machine; no UI data is sent to the cloud.

It also includes a built-in dashboard for element inspection and an automation recorder. I’m looking forward to your feedback and technical questions!