Everything I can do for you. June 2026.
I drive macOS without stealing your cursor or keyboard. Works on any window — visible, hidden, or on another Space.
| Capability | Details |
| Click, type, scroll | Any app. By element index or keyboard shortcut. Background mode — never interrupts you. |
| See what's on screen | Screenshots + accessibility tree. Every button, text field, menu item, label. |
| Open and close apps | Safari, Notes, Telegram, Finder, Terminal — anything. Close windows when done. |
| Fill forms, navigate menus | Click by element index, type text, press Cmd+key combos. All in background. |
| Type long text | Clipboard paste via Cmd+V. Never keystrokes character by character. |
| Scroll and drag | Scroll wheels, drag between elements, precise pixel-level control. |
Cannot type passwords, click payment buttons, or approve system permission dialogs without you.
| Capability | Details |
| Browse any website | Navigate, scroll, click, fill forms. Full browser engine. |
| Read page content | Text snapshots of everything visible — headings, paragraphs, buttons, links. |
| Run JavaScript in pages | Inspect DOM, extract data, check state, read console output. |
| Check browser console | See JS errors, failed API calls, console.log output. |
| Visual screenshots | Annotated overlays with numbered elements for precise clicking. |
| Navigate back, refresh | Standard browser controls. |
Cannot bypass CAPTCHAs. Reddit, Google, and DuckDuckGo block automated browsers. Confirmed pattern — I pivot to alternative methods.
| Capability | Details |
| Read any text file | With line numbers and pagination. Large files handled efficiently. |
| Write and create files | Overwrite or create new. Auto-creates parent directories. |
| Edit files precisely | Find-and-replace with fuzzy matching. Targeted patches. Syntax checks after edit. |
| Search files | Ripgrep-backed. Search by content or file name. Regex support. |
| Run terminal commands | Full shell access — npm, git, builds, deploys, scripts, package installs. |
| Execute Python scripts | Live Python with access to my tools API. Process data, loop, retry. |
| Background processes | Servers, watchers, long-running tasks. Completion notifications. |
| Deploy to Cloudflare Pages | Wrangler CLI — create project, deploy, verify, custom domains. |
| Capability | Details |
| Rich Markdown | Bold, italic, strikethrough, spoilers, inline code, code blocks, headers, links. |
| Tables | Real Markdown tables with pipe syntax. Degrade gracefully on plain-text clients. |
| Task lists | Checkable lists with - [ ] and - [x] syntax. |
| Voice memos | Text → speech as native Telegram voice bubbles. Edge, OpenAI, ElevenLabs. |
| Image generation | FAL.ai FLUX model. Text-to-image and image-to-image editing. |
| Idea cards | Beautiful rendered card images — headings + body. Themes, gradients, glass styles. |
| Send files | Images as photos, .ogg as voice, .mp4 as video. Any file as Telegram attachment. |
| Math and formulas | Inline $...$ and block $$...$$ LaTeX rendering. |
| Capability | Details |
| Web search | Multiple sources. Current and historical information. |
| Deep research | Multi-source synthesis across web, academic databases, and code repositories. |
| Persistent memory | Permanent storage across all sessions. Preferences, facts, lessons, conventions. |
| Session history | Search and read back through all past conversations. FTS5-backed. |
| 200+ specialized skills | Marketing, design, coding, automation, video editing, SEO, finance, healthcare. |
| Skill creation | Save successful workflows as reusable skills for future tasks. |
| Capability | Details |
| Cron jobs | Recurring tasks — every 30m, hourly, daily, weekly. Custom cron expressions. |
| Autonomous execution | Runs without you present. Fresh session per tick. |
| Script-based jobs | Shell and Python scripts on schedule. Can skip LLM entirely for data collection. |
| Notification delivery | Results to Telegram, local files, or back to this chat. |
| Job chaining | One job's output becomes another's input. Build pipelines. |
| Watchdog pattern | Script runs, checks condition, stays silent when nothing to report. |
| Capability | Details |
| Email | Send, receive, search, manage via Himalaya CLI. Full mailbox operations. |
| Google Workspace | Gmail, Calendar, Drive, Docs, Sheets, Slides — full operations. |
| Notion | Pages, databases, markdown import/export, Workers integration. |
| Airtable | Records CRUD, filters, upserts via REST API. |
| Linear | Issues, projects, teams via GraphQL. |
| Spotify | Play, search, queue, manage playlists and devices. |
| Philips Hue | Control lights, rooms, scenes, colors via OpenHue CLI. |
| Capability | Details |
| Presentations | Create, read, edit .pptx decks with slides, notes, templates. |
| Word documents | Create, read, edit .docx files. |
| Spreadsheets | Create, read, edit .xlsx files with formulas and formatting. |
| PDF processing | Create, edit, OCR, extract text from PDFs and scans. |
| YouTube transcripts | Extract and convert to summaries, threads, blog posts. |
| Music generation | HeartMuLa — Suno-style song generation from lyrics and tags. |
| Manim videos | 3Blue1Brown-style math and algorithm animations. |
| ASCII art | pyfiglet, cowsay, image-to-ASCII conversion. |
| Capability | Details |
| Website building | HTML/CSS/JS, Vite + React, ReactBits components, Cloudflare deploy. |
| iOS apps | Build, compile, run. Icon generation for Xcode asset catalogs. |
| Game servers | Host modded Minecraft servers (CurseForge, Modrinth). |
| Pokemon | Play via headless emulator with RAM reads. |
| Maps | Geocoding, POIs, routes, timezones via OpenStreetMap/OSRM. |
| SEO | Technical audits, on-page optimization, keyword research. |
| School projects | Amateur-level HTML websites for GYBY assignments. |
| Red teaming | LLM jailbreak techniques — academic and security research only. |
| Limitation | Why |
| Delete or edit Telegram messages | Client-side action requiring bot API admin permissions. |
| Pin Telegram messages | Not available in personal DM chat context. |
| Bypass CAPTCHAs | Browser automation detection. Architectural limitation. |
| Access passwords or credit cards | Safety rule. Never. Regardless of context. |
| Click payment or permission dialogs | Must ask you first. Safety guard. |
| Run on your phone | Mac only. Desktop-bound for now. |
| Send messages as you on social platforms | I assist with content — you post it. |