Agent OS — Working Notes

Using AI is not enough.
You need AI systems.

A virtual, hosted, POSIX-like operating system for AI agents. Built on WebAssembly + isolates. Everything is files. Every service is a mount point. Every agent is a process.

$ ls /integrations/gmail/inbox/

meeting_invite.eml investor_update.eml receipt.eml

$ mv draft.eml /integrations/gmail/outbox/

✓ Email sent.

$ agents/run job-finder --role "ML Engineer"

⟳ Scanning 45 career pages… 12 matches found.

01 — Narrative & Copy

10 Posts

Each works as a marketing site section, a LinkedIn post, or a short blog entry.

Post 01

You Don't Need a Better AI. You Need a System.

You can ask ChatGPT to write your emails, summarize your docs, and draft your LinkedIn posts. And it'll do all of those things. Once. One at a time. With you babysitting every step.

That's not a system. That's a very smart intern you have to micromanage.

A system is what happens when your AI can find new job postings, tailor your CV to each one, draft the cover letter in your voice, log the application, and follow up in 5 days — without you touching it. Not because the model got smarter. Because someone built the plumbing: the scheduler, the data flow, the state management, the retry logic.

The model is the brain. The system is the body. Right now, most people have a brain floating in a jar.

Agent OS gives the brain a body. A filesystem to store state. Processes to run tasks. Permissions to stay safe. A kernel to keep everything from crashing.

→ What would your AI system look like?

Post 02

Why Files and Folders Are the Future of AI

Contrarian take: the best abstraction for AI agents is the one we've had since 1969. Files and folders.

Every agent needs to read data, write output, and coordinate with other agents. That's a filesystem.

‣ Your agents are executables in /agents/.

‣ Your data lives in /data/ — structured, versioned, browsable.

‣ Your integrations are mounted directories: /integrations/gmail/inbox/ is literally your inbox. Creating a file in drafts/ creates a draft. Moving it to outbox/ sends it.

‣ Your automations are scripts in /scripts/ — cron jobs, triggers, workflows.

60 years of OS design already solved permissions, isolation, process management, IPC. We're not reinventing the wheel. We're putting AI on top of a wheel that already rolls.

→ This is what Agent OS is for.

Post 03

Your Company Already Has an OS. It's Made of Duct Tape.

Every business has a person — usually several — whose entire job is copying data between tools. "When a deal closes in the CRM, update the spreadsheet, notify the team in Slack, create the onboarding doc, and schedule the kickoff."

That's not knowledge work. That's being a human API.

On Agent OS:

/integrations/crm/deals/closed/ → watch for new files

/scripts/on_deal_close.ts → trigger: create doc, notify, schedule

/agents/onboarding-agent/ → runs the full new-client flow

The CRM, email, Slack, and calendar are all mounted directories. The automation is a script. The agent handles the nuance. No Zapier. No if-this-then-that chains that break when someone renames a field.

A 3-person startup on Agent OS operates like a 30-person company.

→ What would you build if your business had its own OS?

Post 04

The $400 Mistake That Proves You Need an OS

A developer spun up a multi-agent pipeline: research, write, review. Left it running overnight.

The research agent hit a loop. Kept re-querying the same API, burning tokens on duplicates. By morning: $400 bill. 47 versions of the same paragraph. Garbage output.

The fix wasn't a better model. It was resource limits.

In an operating system, you set cgroups — CPU limits, memory limits, I/O limits. A runaway process gets killed, not rewarded with more compute.

Agent OS has this built in. Every agent runs in a WebAssembly isolate with token budgets, API rate limits, and timeout policies. When an agent misbehaves, the OS kills it, logs the failure, and notifies you.

You wouldn't run a server without ulimit. Why are you running agents without one?

→ This is what Agent OS is for.

Post 05

Agents Don't Need More Intelligence. They Need Better Handoffs.

I've debugged multi-agent pipelines where every individual agent produced perfect output — and the final result was garbage.

The bug wasn't in any agent. It was in the handoff. Agent A serialized some context, passed it to Agent B, and B hallucinated because it was missing the thirty tokens that actually mattered.

This is an inter-process communication problem. Operating systems solved it decades ago: pipes, message queues, shared memory, sockets. POSIX standardized it.

The agent ecosystem is still passing JSON blobs and hoping for the best.

Agent OS has a real IPC layer. Agents communicate through typed channels. Context is tracked, logged, and validated at every handoff. When something breaks, you trace the failure to the exact point where information was lost — like strace for agents.

→ What would your AI system look like if the plumbing actually worked?

Post 06

The Job Search Agent That Applied While I Slept

Imagine waking up to:

[Agent OS — Morning Report]

━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Jobs found: 34 new matches

Applied: 12 (auto-qualified)

Pending review: 8 (need your input)

Interviews: 2 scheduled

━━━━━━━━━━━━━━━━━━━━━━━━━━━━

This is already real. A dev built this with Claude Code — evaluated 740+ roles, landed Head of Applied AI. Open-source, 26k+ likes on X.

But it's fragile. One bad API call and the whole thing breaks with no recovery.

On Agent OS, each agent runs in its own isolate. If the job finder crashes, the applier keeps running on previously found jobs. Failures are logged, retried, reported. The system degrades gracefully instead of exploding.

That's the difference between a script and a system.

→ What would your job search look like as an operating system?

Post 07

User Mode & Kernel Mode

Every "AI tool" forces you to choose: simple enough for non-devs (and limited), or powerful enough for devs (and unusable by everyone else).

Agent OS doesn't choose. It layers.

User mode is a clean, ChatGPT-like interface. Widgets, dashboards, buttons. Your marketing manager runs campaigns. Your sales rep manages leads. They never see a terminal.

Kernel mode is full OS-level access. Your CTO builds agents, modifies the filesystem, installs extensions. It's a terminal. It's powerful. It requires elevated permissions.

The marketing manager and the CTO use the same system. They just see different views.

→ What would your team build on a shared AI operating system?

Post 08

The Knowledge Base That Thinks

Your company wiki is a graveyard. Thousands of pages, half outdated, none connected.

Now imagine a knowledge base that's an active agent system:

/agents/wiki-gardener/ → daily: find stale docs, flag contradictions

/agents/onboarding-guide/ → personalized reading paths for new hires

/integrations/slack/channels/ → watches conversations, extracts decisions

The wiki-gardener runs every night. Reads every doc, checks for stale information, cross-references with Slack conversations. Creates a report: "These 4 docs contradict each other. This process doc references a tool we stopped using."

Karpathy is already doing this — building personal knowledge bases where the LLM maintains the entire wiki, runs health checks, fills gaps. ~100 articles, ~400K words, queryable like a research engine.

That's not a better search bar. It's a living system.

→ What would your company know if its knowledge base could think?

Post 09

Why Every Solo Founder Needs an OS

Solo founders don't fail because they're not smart. They fail because they're doing 11 jobs and none of them well.

One builder on X runs 4 businesses with 23 agents across 5 departments. $0 payroll vs $500K+ equivalent. 1,847 hours reclaimed in one quarter.

On Agent OS:

‣ /agents/content-writer/ — 3 posts/week, in your voice

‣ /agents/lead-qualifier/ — watches inbound, scores leads, drafts responses

‣ /agents/invoice-tracker/ — monitors Stripe, chases late payments

‣ /agents/customer-support/ — tier-1 from your knowledge base

Each agent runs autonomously with clear boundaries. You review in User Mode — approve, reject, redirect. Never touch the filesystem unless you want to.

This isn't "AI tools for founders." It's an operating system for a one-person company.

→ What would you build if your business had its own OS?

Post 10

The Thesis

The gap between what's possible with AI and what most people actually do with it is enormous.

Developers build full agentic systems — multi-step pipelines that research, write, apply, follow up, learn, iterate. Meanwhile, 90% of AI users open ChatGPT, type a question, copy the answer, close the tab.

That's not using AI. That's using a search engine with better grammar.

The problem isn't access to models. Everyone has access. The problem is that building a system requires you to be a developer.

Agent OS closes this gap:

‣ GenUI — interfaces generated for your specific workflows

‣ Permissions — your intern can't delete production data

‣ Isolation — one bad agent can't corrupt your workspace

‣ Persistence — state across sessions, days, months

‣ Composability — agents combine like Unix pipes

→ Using AI is not enough. You need AI systems. This is what Agent OS is for.

Using AI is not enough.
You need AI systems.

You Don't Need a Better AI. You Need a System.

Why Files and Folders Are the Future of AI

Your Company Already Has an OS. It's Made of Duct Tape.

The $400 Mistake That Proves You Need an OS

Agents Don't Need More Intelligence. They Need Better Handoffs.

The Job Search Agent That Applied While I Slept

User Mode & Kernel Mode

The Knowledge Base That Thinks

Why Every Solo Founder Needs an OS

The Thesis

1. Claude Code Job Hunter

2. Karpathy Knowledge OS

3. Pal — Personal Agent that Learns

4. AI Marketing Skills

5. Continuous-Claude

6. 23-Agent Solo Operator

7. Leadbird Outbound System

8. ShellBot

Marcus Chen

Concerns

Sharp Questions

Diana Okafor

Concerns

Sharp Questions

Raj Patel

Concerns

Sharp Questions

Sarah Kim

Concerns

Sharp Questions

James Morrison

Concerns

Sharp Questions

Elena Vasquez

Concerns

Sharp Questions

David Okonkwo

Concerns

Sharp Questions