MemoryAgent

What is MemoryAgent?

A verifiable execution system, not a chatbot It actually does things. And proves it

Three pillars that make MemoryAgent fundamentally different from AI assistants that just say "done." Most AI tools just talk. MemoryAgent acts on your behalf and keeps a receipt for everything it does.

GoalGraph Execution Engine Smart Step-by-Step Plans

Your intent is compiled into a GoalGraph: a directed acyclic graph that enables parallel operations, explicit dependencies, and partial failure recovery. Not a chain of prompts. A structured execution plan the system can validate before running. When you ask MemoryAgent to do something, it builds a step-by-step plan, checks that it's safe, then runs it. If something goes wrong partway through, it knows exactly where and can pick up from there.

Verification-First Trust Proof, Not Promises

Every action produces machine-checkable evidence. File operations are verified by hash. UI actions are verified by accessibility snapshots. Browser actions are verified by DOM state. When MemoryAgent says "done," you can prove it. Every time MemoryAgent does something, it takes a receipt: a screenshot, a file fingerprint, a before-and-after. So when it says "done," you can actually check.

On-Device Foundation Models Runs Entirely on Your Mac

Built on Apple's SystemLanguageModel with guided generation for structured, reliable outputs. All processing happens locally on your Mac. No cloud, no uploads, no telemetry. Available only on devices with Apple Intelligence enabled. MemoryAgent uses the AI built right into your Mac. Your data never leaves your computer. No accounts, no cloud services, no sign-ups. Just your Mac, working smarter.

See It in Action

How MemoryAgent works

From observation to verified execution. A pipeline built for trust.

1

It observes your screen

MemoryAgent privately captures your active windows, apps, and content using macOS accessibility and vision APIs. Locally, encrypted, and governed by your privacy mode. MemoryAgent quietly watches what's on your screen: which apps are open, what files you're working on, what pages you're reading. Everything stays private on your Mac.

2

struct Context {
  app: "Xcode",
  file: "Model.swift"
}

It understands your context

Foundation Models extract structured signals: which app, which file, what action, building semantic understanding of your work. It figures out what you're actually doing: which app, which document, what kind of work, and remembers it all.

3

It compiles your intent It makes a plan

When you make a request, MemoryAgent builds a GoalGraph: a validated execution plan with explicit steps, dependencies, and verification specs. When you ask it to do something, it breaks the job into steps, checks that each one is safe, and maps out the order.

4

verify {
  fileExists: true,
  hash: "a3f2..."
}

It executes with evidence It does the work and saves proof

Each step runs with pre/post state capture. Every action produces artifacts: screenshots, file hashes, command outputs, UI snapshots. It carries out each step and takes a snapshot before and after, so you always have proof of what changed.

5

Structured responses you can trust You ask, it answers with receipts

MemoryAgent responds with typed conversation cards: approvals, drafts, status updates, resume prompts, and routed follow-ups. Review, edit, or approve before anything ships. Interrupt or resume at any point. When you need proof, it's there. Ask what you worked on, tell it to do something, or both at once. It sorts the request, shows the right kind of response, and keeps the proof attached so you can review before anything important goes out.

Execution Modes

Natural language first. Structured modes when you need them.

MemoryAgent understands intent from plain English. You don't type commands. You describe what you want, and the system infers whether to query, execute, explain, or combine multiple operations. A single request like "find my notes from yesterday's meeting and move them to the Project folder" decomposes into search, then action, handled safely. Just talk to it like a person. Say "find my notes from yesterday's meeting and move them to the Project folder" and it figures out the rest: search first, then move, done safely.

Execution modes like /plan, /query, and /learn are optional controls: explicit constraints for when you want to limit behavior, inspect the system, or teach it new skills. If you want more control, there are special commands you can use. But most of the time, you just talk normally.

Core Operations

/learn

Teach MemoryAgent a workflow or skill. Describe what you want it to learn and it acquires it for future use. Teach it something new. Show it how you do a task and it remembers for next time.

/plan

Preview the GoalGraph without running it. See exactly what would happen before committing. See what it would do before it does it. A dry run you can review.

/query

Force read-only mode. Side effects rejected at the gate, guaranteed safe. Just look, don't touch. It answers your question without changing anything.

Introspection

/explain

Get evidence-anchored explanation of any past run. See exactly what happened and why. Ask it to explain what it did and why. It shows you the receipts.

/debug

AI-powered failure analysis with ranked hypotheses and evidence citations. Something went wrong? It figures out what happened and gives you its best guesses why.

/recollect

Search your work history explicitly. Find that file, that page, that conversation. Search your memory. Find that file, that page, that conversation from last week.

Automation

/monitor

Set condition-triggered automations. "When X happens, do Y." Set up automatic reactions. "When a new file appears in Downloads, move it to Documents."

/schedule

Time-triggered execution. "Tomorrow at 9am, summarize my calendar." Schedule tasks for later. "Tomorrow morning, show me what I was working on."

Technical Architecture

Built for verification, not just generation How it works under the hood

A pipeline designed for trust, transparency, and reproducibility. The pieces that make MemoryAgent reliable, transparent, and trustworthy.

GoalGraph Compiler Plan Builder

Requests are compiled into a directed acyclic graph with typed nodes, explicit edges, and verification specs. Validated before execution. Cycles rejected, dependencies checked, safety gates enforced. Your request gets broken into a step-by-step plan. Each step is checked for safety before anything runs.

Verification Engine Receipt Checker

Comprehensive verification: file existence, content hashes, UI element presence, DOM selectors, clipboard state, app running status. Evidence captured, not assertions accepted. It checks everything: did the file actually get created? Is it the right one? Is the app running? Every action gets a receipt so nothing slips through.

RunArtifactBundle Your Receipt Folder

Every execution produces a complete bundle: the original GoalGraph, execution logs, verification reports, and captured artifacts. Inspect, replay, or debug any run. Everything MemoryAgent does gets saved in a neat folder: the plan, what happened, the proof, and any files it touched. You can look back anytime.

Resume & Replay Pick Up Where You Left Off

Execution can be interrupted and resumed from any checkpoint. Runs can be replayed deterministically from persisted bundles. Branch decisions and approval gates are honored. Interrupt a task and come back later. It picks up right where it stopped. You can also replay past tasks to see exactly what happened.

Use Cases

Built for how you actually work

Whether you're coding, leading, or creating, MemoryAgent adapts to your workflow.

For Developers For Your Projects

Never lose the why behind your code. Never lose track of what you were doing.

Never lose context when switching between files, debugging sessions, or documentation. MemoryAgent records what happened, why it happened, and what evidence proves it. Switching between tasks all day? MemoryAgent remembers what you were working on, what changed, and why, so you never have to retrace your steps.

"What files did I edit for the auth feature?"
→ Verified list with timestamps and hashes
"Show me what that build failure was about"
→ Logs and UI state captured at failure time
"Monitor for new crash logs and summarize them"
→ Automatic detection with evidence-backed summaries
"Find that Stack Overflow page I saw last week"
→ Full history with DOM snapshots, not guesses

For Teams & Founders

Operating context without the recap tax.

Keep track of decisions, conversations, follow-ups, and the evolution of your company's thinking, with sources and open loops attached.

"What did I decide in that meeting last Tuesday?"
→ Notes, attendees, and supporting documents linked
"Explain how that proposal evolved"
→ Source drafts, feedback, and decision checkpoints
"When the board deck lands in Downloads, move it to Presentations"
→ Verified file action with audit trail
"What is still waiting on me?"
→ Pending replies, deferred tasks, and unresolved threads surfaced in one place

For Deep Work & Daily Continuity For Staying in the Zone

Your ambient companion that remembers across sessions. It remembers where you left off, even days later.

MemoryAgent runs as a bounded daily-use companion. The Companion Inbox surfaces active context, resumable work, project memory, and open loops. Interactive controls let you quiet, defer, accept, dismiss, or tune initiative on your terms. Come back to your Mac after a weekend and pick up exactly where you left off. MemoryAgent keeps your current context, loose ends, and next steps in one place, then stays quiet when you need to focus.

"Where was I on the research paper?"
→ Files, sources, notes, and reasoning context restored across days
"What's still waiting on me?"
→ Open-loop tracking: pending replies, deferred tasks, unresolved threads
"Tomorrow at 9am, show me my context for Project X"
→ Scheduled briefing from real past activity with project handoff
"Quiet down until I finish this"
→ Initiative tuning: suppress, defer, or adjust companion suggestions

Developer Integration

First-class MCP server. Your tools, connected.

MemoryAgent exposes a local Model Context Protocol server so tools like Codex, Claude Code, and other MCP clients can inspect context, query operating state, and delegate bounded execution without taking ownership away from the runtime. Instead of rebuilding your desktop state from chat history, they can ask MemoryAgent for the local truth directly.

Context Plane

Read-only access to capabilities, live context, and recent perception via memoryagent:// resources. Your agent can gather the local truth without side effects, instead of spending tokens reconstructing state from pasted context.

Operating-State Plane

Query active runs, resumable checkpoints, project memory, schedules, monitors, and communication queues in real time. That means external agents can stay efficient and state-aware without polling blindly or asking you to restate what is already known locally.

Delegated Execution

Trigger tasks, resume runs, and receive structured results through MCP tools. Clarification receipts, approval boundaries, verification artifacts, and resume flows keep the caller in control while MemoryAgent handles the local execution surface.

Shared Local Backend

Unix-socket broker with client leases. Staged escalation from read-only context to full-parity runtime lets familiar agentic tools stay lightweight while MemoryAgent provides the stable local backend and verification layer.

Privacy & Security

Engineered for privacy.
Not just promised.

Your data stays on your Mac

MemoryAgent is built from the ground up with privacy as a core principle. All screen analysis, memory storage, and AI processing happen locally on your device. We never see your data, and neither does anyone else.

100% on-device processing

No cloud uploads, your data stays local and external actions are privacy gated

On-device models by default, no cloud models

Encrypted local storage

No telemetry or analytics

You own your memory

Privacy Modes

Choose your privacy level based on your needs:

Strict — Maximum privacy. No trace persistence, no external research, no passive learning.

Balanced — Full functionality with automatic redaction of URLs, emails, and paths in logs.

Permissive — Complete evidence retention for maximum introspection and debugging.

Safety Gates

Defense in depth: risky operations are blocked before they run, not after.

ReadOnlyPlanGate — In /query mode, side-effecting operations are rejected before execution.

ApprovalGate — High-risk operations halt for user approval. If the plan changes, re-approval required.

Roadmap

What's shipping and what's next

Proof-backed today, with active work focused on unfamiliar-task understanding, bounded research, and safer action.

Shipping Today

Core product surfaces that are already real in the current repo.

Shipping

Verifiable Execution

GoalGraph planning, approval gates, machine-checkable verification, and bounded multi-step execution are live. It can plan work, carry it out carefully, and prove what it did.

Replayable Evidence

Run bundles, MCP run resources, and replayable artifact receipts make outcomes inspectable after the fact. Every run keeps a receipt you can inspect later instead of asking you to trust a black box.

Companion Memory

The companion inbox, project memory, resumable work, initiative controls, and cross-day continuity are in daily-use shape. It can keep track of active work, remember what matters, and help you pick things back up later.

Browser Actions

A bounded transactional browser lane now handles form fill, validation, table flows, upload/download flows, and async SPA navigation. It can handle a focused set of browser tasks safely instead of just talking about them.

Task Understanding Task Understanding

A first-class semantic task model, stronger clarification, route-fit reasoning, and bounded task-success modeling are now wired through planning and persistence. It is getting better at understanding what kind of task you mean, what is missing, and what it can honestly do next.

Evidence-Grounded Research Research With Receipts

Bounded research now preserves source roles, evidence hierarchy, freshness and conflict posture, and reusable synthesis artifacts. It can show which sources mattered and how strong a conclusion really is.

Safe Action Controls Careful Action Controls

Explicit action eligibility, fail-closed missing-signal handling, checkpoint-aware pause/resume, and operational lineage are live. Before it acts, it is stricter about when it should stop, ask, or wait.

Developer Integration

Local MCP backend with read-only context, operating-state surfaces, delegated execution, and shared runtime ownership.

In Progress

The current frontier is less about adding features and more about widening understanding safely.

In Progress

Stronger Novel Task Understanding Better At Unfamiliar Tasks

Improving semantic task understanding across unfamiliar requests, surfaces, and mixed-context workflows. Getting better at understanding unfamiliar tasks without pretending it understands everything.

Broader Bounded Research Stronger Research Reasoning

Expanding bounded research from single-lane retrieval into stronger corroboration, conflict handling, and reusable evidence chains. Improving how it compares sources, handles uncertainty, and explains why an answer is trustworthy.

Safer Action On Unfamiliar Surfaces Safer On New Screens

Extending fail-closed action policy, observation requirements, and checkpoint behavior across more novel surfaces. Making it safer and more reliable when work lands on screens or states it has seen less often.

Planned

Product expansion that still matters, but is not the main current bottleneck.

Planned

Cross-Device Encrypted Sync

Securely carry MemoryAgent context across multiple Macs without weakening privacy or operator control.

Our Vision

"MemoryAgent exists so your Mac can remember what happened, help with what is next, and show its work."

We believe the next useful computer does more than answer questions. It should keep context, carry out real work carefully, and make its actions inspectable.

MemoryAgent is our version of that: a macOS-native execution layer built around operator control, replayable evidence, and trust.

Built in Massachusetts

Get Started

Request a private demo

See MemoryAgent in action. We'll show you how it can transform your workflow.

Name

Email

Role

Message (optional)

Or email us directly at hello@memoryagent.dev

Edited SwiftUI ViewEdited budget spreadsheet

Researched API docsLooked up flight options

Slack conversationMessages conversation

A verifiable execution system, not a chatbot It actually does things. And proves it

GoalGraph Execution Engine Smart Step-by-Step Plans

Verification-First Trust Proof, Not Promises

On-Device Foundation Models Runs Entirely on Your Mac

How MemoryAgent works

It observes your screen

It understands your context

It compiles your intent It makes a plan

It executes with evidence It does the work and saves proof

Structured responses you can trust You ask, it answers with receipts

Natural language first. Structured modes when you need them.

Core Operations

/learn

/plan

/query

Introspection

/explain

/debug

/recollect

Automation

/monitor

/schedule

Built for verification, not just generation How it works under the hood

GoalGraph Compiler Plan Builder

Verification Engine Receipt Checker

RunArtifactBundle Your Receipt Folder

Resume & Replay Pick Up Where You Left Off

Built for how you actually work

For Developers For Your Projects

For Teams & Founders

For Deep Work & Daily Continuity For Staying in the Zone

First-class MCP server. Your tools, connected.

Context Plane

Operating-State Plane

Delegated Execution

Shared Local Backend

Engineered for privacy.Not just promised.

Your data stays on your Mac

Privacy Modes

Safety Gates

What's shipping and what's next

Shipping Today

Verifiable Execution

Replayable Evidence

Companion Memory

Browser Actions

Task Understanding Task Understanding

Evidence-Grounded Research Research With Receipts

Safe Action Controls Careful Action Controls

Developer Integration

In Progress

Stronger Novel Task Understanding Better At Unfamiliar Tasks

Broader Bounded Research Stronger Research Reasoning

Safer Action On Unfamiliar Surfaces Safer On New Screens

Planned

Cross-Device Encrypted Sync

Request a private demo

Engineered for privacy.
Not just promised.