The memory base
The memory base is where Agivar keeps, long-term, "the things you taught it". Teach Mode writes to it; Task Mode reads from it.
Structure: platform / topic
Every memory lives under a platform/topic path:
- platform — the name of an app, website, or system, e.g.
bilibili,vscode,company-expense-system. Cross-platform know-how, or anything with no clear owner, goes undergeneral. - topic — a specific operation, workflow, rule, or piece of know-how, e.g.
coins,submit-invoice,tool-call-preference.
Examples:
bilibili/coins
bilibili/triple-action
company-expense-system/submit-invoice
general/expense-rules
general/tool-call-preference
What a memory entry looks like
Two kinds, by content:
Operating workflows (the most common)
When you teach the workflow of an app/website/system, the entry contains:
- Overview — the goal, applicable scenario, and end result of the operation.
- Initial state — what state things should be in before executing (browser open? logged in? which page? what key elements should be visible?).
- Steps — split into several abstract steps; each has its concrete operations underneath, ending with a "stage result". On-screen targets are written unambiguously.
- Related demo recording (if there is one) — recording ID, summary, and key clips relevant to this entry.
(See the full example at Teach by recording → Step 6.)
Rules / preferences / concepts
E.g. "expense invoices no more than ¥500 don't need an attached note", "I like tool calls merged into one actions array", "what we call a 'frozen order' means…" — organized freely by content, but kept clearly structured and reusable.
How the AI maintains it
- Look at the index before deciding new vs. update — the Teach/Task agent has the current memory-base index on hand from the start (one line per entry:
platform/topic + a one-line summary), so it knows what entries exist. To change one, it reads the original text first, then makes a precise local edit — rather than creating a duplicate. - Organize, don't copy verbatim — it turns what you said into something clear, executable, and reusable.
- Latest wins on conflict — when old and new information conflict, your latest explicit statement wins.
- When unsure, mark it or ask — it won't invent steps you didn't teach.
- Not everything gets stored — casual chat, one-off instructions, and information with no reuse value are not filed.
Viewing / editing
- View: in a Teach-mode (or Task-mode) conversation, just ask it to "list the memory entries related to X", or "read the
bilibili/coinsentry to me". - Edit / correct: in a Teach-mode conversation, point out what's wrong or what to add, and it updates that entry — see Teach by chat → Correcting or supplementing an existing memory.
- Delete: ask it to delete a memory you no longer need.
How Task Mode uses it
Before running a GUI task, Task Mode consults the memory base: as long as an entry might have something useful for the current task, it reads it; entirely unrelated ones are skipped. If an entry it reads references a demo recording (like Recording ID: 00046), and that recording is useful for the concrete steps, it asks targeted questions about details in it.
So — the more you teach in Teach Mode, and the more accurately, the easier and more reliable Task Mode becomes.
Multiple devices / accounts
The memory base lives in the cloud, tied to your account. Sign in with the same account on another computer and it's still there; different accounts have independent memory bases. (A recording's raw frames stay local on the computer that recorded it; the cloud keeps the processed keyframes and text results.)