System Architecture
Deprex operates on a multi-layered architecture modeled on human cognitive processes. It is designed not as an “AI tool that responds to instructions” but as a cognitive ecosystem that thinks, plans, executes, and learns autonomously, working backward from goals, memory, and observations.
Design Philosophy — AI Reimplementation of Palantir FDE
Section titled “Design Philosophy — AI Reimplementation of Palantir FDE”Deprex’s architecture starts from the question of how to reimplement Palantir’s Forward Deployed Engineer model with AI. Human FDEs deployed on-site at customers’ offices, absorbing context directly and reflecting it into software, achieve a quality of implementation that external proposals cannot reach.
Deprex recreates this “deployment → observation → internal problem framing” model with AI agents. The AI FDE is embedded in the customer’s chat, email, and document management systems, learning the context of operations while autonomously executing routine work.
This vision is the starting point for every layered design decision in Deprex.
5-Layer Architecture
Section titled “5-Layer Architecture”Deprex separates responsibilities into 5 layers. Each layer is loosely coupled, so changes in upper layers do not affect lower layers.
| Layer | Responsibility | PM Engagement |
|---|---|---|
| L5 Customer Adaptation Engine | Tenant-specific business rules, onboarding, pattern library management | High (provides domain knowledge) |
| L4 Autonomous Execution Engine | Cognitive loop control, resident scheduler, mid/long-term strategy | Medium (task approval & review) |
| L3 Knowledge Accumulation Engine | Wiki management, knowledge query and curation | Low (FDE manages autonomously) |
| L2 Data Collection Layer | Differential collection from chat, email, documents, etc. | Low (initial connection only) |
| L1 Foundation Infrastructure | Agent runtime, authentication, daemon residency | None (operated by 0ai) |
PMs interact with the FDE primarily through L5. L4 and below — cognition, execution, data collection — are driven autonomously by the FDE.
Adding a new data source affects only the L2 connector; changing tenant-specific rules affects only L5. The blast radius of any change is structurally localized.
Reactive Mind and Proactive Mind
Section titled “Reactive Mind and Proactive Mind”Following the brain’s processing hierarchy, Deprex has both a reactive mind triggered by inputs and a proactive mind driven internally.
+--------------------------------------------------+| Sensory || - Receive, resolve metadata, fetch files || - No reasoning |+--------------------------+-----------------------+ |+--------------------------v-----------------------+| Reflexive || - Triage input (instant / light / complex) || - High-confidence patterns get instant replies || - Complex items escalate to the cognitive layer |+--------------------------+-----------------------+ |+--------------------------v-----------------------+| Cognitive || - Run the full cognitive cycle || - Also self-activates on internal timers |+--------------------------------------------------+This unifies in a single system both “a reactive bot that responds instantly to @mentions” and “a proactive agent that continues to observe and learn on its own.”
Reflexive responses return at sub-second latency without harming the PM’s perceived speed. Tasks requiring complex judgment or external side effects are handed to the cognitive layer and executed only after a full cognitive cycle.
Cognitive Cycle — Bidirectional Network
Section titled “Cognitive Cycle — Bidirectional Network”In the cognitive layer, every complex task is processed through a cognitive cycle. This is not a linear pipeline but a network where each phase can feed back bidirectionally to others.
| Phase | Role |
|---|---|
| PERCEIVE | Observe the environment |
| REMEMBER | Reference relevant memory |
| REASON | Inference and judgment |
| PLAN | Plan formulation |
| ACT | Execution and external effects |
| EVALUATE | Result evaluation |
| LEARN | Knowledge formation and memory update |
Phases are dynamically reactivated based on the situation, not controlled in fixed order. This mirrors how humans naturally “observe again” or “revise the plan” while solving problems.
Cycles iterate recursively until acceptance criteria are met, completing only when quality has converged.
Structural Bias Removal
Section titled “Structural Bias Removal”When LLMs make judgments alone, confirmation bias takes hold and reinforces wrong directions. Errors in the reasoning phase are especially damaging because they propagate through every downstream phase.
Deprex structurally embeds multi-layer guardrails in reasoning. Each conclusion is challenged by an independent context separate from the one that produced it. Higher-risk decisions are audited from more perspectives, and a negative judgment halts further progress at that point.
These guardrails increase reasoning cost, but the gain is substantial compared to the cost of a flawed decision contaminating everything downstream.
AC-Driven Autonomous Development
Section titled “AC-Driven Autonomous Development”The completion of a Deprex cognitive cycle is judged not by a vague sense of “done” but by whether predefined Acceptance Criteria have been mechanically satisfied.
Requirements Implementation Verification----------------------- ----------------- ----------------Objective + AC -> Spec (with -> Independent evidence type and verification verification steps) subagent | PASS / FAIL | FAIL -> fix -> reverify | Repeated failure -> escalate to humanEach AC declares an evidence type that ties it to a verification procedure suited to its nature: test execution, semantic LLM judgment, state inspection, command output review, or human review.
Verification is delegated to a verification subagent running in an independent context, not to the agent that did the implementation. The subagent receives only the target and the criteria — implementation process and prior attempts are deliberately hidden. This structurally removes the confirmation bias of “evaluating one’s own output.”
Autonomous Execution Without Approval Gates
Section titled “Autonomous Execution Without Approval Gates”Deprex executes every phase of the cognitive cycle end-to-end autonomously. There is no human approval gate between phases.
This is not about overlooking dangerous operations. It is the explicit design judgment that human approval speed is thousands of times slower than AI processing speed, and gates trap AI in the role of an “expensive assistant.”
In place of gates, the following structural safeguards apply:
- Predefined escalation conditions: Irreversible operations, externally impactful actions, and threshold breaches are handed to humans before execution
- Quantitative limits: Batch caps, parallel-work caps, and cost budgets prevent runaways
- Append-only judgment ledger: Every cognitive module decision is recorded; quality is ensured by retrospective review
- AC verification loop: Code and artifact correctness is verified structurally
Designing “what humans must judge” leaves the rest to AI. This is Deprex’s autonomy philosophy.
3-Axis Self-Improvement Loop
Section titled “3-Axis Self-Improvement Loop”Deprex autonomously improves its own harness (skills, rules, cognitive modules) from three independent signal sources.
| Signal Source | Trigger | Behavior |
|---|---|---|
| Failure signal | Task failure, user correction, friction | Real-time causal analysis and fix |
| Ground-truth signal | Quality gap against human artifacts | Convergence loop optimizing the producer of the artifact |
| Experiment signal | Improvement room with only criteria defined | Exploration that monotonically increases quality with criteria fixed |
The three axes share the same optimization target; only the signal source differs. A continuously running daemon monitors each signal and synchronously drives improvement upon detection.
The experiment signal in particular adopts a pattern where, if you define only the criteria and walk away, many improvements are tried and verified in the background. Only confirmed improvements are kept; the rest are discarded — a behavior that guarantees monotonic quality increase.
Agent-First Execution — Concentrating Judgment in AI
Section titled “Agent-First Execution — Concentrating Judgment in AI”The responsibility boundary between scripts and AI in Deprex is clearly separated.
| Responsibility | Implementation |
|---|---|
| I/O pipelines (collection, storage, formatting) | Scripts (deterministic) |
| Database and file operations | Scripts |
| Relevance judgment, summarization, context selection, document composition | AI (requires natural language understanding and semantic judgment) |
Operations involving judgment, interpretation, or inference have a quality ceiling when implemented as heuristics. Delegating them to AI keeps the structure where AI model improvements directly translate into system-wide quality improvements.
Daemon-Driven — Two Always-On Processes
Section titled “Daemon-Driven — Two Always-On Processes”Deprex runs two daemon processes continuously.
Reactive Daemon
Section titled “Reactive Daemon”Receives @mentions in chat and routes them through three-stage triage.
- Reflexive response: Greetings, simple Q&A → instant reply
- Light tasks: Summarization, search → handled in place
- Complex tasks: Report generation, analysis → fed into the cognitive cycle
Proactive Daemon
Section titled “Proactive Daemon”Beyond cron schedules, performs internally driven activity.
- Memory curation: Detect staleness in accumulated memory, propose merges and refinements
- Harness self-improvement: Continuous execution of the 3-axis (failure / ground-truth / experiment) loop
- Cognitive scheduling: Dynamically optimize execution order using each task’s history (success rate, duration, dependencies)
- Operational observation: Detect new task patterns from chat and meeting transcripts and propose them as skill candidates
It is designed not as a command to be invoked but as a residency that keeps running.
Autonomous Task Displacement Pipeline
Section titled “Autonomous Task Displacement Pipeline”Deprex’s most important capability is autonomously displacing human tasks. This is realized through a 6-stage pipeline from observation to production cutover.
1. Operational flow observation FDE detects task patterns from chat and meeting transcripts
2. Task flow recording Detected patterns are recorded as structured I/O ground truth
3. Skill auto-generation Skills are auto-generated from task flows
4. Shadow comparison FDE output is run in parallel with human output and compared
5. Production displacement After PM approval, FDE switches to autonomous execution
6. Continuous improvement Skills self-improve via the 3-axis loop on execution feedbackThe PM receives notifications at each stage and approves them. The essence of this structure is self-propagating automation — FDE finds its own tasks, builds its own skills, and improves itself.
Multi-Tenant Isolation
Section titled “Multi-Tenant Isolation”To deploy FDEs simultaneously to multiple client organizations, tenants are completely isolated.
app/orgs/ acme/ # Acme's environment skills/ # Skills auto-generated for Acme memory/ # Memory learned from Acme's operations config/ # Acme-specific authentication and config beta-inc/ # Beta Inc's environment skills/ memory/ config/The directory structure itself is the tenant definition; no separate tenant declaration file is required. Adding a new tenant is just creating a directory.
Each tenant’s memory, skills, and configuration are not visible across tenants. Patterns observed in Acme’s chat will never contaminate Beta Inc’s skills.
Differences from Other AI Agent Systems
Section titled “Differences from Other AI Agent Systems”Multiple systems claim to “act autonomously.” To clarify Deprex’s positioning, here is a comparison against general-purpose OSS agent runtimes.
| Aspect | General OSS agent runtime | Deprex |
|---|---|---|
| Central metaphor | Personal learning agent | FDE deployed in the customer organization |
| Memory abstraction | Personal notebook | Organizational wiki (per-tenant knowledge graph) |
| Cognitive loop | Single agent loop | Bidirectional network + multi-layer guardrails |
| Bias removal | Mostly relies on user review | Structural context separation via independent subagents |
| Self-improvement | Per-PR review | Multi-axis daemons running continuously |
| Multi-tenant | Not central to the design | Central to the design (tenant isolation as a first-class concept) |
| Completion check | Trial-and-error test pass | Structural verification with AC × evidence type |
| Approval | Halts on dangerous ops | No approval gates + structural safeguards |
| Target user | Individual developers / SaaS automation | B2B customers requiring an embedded operator |
General OSS chooses “broad and shallow”; Deprex chooses “narrow and deep.” The two are not competitors — they solve different problems. Deprex commits deeply to challenges intrinsic to embedded operation (tenant isolation, long-term knowledge accumulation, judgment-quality assurance, autonomy throughput), achieving a quality that general-purpose OSS cannot reach.
What This Means for the PM
Section titled “What This Means for the PM”PMs do not need to understand the architectural details. The four points that matter:
- The FDE is always on — observation and learning continue even without requests
- Automation progresses in stages — observe → propose → approve → execute, with PM control at each step
- Quality improves autonomously — the FDE itself improves outputs through the 3-axis loop
- Judgment quality is structurally protected — multi-layer guardrails and AC verification break chains of bad decisions
Deprex is designed not as “an AI tool the PM operates” but as a resident agent that continuously surfaces decisions to the PM for approval against the organization.