Active, goal-oriented initiatives with deadlines and specific outcomes. The fire of the system.
Ongoing responsibilities without end dates. Long-term maintenance, recurring duties, domain ownership.
Reference materials and topical knowledge. Information retained but not actively being worked on.
Completed or inactive items. No longer relevant but preserved for historical context and audits.
| Entity | Key Attributes | Relationships | PARA Mapping |
|---|---|---|---|
| Project | name, status, dates, success_criteria | has_task, depends_on, owned_by | Projects |
| Decision | description, rationale, alternatives, outcome | affects, supersedes, contested_by | Projects |
| Task | description, status, priority, due_date | part_of, blocked_by, assigned_to | Projects |
| Person | name, role, contact_info, capabilities | works_on, manages, collaborates_with | Areas |
| Agent | name, role, capabilities, status, memory | specializes_in, reports_to, delegates_to | Areas |
| Lesson | context, what_happened, why, what_to_do | learned_from, applies_to, related_to | Resources |
Natural language queries answered via graph traversal:
2560-dimensional vectors from Qwen3-4B. Cosine similarity search finds conceptually related content even when keywords don't match.
Full-text index on content and metadata. Title matches boosted 3× over body. Exact phrase matching and boolean operators.
Entity relationships stored as typed edges. Supports BFS/DFS traversal. Answers "what depends on this" and "what does this affect" queries.
Modes:
hybrid semantic keyword graphFilters:
para_category entity_type date_range agent_idAny source — chat message, document, decision, URL. Captured by front-desk agent (GWEN).
System suggests PARA location based on content type, urgency, and time horizon. User confirms or redirects.
System proposes entity types and relationship candidates. Edges created: affects, part_of, blocked_by, etc.
Content written to file (authoritative source), then indexed in three layers: Qdrant vector, BM25 full-text, and ontology graph edges.
Vector index is derived data. Rebuild from authoritative PARA files on disk — no data is lost.
Qdrant and BM25 store the content payload. Restore files from backup, then re-hydrate metadata.
Files are primary. Indices are derived. Recovery always starts from disk.
No raw dumps. Everything categorized immediately into PARA. No orphaned, unorganized data ever enters the system.
Semantic (meaning), keyword (exact match), and graph (relationships). One query, three complementary retrieval paths.
Ask "what does this affect?" or "what depends on this?" The ontology graph answers dependency questions that vectors cannot.
PARA categories are first-class citizens. Filtering by active project vs archived reference is native, not bolted on.
Every index can be rebuilt from PARA files. Complete audit trail of decisions, their rationale, and their cascading impacts.
Every search refines embeddings. Every correction improves ontology. The system gets smarter with each interaction.
Implement PARA-structured ingestion workflow with automatic ontology typing and multi-modal indexing in the Nexus pipeline.