step 4 complete: intelligent routing with single-agent and pipeline modes

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-02 13:03:54 +01:00
parent 864dfdc4e6
commit 68955d2fc2
3 changed files with 735 additions and 42 deletions
--- a/BACKEND_PLAN.md
+++ b/BACKEND_PLAN.md
@@ -2,8 +2,8 @@

 > **Separate repository.** This document defines the FastAPI backend that the Electron app communicates with.
 >
-> The backend owns: orchestration logic, chat agent intelligence, prompt IP, auth, billing, and backup blob storage.
-> The backend NEVER persists user data. It receives context in requests, uses it for orchestration, and discards it.
+> The backend owns: orchestration logic, chat agent intelligence, prompt IP, auth, billing, E2E backup blob storage, cloud storage (encrypted blobs), cloud vector store, and plugin marketplace.
+> The backend NEVER persists user data in plaintext. Cloud storage blobs are E2E encrypted before upload — the backend only verifies integrity, never decrypts.

 ---

@@ -20,7 +20,7 @@ adiuva-api/
 │   │   ├── orchestrator.py        # LLM-based intent router
 │   │   ├── execution_plan.py      # Plan builder + cache
 │   │   └── plugin_loader.py       # Dynamic agent loading
-│   ├── agents/
+│   ├── agents/                    # Chat agents (proprietary logic + prompts)
 │   │   ├── __init__.py            # Auto-registers all agents
 │   │   ├── task_agent.py
 │   │   ├── calendar_agent.py
@@ -32,7 +32,10 @@ adiuva-api/
 │   │   │   ├── __init__.py
 │   │   │   ├── chat.py            # POST /chat + WS /chat/stream
 │   │   │   ├── plans.py           # GET /plans/playbook
+│   │   │   ├── storage.py         # CRUD cloud storage (E2E encrypted blobs)
+│   │   │   ├── vectors.py         # Upsert/search cloud vector store
 │   │   │   ├── backup.py          # PUT/GET /backup
+│   │   │   ├── plugins.py         # Plugin marketplace
 │   │   │   ├── auth.py            # Register/login/refresh
 │   │   │   └── billing.py         # Checkout/webhook/subscription
 │   │   └── middleware/
@@ -40,6 +43,16 @@ adiuva-api/
 │   │       ├── auth.py            # JWT validation
 │   │       ├── rate_limit.py      # Tier-aware rate limiting
 │   │       └── sanitizer.py       # Strip prompt metadata from responses
+│   ├── storage/
+│   │   ├── __init__.py
+│   │   ├── blob_store.py          # S3 for E2E encrypted blobs
+│   │   ├── vector_store.py        # Cloud vector store (Pinecone/Qdrant)
+│   │   └── encryption.py          # Integrity verification only — NO decryption
+│   ├── marketplace/
+│   │   ├── __init__.py
+│   │   ├── plugin_registry.py     # Plugin catalog (metadata, versions, ratings)
+│   │   ├── plugin_review.py       # Review queue + approval workflow
+│   │   └── revenue_share.py       # 70/30 split tracking with Stripe Connect
 │   ├── billing/
 │   │   ├── __init__.py
 │   │   ├── stripe_service.py      # Stripe checkout + webhooks
@@ -53,8 +66,10 @@ adiuva-api/
 │   ├── test_orchestrator.py
 │   ├── test_agents.py
 │   ├── test_auth.py
-│   └── test_backup.py
-├── alembic/                       # DB migrations (auth/billing tables only)
+│   ├── test_backup.py
+│   ├── test_storage.py
+│   └── test_plugins.py
+├── alembic/                       # DB migrations (auth/billing/marketplace tables only)
 │   ├── alembic.ini
 │   └── versions/
 ├── requirements.txt
@@ -92,7 +107,7 @@ adiuva-api/
  pytest-asyncio>=0.24.0
  ```
 - [x] Write `app/main.py`: FastAPI app with CORS (allow `app://`, `http://localhost:*`), lifespan (init DB pool, init agent registry), include all routers under `/api/v1`
- [x] Write `app/config/settings.py`: `Settings(BaseSettings)` with fields: `DATABASE_URL`, `JWT_SECRET`, `JWT_ALGORITHM` (default HS256), `STRIPE_SECRET_KEY`, `STRIPE_WEBHOOK_SECRET`, `S3_BUCKET`, `S3_REGION`, `AWS_ACCESS_KEY_ID`, `AWS_SECRET_ACCESS_KEY`, `OPENAI_API_KEY`, `CORS_ORIGINS`, `ENV` (dev/prod)
+- [x] Write `app/config/settings.py`: `Settings(BaseSettings)` with fields: `DATABASE_URL`, `JWT_SECRET`, `JWT_ALGORITHM` (default HS256), `STRIPE_SECRET_KEY`, `STRIPE_WEBHOOK_SECRET`, `S3_BUCKET`, `S3_REGION`, `AWS_ACCESS_KEY_ID`, `AWS_SECRET_ACCESS_KEY`, `OPENAI_API_KEY`, `CORS_ORIGINS`, `ENV` (dev/prod), `PINECONE_API_KEY`, `PINECONE_INDEX`, `QDRANT_URL`, `QDRANT_API_KEY`
 - [x] Write `Dockerfile`: Python 3.12 slim, multi-stage (builder + runtime), non-root user
 - [x] Write `docker-compose.yml`: app, postgres:16, optional redis
 - [x] Write `.env.example`
@@ -103,13 +118,24 @@ adiuva-api/
  - `ChatRequest`: `message: str`, `context: ChatContext`, `execution_mode: Literal['direct', 'plan']`
  - `ChatContext`: `user_profile: dict`, `relevant_documents: list[str]`, `recent_tasks: list[dict]`, `conversation_history: list[dict]`
  - `ChatResponse`: `response: str`, `actions: list[PlanAction]`
-  - `PlanAction`: `type: Literal['create_record', 'update_record', 'delete_record', 'index_document', 'send_notification']`, `table: str | None`, `data: dict | None`
+  - `PlanAction`: `type: Literal['create_record', 'update_record', 'delete_record', 'index_document', 'send_notification', 'call_agent']`, `table: str | None`, `data: dict | None`, `agent: str | None`
  - `ExecutionPlan`: `agent: str`, `steps: list[PlanStep]`
  - `PlanStep`: `action: str`, `prompt_template: str | None`, `variables: dict | None`, `data_from_step: int | None`
  - `BackupMetadata`: `version: int`, `timestamp: int`, `checksum: str`, `chunk_count: int`
  - `BillingTier`: `Literal['free', 'pro', 'power', 'team']`
  - `AuthTokens`: `access_token: str`, `refresh_token: str`, `expires_at: int`
  - `UserProfile`: `id: str`, `email: str`, `tier: BillingTier`
+  - `StorageRecord`: `id: str`, `user_id: str`, `table: str`, `blob: bytes`, `checksum: str`, `created_at: int`, `updated_at: int` — blob is always E2E encrypted by client
+  - `StorageRecordCreate`: `table: str`, `blob: bytes`, `checksum: str`
+  - `StorageRecordUpdate`: `blob: bytes`, `checksum: str`
+  - `VectorUpsertRequest`: `vectors: list[VectorItem]`
+  - `VectorItem`: `id: str`, `blob: bytes`, `checksum: str` — vector + metadata encrypted by client
+  - `VectorSearchRequest`: `query_blob: bytes`, `top_k: int = 10`
+  - `VectorSearchResponse`: `results: list[VectorSearchResult]`
+  - `VectorSearchResult`: `id: str`, `score: float`, `blob: bytes`
+  - `PluginManifest`: `id: str`, `name: str`, `description: str`, `version: str`, `author: str`, `permissions: list[str]`, `category: str`, `price_cents: int = 0`
+  - `PluginListResponse`: `plugins: list[PluginManifest]`, `total: int`, `page: int`
+  - `PluginInstallRequest`: `plugin_id: str`
 - **Outcome:** All request/response models defined and validated.

 ### Step 3 — Agent Registry + base classes ✅
@@ -130,8 +156,8 @@ adiuva-api/
 - [x] Unit tests: register, get, list, call_agent with mock
 - **Outcome:** Pluggable agent framework.

-### Step 4 — Orchestrator
- [ ] `app/core/orchestrator.py`:
+### Step 4 — Orchestrator ✅
+- [x] `app/core/orchestrator.py`:
  - `async classify_intent(message, context, registry) -> str`:
    - System prompt: "You are an intent classifier. Given the user message and context, decide which agent to route to. Available agents: {registry.list_agents()}. Respond with just the agent name."
    - Uses gpt-4o-mini via LangChain for low latency
@@ -146,12 +172,13 @@ adiuva-api/
    - Final synthesis via LLM: "Summarize these agent results into a coherent response"
  - `async orchestrate(request: ChatRequest) -> ChatResponse | ExecutionPlan`:
    - Main entry point
+    - Context is transparent to orchestrator — data may originate from local or cloud storage on the client side
    - Classifies intent
    - If `execution_mode == 'direct'`: route + return response
    - If `execution_mode == 'plan'`: route + return execution plan with template IDs
  - `async orchestrate_stream(request: ChatRequest) -> AsyncGenerator[str, None]`:
    - Same as orchestrate but yields tokens for WebSocket streaming
- [ ] Integration tests with mocked LLM and mocked agents
+- [x] Integration tests with mocked LLM and mocked agents
 - **Outcome:** Intelligent routing with single-agent and pipeline modes.

 ### Step 5 — Execution Plan generator
@@ -174,6 +201,7 @@ adiuva-api/
  - Tools: `create_task(title, description, priority, due_date)`, `update_task(id, updates)`, `list_tasks(filters)`, `suggest_tasks(notes_context)`
  - System prompt: PM-oriented, validates task structure, infers priority from context
  - `handle()`: LLM + tool loop via `_tool_loop()`, returns response text + list of actions performed
+  - Accepts flexible context: mandatory fields `user_profile` + `message`, all other fields (from batch/plugin output) are optional
 - [ ] `app/agents/calendar_agent.py` — `@registry.register`:
  - Description: "Calendar management: events, conflicts, scheduling"
  - Tools: `list_events(date_range)`, `detect_conflicts(events)`, `suggest_reschedule(conflict)`
@@ -190,9 +218,32 @@ adiuva-api/
 - [ ] Unit tests per agent with mocked LLM
 - **Outcome:** Four specialized agents, all registered and tested.

-### Step 7 — API Routes
+### Step 7 — Storage Layer
+- [ ] `app/storage/blob_store.py`:
+  - `BlobStore`:
+    - `async upload(user_id, table, record_id, blob: bytes, checksum: str) -> str` — returns S3 key
+    - `async download(user_id, s3_key) -> bytes`
+    - `async delete(user_id, s3_key) -> None`
+    - `async list_keys(user_id, table) -> list[str]`
+  - Keys structured as `{user_id}/{table}/{record_id}` — backend never inspects blob content
+  - Uses boto3 S3 with server-side encryption at rest (SSE-S3) as extra layer
+- [ ] `app/storage/vector_store.py`:
+  - `VectorStore`:
+    - `async upsert(user_id, vectors: list[VectorItem]) -> None` — vectors are pre-encrypted blobs
+    - `async search(user_id, query_blob: bytes, top_k: int) -> list[VectorSearchResult]`
+    - `async delete(user_id, vector_ids: list[str]) -> None`
+  - Wraps Pinecone (default) or Qdrant — configurable via settings
+  - Namespace per `user_id` for isolation
+  - Note: because vectors are E2E encrypted by client, ANN search is on the encrypted representation — semantic search accuracy is a known trade-off when users choose cloud vectors
+- [ ] `app/storage/encryption.py`:
+  - `verify_checksum(blob: bytes, checksum: str) -> bool` — SHA-256 HMAC integrity check only
+  - `reject_if_tampered(blob, checksum)` — raises `400` if mismatch
+  - Backend NEVER holds decryption keys — all crypto is client-side
+- **Outcome:** Cloud storage layer that handles E2E encrypted blobs without ever accessing plaintext.

-#### 7a — Chat endpoint
+### Step 8 — API Routes
+
+#### 8a — Chat endpoint
 - [ ] `app/api/routes/chat.py`:
  - `POST /api/v1/chat`:
    - Request: `ChatRequest`
@@ -204,48 +255,93 @@ adiuva-api/
    - Final frame: JSON `ChatResponse` with `{"done": true, "response": "...", "actions": [...]}`
    - Heartbeat ping every 30s to keep connection alive

-#### 7b — Plans endpoint
+#### 8b — Plans endpoint
 - [ ] `app/api/routes/plans.py`:
  - `GET /api/v1/plans/playbook`: Returns all playbooks available for the user's tier
  - `GET /api/v1/plans/playbook/{plan_id}`: Returns a specific plan

-#### 7c — Backup endpoint
+#### 8c — Storage endpoint (cloud records)
+- [ ] `app/api/routes/storage.py`:
+  - `POST /api/v1/storage/records`: Create encrypted record
+    - Request: `StorageRecordCreate`
+    - Verifies checksum, stores blob in S3, inserts metadata row in PostgreSQL
+    - Response: `{id: str, created_at: int}`
+  - `GET /api/v1/storage/records`: List record metadata (no blobs)
+    - Query params: `table: str`, `page: int`, `limit: int`
+    - Response: `list[{id, table, checksum, created_at, updated_at}]`
+  - `GET /api/v1/storage/records/{id}`: Download encrypted blob
+    - Response: blob bytes + `X-Checksum` header
+  - `PUT /api/v1/storage/records/{id}`: Update encrypted blob
+    - Request: `StorageRecordUpdate`
+  - `DELETE /api/v1/storage/records/{id}`: Delete record + S3 blob
+  - All routes enforce tier cloud_storage_gb quota via `TierManager.check_quota(user_id)`
+
+#### 8d — Vectors endpoint (cloud vector store)
+- [ ] `app/api/routes/vectors.py`:
+  - `POST /api/v1/storage/vectors/upsert`:
+    - Request: `VectorUpsertRequest`
+    - Verifies checksums, delegates to `VectorStore.upsert()`
+    - Response: `{upserted: int}`
+  - `POST /api/v1/storage/vectors/search`:
+    - Request: `VectorSearchRequest`
+    - Delegates to `VectorStore.search()`
+    - Response: `VectorSearchResponse`
+  - `DELETE /api/v1/storage/vectors`:
+    - Request: `{ids: list[str]}`
+
+#### 8e — Backup endpoint
 - [ ] `app/api/routes/backup.py`:
  - `PUT /api/v1/backup`: Accepts binary blob + metadata headers (`X-Backup-Version`, `X-Backup-Timestamp`, `X-Backup-Checksum`). Stores in S3 keyed by `{user_id}/{timestamp}`. Enforces tier limits:
    - Free: 0 (no backup)
    - Pro: 5 GB
-    - Power: 50 GB
+    - Power: 25 GB
    - Team: unlimited
  - `GET /api/v1/backup`: Returns latest blob for authenticated user. Supports `If-Modified-Since`.
  - `GET /api/v1/backup/history`: Returns list of `BackupMetadata` (no blobs).
  - `DELETE /api/v1/backup/{backup_id}`: Delete specific backup.

-#### 7d — Auth endpoint
+#### 8f — Plugins endpoint
+- [ ] `app/api/routes/plugins.py`:
+  - `GET /api/v1/plugins`:
+    - Query params: `category: str | None`, `q: str | None`, `page: int`, `sort: Literal['rating', 'installs', 'newest']`
+    - Response: `PluginListResponse`
+    - Available from Power tier and above
+  - `GET /api/v1/plugins/{id}`:
+    - Response: `PluginManifest` + ratings + install count
+  - `POST /api/v1/plugins/{id}/install`:
+    - Request: `PluginInstallRequest`
+    - Records installation for the user (billing tracking, analytics)
+    - If plugin is paid: triggers Stripe Connect charge + revenue split (70% developer, 30% platform)
+    - Response: `{ok: true, download_url: str}` — signed S3 URL for plugin package
+  - `DELETE /api/v1/plugins/{id}/install`:
+    - Unregisters installation
+
+#### 8g — Auth endpoint
 - [ ] `app/api/routes/auth.py`:
  - `POST /api/v1/auth/register`: `{email, password}` → bcrypt hash → insert user → return `AuthTokens`
  - `POST /api/v1/auth/login`: Validate credentials → return `AuthTokens`
  - `POST /api/v1/auth/refresh`: Rotate refresh token → return new `AuthTokens`
  - `GET /api/v1/auth/me`: Return `UserProfile` for current JWT

-#### 7e — Billing endpoint
+#### 8h — Billing endpoint
 - [ ] `app/api/routes/billing.py`:
  - `POST /api/v1/billing/checkout`: Creates Stripe checkout session → returns URL
  - `POST /api/v1/billing/webhook`: Handles Stripe webhooks (subscription lifecycle)
  - `GET /api/v1/billing/subscription`: Returns current subscription info
  - `DELETE /api/v1/billing/subscription`: Cancels subscription

- **Outcome:** Complete REST + WebSocket API.
+- **Outcome:** Complete REST + WebSocket API covering orchestration, storage, vectors, backup, marketplace.

-### Step 8 — Middleware
+### Step 9 — Middleware

-#### 8a — Auth middleware
+#### 9a — Auth middleware
 - [ ] `app/api/middleware/auth.py`:
  - FastAPI dependency: `get_current_user(token: str = Depends(oauth2_scheme)) -> UserProfile`
  - Validates JWT signature, expiry, extracts `user_id` and `tier`
  - Raises `401` on invalid/expired token
  - Exempt routes: `/api/v1/auth/register`, `/api/v1/auth/login`, `/api/v1/billing/webhook`

-#### 8b — Rate limiter
+#### 9b — Rate limiter
 - [ ] `app/api/middleware/rate_limit.py`:
  - Uses `slowapi` with `Limiter(key_func=get_user_id_from_jwt)`
  - Tier-based limits:
@@ -255,7 +351,7 @@ adiuva-api/
    - Team: 200 req/seat/min
  - Custom 429 response with `Retry-After` header

-#### 8c — Sanitizer
+#### 9c — Sanitizer
 - [ ] `app/api/middleware/sanitizer.py`:
  - Response middleware that scans response bodies
  - Strips: system prompt fragments, agent internal reasoning, tool schemas, routing metadata
@@ -264,7 +360,27 @@ adiuva-api/

 - **Outcome:** Secure, rate-limited API with prompt IP protection.

-### Step 9 — Billing & Tier management
+### Step 10 — Plugin Marketplace
+- [ ] `app/marketplace/plugin_registry.py`:
+  - `PluginRegistry`:
+    - `async list_plugins(category, query, page, sort) -> PluginListResponse`
+    - `async get_plugin(plugin_id) -> PluginManifest | None`
+    - `async submit_plugin(manifest: PluginManifest, package_s3_key: str) -> str` — returns plugin_id, sets status = 'pending_review'
+    - `async approve_plugin(plugin_id) -> None` — admin only, sets status = 'approved'
+    - `async reject_plugin(plugin_id, reason: str) -> None`
+- [ ] `app/marketplace/plugin_review.py`:
+  - `ReviewQueue`:
+    - `async get_pending() -> list[dict]`
+    - `async submit_review(plugin_id, reviewer_id, decision, notes) -> None`
+  - Security checklist enforced before approval: manifest schema valid, permissions are from allowed set, no binary blobs in manifest
+- [ ] `app/marketplace/revenue_share.py`:
+  - `RevenueShare`:
+    - `async record_install(plugin_id, user_id, amount_cents) -> None`
+    - `async payout_developer(plugin_id, period) -> None` — Stripe Connect transfer: 70% to developer
+    - `async get_earnings(developer_id, period) -> dict`
+- **Outcome:** Plugin marketplace with catalog, review workflow, and revenue split.
+
+### Step 11 — Billing & Tier management
 - [ ] `app/billing/stripe_service.py`:
  - `create_checkout_session(user_id, tier) -> str`
  - `handle_webhook(payload, sig_header) -> None`: processes `checkout.session.completed`, `customer.subscription.updated`, `customer.subscription.deleted`, `invoice.payment_failed`
@@ -275,33 +391,77 @@ adiuva-api/
    - Feature matrix:
      ```python
      FEATURES = {
-          'free':  {'agents': 3, 'batch': False, 'providers': 1, 'backup_gb': 0},
-          'pro':   {'agents': -1, 'batch': True, 'providers': -1, 'backup_gb': 5},
-          'power': {'agents': -1, 'batch': True, 'providers': -1, 'backup_gb': 50, 'byok': True},
-          'team':  {'agents': -1, 'batch': True, 'providers': -1, 'backup_gb': -1, 'sso': True},
+          'free':  {
+              'agents': 3,
+              'batch_active': 2,
+              'cloud_storage_gb': 0,
+              'backup_gb': 0,
+              'providers': 1,
+              'batch_builder': False,
+              'plugin_marketplace': False,
+              'sso': False,
+          },
+          'pro':   {
+              'agents': -1,          # unlimited
+              'batch_active': 10,
+              'cloud_storage_gb': 5,
+              'backup_gb': 5,
+              'providers': -1,
+              'batch_builder': False,
+              'plugin_marketplace': False,
+              'sso': False,
+          },
+          'power': {
+              'agents': -1,
+              'batch_active': -1,    # unlimited
+              'cloud_storage_gb': 25,
+              'backup_gb': 25,
+              'providers': -1,
+              'batch_builder': True,
+              'plugin_marketplace': True,
+              'sso': False,
+          },
+          'team':  {
+              'agents': -1,
+              'batch_active': -1,
+              'cloud_storage_gb': -1,
+              'backup_gb': -1,
+              'providers': -1,
+              'batch_builder': True,
+              'plugin_marketplace': True,
+              'sso': True,
+          },
      }
      ```
    - `get_tier(user_id) -> BillingTier`
    - `check_feature(user_id, feature) -> bool`
    - `get_rate_limit(tier) -> int`
- **Outcome:** Stripe integration with tier-based feature gating.
+    - `check_quota(user_id) -> bool` — checks cloud_storage_gb current usage vs limit
+- **Outcome:** Stripe integration with tier-based feature gating matching Free/Pro(15€)/Power(29€)/Team(49€/seat).

-### Step 10 — Database (auth/billing only)
+### Step 12 — Database (auth/billing/marketplace only)
 - [ ] PostgreSQL schema via Alembic:
  - `users`: `id UUID PK`, `email UNIQUE`, `password_hash`, `tier` (default 'free'), `stripe_customer_id`, `created_at`, `updated_at`
  - `refresh_tokens`: `id UUID PK`, `user_id FK`, `token_hash`, `expires_at`, `created_at`
  - `subscriptions`: `id UUID PK`, `user_id FK`, `stripe_subscription_id`, `tier`, `status`, `current_period_end`, `created_at`
  - `backup_metadata`: `id UUID PK`, `user_id FK`, `s3_key`, `version`, `timestamp`, `checksum`, `size_bytes`, `created_at`
+  - `storage_records`: `id UUID PK`, `user_id FK`, `table_name VARCHAR`, `s3_key`, `checksum`, `size_bytes`, `created_at`, `updated_at` — metadata only, no plaintext
+  - `plugins`: `id UUID PK`, `name`, `description`, `version`, `author_id FK`, `category`, `status` (pending_review/approved/rejected), `price_cents`, `s3_package_key`, `install_count`, `avg_rating`, `created_at`
+  - `plugin_installations`: `id UUID PK`, `plugin_id FK`, `user_id FK`, `installed_at`
+  - `plugin_reviews`: `id UUID PK`, `plugin_id FK`, `reviewer_id FK`, `decision`, `notes`, `reviewed_at`
+  - `revenue_events`: `id UUID PK`, `plugin_id FK`, `user_id FK`, `amount_cents`, `developer_share_cents`, `stripe_transfer_id`, `created_at`
 - [ ] Initial Alembic migration
 - [ ] SQLAlchemy models in `app/models.py`
- **Outcome:** Auth and billing persistence. Zero user data stored.
+- **Outcome:** Auth, billing, storage metadata, and marketplace persistence. Zero user data in plaintext.

-### Step 11 — Testing & deployment
- [ ] `tests/conftest.py`: TestClient fixture, mock LLM fixture (`AsyncMock` returning canned responses), mock agent fixture, test DB (SQLite in-memory for speed)
+### Step 13 — Testing & deployment
+- [ ] `tests/conftest.py`: TestClient fixture, mock LLM fixture (`AsyncMock` returning canned responses), mock agent fixture, test DB (SQLite in-memory for speed), mock S3 (moto), mock Pinecone
 - [ ] `tests/test_orchestrator.py`: classify_intent routing, single agent, pipeline, plan mode
 - [ ] `tests/test_agents.py`: each agent with mocked tools
 - [ ] `tests/test_auth.py`: register → login → access protected → refresh → expired token
 - [ ] `tests/test_backup.py`: upload → download → history → delete, tier limit enforcement
+- [ ] `tests/test_storage.py`: create record → list → download → update → delete, checksum rejection, quota enforcement
+- [ ] `tests/test_plugins.py`: list plugins, install, uninstall, revenue event creation, tier gate (free user blocked)
 - [ ] `Dockerfile` optimized for production (gunicorn + uvicorn workers)
 - [ ] GitHub Actions CI: lint (ruff), test (pytest), build Docker image
 - **Outcome:** Fully tested, deployable backend.
@@ -320,10 +480,22 @@ adiuva-api/
 | WS | `/api/v1/chat/stream` | JWT | `ChatRequest` (first frame) | Token stream + final JSON |
 | GET | `/api/v1/plans/playbook` | JWT | — | `ExecutionPlan[]` |
 | GET | `/api/v1/plans/playbook/:id` | JWT | — | `ExecutionPlan` |
+| POST | `/api/v1/storage/records` | JWT | `StorageRecordCreate` | `{id, created_at}` |
+| GET | `/api/v1/storage/records` | JWT | `?table&page&limit` | `RecordMeta[]` |
+| GET | `/api/v1/storage/records/:id` | JWT | — | Binary blob |
+| PUT | `/api/v1/storage/records/:id` | JWT | `StorageRecordUpdate` | `{ok: true}` |
+| DELETE | `/api/v1/storage/records/:id` | JWT | — | `{ok: true}` |
+| POST | `/api/v1/storage/vectors/upsert` | JWT | `VectorUpsertRequest` | `{upserted: int}` |
+| POST | `/api/v1/storage/vectors/search` | JWT | `VectorSearchRequest` | `VectorSearchResponse` |
+| DELETE | `/api/v1/storage/vectors` | JWT | `{ids: list[str]}` | `{ok: true}` |
 | PUT | `/api/v1/backup` | JWT | Binary blob + headers | `{ok: true}` |
 | GET | `/api/v1/backup` | JWT | — | Binary blob |
 | GET | `/api/v1/backup/history` | JWT | — | `BackupMetadata[]` |
 | DELETE | `/api/v1/backup/:id` | JWT | — | `{ok: true}` |
+| GET | `/api/v1/plugins` | JWT | `?category&q&page&sort` | `PluginListResponse` |
+| GET | `/api/v1/plugins/:id` | JWT | — | `PluginManifest` + stats |
+| POST | `/api/v1/plugins/:id/install` | JWT | `PluginInstallRequest` | `{ok, download_url}` |
+| DELETE | `/api/v1/plugins/:id/install` | JWT | — | `{ok: true}` |
 | POST | `/api/v1/billing/checkout` | JWT | `{tier}` | `{checkout_url}` |
 | POST | `/api/v1/billing/webhook` | Stripe sig | Stripe event | `{ok: true}` |
 | GET | `/api/v1/billing/subscription` | JWT | — | Subscription info |
@@ -339,21 +511,24 @@ adiuva-api/
 | Framework | FastAPI + Uvicorn |
 | LLM | LangChain + langchain-openai |
 | Auth | PyJWT + bcrypt + OAuth2 |
-| Billing | stripe-python |
-| Storage | boto3 (S3) |
+| Billing | stripe-python + Stripe Connect |
+| Blob storage | boto3 (S3) |
+| Vector store | Pinecone or Qdrant (configurable) |
 | Database | PostgreSQL + SQLAlchemy + Alembic |
 | Rate limiting | slowapi |
-| Testing | pytest + pytest-asyncio + httpx |
+| Testing | pytest + pytest-asyncio + httpx + moto (S3 mock) |
 | Deployment | Docker → fly.io / Railway / AWS ECS |

 ---

 ## Development Rules

-1. **NEVER persist user data.** The DB stores only auth, billing, and backup metadata. User context arrives in requests and is discarded after processing.
-2. **NEVER expose prompts.** System prompts are composed server-side from fragments. Responses are sanitized before sending.
-3. **Stateless request handling.** No server-side session state. All context comes from the client + JWT.
-4. **Type hints everywhere.** All functions have full type annotations.
-5. **Test every agent.** Each chat agent has unit tests with mocked LLM responses.
-6. **Structured logging.** JSON logs with request ID correlation.
-7. **One step at a time.** Implement one numbered step per session. When the step is fully done, mark all its checkboxes as `[x]` in this file and commit with message `step N complete: <outcome line>`.
+1. **NEVER persist user data in plaintext.** The DB stores only auth, billing, storage metadata, and marketplace data. User context arrives in requests and is discarded. Cloud blobs are E2E encrypted client-side — backend only stores opaque bytes.
+2. **NEVER expose prompts.** System prompts are composed server-side from fragments. Responses are sanitized before sending. In plan mode, `prompt_template` fields are reference IDs only.
+3. **NEVER decrypt user blobs.** `app/storage/encryption.py` only verifies checksums. No decryption key ever reaches the backend.
+4. **Stateless request handling.** No server-side session state. All context comes from the client + JWT.
+5. **Type hints everywhere.** All functions have full type annotations.
+6. **Test every agent.** Each chat agent has unit tests with mocked LLM responses.
+7. **Structured logging.** JSON logs with request ID correlation.
+8. **Tier gates are enforced server-side.** Never trust client-reported tier. Always fetch from DB via `TierManager.get_tier(user_id)`.
+9. **One step at a time.** Implement one numbered step per session. When the step is fully done, mark all its checkboxes as `[x]` in this file and commit with message `step N complete: <outcome line>`.