[P2][Phase 2][Platform] Multi-Tenant Architecture — phased SaaS isolation

## Overview

Enable multiple organizations to use a **single hosted EngineX deployment** with complete isolation of data, agents, credentials, and users.

Example: Acme Corp and Beta Inc both use `app.enginex.com`. Acme cannot access Beta's sessions, credentials, or reports.

**Assignee:** @vidorc
**Timeline:** Phase 2 — after [#2 Hourly Tracking](https://github.com/EngineXV/engineX/issues/2) v1 pilot (PR #9 shipped).
**Not required for:** early pilots with **dedicated EngineX install per client** in their VPC (recommended OSS model today).

---

## Ticket metadata

| Field | Value |
|-------|-------|
| **Phase** | Phase 2 — Platform scale (post-pilot) |
| **Priority** | **P2** — Medium |
| **Type** | Platform / infrastructure |
| **Depends on** | #2 pilot, #10 docs |
| **Related** | [#5 Desktop](https://github.com/EngineXV/engineX/issues/5) — local clients register to this backend |

---

## Why this matters

- Enables SaaS deployment (recurring revenue, faster onboarding)
- Single infrastructure stack serves multiple customers
- Required for enterprise self-service signup
- Tenant isolation failures are security incidents — must be tested rigorously

---

## Current state

EngineX is **single-tenant** today:

| Area | Today | Required change |
|------|-------|-----------------|
| HTTP API | `./engine serve` — no authentication | Auth + tenant context on every request |
| Sessions | In-memory `SessionManager` | Scoped by `tenant_id`, persisted |
| Storage | Files on disk (`~/.engine/` JSON — **no database**) | Tenant-prefixed paths or database |
| Credentials | Global `~/.engine/credentials/` | Per-tenant encrypted vault |
| Agents | Shared repo templates | Per-tenant agent catalog |
| Users | None | Organization, user, roles |
| Frontend | Single workspace | Login + tenant-scoped API |

**Key files:** `core/engine/server/`, `core/engine/storage/`, `core/engine/credentials/`, `core/frontend/`

---

## Personas

| Persona | Access |
|---------|--------|
| **Tenant admin** | Manage users, credentials, agents for their organization |
| **Operator / approver** | Run workflows, approve in browser |
| **Platform admin** | Create tenants, monitor health |
| **Builder** | Deploy agent packages into tenant workspace |

Primary user interface: **web dashboard** (not desktop — see #5).

---

## Phased delivery

### MVP — Tenant isolation core

Goal: Two test tenants; Tenant A cannot read Tenant B data.

- [ ] Data model: `Organization`, `User`, `Membership`, `Role`
- [ ] Login (email/password or OIDC for MVP)
- [ ] Tenant middleware on all API routes
- [ ] Scope sessions, credentials, storage by `tenant_id`
- [ ] Frontend login + tenant-scoped API calls
- [ ] Cross-tenant security tests (required)

### Phase 2 — Tenant workspaces

- [ ] Per-tenant agent catalog
- [ ] Per-tenant LLM/model settings
- [ ] Per-tenant MCP / skills config
- [ ] Audit log (runs, credential access, approvals)

### Phase 3 — Enterprise

- [ ] SSO/SAML, SCIM provisioning
- [ ] Billing / usage quotas
- [ ] GDPR export/delete, data residency

---

## Non-goals (MVP)

- Billing, multi-region, custom domains per tenant
- Replacing per-customer dedicated deployment (valid GTM before this ships)

---

## System architecture

### Request flow (auth + tenant context)

Every API call carries tenant identity from login through to storage:

```mermaid
sequenceDiagram
 participant U as User browser
 participant A as Auth
 participant M as Tenant middleware
 participant API as EngineX API
 participant S as Tenant-scoped storage

 U->>A: Login
 A->>U: Session / JWT with tenant_id
 U->>M: HTTP request + credentials
 M->>M: Resolve and validate tenant_id
 M->>API: Request with tenant context
 API->>S: Read / write scoped data only
 S-->>U: Response — never cross-tenant
```

### Data isolation (tenant boundaries)

```mermaid
flowchart TB
 subgraph deploy [Single EngineX deployment]
 API[API + SessionManager + CredentialStore]
 end

 subgraph acme [Tenant A — Acme Corp]
 direction TB
 UA[Users]
 SA[Sessions / checkpoints]
 CA[Encrypted credentials]
 AA[Agent catalog]
 end

 subgraph beta [Tenant B — Beta Inc]
 direction TB
 UB[Users]
 SB[Sessions / checkpoints]
 CB[Encrypted credentials]
 AB[Agent catalog]
 end

 API --> acme
 API --> beta
```

> **Rule:** `tenant_id` on every session, credential, and storage path. Cross-tenant access attempts must fail and be covered by automated tests.

### Reference diagram

<img width="941" height="866" alt="Multi-tenant architecture" src="https://github.com/user-attachments/assets/dcd649ea-3307-4402-9777-876554f75775" />

---

## Definition of done (MVP)

- [ ] Two or more tenants operate independently on one deployment
- [ ] Users access only their tenant's data
- [ ] All APIs enforce tenant isolation — verified by automated tests
- [ ] Frontend requires authentication
- [ ] `docs/MULTI_TENANT.md` — data model, auth flow, operations runbook
- [ ] Security review completed before merge

---

## Open questions

1. Postgres vs filesystem for tenant/user storage in MVP?
2. Auth provider: built-in vs Auth0 / Clerk / Keycloak?
3. Agent delivery: git import, zip upload, or registry?
4. One supervisor per tenant or shared platform supervisor?



Field	Value
Phase	Phase 2 — Platform scale (post-pilot)
Priority	P2 — Medium
Type	Platform / infrastructure
Depends on	#2 pilot, #10 docs
Related	#5 Desktop — local clients register to this backend

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[P2][Phase 2][Platform] Multi-Tenant Architecture — phased SaaS isolation #4

Overview

Ticket metadata

Why this matters

Current state

Personas

Phased delivery

MVP — Tenant isolation core

Phase 2 — Tenant workspaces

Phase 3 — Enterprise

Non-goals (MVP)

System architecture

Request flow (auth + tenant context)

Data isolation (tenant boundaries)

Reference diagram

Definition of done (MVP)

Open questions

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Area	Today	Required change
HTTP API	`./engine serve` — no authentication	Auth + tenant context on every request
Sessions	In-memory `SessionManager`	Scoped by `tenant_id`, persisted
Storage	Files on disk (`~/.engine/` JSON — no database)	Tenant-prefixed paths or database
Credentials	Global `~/.engine/credentials/`	Per-tenant encrypted vault
Agents	Shared repo templates	Per-tenant agent catalog
Users	None	Organization, user, roles
Frontend	Single workspace	Login + tenant-scoped API

Persona	Access
Tenant admin	Manage users, credentials, agents for their organization
Operator / approver	Run workflows, approve in browser
Platform admin	Create tenants, monitor health
Builder	Deploy agent packages into tenant workspace

Uh oh!

[P2][Phase 2][Platform] Multi-Tenant Architecture — phased SaaS isolation #4

Description

Overview

Ticket metadata

Why this matters

Current state

Personas

Phased delivery

MVP — Tenant isolation core

Phase 2 — Tenant workspaces

Phase 3 — Enterprise

Non-goals (MVP)

System architecture

Request flow (auth + tenant context)

Data isolation (tenant boundaries)

Reference diagram

Definition of done (MVP)

Open questions

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions