Atlas Metis: RAG Engine
Platform Status Brief
March 29, 2026
Confidential
01 — Overview
Platform Status

Atlas Metis is a production-deployed RAG-as-a-Service platform. Backend API and dashboard are live on Vercel with the first client tenant (SparkBot.App) operational. The full pipeline — document ingestion, hybrid search with Cohere reranking, GPT-4o-mini generation with source citations, and faithfulness scoring — is verified end-to-end. Client onboarding is invite-only with admin-controlled provisioning. All original P1 issues are resolved. The platform is ready for client acquisition.

62
API Routes
63
Automated Tests
1
Live Client Tenant
86%
Target Margin

System Status

02 — Operations
How a Client Gets Onboarded
Current onboarding time: < 5 minutes from tenant creation to first query.
03 — Current State
Operational Systems
Single File Ingestion
Upload PDF, DOCX, TXT, CSV, audio, images (with Gemini) — auto-process to searchable vectors
Hybrid Search
Semantic + keyword search combined via Reciprocal Rank Fusion
Cohere Reranking
Cross-encoder reranking boosts raw scores from ~0.016 to ~0.97
LLM Generation
GPT-4o-mini generates grounded answers with source citations
Self-RAG Validation
Validates chunk relevance before generating — catches hallucinations
SSE Streaming
Real-time streaming responses via Server-Sent Events
Multi-Tenant Isolation
Verified — Tenant A cannot see Tenant B’s data
Dual API Key Auth
Tenant keys (scoped) + Admin keys (full access) with bcrypt
Job Tracking
Every ingestion tracked with status, timing, and error capture
Health Diagnostics
7 automated checks per tenant with alert creation and resolution
Admin Dashboard
Fleet overview, alert center, metrics bar, diagnostics trigger
Usage Tracking
Queries, tokens, rerank units tracked per tenant per period
Cost Tracking
Real pricing from OpenAI ($0.13/1M), Cohere, and Gemini ($0.15/1M tokens)
Rate Limiting
Sliding window rate limiter enforced per API key
File Size Limits
100MB upload cap prevents memory exhaustion on large files
Gemini Embeddings
Multimodal: images, video, audio natively embedded via Gemini Embeddings 2
04 — Issues
Issues by Priority
RESOLVED Completed This Sprint (March 29)
FeatureResolution
Backend Deployment Deployed to Vercel Python (atlas-metis-api.vercel.app)
OAuth Token Refresh Auto-refresh on 401 for Google Drive + Dropbox, persists new tokens
Gemini E2E Test Full pipeline verified — ingest, search, rerank, generate (score 0.952)
HyDE Fallback Implemented in retrieval.py, per-tenant toggle via use_hyde config
Test Suite 63 automated tests — chunking, context, auth, cache, rate limiting
Client Onboarding Request Access flow + admin approve + auto-provisioning
API Key Security Masked in settings UI, proxy is server-side only
Middleware Hardening Public routes skip Supabase auth — prevents 504 on outage
P2 Remaining (Non-Blocking)
FeatureStatus
Additional Connectors 8 of 11 types not yet implemented (SharePoint, S3, Notion, etc.)
Per-Tenant Portal Customization Branding, persona, suggested questions — planned
Stripe Billing Integration Plans defined, integration not yet built
P&L Admin Dashboard Revenue vs cost tracking per tenant — planned
All P0s and P1s resolved. Zero blocking issues remain. The platform is client-ready for managed onboarding. P2 items are feature enhancements for scale.
05 — Product Architecture
Three-Tier Dashboard Architecture

Atlas Metis serves three distinct user tiers, each with a purpose-built interface, authentication model, and API layer.

Access Levels

LayerAccessAuthURL Pattern
Master Admin Atlas Minds team Admin API key /admin
Org Admin Client admins Supabase Auth /dashboard
End User Anyone with link None (public) /portal/{org-slug}
Master Admin (Atlas Minds)
Fleet Health
Cross-Tenant Analytics
Revenue & Cost Tracking
Org Comparison
Diagnostics & Alerts
Org Admin (Client)
Document Management
Collection Management
Query Playground
Usage & Billing
Settings & API Keys
End User Portal
Chat Interface
Streaming Responses
Source Citations
Per-Org Branding
No Login Required

Data Flow by Tier

Public
End User
Portal API
Public
No auth required
Backend
Supabase
Authenticated
Org Admin
Customer API
Auth’d
Supabase Auth
Backend
Supabase
Admin
Master Admin
Admin API
Admin Key
Full access
Backend
Supabase
06 — Infrastructure
Technical Architecture
External
Client App
Gateway
FastAPI
55 routes • async
Security
Auth Layer
Dual key • bcrypt
Ingestion Pipeline
Parse → Chunk → Embed → Store
Retrieval Pipeline
Search → Rerank → Validate → Generate
Diagnostics
7 Health Checks → Alerts → Auto-Resolve
Database
Supabase
pgvector • RLS
Embeddings + LLM
OpenAI + Gemini
Multi-provider • 3072d • Multimodal
Reranking
Cohere
Rerank v3.5
Task Queue
Redis / Celery
Not running
07 — Roadmap
Path to Production
A
Fix P0s — Critical Dead Code
✓ Completed 2026-03-05
  • URL ingestion, batch upload, connector sync — all wired
  • Celery sync fallbacks added for all dependent endpoints
B
Fix P1s + Multi-Provider Embeddings
✓ Completed 2026-03-11
  • Cost tracking, collection counters, concept clustering, auth caching, query cache
  • Multi-provider embeddings (OpenAI + Gemini) with multimodal support
  • Rate limiting, file size limits, faithfulness checks, auto-slug
C
Three-Tier Dashboard + Analytics
✓ Completed 2026-03-11
  • End User Portal — public chat interface per organization with streaming and citations
  • Master Admin analytics — cross-tenant usage, cost trends, org comparison
  • Org Admin dashboard — document/collection management, query playground, billing
D
Production Deployment + Client Onboarding
✓ Completed 2026-03-29
  • Backend deployed to Vercel Python (atlas-metis-api.vercel.app)
  • OAuth token refresh, HyDE search, 63 automated tests, middleware hardening
  • Invite-only onboarding: Request Access → Admin Approve → auto-provision
  • First client tenant live (SparkBot.App) with verified E2E pipeline
E
Scale & Monetize
In Progress
  • Per-tenant portal customization (branding, persona, suggested questions)
  • Stripe billing integration with tiered plans
  • P&L admin dashboard (revenue vs cost per tenant)
  • Local GPU hosting option (hybrid cloud/local failover)
  • Additional connectors (SharePoint, S3, Notion)
Platform is production-ready and serving clients
Built by Atlas Minds
atlas-minds.com
March 29, 2026