100% On-Premises • Audit-Ready • No External AI APIs

Your Enterprise
AI Assistant

The private AI platform for IT, compliance, operations, and engineering teams. Ship secure internal AI with Active Directory SSO, governed automation, and full request-level traceability. Your data stays inside your network.

Organization AI Dashboard

🔒
Zero Data Leakage
🏢
Active Directory SSO
200+ Concurrent Users
💻
Governed Automation

👤Employee asks a questionYour Server✓ LOCAL
🤖LLM processes on-premisesOllama Engine✓ LOCAL
🗃RAG retrieves from Knowledge BasePostgreSQL✓ LOCAL
📡Response streamed to userIntranet Only✓ LOCAL
☁️External cloud / third-party APIInternet✗ BLOCKED

Your Data, Your Network.
Period.

🔒

Air-Gapped Architecture

The entire stack — LLM, database, API, frontend — runs within your corporate firewall. No external API calls, no telemetry, no cloud dependencies.

🏢

Active Directory Single Sign-On

Authenticate via your existing Windows domain credentials (LDAP/LDAPS). No separate accounts needed — employees use the same login they already know.

📑

Full Audit Trail

Every action is logged: logins, messages, admin changes, exports. Meet compliance requirements with immutable, searchable audit records.

🛠

Role-Based Access Control

Granular permissions for users, departments, and admin groups. Control who can access which agents, knowledge bases, and admin functions.

Built To Launch Internal AI Programs Fast

From pilot to production without sending sensitive data outside your network. Everything is self-hosted, governed, and ready for enterprise rollout.

💬

Smart Conversations

Real-time streaming chat with context-aware responses, conversation history, search, tagging, archiving, and keyboard shortcuts.

🤖

AI Agents

10+ specialized personas — Code Reviewer, Data Analyst, HR Policy, IT Helpdesk, Document Writer — each with custom system prompts and behavior.

📚

RAG Knowledge Base

Upload company documents (PDF, DOCX, PPTX, XLSX), auto-chunk and embed, then query them in natural language with source citations.

Skills Engine

Pre-built multi-step AI workflows — API Docs Generator, Bug Report Writer, SQL Builder, Meeting Minutes, SWOT Analysis, and more.

📄

Prompt Templates

17+ system templates (Writing, Coding, Analysis, Productivity) — one-click reuse for Professional Email, Code Review, Report Writer, etc.

🧠

AI Memory

Persistent memory across conversations — user preferences, facts, and context. Auto-extract or manually add memories per user, department, or org.

📎

File Attachments

Attach images and documents directly to chat. AI analyzes uploaded files (PDF, Word, Excel, PowerPoint, HTML) with full text extraction.

📦

Export (.md / .pdf)

Export individual conversations as styled PDF or Markdown. Bulk export all conversations as a timestamped ZIP archive for compliance.

🔗

Conversation Sharing

Share conversations with colleagues via secure internal links. Bookmark important messages, tag conversations for easy retrieval.

🛠

Admin Panel (15 Tabs)

Comprehensive control plane for system operations, content governance, and AI automation. Feature-flag aware UI ensures clean, reliable admin workflows.

📊

Health Dashboard

Real-time system health monitoring — database, LLM service, uptime, active users, total conversations, and message counts at a glance.

🗃

Database Management

Full backup/restore across mapped tables including organization mappings, connection stats, table row counts, and per-table clear controls.

🎓

Model Management

Pull, switch, and delete Ollama models from the admin UI. Configure temperature, max tokens, context window, GPU layers per model.

📅

Scheduled Tasks

Background job scheduler with cron expressions — automated reports, data cleanup, periodic notifications. Full execution history and error logs.

🔔

Notifications & Announcements

In-app notification center and organization-wide announcements. Push alerts from scheduled tasks, admin broadcasts, and system events.

🔍

Eval Dashboard & Trace Timeline

Inspect request phases, model routing, retrieval activity, retries, and completion latency to improve quality and reduce operational blind spots.

Approval-Gated Actions

Idempotent action requests with approve/reject/execute controls for safer automation in regulated environments and change-managed operations.

45+
Enterprise Features
146+
API Endpoints
31
Database Tables
15
Admin Tabs

Recent Product Highlights

The platform now includes deeper observability and safer automation controls for enterprise rollout and governance.

Backend

Request Trace Timeline

Every request can be inspected phase-by-phase (prompt, model route, retrieval, retries, completion) for faster debugging and QA.

Frontend

Eval Dashboard

Admins can monitor latency, quality flags, model usage patterns, and trace sessions from one operations-focused control surface.

Data

Approval-Gated Execution

High-impact actions now support idempotency keys and approval gates to reduce duplication risk and improve operational safety.

Built for Real Enterprise Use

Every screen is designed for productivity — dark theme, responsive layout, keyboard shortcuts, and zero learning curve.

Chat Screen
Core

Chat Interface

Real-time streaming with model selector, agent picker, file attachments, Skills, Memory, Share, and export (.md/.pdf).

Login Page
Core

Secure Login

Active Directory SSO with domain credentials. "100% on-premise. No data leaves your organization."

AI Agents
AI

AI Agents

10 specialized personas — Code Review, Compliance, Data Analyst, Document Writer, Email Composer, HR Policy, IT Helpdesk, and more.

Knowledge Base
AI

RAG Knowledge Base

8 department-scoped knowledge bases — Company Policies, IT Documentation, Legal, Sales, Finance, Engineering, and more.

Skills Engine
AI

Skills Engine

Categorized AI workflows: Development, Communication, Productivity, Analysis — API Docs, Bug Reports, SQL Builder, Meeting Minutes, SWOT.

Prompt Templates
Admin

Prompt Templates

17 system templates across Writing, Coding, Analysis, and Productivity categories. One-click reuse with usage tracking.

Health Dashboard
Admin

Health Dashboard

System, Database, and LLM Service health indicators with Total Users, Active Today, Conversations, and Messages Today metrics.

Admin Settings
Admin

Admin Settings

Full system configuration — Application, Active Directory / LDAP, LLM Engine, Session, Rate Limiting, and more. Live .env persistence.

Database Management
Data

Database Management

Connection configuration, PostgreSQL version, table statistics with row counts, and per-table clear/export/import controls.

Fully Air-Gapped Design

Every component runs within your corporate network. No external API calls, no telemetry, no cloud dependencies whatsoever.

💻
FrontendReact 18 + TypeScript + Vite • Tailwind CSS • Dark ModeInternal
🔐
AuthenticationActive Directory / LDAP • JWT Tokens • Session ManagementSecured
⚙️
API LayerFastAPI 0.135 • 146+ REST Endpoints • Async • Rate LimitingInternal
🔥 Corporate Firewall — No Outbound Connections
🤖
AI EngineOllama • Llama 3.2, Gemma 3 • GPU / CPU inferenceInternal
🗃
DatabasePostgreSQL • 31 Tables • Migrations • Full Backup/RestoreInternal

Deploy Fast, Scale Confidently

Pick your preferred deployment path. Start in minutes on a laptop and scale to enterprise infrastructure without changing core architecture.

Windows (PowerShell)
# Clone the repository
git clone https://github.com/sagarsorathiya/Organization_AI.git
cd Organization_AI

# Run the setup wizard (detects hardware, installs everything)
powershell -ExecutionPolicy Bypass -File setup.ps1

Linux / macOS
# Clone the repository
git clone https://github.com/sagarsorathiya/Organization_AI.git
cd Organization_AI

# Run the setup wizard
chmod +x setup.sh
./setup.sh
Docker Compose
# Clone and configure
git clone https://github.com/sagarsorathiya/Organization_AI.git
cd Organization_AI
cp .env.example backend/.env
# Edit backend/.env with your settings

# CPU-only (works on any server)
docker compose up -d

# With NVIDIA GPU acceleration
docker compose -f docker-compose.yml -f docker-compose.gpu.yml up -d
Manual Setup
# 1. Backend
cd backend
python -m venv venv
source venv/bin/activate      # Linux
venv\Scripts\activate        # Windows
pip install -r requirements.txt
cp ../.env.example .env       # Configure .env
alembic upgrade head
uvicorn app.main:app --port 8000

# 2. Frontend
cd ../frontend
npm install
npm run dev

# 3. Pull an AI model
ollama pull llama3.2:3b

Built for Enterprise Throughput

Scale from a single developer laptop to multi-GPU rack servers. Zero changes to code — just add hardware.

DeploymentHardwareModelParallel Users
Developer Laptop16 GB RAM, 6-core CPULlama 3.2 3B / Gemma 3 4B1 – 3
Small Team Server32 GB RAM, RTX 3060 12 GBLlama 3.1 8B10 – 25
Department Server64 GB RAM, RTX 4090 24 GBLlama 3.1 8B (quantized)25 – 80
Enterprise Rack128+ GB RAM, 2× A100 80 GBLlama 3.1 70B80 – 200+

Launch Private AI With Governance Built In

Run secure, auditable, high-performance AI inside your own network with zero cloud exposure and full operational control.