ContextuAI Solo — Documentation

Installation

Download the installer for your platform and run it — no build tools, no terminal commands, no dependencies to install.

Step 1: Download

Go to the GitHub Releases page and download the latest version for your OS:

Platform	File	Notes
Windows	`.msi` or `.exe`	Double-click to install. You may need to click "More info" → "Run anyway" on the SmartScreen prompt.
macOS	`.dmg`	Open the DMG, drag ContextuAI Solo to Applications. On first launch, right-click → Open to bypass Gatekeeper.
Linux	`.AppImage` or `.deb`	For AppImage: `chmod +x` then run. For .deb: `sudo dpkg -i contextuai-solo.deb`

Step 2: Launch

Open ContextuAI Solo from your applications. On first launch, the Setup Wizard guides you through choosing an AI provider and configuring your profile. That's it — you're ready to go.

No API key? No problem. Select Ollama as your provider in the wizard — Solo will download a free local model and you can start chatting with zero cost, fully offline.

Building from Source (Advanced)

If you prefer to build from source (contributors, developers):

# Prerequisites: Node.js 18+, Python 3.11+, Rust (rustup.rs) git clone https://github.com/contextuai/contextuai-solo.git cd contextuai-solo # Frontend cd frontend && npm install # Backend cd ../backend && pip install -r requirements.txt # Run (two terminals) # Terminal 1: cd backend && CONTEXTUAI_MODE=desktop ENVIRONMENT=desktop python -m uvicorn desktop_app:app --host 127.0.0.1 --port 18741 # Terminal 2: cd frontend && npm run dev # Open http://localhost:1420

Setup Wizard

On first launch, a 3-step wizard walks you through configuration:

Profile — Enter your name, business name, and select your industry (16 options: SaaS, E-commerce, Healthcare, Finance, Education, Marketing, Consulting, Real Estate, Legal, Manufacturing, Non-Profit, Freelancing, and more)

AI Provider — Choose how you want to run AI:

Built-in Local AI Free · No Setup

Solo ships with a built-in Model Hub — 41 prebuilt GGUF models ready to download, plus access to 140,000+ GGUF models on Hugging Face. No API keys, no external tools, no Ollama, no Python — just pick a model, click download, and start chatting. Everything runs on your CPU, fully offline.

This is the fastest way to get started. The wizard lets you pick a recommended model based on your RAM:

Your RAM	Recommended Model	Download Size
4-8 GB	Qwen 3.5 0.8B or Gemma 3 1B	~700 MB
8-16 GB	Qwen 3 8B or DeepSeek R1 7B	~4-5 GB
16-32 GB	Gemma 4 12B or Qwen 3 14B	~7-9 GB
32+ GB	Gemma 4 27B or DeepSeek R1 32B	~16-20 GB

New: Gemma 4 — Google's latest model family (April 2025). Gemma 4 12B rivals models twice its size on reasoning and instruction-following. Gemma 4 27B is one of the strongest open models available. Both run fully on CPU inside Solo — our top recommendation for 16GB+ machines.

Cloud Providers BYOK

If you have API keys from cloud providers, you can add them for access to the latest frontier models:

Anthropic Claude — Sonnet 4, Opus, Haiku (Get API key)
OpenAI — GPT-4o, GPT-4o Mini, GPT-4 Turbo, o1-preview (Get API key)
Google Gemini — 2.0 Flash, 2.0 Pro, 1.5 Flash (Get API key)
AWS Bedrock — Claude and Titan models
Ollama — If you already run Ollama locally, Solo can connect to it as an additional provider

You can use local + cloud together — switch between them anytime from the model dropdown in chat.

Brand Voice — Define your target audience, content topics, and brand tone

Zero-cost path: Pick a built-in local model in the wizard and you're chatting in minutes — no accounts, no API keys, no subscriptions. Add cloud providers later from Settings if you want.

Your First Chat

After the wizard, you land on the Chat page:

Type a message in the input box at the bottom
Select an AI model from the dropdown (shows provider badges: Local/Cloud)
Optionally pick a persona to give the AI specialized context
Press Enter to send — the AI streams its response in real-time

Conversations are saved as sessions in the left sidebar, grouped by date with message counts.

The main chat interface with model selector, persona picker, and session sidebar

AI Chat

The chat interface is the heart of Solo:

Multi-model support — Switch between providers and models mid-conversation using the dropdown
Real-time streaming — Responses appear token-by-token; click the stop button to abort
Session management — Create, rename, archive, delete, and search sessions from the sidebar
Persona selection — Attach a persona to give the AI access to your database, API, or custom instructions
Markdown rendering — Code blocks with syntax highlighting, tables, lists, and rich formatting
Thinking mode — Models that support reasoning (Qwen 3, DeepSeek R1) show a collapsible thinking section
Dark/Light mode — Toggle from the top bar; applies across all pages

Shortcut: Press Ctrl+N (or Cmd+N) to start a new chat instantly. Use Shift+Enter for a new line without sending.

Model Hub

Configure cloud AI providers and download local models from Settings → AI Providers.

Browse and configure AI models — cloud providers and local GGUF models

Cloud Providers

Enter your API key for any supported provider. Click Test Connection to verify it works. Toggle the active provider to set your default model.

Local Models

Download GGUF models with one click. Solo auto-detects your RAM and recommends compatible models. Progress is shown in real-time during download.

Manage installed local models — sync status, storage usage, ready to chat

AI Mode Toggle

Switch between Local and Cloud mode. In local mode, all inference runs on your CPU — no internet required after model download.

Personas

Personas are AI identities that connect to your real systems. Solo includes 12 persona types:

Persona Type	What It Does
Nexus Agent	General-purpose AI with custom system prompts
Web Researcher	Search the web and scrape pages
PostgreSQL	Query PostgreSQL in natural language — AI writes the SQL
MySQL	Query MySQL databases with auto-generated SQL
MSSQL	Connect to Microsoft SQL Server
Snowflake	Query your Snowflake data warehouse
MongoDB	Query MongoDB document databases
GitHub	Browse repos, issues, and pull requests
GitLab	Access repos and CI/CD pipelines
API Connector	Call any REST API endpoint
File Operations	Read, write, and parse local files
Slack	Send and read messages in Slack channels

Agent library showing all agent types and kinds

Creating a Persona

Solo uses a 2-step wizard:

Choose Type — Select from the persona type dropdown (each type shows an icon and description)
Configure Details — Enter name, category, description, connection credentials, and system prompt

For database types, use the Test Connection button to verify credentials before saving.

Create agent — select type, configure settings, and customize

Privacy: All credentials are stored locally in SQLite. Nothing leaves your machine.

Agent Library

Solo ships with 96 prebuilt business agents across 14 departments, plus you can create unlimited custom agents. Each has a specialized system prompt, recommended model, and tool configurations.

Browse 96 prebuilt agents across 14 departments — search, filter by role, view details

Agents organized by kind — Prompt, Database, Web, MCP, API, and File

Department	Example Agents
C-Suite	CEO Strategic Advisor, CFO Financial Strategist, COO Operations Optimizer, CTO Technology Advisor
Marketing & Sales	Content Strategist, SEO Specialist, Social Media Manager, Brand Voice Designer, Email Campaign Builder
Finance & Operations	Financial Analyst, Budget Planner, Invoice Processor, Tax Advisor, Revenue Forecaster
Legal & Compliance	Contract Reviewer, Compliance Checker, IP Advisor, Privacy Policy Drafter
HR & People	Recruiter Assistant, Job Description Writer, Performance Review Helper
Design & UX	UI/UX Advisor, Brand Identity Designer, Presentation Builder, Color Palette Generator
Data & Analytics	Data Analyst, SQL Query Builder, Dashboard Designer, Statistical Modeler
IT & Security	DevOps Assistant, Security Auditor, Infrastructure Planner, Incident Response Helper
Product Management	Product Manager, Feature Prioritizer, User Story Writer, Roadmap Planner
Startup & Venture	Pitch Deck Builder, Business Model Canvas Creator, Go-to-Market Strategist
Operations	Process Optimizer, Supply Chain Analyst, Quality Assurance Planner
Specialized	Industry-specific agents and custom roles

Using Agents

Browse by department or filter by role: Researcher, Writer, Analyst, Designer, Developer, Reviewer, Planner, or Custom. Click any agent card to view its full system prompt, tools, and model override.

Creating a Custom Agent

Click + Create Agent and configure: name, role, description, system prompt, tool access, model preference, category, and public/private visibility.

Create a custom agent with role, system prompt, and tool configuration

Crews

Crews are multi-agent teams that collaborate on complex tasks. The crew system uses a 7-step creation wizard.

Crew dashboard with stats (Total, Running, Completed, Failed) and crew list

Solo — Crew Wizard Step 2: Execution Mode

Crew Builder Step 2 — choose execution mode: Sequential, Parallel, Pipeline, or Autonomous

Crew Builder Step 3 — add agents from the library, organized by kind

7-Step Crew Wizard

1 Details & Blueprint

Name your crew and write a clear description of its purpose. Optionally select a blueprint template to pre-fill the remaining steps with proven configurations. Choose the AI model that will power your crew — local GGUF models for privacy, or cloud providers for maximum capability.

Start from a blueprint when possible — you can always customize after. Blueprints encode best practices from real workflows.

2 Execution Mode

Define how your agents collaborate. This is the most important architectural decision for your crew:

Mode	How It Works	Best For	Example
Sequential	Agents run one after another. Each agent's output becomes the next agent's input.	Multi-step processes with clear handoffs	Research → Write → Edit → Publish
Parallel	All agents run simultaneously on the same input. Results are collected together.	Speed-critical tasks needing multiple perspectives	SEO + Social + Email all at once
Pipeline	Staged processing with checkpoints between stages. Pause and verify before the next stage proceeds.	High-stakes workflows needing quality gates	Draft → Review → Approve → Publish
Autonomous	A coordinator agent dynamically decides which agents to invoke, in what order, based on the input.	Open-ended or unpredictable tasks	Customer inquiry triage & resolution

3 Agent Team

Build your team by selecting specialists from the 96-agent library or your custom agents. Agents are organized by kind for fast discovery:

Kind	What They Do	Examples
Prompt	Text generation, analysis, and reasoning	Content Writer, Strategist, Analyst
Database	Query and transform structured data	SQL Analyst, Data Mapper
Web	Scrape, search, and monitor web content	Web Researcher, SEO Auditor
MCP	Connect to external tools via Model Context Protocol	GitHub Agent, Slack Agent
API	Call REST/GraphQL endpoints	CRM Sync, Payment Processor
File	Read, write, and transform local files	CSV Processor, Report Generator

Drag to reorder agents for sequential and pipeline modes — order determines the processing chain.

4 Channels

Bind your crew to distribution channels so it can receive and send messages across platforms. For each connected channel, set the direction:

Direction	Behavior
Inbound	Crew listens for incoming messages and reacts to triggers
Outbound	Crew publishes its output to this channel
Both	Full two-way communication — listen and respond

Available channels: WhatsApp, Telegram, Slack, Discord, Email (SMTP/IMAP), SMS (Twilio), Webchat, WordPress, Zapier, and Webhooks.

5 Triggers

Define what activates your crew. Combine reactive and scheduled triggers for full automation:

Type	How It Works	Example
Keywords	Crew activates when specific words appear in incoming messages	"pricing", "refund", "cancel"
Hashtags	Trigger on hashtag mentions across social channels	#support, #feedback
@Mentions	Activate when your bot or brand is mentioned	@yourbot in Slack or Discord
Regex	Pattern-based matching for complex conditions	Order IDs like `ORD-\d{6}`
Cron Schedule	Run on a recurring schedule	Every Monday at 9am, daily at midnight
One-shot	Run once at a specific date and time	Campaign launch on June 1st

Triggers support cooldown periods to prevent spam — set a minimum gap between activations so your crew doesn't fire repeatedly on rapid messages.

6 Approval

Enable human-in-the-loop review to maintain quality and brand safety. When enabled, every outbound message lands in your Approval Queue before being sent.

You can approve, edit, or reject each message. Nothing reaches your audience without your explicit sign-off. This is especially important for:

Customer-facing communications (support replies, sales outreach)
Social media posts that represent your brand
Financial or legal content requiring compliance review
Any crew you're still tuning and testing

Start with approval enabled for new crews. Once you trust the output quality, you can disable it for fully autonomous operation.

7 Review & Create

The final step shows a complete summary of your crew configuration — agents, execution mode, channels, triggers, and approval settings. Confirm everything looks right and click Create Crew. Your crew is immediately ready to run manually or will activate automatically based on your configured triggers.

Running a Crew

Click Run on any crew. Enter your input text and watch real-time progress: status bar, duration, token count, cost, step timeline, and per-agent metrics. You can cancel a run mid-execution.

The Runs tab shows execution history with status, duration, and cost for every past run.

Blueprints

Solo ships with 10 pre-built crew templates across 5 categories to jumpstart your workflows.

Browse blueprint templates — search, filter by category, preview content

10 Blueprints

Reusable crew configurations across 5 categories (Strategy, Content, Marketing, Product, Research):

Strategy — OKR Planning, SWOT Analysis, Lean Canvas Brainstorm
Content — Blog Content Pipeline, Social Media Strategy
Marketing — Campaign Planning
Product — Feature Prioritization, User Story Workshop
Research — Competitor Analysis

Each blueprint pre-selects agents, execution mode, and goals. Clone and customize to fit your specific workflow.

Creating Custom Blueprints

Click + Create Blueprint and define: name, description, category, tags, and markdown content. Your custom blueprints appear alongside the built-in ones.

Create a custom blueprint with markdown content and tags

Knowledge Base

Solo includes a fully local RAG (Retrieval-Augmented Generation) system. Upload documents or map folders from your machine — Solo chunks, embeds, and indexes everything locally. Ask questions in chat and get answers with source citations.

Adding Documents

Two ways to add content to a Knowledge Base:

File Upload — Drag and drop PDF, DOCX, TXT, or Markdown files (up to 10 MB each)
Folder Mapping — Point at any directory on your machine (Obsidian vault, Notes folder, code repo). Solo indexes all supported files recursively up to depth 10.

How Indexing Works

Step	Details
Chunking	~500 tokens per chunk, 50-token overlap, page-tracked for PDFs
Embedding	all-MiniLM-L6-v2 ONNX model (384-dim, unit-normalized) — bundled with Solo, no download needed
Storage	Embeddings + chunks + metadata stored in SQLite (`kb_documents`, `kb_chunks`)
Retrieval	Cosine similarity via numpy dot-product, top-k results injected into prompt

Folder Mapping Schedules

Auto-sync options: Manual, Every 1 hour, Every 6 hours, or Every 24 hours. Solo tracks file changes (new, modified, deleted) using path + size + modification time. A friction guardrail pauses indexing if a folder contains more than 1,000 files — you confirm before proceeding.

Using a Knowledge Base in Chat

Select a KB from the dropdown above the chat input
Ask your question — Solo retrieves relevant chunks and passes them to the AI model
The AI responds with [1], [2] citations pointing back to source documents

Crew Binding: Crews and workspace agents can bind to specific Knowledge Bases. The KB is queried automatically on every turn, so crew agents always have relevant context.

Limits

Max file size: 10 MB per file
Max files per folder mapping: 5,000
Max folder depth: 10 levels
Works with both local GGUF models and cloud providers

Automations

Automations are quick, one-off agent tasks using natural-language @mention syntax. No wizard needed — just type and run.

Automation list with @agent-mention prompts, triggers, and output actions

How It Works

Write a prompt with @agent-handle mentions (e.g., @content-strategist write a LinkedIn post about our Q2 launch)
Solo's parser detects the agents and infers the execution mode (sequential if ordered, parallel if independent)
Agents execute and stream results in real-time

Output Options

Output	Description
Chat	Results displayed in the automation view
PDF	Formatted document exported via ReportLab
PPTX	Presentation slides via python-pptx
Markdown	Raw markdown output
Distribution Channel	Publish directly to LinkedIn, Twitter, Email, Telegram, or any connected channel

Promote to Crew

Any automation can be promoted to a scheduled crew with one click. The agent selections, output action, and context carry over — just add triggers and approval settings.

Quick tip: Use automations for ad-hoc work and promote to crews when you want the same task to run on a schedule.

Automation builder — name, @agent prompt, trigger configuration, and output action picker

Connections

Connect Solo to external messaging platforms for automated content publishing and AI-powered responses.

Platform connection cards with status badges and direction toggles

Platform	Auth Method	Direction	Status
Telegram Bot	Bot token	Inbound + Outbound	Live
Discord Bot	Bot token + Public Key + App ID	Inbound + Outbound	Live
Reddit	Script app credentials	Inbound + Outbound	Live
LinkedIn	OAuth 2.0 (Client ID/Secret)	Outbound	Live
Twitter/X	API Key + Secret + Access Tokens	Outbound	Live
Instagram	OAuth 2.0 (Meta)	Outbound	Live
Facebook	OAuth 2.0 (Meta)	Outbound	Live
Blog	API key (Ghost / WordPress)	Outbound	Live
Email	API key (SendGrid / SES) or SMTP	Outbound	Live
Slack	Webhook URL	Outbound	Live

Setting Up Telegram

Open Telegram, search for @BotFather
Send /newbot and follow the prompts to get your bot token
In Solo, go to Connections → Telegram → paste the token
Toggle Inbound/Outbound as needed and click Connect
For inbound messages, set up a webhook with ngrok: ngrok http 18741

Setting Up Discord

Go to the Discord Developer Portal
Create a New Application → go to Bot → copy the token
Enable Privileged Gateway Intents (Message Content, Server Members)
In Solo, paste the Bot Token, Public Key, and Application ID

Setting Up LinkedIn

Create an app at LinkedIn Developers
Request "Share on LinkedIn" access for your app
In Solo, enter Client ID and Client Secret
Click Connect — an OAuth browser window will open for authorization

Connected platforms can be bound to crews in Step 4 of the crew wizard, enabling automated content distribution with human-in-the-loop approval.

Best models for connections: Qwen 3 8B or 14B gives the best results for auto-reply and content generation.

Approval Queue

When crews or agents auto-reply to incoming messages, every response lands in your approval queue first. Nothing gets sent without your explicit approval.

Review AI-drafted replies before they are sent — approve, edit, or reject

Set up triggers — Link a channel (Telegram, Discord) to a crew or agent with "require approval" enabled
AI drafts a reply — Incoming messages are processed by your AI; the draft appears in the queue
Review & send — Edit the response if needed, then approve to send or reject to discard

Triggers support cooldown periods to prevent spam, and you can link a crew instead of a single agent for more sophisticated multi-step responses.

Coder Mode

Toggle into Coder Mode (Ctrl+Shift+M / Cmd+Shift+M) for a local development workspace with AI-powered multi-agent coding workflows.

Mode Toggle

A pill toggle at the top center of the app switches between Solo (business assistant) and Coder (development workspace). Your mode preference is saved and persists across sessions.

Coder Sidebar

Projects — Create, list, and run coding projects
Running — Active executions with stdout streaming and kill button
Templates — 4 starter scaffolds (Web App, Telegram Bot, CLI Tool, Static Site)
Models — Model selector for coding tasks
Settings — Provider keys and coder-specific configuration

Project Templates

Template	Stack	What You Get
Web App	React + Vite	Starter app with localhost preview
Telegram Bot	Bot framework	Bot scaffold with token config
CLI Tool	Python/Node	CLI scaffold with argument parsing
Static Site	HTML/CSS/JS	Basic site ready to customize

Multi-Agent Workflows

Define agent roles (planner, coder, reviewer) and run them in 4 workflow modes:

Mode	How It Works
Solo	One agent (usually "coder") runs alone
Sequential	Agents run in order, each receiving chat history + all prior outputs
Parallel	"Coder" runs first; all other agents run concurrently after
Custom	Agents in user-defined order

5 Coder-Companion Agents

Visible only in Coder Mode:

Code Reviewer — PR review, quality feedback
Bug Analyzer — Error diagnosis, stack trace interpretation
Test Writer — Unit test and integration test generation
Doc Generator — Docstring and README generation
Refactor Advisor — Code cleanup, performance optimization

Sandboxed Shell

Run dev servers, build commands, and tests in a restricted shell. Tauri capabilities restrict file system access to the project directory only. Each project has a trust state that controls what commands can execute.

Cross-Mode Handoffs

Crews can include a "Run Coder Project" step to execute a coding project headlessly. Automations can also invoke Coder workflows. This lets you chain business logic (Solo) with coding tasks (Coder) in a single pipeline.

Index Codebase as Knowledge Base

Coder projects can be indexed as a Knowledge Base (folder-mapped RAG), making the entire codebase searchable by AI agents during code review or bug analysis.

Shortcut: Press Ctrl+Shift+M (or Cmd+Shift+M) to toggle between Solo and Coder mode instantly.

BYOK — Bring Your Own Key

Solo supports 5 AI providers. Configure in Settings → AI Providers:

Provider	Models	Cost
Anthropic Claude	Sonnet 4, Opus, Haiku	BYOK
OpenAI	GPT-4o, GPT-4o Mini, GPT-4 Turbo, o1-preview	BYOK
Google Gemini	2.0 Flash, 2.0 Pro, 1.5 Flash	BYOK
AWS Bedrock	Claude, Titan	BYOK
Built-in Model Hub	41 prebuilt + 140K+ Hugging Face GGUF	Free

API keys are stored in your browser's localStorage. They are never sent to any server other than the respective AI provider.

Local Models

Solo includes a Model Hub with 41 prebuilt GGUF models across 9 families, plus access to 140,000+ GGUF models on Hugging Face. One-click download — no API key, no internet after download, no data leaving your machine.

RAM Requirements

Your RAM	Models You Can Run
4 GB	Qwen 3.5 0.8B, Gemma 3 1B, Llama 3.2 1B
8 GB	Qwen 3 8B, DeepSeek R1 7B, Mistral 7B, Qwen 2.5 Coder 7B
16 GB	Qwen 3 14B, Gemma 4 12B, Phi-4 14B, DeepSeek R1 14B
32 GB	Qwen 3 32B, Gemma 4 27B, DeepSeek R1 32B, Gemma 3 27B
48+ GB	Llama 3.1 70B, DeepSeek R1 70B

9 Model Families (41 Models)

Gemma 4 New · Recommended — Google's latest (April 2025). 12B and 27B variants. Best-in-class instruction following, reasoning, and multilingual support at its size. Our top pick for 16GB+ machines.
Qwen 3.5 / 3 / 2.5 — Alibaba's versatile family. General chat, reasoning, and coding (Qwen 2.5 Coder is excellent for IDE integration). Wide range from 0.8B to 32B.
DeepSeek R1 — Advanced reasoning with visible thinking mode. 7B to 70B. Great for analysis, math, and complex problem-solving.
Gemma 3 — Google's previous generation. Still solid for lightweight tasks (1B to 27B).
Llama 3 — Meta's open models. 1B to 70B. Strong general-purpose performance.
Mistral — Fast and lightweight (7B). Good for quick responses on lower-spec machines.
Phi-4 — Microsoft's 14B model. Punches above its weight on reasoning tasks.

Inference powered by llama-cpp-python — CPU-only, no GPU required. Models stored in ~/.contextuai-solo/models/.

Built-in Coding Server

Solo exposes an OpenAI-compatible API at localhost:18741/v1/chat/completions. Point your IDE at it for free, offline code completion.

IDE Setup

IDE	How to Connect
VS Code	Continue or Copilot extension → set base URL to `http://localhost:18741/v1`
Cursor	Settings → Models → Add custom model endpoint
Windsurf	OpenAI-compatible provider → set base URL
Aider	`aider --openai-api-base http://localhost:18741/v1`
Any tool	Any tool that speaks the OpenAI API format — just change the base URL

Best coding models: Download Qwen 2.5 Coder (7B for 8GB RAM, 14B for 16GB, 32B for 32GB) or DeepSeek R1 for reasoning-heavy tasks.

Settings

Access settings from the sidebar. 5 tabs:

Settings page with AI Providers, Brand Voice, Appearance, Data & Export, and About tabs

AI Providers tab — configure cloud provider API keys (Anthropic, OpenAI, Google, Bedrock, Ollama)

AI Providers — Configure API keys, download local models, test connections, set active provider
Brand Voice — Business name, industry, brand prompt, target audience, content topics
Appearance — Theme (Light, Dark, System) and font size (Small, Medium, Large)
Data & Export — Export/import database as JSON, clear all data
About — App version, check for updates, technology stack links

Brand Voice

Configure your brand identity so AI-generated content matches your tone. Settings → Brand Voice:

Business Name — Used in agent prompts and content generation
Industry — 12 options (Tech, Marketing, Finance, Healthcare, etc.) to inform domain knowledge
Brand Voice Prompt — Free-form instructions: tone, style, personality, things to avoid
Target Audience — Who your content is aimed at
Content Topics — Key subjects the AI should focus on

A dynamic preview shows how your brand voice configuration looks before saving.

Data & Export

All your data lives locally. From Settings → Data & Export:

Export Data — Download a full JSON backup of all chats, personas, agents, crews, and settings
Import Data — Restore from a previous backup file
Clear All Data — Permanently delete everything (use with caution)

Important: Solo has no cloud sync. Back up regularly via export if your data matters to you. Exports go to your Downloads folder.

Keyboard Shortcuts

Shortcut	Action
`Ctrl+N` / `Cmd+N`	New chat session
`Enter`	Send message
`Shift+Enter`	New line in message
`Ctrl+Shift+M` / `Cmd+Shift+M`	Toggle Solo / Coder mode
`Escape`	Stop AI generation

Data Storage

Everything is stored locally on your machine:

What	Where
Database (chats, personas, agents, crews)	`~/.contextuai-solo/data/contextuai.db`
Local AI models (GGUF files)	`~/.contextuai-solo/models/`
API keys & provider config	Browser localStorage
Data exports	Downloads folder

No telemetry, no cloud calls (except to your chosen AI provider), no data collection.

Tech Stack

Layer	Technology
Desktop Shell	Tauri v2 (Rust) — lightweight, secure, cross-platform
Frontend	React 19 + Vite + TypeScript 5.9
Styling	Tailwind CSS + Framer Motion
Backend	FastAPI (Python 3.11+)
Database	SQLite via async adapter
Local Inference	llama-cpp-python (CPU-only)
AI Providers	Anthropic, OpenAI, Google, AWS Bedrock, Ollama
Agent Framework	Strands Agents SDK
Icons	Lucide React

Troubleshooting

Backend won't start

Verify Python 3.11+: python --version
Install dependencies: cd backend && pip install -r requirements.txt
Check port 18741 is free: netstat -ano | findstr 18741 (Windows) or lsof -i :18741 (Mac/Linux)

Frontend won't start

Verify Node.js 18+: node --version
Clean install: rm -rf node_modules && npm install
Windows rollup error: npm install @rollup/rollup-win32-x64-msvc

Chat returns errors

Verify API key in Settings → AI Providers → Test Connection
If using Ollama, make sure it's running: ollama serve
Check the backend terminal for error details

Local model download fails

Check internet connection (download only — inference is fully offline)
Ensure enough disk space in ~/.contextuai-solo/models/
Try a smaller model first (Gemma 3 1B is ~700MB)

Telegram/Discord webhook not receiving messages

Ensure ngrok is running: ngrok http 18741
Set the webhook URL in the platform's settings to your ngrok HTTPS URL
Check Discord privileged intents are enabled (Message Content, Server Members)

Database reset

Export your data first from Settings → Data & Export
Delete ~/.contextuai-solo/data/contextuai.db
Restart the backend — it recreates the database with defaults

ContextuAI Solo User Guide

Installation

Step 1: Download

Step 2: Launch

Building from Source (Advanced)

Setup Wizard

Built-in Local AI Free · No Setup

Cloud Providers BYOK

Your First Chat

AI Chat

Model Hub

Cloud Providers

Local Models

AI Mode Toggle

Personas

Creating a Persona

Agent Library

Using Agents

Creating a Custom Agent

Crews

7-Step Crew Wizard

Running a Crew

Blueprints

10 Blueprints

Creating Custom Blueprints

Knowledge Base

Adding Documents

How Indexing Works

Folder Mapping Schedules

Using a Knowledge Base in Chat

Limits

Automations

How It Works

Output Options

Promote to Crew

Connections

Setting Up Telegram

Setting Up Discord

Setting Up LinkedIn

Approval Queue

Coder Mode

Mode Toggle

Coder Sidebar

Project Templates

Multi-Agent Workflows

5 Coder-Companion Agents

Sandboxed Shell

Cross-Mode Handoffs

Index Codebase as Knowledge Base

BYOK — Bring Your Own Key

Local Models

RAM Requirements

9 Model Families (41 Models)

Built-in Coding Server

IDE Setup

Settings

Brand Voice

Data & Export

Keyboard Shortcuts

Data Storage

Tech Stack

Troubleshooting

Backend won't start

Frontend won't start

Chat returns errors

Local model download fails

Telegram/Discord webhook not receiving messages

Database reset

Get in Touch

Message Sent!