
Off Grid
A private AI that runs on the hardware you already own.
Chat, images, vision, voice, documents - all on your phone, all offline, nothing sent anywhere. And it is becoming something bigger: one intelligence layer across your phone and your laptop, ambient and proactive, that never leaves your hands.
Play Store, App Store & GitHub
Over 100,000 people already run AI on their own phone with Off Grid. No account, no subscription for the core, no cloud. The phone in your pocket has enough compute to run a capable model offline, at real speed - Off Grid makes it do exactly that.
Start with the app
| Capability | Details |
|---|---|
| Text generation | Llama, Qwen 3, Gemma 3, Phi-4, Mistral and any GGUF model - 15–30 tok/s on flagship devices |
| Image generation | On-device Stable Diffusion - 5–10s on NPU (Snapdragon), Core ML on iOS. 20+ models |
| Vision AI | Point your camera at anything and ask questions. SmolVLM, Qwen3-VL, Gemma 3n |
| Voice input | On-device Whisper speech-to-text. Hold to record, auto-transcribe. No audio leaves your phone |
| Tool calling | Web search, calculator, date/time, device info. Automatic tool loop |
| Document analysis | Attach PDFs, CSVs, code files. Native PDF text extraction on both platforms |
| Remote servers | Connect to Ollama, LM Studio, LocalAI on your home network |
| Works offline | Airplane mode, restricted networks, anywhere |
Where this is going
The app is the first piece. The whole is a Personal AI OS: a private intelligence layer that lives across your phone and your laptop, learns your day in the background, and gets ahead of you the way a chief of staff would.
Your phone knows your life. Your laptop knows your work. Today neither has the full picture. Off Grid unifies them into one working model of who you are and what you are doing. It syncs over your own network, never a cloud relay. It does not wait to be opened - it briefs you on the day, surfaces the item you left open, and drafts the reply before you remember you owe it.
Nothing is sent anywhere, because there is no server to send it to. It is open source, so you can check.
Why local AI matters
When you run a query on a cloud AI service - ChatGPT, Gemini, Claude - it’s logged on a server. Your prompt, the response, the time, your account. Stored indefinitely. Used to train future models. Subject to law enforcement requests. Readable by employees.
With Off Grid, none of that applies. The model runs in your phone’s memory. Inference happens on your CPU and GPU. Nothing is sent anywhere. Ever.
Privacy here isn’t a setting or a promise. It’s the default output of the architecture. The system has no mechanism to do otherwise - and because the code is open, anyone can verify it.
Get started
Guides
LLMs
- How to Run LLMs Locally on Your Android Phone in 2026
- How to Run LLMs Locally on Your iPhone in 2026
Image Generation
Vision, Voice and Documents
- Vision AI - Analyse Images and Documents On-Device
- Voice Input - On-Device Speech-to-Text with Whisper
- Document Analysis and Attachments
- Knowledge Base and RAG
Tools and Intelligence
Remote Servers
- Remote Servers - Connect Ollama, LM Studio, and LocalAI
- How to Use Ollama From Your Android Phone in 2026
- How to Use LM Studio From Your Android Phone in 2026
Community
Questions, feedback, and feature requests - join the Slack community.
Source code is open - star the repo on GitHub.