Skip to main content
Endemic app icon

Run Local LLMs on iPhone and Mac

Run GGUF models on-device with no API keys. Private, offline-first chat. No cloud API roundtrip for inference.

On-device LLM chat without a subscription

Endemic is for people who want local inference on iPhone, iPad, and Mac: download a quantized model, keep prompts and completions on your hardware, and use the app without wiring up cloud API keys.

It is not a replacement for ChatGPT or Claude in the browser. It is an offline-first option when you want the model running on the device you already own.

Peter with dahlias

Built by Peter. Bootstrapper in Beaverton.