Local LLM Setup

Running a local LLM means your conversations never leave your machine. No API keys, no usage costs, and full offline support.

Ollama

Ollama is a lightweight tool for running open-source LLMs locally. Available on macOS, Linux, and Windows.

macOS

brew install ollama

Linux

curl -fsSL https://ollama.ai/install.sh | sh

Windows

Download the installer from ollama.ai and run it.

Download a model before you can use it:

ollama pull llama3.2

Other options worth trying: mistral, phi3, codellama.

ollama serve

This starts the Ollama API on http://localhost:11434.

Open Settings (gear icon)
Navigate to the Character tab
Enable the LLM toggle, then select Ollama from the provider dropdown
Select a model from the model dropdown (click the refresh icon to reload the list from your server)
Start chatting

LM Studio provides a GUI for downloading and running local models. Good option if you prefer not to use the terminal.

Download from lmstudio.ai and install it.

Open LM Studio and browse the built-in model catalog. Search for a model, click download, and wait for it to finish.

This starts an OpenAI-compatible API on http://localhost:1234.

Open Settings (gear icon)
Navigate to the Character tab
Enable the LLM toggle, then select LM Studio from the provider dropdown
Select a model from the model dropdown (click the refresh icon to reload the list from your server)
Start chatting

Model	Size	Best For	RAM Required
Llama 3.2 (3B)	~2GB	General chat, fast responses	8GB
Llama 3.1 (8B)	~4.7GB	Better quality responses	16GB
Mistral (7B)	~4.1GB	Good balance of speed and quality	16GB
Phi-3 (3.8B)	~2.3GB	Lightweight, efficient	8GB

Start with Llama 3.2 (3B) if you’re unsure. It runs well on most hardware and gives solid results for conversational use.

If you’re running the LLM server on a different machine or non-default port, enter the full URL in the provider settings. For example:

The LLM server isn’t running. Start it:

The port doesn’t match. Default ports:

Provider	Port
Ollama	11434
LM Studio	1234

Make sure the URL in Aikeya matches the port your server is using.

If you’re running Aikeya in a browser and getting CORS errors with Ollama, set the origins environment variable before starting the server:

OLLAMA_ORIGINS=* ollama serve

This isn’t needed when using the desktop app.