Custom Providers
Chat through your own endpoint and key.
Adding a provider
Add any OpenAI-compatible endpoint with a name, base URL, and API key. The provider and its key are stored only in your local database.
Custom models appear in the model picker alongside catalog models; a conversation can switch between them freely.
Models
Fetch the provider's model list automatically or add model ids by hand. Each model is typed as text or image; image-typed models feed the in-chat image generator instead of the chat dropdown.
Model entries carry their own settings, including the tokenizer used for budgeting.
Per-model tokenizers
Token counting drives history trimming and lorebook budgets, so the right tokenizer matters for long chats. Every custom model picks from 13 presets:
auto cl100k o200k claude glm5 glm4 deepseek deepseek-v4 llama3 gemma qwen mistral cohere
Or point at any HuggingFace tokenizer by slug or URL:
hf:openai-community/gpt2
hf:https://huggingface.co/Qwen/Qwen2.5-7B/resolve/main/tokenizer.jsonHuggingFace tokenizers download once and cache locally. Counting never blocks a send: if a tokenizer fails to load, an approximate fallback counts instead.
Billing group
Catalog models can route through a billing group that seeds new chats. Custom-provider models bypass routing entirely, so the group has no effect on them.
Privacy
Requests to custom providers go straight from your browser to your endpoint. UnoRouter servers never see the request, the reply, or your provider key.