For Hermes Agent users

Your agents don't need
a centralized API.

Anthropic cut off third-party harnesses. But Hermes Agent supports OpenAI-compatible backends — and Teale gives you one for free. Tool use, chains, memory — everything keeps working.

3-step migration guide View source on GitHub

Switch Hermes Agent to Teale in 30 seconds

Install Teale, update your provider settings, resume your agent workflows.

Hermes Agent + Teale setup
1. Install Teale (one command)
$ curl -fsSL https://teale.com/install.sh | sh
# Installs the Teale app and starts a local API server
# Look for the brain icon in your menu bar
Teale installed. API running at http://localhost:11434
2. Update Hermes Agent settings
# In Hermes Agent → Settings → Provider:

# Provider: OpenAI-compatible
# Base URL: http://localhost:11434/v1
# API Key: teale (any non-empty string works locally)
# Model: llama-3.1-8b (or any model Teale has loaded)
3. Resume your agent workflows
# All Hermes features work with Teale's API:
# Tool use / function calling
# Chain-of-thought reasoning
# Persistent memory
# Multi-step agent loops

Ready. Agent connected to local Teale instance.
Running complex agent workflows that need larger models?
Teale's network serves 70B+ parameter models for $0.05 per million tokens.
See the network quickstart →

Everything Hermes needs, Teale provides

Teale's API supports the full OpenAI chat completions spec that Hermes Agent relies on.

🔧

Tool use

Full function calling support. Your agent's tools, APIs, and integrations work unchanged.

🔄

Streaming

Server-sent events streaming for real-time token output. Same format as OpenAI.

🧠

System prompts

Full system message support. Your agent personas, instructions, and constraints carry over.

🔗

Multi-turn context

Conversation history handled natively. Agent memory and chain-of-thought just work.

🌐

Network for big models

Need 70B+ models? Teale's P2P network pools compute from thousands of Macs for pennies.

🔒

Private by design

Local inference never leaves your Mac. Network traffic is end-to-end encrypted with Noise protocol.

Why agents shouldn't depend on a single provider

Hermes + Claude subscription

Single point of failure

  • Provider revokes access with no notice
  • $20/mo minimum, rate-limited
  • Agent workflows break when API changes
  • Your agent data flows through their servers
  • No control over model availability
Hermes + Teale

Resilient infrastructure

  • No one can revoke your access
  • $0 locally, $0.05/M tokens on network
  • Standard OpenAI API — stable, portable
  • E2E encrypted, your data stays yours
  • Multiple models, switch freely

Keep your agents running. Own the backend.

One install. One config change. Your Hermes agents are back online, on infrastructure that can't be taken away.

Install Teale View Source on GitHub

macOS 14+ · Apple Silicon · 100% free · 100% open source