Voice AI for relationships

Voice agents built for years.

Companions, coaches, tutors, eldercare, hardware that lives in users’ homes. Voice agents that need to remember, evolve, and stay in character for months. Bring your STT, LLM, and TTS — Hyponema holds the relationship.

How it actually works

Tuesday's call. Thursday's call. The same agent, only better.

Most voice agents start every conversation from zero. Yours pick up where the last one ended, with the right tone, the open thread surfaced, and the topics that should rest.

Tue, Apr 15

session.live

User

I'm not great today. The week's been heavy.

Agent

Take your time. I'm here.

session note

Heavy session. The week has been rough on her.

topic at rest

Don't bring up work for the next few days.

She left the call heavy. The agent flags work as a tender topic for the next few days.

Thu, Apr 17

session.start

how to open

Be gentle. Last call was hard.

open thread

Ask about the weekend, not work.

Agent

Hi. How has the rest of the week landed?

Work stays out of the conversation. It is resting until Friday.

The agent opens differently. Because it knows.

Six weeks later

emotional baseline

tone drift−0.4 → −0.1

day 1week 6

her baseline lifted across 11 conversations

The agent walked with them.

Across eleven conversations, the agent moved with her. The relationship has weight.

Join the waitlist How the engine actually works →

Relational memory

Memory built for relationships. The shape of the conversation, kept across months.

Hyponema holds the relationship itself: the tone of last week’s call, the promise from a month ago, the topic that should rest until Friday.

How the engine works

01
It remembers what matters
Importance scoring
0.4·recency + 0.3·frequency + 0.3·relevance. Health, family, identity carry floors so they never decay below 0.5 even after months of silence.
02
Nothing silently overwritten
Supersede chains
Contradictions create explicit linked records with reasons. Old facts stay queryable. Consolidation merges near-duplicates at cosine ≥ 0.75.
03
It tracks the storyline
Narrative arcs
Longitudinal storylines detected post-call (cosine ≥ 0.65). Lifecycle: ACTIVE → DORMANT (after 60 d silence) → RECURRING.
04
It feels the shift before you do
Emotional trajectory
14-day rolling vs 60-day baseline drift. Pure tabular computation, no LLM call. Topic cooldowns auto-suppress sensitive users for 3 days after a NEGATIVE / CONCERNED episode.
05
Every call picks up where the last ended
Session resume
"Where did we leave off" snapshot — last episode tone, top open question, outstanding promises, recommended resumption tone. Injected at priority 0 of the system prompt.
06
The agent never contradicts itself
Persona consistency
Drift detector pre-emit. Three verdicts: CONSISTENT (≥ 0.85 same-kind) lets it through, DRIFT (0.7–0.85) regenerates, NEW becomes a new persona fact.

Christina M.

User · 87 sessions

User memory

Last topicLimits

Cooldowns2 active

Open threadWalk through derivatives

Latest reflection

Christina is a software developer building reliable distributed systems. Engaged · curious. Last call ended on a promised next step about derivatives.

Aria · AI coachAgent

Welcome back. Last time we left on derivatives. Want to keep going?

ChristinaUser

Yes, picking up where we left off.

Aria · AI coachAgent

Great. I'll keep the pace gentle since last session ended a bit late.

All six, on every conversation. Live in the dashboard.

Bring your own stack

Pick the providers you trust. Swap any of them per session.

Vendor-neutral by construction. The provider registry is the only switch — adding Sarvam, Azure, or any new model is a config addition, not an architectural change. Update one line of voice_stack JSON and the next session picks it up. No redeploy, no downtime.

Join the waitlist

session.live

00:24

[user] Hey Aria, how was your run?

deepgramstreaming · 16kHz

Listen

Pick the transcriber that fits your language and budget. Streaming on Deepgram and AssemblyAI; Whisper for batch. Cascading is independent of the LLM and TTS layers, with up to three retries per turn.

Deepgram
AssemblyAI
Whisper

cascade.llm3-deep

anthropicclaude-sonnet-4-5
primary420ms
openaigpt-4o
fallbackidle
groqllama-3.1-70b
fallbackidle

turn_4f2astreaming

Think

Pick the brain your agent actually needs — Anthropic Haiku 4.5 for fast, Sonnet 4.6 for balanced, Gemini for million-token context. If your primary has a bad minute, the next one in line takes the turn and the failover lands in the error_event log.

Anthropic
OpenAI
Gemini

tts.elevenlabsturn_4f2a

queued12ms

synthesizing180ms

streamed380ms

200TTFT 180ms · 24kHz · pcm

Speak

Pick the voice that holds the persona. Cartesia sonic for speed, ElevenLabs turbo for naturalness, OpenAI TTS for the cheap path. Pronunciation dictionaries (IPA / CMU) override the TTS layer per voice when needed.

Cartesia
ElevenLabs
OpenAI

From the dashboard, or from your code

Configure once. Talk to people anywhere.

Shape the agent in the dashboard, or push it from the SDK. Tone, voice, guardrails, quiet hours. Every save is a version you can roll back in one click.01
Your provider keys, encrypted with your name on them. Used only for the seconds a call lasts. Free tier runs on your bills, not ours.02
Ship to phone, web, SDK, or hardware. Embeddable widget under 50 KB. New channels coming on the same primitives.03

Two ways to ship. A no-code Persona Builder for solo founders, clinicians, copywriters, and product teams. A TypeScript and Python SDK for engineers who want it inside their app. Same controls, same versioning, same guardrails.

See what Hyponema agents do in production

Common use cases people ship on Hyponema

Health journey companions

Daily check-ins with patients in chronic care or post-op recovery. The agent remembers symptoms, medication, and last week’s setbacks.

80% of users come back next week
Quiet hours by time zone

Wellness coaching

Coaches that pick up after a 14-day pause without rebuilding rapport.

Tone of the relationship tracked
Sensitive topics rest after a hard call

AI tutors

Sessions that resume from the last open question, not a blank slate.

Last open thread surfaced
Promised next step always remembered

Hardware companions

Always-on devices in the Friend / Plaud / Limitless / Bee category that stay on-character across long conversations.

The agent never contradicts itself
Every change auditable

See it in motion

The dashboard, the engine, and the agent, all in one frame.

aria · v3 · production

session_4f2a · live

Every turn pre-emit-checked. Tool calls audited. Memory updated server-side at the end of the turn.

Already on ElevenLabs, Vapi, or Retell?

Plug the memory engine in. Keep your stack.

The Hyponema MCP server exposes the relational memory engine to any MCP-aware client — Claude Desktop, ElevenLabs, Vapi, Retell, custom agents. Seven tools (retrieve_context, session_resume, narrative_arcs, emotional_trajectory, save_observation, forget_user) over stdio or HTTP/SSE. Bring your own LLM key so reflection and consolidation run on your bill.

Read the MCP integration docs

Reliable infrastructure

Designed for scale

Postgres RLS for isolation. AES-256-GCM envelope encryption for provider credentials. Append-only audit log. The same controls run on every plan, including Free.

memory.factsRLS · enforced

SELECT * FROM memory.facts

WHERE tenant_id = 'acme-7f3a'

AND category IN ('health', 'family')

ERRORcross-tenant access denied

current_setting('memory.current_tenant')

RLS-enforced isolation

Tenant isolation lives in Postgres, not in our API. Every table carries tenant_id with a row-level security policy bound to the session — a bug in our app code can't leak rows.

credentials.openaiacme-7f3a

DEKper tenant
wrapped
KEKper host
wrapped
KMScloud-managed
wrapped

in-memory onlycached 60s · operator-blind

Operator-blind credentials

Provider keys are AES-256-GCM envelope-encrypted: a per-credential DEK wrapped by a KMS-managed KEK. Plaintext lives in memory for the seconds a call lasts, then it's gone. Hyponema operators never see them.

audit_log4 events · 1m

14:22:01alice@acmepersona.updatearia/v3
14:22:18alice@acmecredential.rotateopenai
14:23:04bob@acmeagent.createsupport-bot
14:24:12alice@acmepersona.deployaria → prod

diff · IP · actorretained per tenant policy

Append-only audit log

Every persona edit, credential rotation, agent deploy, and DSAR lands in an append-only log with actor, IP, diff, and timestamp. JSON export through one endpoint — your tenant, your retention policy.

Voice agents built for years.

Tuesday's call. Thursday's call. The same agent, only better.

It remembers what matters

Nothing silently overwritten

It tracks the storyline

It feels the shift before you do

Every call picks up where the last ended

The agent never contradicts itself