Hyponema
Voice AI for relationships

Voice agents built for years.

Companions, coaches, tutors, eldercare, hardware that lives in users’ homes. Voice agents that need to remember, evolve, and stay in character for months. Bring your STT, LLM, and TTS — Hyponema holds the relationship.

No spam. Early access invites only.

How it actually works

Tuesday's call. Thursday's call. The same agent, only better.

Most voice agents start every conversation from zero. Yours pick up where the last one ended, with the right tone, the open thread surfaced, and the topics that should rest.

Tue, Apr 15
session.live
User

I'm not great today. The week's been heavy.

Agent

Take your time. I'm here.

session note

Heavy session. The week has been rough on her.

topic at rest

Don't bring up work for the next few days.

She left the call heavy. The agent flags work as a tender topic for the next few days.

Thu, Apr 17
session.start
how to open

Be gentle. Last call was hard.

open thread

Ask about the weekend, not work.

Agent

Hi. How has the rest of the week landed?

Work stays out of the conversation. It is resting until Friday.

The agent opens differently. Because it knows.

Six weeks later
emotional baseline
tone drift−0.4 → −0.1
day 1week 6

her baseline lifted across 11 conversations

The agent walked with them.

Across eleven conversations, the agent moved with her. The relationship has weight.

Relational memory

Memory built for relationships. The shape of the conversation, kept across months.

Hyponema holds the relationship itself: the tone of last week’s call, the promise from a month ago, the topic that should rest until Friday.

How the engine works
  • 01

    It remembers what matters

    Importance scoring

    0.4·recency + 0.3·frequency + 0.3·relevance. Health, family, identity carry floors so they never decay below 0.5 even after months of silence.

  • 02

    Nothing silently overwritten

    Supersede chains

    Contradictions create explicit linked records with reasons. Old facts stay queryable. Consolidation merges near-duplicates at cosine ≥ 0.75.

  • 03

    It tracks the storyline

    Narrative arcs

    Longitudinal storylines detected post-call (cosine ≥ 0.65). Lifecycle: ACTIVE → DORMANT (after 60 d silence) → RECURRING.

  • 04

    It feels the shift before you do

    Emotional trajectory

    14-day rolling vs 60-day baseline drift. Pure tabular computation, no LLM call. Topic cooldowns auto-suppress sensitive users for 3 days after a NEGATIVE / CONCERNED episode.

  • 05

    Every call picks up where the last ended

    Session resume

    "Where did we leave off" snapshot — last episode tone, top open question, outstanding promises, recommended resumption tone. Injected at priority 0 of the system prompt.

  • 06

    The agent never contradicts itself

    Persona consistency

    Drift detector pre-emit. Three verdicts: CONSISTENT (≥ 0.85 same-kind) lets it through, DRIFT (0.7–0.85) regenerates, NEW becomes a new persona fact.

Christina M.

User · 87 sessions

User memory

Last topicLimits
Cooldowns2 active
Open threadWalk through derivatives

Latest reflection

Christina is a software developer building reliable distributed systems. Engaged · curious. Last call ended on a promised next step about derivatives.
Aria · AI coachAgent

Welcome back. Last time we left on derivatives. Want to keep going?

ChristinaUser

Yes, picking up where we left off.

Aria · AI coachAgent

Great. I'll keep the pace gentle since last session ended a bit late.

All six, on every conversation. Live in the dashboard.

Bring your own stack

Pick the providers you trust. Swap any of them per session.

Vendor-neutral by construction. The provider registry is the only switch — adding Sarvam, Azure, or any new model is a config addition, not an architectural change. Update one line of voice_stack JSON and the next session picks it up. No redeploy, no downtime.

Join the waitlist
session.live
00:24
[user] Hey Aria, how was your run?
deepgramstreaming · 16kHz
01

Listen

Pick the transcriber that fits your language and budget. Streaming on Deepgram and AssemblyAI; Whisper for batch. Cascading is independent of the LLM and TTS layers, with up to three retries per turn.

  • Deepgram
  • AssemblyAI
  • Whisper
cascade.llm3-deep
  • anthropicclaude-sonnet-4-5
    primary420ms
  • openaigpt-4o
    fallbackidle
  • groqllama-3.1-70b
    fallbackidle
turn_4f2astreaming
02

Think

Pick the brain your agent actually needs — Anthropic Haiku 4.5 for fast, Sonnet 4.6 for balanced, Gemini for million-token context. If your primary has a bad minute, the next one in line takes the turn and the failover lands in the error_event log.

  • Anthropic
  • OpenAI
  • Gemini
tts.elevenlabsturn_4f2a
queued12ms
synthesizing180ms
streamed380ms
200TTFT 180ms · 24kHz · pcm
03

Speak

Pick the voice that holds the persona. Cartesia sonic for speed, ElevenLabs turbo for naturalness, OpenAI TTS for the cheap path. Pronunciation dictionaries (IPA / CMU) override the TTS layer per voice when needed.

  • Cartesia
  • ElevenLabs
  • OpenAI

From the dashboard, or from your code

Configure once. Talk to people anywhere.

  • Shape the agent in the dashboard, or push it from the SDK. Tone, voice, guardrails, quiet hours. Every save is a version you can roll back in one click.01
  • Your provider keys, encrypted with your name on them. Used only for the seconds a call lasts. Free tier runs on your bills, not ours.02
  • Ship to phone, web, SDK, or hardware. Embeddable widget under 50 KB. New channels coming on the same primitives.03

Two ways to ship. A no-code Persona Builder for solo founders, clinicians, copywriters, and product teams. A TypeScript and Python SDK for engineers who want it inside their app. Same controls, same versioning, same guardrails.

See what Hyponema agents do in production

Common use cases people ship on Hyponema

Health journey companions

Daily check-ins with patients in chronic care or post-op recovery. The agent remembers symptoms, medication, and last week’s setbacks.

  • 80% of users come back next week
  • Quiet hours by time zone

Wellness coaching

Coaches that pick up after a 14-day pause without rebuilding rapport.

  • Tone of the relationship tracked
  • Sensitive topics rest after a hard call

AI tutors

Sessions that resume from the last open question, not a blank slate.

  • Last open thread surfaced
  • Promised next step always remembered

Hardware companions

Always-on devices in the Friend / Plaud / Limitless / Bee category that stay on-character across long conversations.

  • The agent never contradicts itself
  • Every change auditable

See it in motion

The dashboard, the engine, and the agent, all in one frame.

aria · v3 · production
session_4f2a · live

Every turn pre-emit-checked. Tool calls audited. Memory updated server-side at the end of the turn.

Already on ElevenLabs, Vapi, or Retell?

Plug the memory engine in. Keep your stack.

The Hyponema MCP server exposes the relational memory engine to any MCP-aware client — Claude Desktop, ElevenLabs, Vapi, Retell, custom agents. Seven tools (retrieve_context, session_resume, narrative_arcs, emotional_trajectory, save_observation, forget_user) over stdio or HTTP/SSE. Bring your own LLM key so reflection and consolidation run on your bill.

Reliable infrastructure

Designed for scale

Postgres RLS for isolation. AES-256-GCM envelope encryption for provider credentials. Append-only audit log. The same controls run on every plan, including Free.

memory.factsRLS · enforced
SELECT * FROM memory.facts
WHERE tenant_id = 'acme-7f3a'
AND category IN ('health', 'family')
ERRORcross-tenant access denied
current_setting('memory.current_tenant')

RLS-enforced isolation

Tenant isolation lives in Postgres, not in our API. Every table carries tenant_id with a row-level security policy bound to the session — a bug in our app code can't leak rows.

credentials.openaiacme-7f3a
  • DEKper tenant
    wrapped
  • KEKper host
    wrapped
  • KMScloud-managed
    wrapped
in-memory onlycached 60s · operator-blind

Operator-blind credentials

Provider keys are AES-256-GCM envelope-encrypted: a per-credential DEK wrapped by a KMS-managed KEK. Plaintext lives in memory for the seconds a call lasts, then it's gone. Hyponema operators never see them.

audit_log4 events · 1m
  • 14:22:01alice@acmepersona.updatearia/v3
  • 14:22:18alice@acmecredential.rotateopenai
  • 14:23:04bob@acmeagent.createsupport-bot
  • 14:24:12alice@acmepersona.deployaria → prod
diff · IP · actorretained per tenant policy

Append-only audit log

Every persona edit, credential rotation, agent deploy, and DSAR lands in an append-only log with actor, IP, diff, and timestamp. JSON export through one endpoint — your tenant, your retention policy.

Voice agents built for years, not minutes.

No spam. Early access invites only.