AvatarLayer — pluggable SDK for realtime conversational avatars

AvatarLayer is a pluggable TypeScript SDK for realtime conversational avatars — LLM, TTS, STT, and avatar rendering behind a single unified interface.

Pluggable TypeScript SDK for realtime conversational avatars. Provides a clean provider model for LLM, TTS, STT, and avatar rendering — supporting local 3D avatars (VRM, Live2D), remote video avatar services (LemonSlice, Atlas, HeyGen), voice input, persistent memory, character cards, and on-device ML behind a single unified interface.

The pipeline

Every conversational turn follows the same flow:

User text / voice → LLM stream → sentence split → TTS → renderer.speak()

When voice input is enabled, the pipeline extends to:

Mic → RealtimeSTT → transcript → (barge-in or sendMessage) → LLM → TTS → renderer

AvatarSession handles streaming, sentence segmentation, interruption, voice activity detection, memory recall, and state transitions automatically. You plug in the providers you want and the SDK does the rest.

Install

One-line install

AvatarLayer is published on npm and works with any Node.js package manager.

npm install avatarlayer

Quick start

import { AvatarSession } from "avatarlayer";
import { OpenAIAdapter } from "avatarlayer/llm";
import { ElevenLabsAdapter } from "avatarlayer/tts";
import { VRMLocalRenderer } from "avatarlayer/renderers";

const session = new AvatarSession({
  llm: new OpenAIAdapter({ apiKey: "sk-...", model: "gpt-5.4-mini" }),
  tts: new ElevenLabsAdapter({ apiKey: "...", voiceId: "21m00Tcm4TlvDq8ikWAM" }),
  renderer: new VRMLocalRenderer({ modelUrl: "/models/avatar.vrm" }),
  systemPrompt: "You are a helpful avatar assistant.",
});

await session.start(document.getElementById("avatar-container")!);
await session.sendMessage("Hello! Tell me about yourself.");

Introduction

The pipeline

Install

Quick start

Key features

13+ LLM adapters

Multiple avatar backends

Voice input

Memory and threads

Character cards

Emotions

React bindings

Interruptible pipeline

Local ML

Vision

Custom adapters

Avatar control schema

Next steps

Getting Started

Providers

Renderers

Voice Input

Memory

React Integration

On this page