Listening interfaces from the ongoing research. Pick a session below.
→Blind A/B — 32 voices
All current candidates rendering the same JP passage. Engines · blind reveal · pipeline tabs.
→Emotion sweep — instruct mode
Qwen3-TTS vs VoxCPM2 across 5 emotions (calm / sad / happy / angry / anxious).
jparler_* — japanese-parler-tts-mini (Apache-2.0, description-steered).
outetts_en_female — OuteTTS v1.0 1B llama.cpp Metal, cross-lingual EN→JP.
voxcpm_instruct + voxcpm_8bit_clone — previously missing VoxCPM2 variants now in.
chatterbox_* — Resemble AI MIT.
sarashina22_clone — SB Intuitions/SoftBank NC, JP-first.