___ ___ ________ ___ ___ ________ ___ ___ _______ ________ ________ ________
|\ \ / /||\ __ \ |\ \ / /|\ ____\|\ \|\ \|\ ___ \ |\ __ \|\ __ \|\ __ \
\ \ \ / / /\ \ \|\ \ \ \ \/ / | \ \___|\ \ \\\ \ \ __/|\ \ \|\ \ \ \|\ \ \ \|\ \
\ \ \/ / / \ \ \\\ \ \ \ / / \ \_____ \ \ __ \ \ \_|/_\ \ _ _\ \ ____\ \ __ \
\ \ / / \ \ \\\ \ / \/ \|____|\ \ \ \ \ \ \ \_|\ \ \ \\ \\ \ \___|\ \ \ \ \
\ \__/ / \ \_______\/ /\ \ ____\_\ \ \__\ \__\ \_______\ \__\\ _\\ \__\ \ \__\ \__\
\|__|/ \|_______/__/ /\ __\ |\_________\|__|\|__|\|_______|\|__|\|__|\|__| \|__|\|__|
|__|/ \|__| \|_________|
Offline Neural TTS · Android · No Cloud · No Limits
VoxSherpa TTS runs studio-quality neural text-to-speech entirely on your Android device.
Powered by Sherpa-ONNX — supports Kokoro-82M, Piper, and VITS engines.
Hindi, English, Japanese, Chinese and 50+ languages — zero internet required.
[whisper] [angry] [happy] tags. 100+ Kokoro voices.82 million parameter neural model. Same architecture powering top commercial TTS — running fully offline on your phone.
Lightweight and fast. Generates natural speech in seconds on any Android device — perfect for daily use.
| Device Tier | Kokoro-82M | Piper / VITS |
|---|---|---|
| ● Flagship (SD 8 Gen 3) | ~20–40 sec / min audio | ~5 sec / min audio |
| ● Mid-range (8-core) | ~60–90 sec / min audio | ~10 sec / min audio |
| ● Budget (6-core) | ~2–3 min / min audio | ~20 sec / min audio |
Found a bug? → Open an Issue