Bring Cloud-Standard Voice AI to Microcontrollers.
Cloud-standard voice features — now offline on MCU: ASR, TTS, and Small LM.
ASR (Automatic Speech Recognition) — IP block
- Available: English.
- Customization: new languages, domain-specific vocabularies, constrained grammars.
TTS (Text-to-Speech) —
IP block
- Available: English.
- Customization: new languages, custom voices (tone, style, brand identity).
Small LM (Lightweight Language Models)
- Ready IP: TinyStories (EN) — pre-trained, fully offline storytelling.
- Custom Small LMs — tailored for intents, Q&A, dialogs, or new languages.
Customization Options
- Add new languages and domain vocabularies.
- Design voice flows with clarifications and confirmations.
- Create custom TTS voices.
- Optimize for RAM, Flash, and latency on your platform.
Platforms We Support
- ESP32 (incl. ESP32-S3) → fast PoC and cost-sensitive devices.
- Arm® Cortex™-M + Arm® Ethos™-U55 / other AI SoCs → high performance, energy-efficient production.
- Migration path: start on ESP32 → scale to Arm® Ethos™-U55.
Applications
Smart appliances
natural dialogs instead of rigid commands:
“Start cooking.” → “For how many minutes?” → TTS: “Cooking started, 10 minutes remaining.”
Toys & EdTech
interactive learning and safe offline storytelling (AI Teacher, TinyStories).
Access control
ASR turned into offline numeric PIN (VoicePIN).
OEM Benefits
- Faster time-to-market — PoC in weeks, not months.
- Lower OPEX — no recurring cloud costs, fewer complaints.
- Minimal BOM increase — add offline voice with negligible hardware cost.
- Pro SKU differentiation — premium devices with offline voice features.
- De-risked adoption — tested by the DIY community before reaching OEM scale.
DIY Proof (Social Proof)

our DIY product with three apps showing how one ASR + Intent stack enables multiple use cases:
For OEMs, the VoxControl Kit is proof that one stack can power very different devices — from toys to appliances to security systems.
How We Work
Evaluate
Pilot (PoC)
License & Integrate
- Evaluate — define target sounds, access scenarios, hardware.
- PoC — 2–3 weeks, fixed scope, measurable detection rates and latency.
- License & Integrate — adapt IP for your product; PoC fee credited to license.
Deliverables
- ASR & TTS (EN) IP binaries, APIs, integration guides.
- TinyStories (EN) ready-to-use IP.
- Optional: custom Small LMs, language packs, TTS voices.
- Metrics pack (latency, energy, memory).
- Demo video on your device.
Licensing
- PoC license — quick validation.
- Project license — tuned models, integration support.
- Volume license — mass deployment.
No lock-in: portable IP, stable APIs, integration code stays with OEM.
Ready to add offline dialogs to your devices?
Start with TinyStories or request a PoC tailored to your product.