Bring GPT-4o-Like Multimodal AI to Microcontrollers.

Vision + sound, fully offline — private, battery-friendly, OEM-ready. 

VWW (Visual Wake Word) — IP block

  • Detects human or object presence locally on-device.
  • Ideal for event-driven activation of cameras and devices.

SoundWW (Sound Wake Word) — IP block

  • Offline always-on detection of keywords or sound events.
  • Enables low-power audio triggers.

Fusion logic — custom integration

  • Combine VWW + SoundWW with AND/OR/priority logic.
  • Reduce false alerts, improve safety, save energy.

Customization Options

Platforms We Support

Applications

 Smart cameras & doorbells

record and notify only when both vision and sound triggers agree.

Robots (AMRs, service, consumer)

act on commands only when operator presence is confirmed visually.

Access systems

verify presence with VWW and validate offline credentials (VoicePIN or ALPR).

OEM Benefits

DIY

 Battery-Powered Edge AI Module (Himax HX6538) 

hardware platform for multimodal PoCs.

For OEMs, these demos show how multimodal triggers reduce false alerts and enable premium offline devices.

How We Work

Evaluate
Pilot (PoC)
License & Integrate
  1. Evaluate — define target sounds, access scenarios, hardware.
  2. PoC — 2–3 weeks, fixed scope, measurable results.
  3. License & Integrate — adapt IP for production; PoC fee credited.

Deliverables

Licensing

No lock-in: portable IP, stable APIs, integration code stays with OEM.

Ready to make your devices
both see and hear?

Launch multimodal PoCs in weeks and unlock Pro features offline