Listen
Mic array. 16 kHz stream. VAD-gated.
Wearable, on-device, bilingual
Sound, made visible.
Live captions, severity-tiered alerts, and wrist haptics. All inference runs locally on the device.
The gap
people live with hearing loss worldwide (WHO, 2024).
Hearing aids amplify. They don't translate.
Pipeline
Mic array. 16 kHz stream. VAD-gated.
ASR, sound classes, prosody, reply intents.
Captions, alerts, haptic patterns.
What it does
Real-time speech in EN and PL.
527 classes across three severity tiers.
Three tap-to-speak suggestions per turn.
Hears your name in any inflection.
Knows when a room is laughing.
One model, two languages, auto-detect.
The stack
Every model in the pipeline is open-source or replicable from a published paper.
Hardware
All ML runs locally. No cloud, ever.
Raspberry Pi CM5 (8 GB), quad-core Cortex-A76. Active cooling for continuous inference.
MEMS mic array over I2S, RP2040 USB Audio Class bridge. 32 kHz capture, decimated to 16 kHz in software.
ESP32-S3 over BLE. Four LRA via DRV2605L. Five severity patterns, under 150 ms target.
INMO Air3 over 5 GHz WiFi. Captions in the field of view. The system works without them too.
18650 lithium-ion cell. ≈211 g without glasses. About 1 to 1.3 h in demo mode.
No cloud connection for the core features. Audio never leaves the device.
Numbers
Bench numbers from a fixed reference-audio suite. Full user-testing is the next stage.
Our commitment
These are commitments and intentions. Where work hasn't happened yet, we say so plainly.
We don't claim to fix deafness. HearSense translates sound into sight and touch.
User-testing with deaf and hard-of-hearing participants comes before we finalize the tech, not after.
All inference on-device. No cloud, no audio leaves the body.
WCAG 2.2 AA across the device and this site, from the start.
Impact
Closing the information gap between hearing and deaf participants in the same room.