 |
| |
| Audio and Speech Technologies |
| |
Sensory offers world class speech technologies on both hardware and software platforms. The technologies listed below can be implemented on the platforms depicted by the product symbols here:
|
| |
|
|
| |
| Technology Demo Videos Available:
NLP-5x Demo-Text-to-Speech NLP-5x Demo-Math Flash Card
NLP-5x Microwave Oven
Beat Prediction BlueGenie VUI LipSync NanoLock w/Voice Password
Real-Time LipSync SonicNet SoundSource Natural TimeSet
|
| |
|
| Speech Recognition: |
|
|
| Natural Language Interface |
  |
Flexible Grammars!
Sensory's Natural Language Interface for the NLP-5x provides the unique ability to understand context-specific user's commands in the natural way the user would like to speak. Order independence allows flexibility in commands and speech prompts can request any missing information (form filling). Revolutionary flexible grammars allow the user to say multiple commands in a single phrase, and even in a flexible order. This results in the most natural use of speech recognition!
|
|
| |
|
| |
|
| |
| Speaker Verification |
 |
Biometric security through voice
While similar to Speaker-Dependent recognition, Speaker Verification offers the capability of being able to identify whether or not a password is spoken by the original individual who trained that password. The user trains 1-4 passwords (the more passwords, the better the security) that can create voice access to any product. Equal error rates (where the probability of an incorrect acceptance equals that of an incorrect rejection) ranges between 0.01-7% depending on the number of words and whether the passwords are known to the imposter. On the NLP-5x, up to 10 SV templates can be stored on-chip. The RSC-4x can store 5 SV templates on-chip. With external memory, the number of unique sets for both chips is limited only by programmable memory capacity.
NanoLock SV Demo
|
|
| |
| Speaker Dependent Speech Recognition |
 |
Flexible vocabulary, any language, any accent
Speaker dependent (SD) recognition is desirable where user-specific or language-specific vocabularies are required. Each recognition word is trained just once by the user to create voice "templates", each of which requires up to 200 bytes of memory (which can be on-chip or external). Vocabularies in excess of 100 words are possible, although there are often practical reasons for keeping recognition sets under 50 words. The NLP-5x can store up to 10 SD templates in on-chip SRAM. The RSC-4128 can store up to 7 SD templates in on-chip SRAM. With proper design, Sensory's SD technology can yield highly accurate recognition for any user, regardless of language or accent. |
|
| |
|
| |
|
| Audio |
|
|
| |
| Stereo MP3 Decoder |
 |
Hi-Fidelity Stereo MP3 Decoder with all standard bitrates and a 5-band equalizer.
|
|
| |
|
| |
|
| |
| Record and Playback |
 |
Store messages and play back--voice messaging capability
Compressed digital sound reproduction
Sensory's RSC-4x and NLP-5x processors can record audio to off-chip RAM or Flash at data rates of under 30k bits per second for custom greetings, phones and answering machines, voice pitch changers, and hand-held recording devices. On-chip compression levels can be varied depending on the quantity and quality of playback desired. Automatic silence removal can also be done to reduce memory requirements. The NLP-5x offers 8k and 16k bit samples per second while the RSC-4x family offers 8k samples per second. The NLP-5x signal processing provides superior voice quality.
|
|
| |
|
| Interactive / Robotic |
|
| LCD Control |
 |
LCD control logic and drive - up to 104 icons or pixels. SPI for large array driver interfaces.
|
|
| |
| Motor Control |
 |
Motor control logic - up to 3 bi-directional motors.
|
|
| |
| Silent SonicNet |
 |
Silent SonicNet communicates data via encoded sound at 14KHz or 18KHz in short bursts on the NLP-5x. These high frequencies make the short bursts essentially inaudible in practical application. Silent SonicNet can run conincident with SX or T2SI, allowing data transmission during VR dialogues. Products with integrated speech that already include an NLP-5x, microphone and speaker can implement this at no additional cost, and can interact with each other, potentially doubling demand.
|
|
| |
| SonicNet |
 |
Communicates data at 8KHz via encoded sound in short bursts on the RSC-4x. SonicNet can run coincident with SX to partially mask the sonic tones. Products with integrated speech that already include an RSC-4x, microphone and speaker can implement this at no additional cost, and can interact with each other, potentially doubling demand.
SonicNet Demo
Interactive Multimedia Windows Media demo - requires RSC-4x Demo/Eval Board |
|
| |
| System Communications |
 |
USB1.1, SPI, UART-Lite, I2S and infrared (IR) interfaces combine with voice user interface capabilities, enabling man-machine interface solutions with an unprecedented combination of power and cost-effectiveness.
|
|
| |
|
| |
|
| |
|
| |
| Peak Detection |
 |
Picking up the amplitude of different sounds in the room as they occur and reacting to them with a movement or display function.
|
|
| |
| Pitch Detection |
 |
A human pitched voice can be analyzed by the RSC processor to figure out the pitches being sung. |
|
| |
|
| |
|
| |
| Talk Back |
 |
The RSC can produce speech in response to your talking or inquiries that appears to be conversational speech from a non-human creature. |
|
| |
| |
| Natural Radio Tuning |
 |
Set radio stations using natural phrases on the NLP-5x.
|
|
| |
|
| |
| Low Power Audio Wakeup |
 |
Wake from low power mode capability
One of the challenges for hands-free battery operated products, was that if they were always on, always listening the batteries would drain rapidly. Sensory has created a low power technology on the RSC-4x that can listen for audio (whistle or claps) and wake up from this low power mode and begin listening for speech recognition commands. This technology can extend the life of battery operated products from weeks to years.
|
|
| |
| Sensor Interfacing |
 |
Sensory and 3rd party developers provide support for presence detection, touch and position sensors, gesture and motion analysis, etc. USB1.1, SPI, UART-Lite, I2S and infrared (IR) interfaces combine with voice user interface capabilities, enabling man-machine interface solutions with an unprecedented combination of power and cost-effectiveness.
|
|
| |
|
| Voice Recognition for BlueTooth Products |
|
| BlueGenie™ Voice Interface |
 |
Speech Recognition and TTS for Headsets, Music Players, Hands-Free Kits & More
Sensory’s BlueGenie Voice Interface software suite runs on CSR's BC-5 MM Kalimba DSP, and enables manufacturers of Bluetooth products to integrate full voice control and synthetic speech output without the need for visual displays or complex user interfacing. It frees designers to pack functionality onto small form factor Bluetooth devices and answers consumer demand for a truly hands-free experience. TTS allows Caller ID announcement and SMS message playback with speech.
BlueGenie Voice User Interface Demo
|
|
|
| |
| Technology Matrix |
| |
NLP-5x Natural Language Processor |
RSC-4x Family: |
SC-6x Family: |
FluentSoft Speech Recognition: |
BlueGenie™
Voice
Interface: |
Natural Language Interface
|
 |
|
|
 |
coming soon |
| Phrase Spotting |
 |
|
|
 |
 |
Speaker Independent with T2SI™ |
 |
 |
|
 |
 |
| Speaker Verification |
 |
 |
|
coming soon |
|
| Speaker Dependent |
 |
 |
|
coming soon |
|
| Continuous Digits |
 |
 |
|
 |
 |
| Text-to-Speech Synthesis (TTS) with Voice Morphing |
 |
|
|
 |
 |
| Stereo MP3 decoder |
 |
|
|
|
|
| Mono or Stereo Music |
 |
 |
 |
|
|
| Speech Synthesis |
 |
 |
 |
 |
 |
| Record and Playback |
 |
 |
|
|
|
| LCD Control |
 |
|
|
|
|
| Motor Control |
 |
|
|
|
|
| Silent SonicNet |
 |
|
|
|
|
| SonicNet |
|
 |
|
|
|
| System Communications |
 |
|
|
|
|
| RealTime LipSync |
 |
 |
|
|
|
| Beat Prediction |
 |
 |
|
|
|
| LipSync |
 |
 |
|
|
|
| Peak Detection |
 |
 |
|
|
|
| Pitch Detection |
 |
 |
|
|
|
| Sing Back |
 |
 |
|
|
|
| Sound Sourcing |
 |
 |
|
|
|
| Talk Back |
 |
 |
|
|
|
| Natural Radio Tuning |
 |
|
|
|
|
| Natural TimeSet |
 |
 |
|
|
|
| Low Power Audio Wakeup |
|
 |
|
|
|
| Sensor Interfacing |
 |
|
|
|
|
| Bluetooth Support |
|
|
|
|
 |
|
| |