Intel Speech Enabling Developer Kit Works with Alexa Voice Service, Raspberry Pi 3 Board

We’ve known Intel has been working on Quark S1000 “Sue Creek” processor for voice recognition for several months. S1000 SoC is based on two Tensilica LX6 with HiFi3 DSP, some speech recognition accelerators, and up to 8x microphones interfaces which allows it to perform speech recognition locally. The solution can also be hooked to an application processor via SPI, I2S and USB (optional) when cloud based voice recognition is needed. Intel has recently introduced their Speech Enabling Developer Kit working with Amazon Alexa Voice Service (AVS) featuring a “dual DSP with inference engine” – which must be Quark S1000 – and an 8-mic array. The kit also includes a 40-pin cable to connect to the Raspberry Pi 3 board. Intel only provided basic specifications for the kit: Intel’s dual DSP with inference engine Intel 8-mic circular array High-performance algorithms for acoustic echo cancellation, noise reduction, beamforming and custom wake word […]

AMBE+2 Vocoder Promises High Voice Quality at Low (2.0 to 9.6 Kbps) Data Rates

Opus 1.2 open source audio codec was release a few months ago with the ability to deliver low power low high-quality audio bitrate for speech with bitrates as low as  12 Kbps. Digital Voice Systems (DVSI) claims to have gone even lower thanks to their AMBE+2 vocoder (Advanced MultiBand Excitation) providing high-quality speech at data rates from 2.0 to 9.6 kilobytes per second. AMBE+2 vocoder is said to outperform the company’s previous generation AMBE+ Vocoder as well as the G.729 and G.726 vocoders, while operating at only 4.0 Kbps. The vocoder is suitable for mobile radio, secure voice, satellite communication, computer telephony, digital voice and storage applications The solution can be integrated into product either using software licensing, or through Vocoder chips, and the company lists the following key benefits: Maintains speech intelligibility and speaker recognition at rates as low as 2.0 kbps Resistant to background noise and channel bit errors […]

Amlogic A111, A112 & A113 Processors are Designed for Audio Applications, Smart Speakers

Amlogic processors are mostly found in TVs and TV boxes, but the company is now apparently entering a new market with A111, A112, and A113 audio processors. I was first made aware of those new processors through Buildroot OpenLinux Release Notes V20170831.pdf document posted on their Open Linux website, where two boards with Amlogic A113D and A113X are shown. First, S400 board with the following key features/specifications: SoC – Amlogic A113D CPU System Memory – 1GB DDR3 Storage – 512MB SLC NAND flash Display I/F – MIPI interface Connectivity – Gigabit Ethernet SDIO WiFi/BT (AP6356S) Audio SPDIF_IN/SPDIF_OUT LINE_IN/LINE_OUT 2x Audio headers (MIC_Connector & SPK_Connector) USB – 1x USB 2.0 OTG Expansion – 2x PCIe ports Misc – 6x ADC Keys, IR_IN/IR_OUT, UART Interface (RS232) The second S420 board is based on A113X SoC, and comes with less features (no display, no Ethernet, no PCIe…), less memory: SoC – Amlogic A113X […]

Google Assistant News – AIY Voice Kit For Sale, Offline Support, 3rd Party Smart Speakers Announced

There’s been a lot of development related to Google Assistant in the last few days. First, Google provided an update for AIY Projects, with their AIY Projects Voice Kit now available for pre-order on Micro Center for $35 including a Raspberry Pi 3 board, making the kit virtually free, although you may also purchase it. Note that Micro Center blocks traffic originating from some countries, so I had to use Zend2 to access the site. [Update 10/09/2017: You can also get it from Seeed Studio for worldwide shipping] Google also announced the Speech Commands Dataset with 65,000 one-second long utterances of 30 short words, which they are in the process of integrating with the next release of the Voice Kit, and will allow the devices to respond to voice commands without the need for an Internet connection. So if you lose your Internet connection, or want to isolate your Voice […]

Those Charts Show The Benefits of Microphone Arrays for Hot Word Detection

Since I started looking more into smart speakers, including DIY ones such as the I made with Orange Pi Zero board + Google Assistant with a single microphone, I was told about the importance of microphone arrays, but so far, I had not seen any clear study or data about that. That changed today, as I came across a review of mic arrays by the makers of Snips Voice Platform. They tested five arrays connected to a Raspberry Pi 3 with the system, and also added a generic USB microphone to the mix. The results speak for themselves… In that experiment, they measured the rate at which a hot word was successfully detected by incrementally increasing the distance between 0.5 meters to 5 meters (16 ft), and for each distance, repeating the hot word 25 times at 3 second intervals using pre-recording to keep the voice level constant, and the […]

X-Powers AC108 is a Quad-Channel ADC Chip for Microphone Arrays

AC108-Mic-Array-Chip

X-Powers, a company better known to supply PMIC “companion” chip for Allwinner processors, also made some audio chips including AC108 is a chip specifically designed for microphone arrays with support for 4 microphones, and an I2C + I2S output interface to the host processor. Microphone arrays are particularly useful for smart speakers, and especially hot word detection (voice activity detection) as single microphone setups like I use with Orange Pi Zero, may have trouble detecting hot words like “OK Google” in noisy environments (music playing, alarm ringing…). X-Powers AC108 specifications: 108 dB dynamic range (A-weighted) @ 0 dB boost gain -90 dB THD+N (total harmonic distortion plus noise) @ 0 dB boost gain 4x programmable boost amplifiers with 0dB to 45dB in 3dB step ADC sample rates supported – 8kHz,12kHz,16kHz, 22.05kHz, 24kHz, 32kHz, 44.1kHz, 48kHz,96kHz Analog mixer and digital mixer in record data path 4x fully differential microphone inputs: MIC1P/N […]

Sony Spritzer is an Arduino Compatible Board with Built-in GPS, Audio Codec

Look who is joining the maker community! Sony has showcased their Arduino compatible Spritzer board during the Maker Faire Tokyo on August 5-6. Despite lacking on-board network connectivity, the board is said to have been designed for IoT applications with features such as an integrated GPS and an advanced digital audio codec and amplifier. Sony Spritzer specifications: MCU – Sony CDX5602 ARM Cortex-M4F ×6 micro-controller clocked at up to 156 MHz with 1.5MB SRAM Storage – 8MB Flash Memory, micro SD card GNSS – GPS, GLONASS, supported Audio – 3.5mm audio jack Expansion I/Os Digital I/O Pins – SPI, I2C, UART, PWM ×4 (3.3V) Analog Pins – 6ch (3.3V range) Audio I/O – 8ch Digital MICs or 4ch Analog MICs, Stereo Speaker, I2S, CXD5247 audio codec with 192 kHz/24bit High-Resolution audio 2x camera interfaces USB – 1x micro USB port for programming Power Supply – Via Power barrel and Vin […]

Qualcomm Snapdragon 212 Boards – Intrinsyc Open-Q 212 and Kaynes Technology SKATE-212

Qualcomm Snapdragon 212 (APQ8009) quad core Cortex A7 processor is used in entry-level smartphones, but it’s also one of the processors which the company expects to use in their Smart Speaker Platform leveraging Google Assistant, Amazon Alexa, and other A.I. voice services. Two company has designed single board computers that can be used for this purpose: Intrisync Open-Q 212 and Kaynes Technology SKATE-212. Intrisync Open-Q 212 SBC Development Board Contrary to some other Open-Q boards, but not all, Open-Q 212 is not comprised of a baseboard and a system-on-module, as everything is soldered on a single PCB. Open-Q 212 specifications: SoC – Qualcomm Snapdragon 212 (APQ8009) quad core ARM Cortex A7 processor @ 1.267GHz with Adreno 304 GPU, QDSP6 DSP System Memory – 1GB LPDDR3 Storage – 8GB eMMC (non-POP) flash and micro SD card socket Connectivity – Ethernet,  pre-scanned Wi-Fi 802.11n 2.4Ghz (WCN3610) with chip and U.FL antennas, Bluetooth 4.1 […]

Exit mobile version