$8 LU-ASR01 offline speech recognition board features “TW-ASR ONE” chip

LU-ASR01 offline speech recognition module

LU-ASR01 is a board capable of offline speech recognition with a built-in microphone, a speaker connector, twelve through holes for GPIOs and a temperature sensor interface for DHT11/DS18B20,  plus a USB Type-C port for power and programming. At first, I thought it might be based on the Unisound US516P6 microcontroller which we’ve seen on some inexpensive offline speech recognition modules, but the chip looks completely different, with the marking TW-ASR ONE. So let’s investigate… LU-ASR01 board specifications took some effort, but here’s what I’ve managed: MCU – TW-ASR ONE (aka TWen ASR ONE) microcontroller with 4MB flash, 512KB RAM, and a BNPU for voice processing; package: QFN48L (6x6x0.85mm) Audio I/O Built-in microphone 2-pin speaker header plus 3W power amplifier for 4Ω/3W speaker Voice recognition Up to 10 meters wake-up range 98% ultra-high recognition rate Customizable to 5 wake-up words and 200 recognition words USB – 1x USB Type-C port for […]

SmartCow Apollo – A Jetson Xavier NX devkit for conversational AI, computer vision

SmartCow Apollo Devkit

SmartCow Apollo is an audio/video AI engineering kit based on NVIDIA Jetson Xavier NX computer module designed for applications with conversational AI capabilities, such as speaker recognition and sentiment analysis. But considering a camera is included, computer vision applications should also be possible. The development kit comes with a 128GB NVMe SSD, four microphones, two speaker terminals, two 3.5mm phone jacks, an 8MP camera module, and a 2.08-inch OLED display with everything housed in a frame that keeps the module and accessories like that camera upright. SmartCow Apollo specifications: NVIDIA Jetson Xavier NX system-on-module CPU – 6-core NVIDIA Carmel ARMv8.2 64-bit CPU with 6MB L2 and 4MB L3 cache GPU – NVIDIA Volta architecture with 384 NVIDIA CUDA cores and 48 Tensor cores Memory – 8 GB or 16GB 128-bit LPDDR4x Storage – 16 GB eMMC 5.1 flash Display 1x Mini DP port 7-pin SPI header for OLED display (included) […]

MECOOL HomePlus – A 2-in-1 smart speaker and 4K TV box with DVB Tuner

MECOOL KA1

We’ve reviewed MECOOL devices in the past, such as their hybrid set-top boxes with DVB-T2 and/or DVB-S2 tuners. But MECOOL HomePlus (aka MECOOL KA1) packs even more features as it serves as both a 4K Android set-top box with a DVB tuner and a smart speaker compatible with Google Assitant. Based on Amlogic S905X4 Arm Cortex-A55 processor, and equipped with 4GB RAM and 32GB storage, MECOOL HomePlus runs a Google-certified Android TV 11 OS with far-field voice and hands-free TV control. MECOOL HomePlus (KA1) specifications: SoC – Amlogic S905X4 quad-core Arm Cortex-A55 processor with Arm G31 MP2 GPU RAM – 4GB LPDDR4 Storage – 32GB eMMC flash Video Output – HDMI 2.1 port with HDCP 2.2 Video Codes – 4K AV1, VP9, H.265, H.264 Audio Built-in two linear microphone array 2x 5W RMS speakers Tuner – RF antenna input for DVB T/T2/C tuner (optional) Connectivity 10/100M Ethernet RJ45 port Dual-band […]

PicoVoice offline Voice AI engine gets free tier for up to 3 users

PicoVoice Console Custom Wake Word

PicoVoice offline Voice AI engine has now a free tier that allows people to create custom wake words and voice commands easily for up to three users on any hardware including Raspberry Pi and Arduino boards. I first learned about PicoVoice about a year ago when the offline voice AI engine was showcased on a Raspberry Pi fitted with ReSpeaker 4-mic array to showcase the company’s Porcupine custom wake word engine, and Rhino Speech-to-Intent engine. The demo would support 9 wake words with Alexa, Bumblebee, Computer, Hey Google, Hey Siri, Jarvis, Picovoice, Porcupine, and Terminator. More importantly, the solution allows you to easily create your own custom words in minutes from a web interface by simply typing the selected wake word, with no need for hundreds of voice samples or waiting weeks to get it done. So I tried “Hey You” first, but I was told it was too short, […]

Espressif introduces ESP32-S3-BOX AI development kit for online and offline voice applications

ESP32-S3-Box

Espressif Systems has very recently introduced the ESP32-S3-BOX AI voice devkit designed for the development of applications with offline and online voice assistants, and whose design I find similar to the M5Stack Core2 devkit, but the applications will be different. The ESP32-S3-BOX features the latest ESP32-S3 processor with WiFi and BLE connectivity, AI capabilities, as well as a 2.4-inch capacitive touchscreen display, a 2-mic microphone array, a speaker, and I/O connectors with everything housed in a plastic enclosure with a stand. ESP32-S3-BOX specifications: WiSoC – ESP32-S3 dual-core Tensilica LX7 up to 240 MHz with Wi-Fi & Bluetooth 5, AI instructions, 512KB SRAM Memory and Storage – 8MB octal PSRAM and 16MB QSPI flash Display – 2.4-inch capacitive touchscreen display with 320×240 resolution Audio – Dual microphone, speaker USB – 1x USB Type-C port for power and debugging (JTAG/serial) Expansion – 2x Pmod-compatible headers for up to 16x GPIOs Misc Power […]

Picovoice Cobra Voice Activity Detection Engine shown to outperform Google WebRTC VAD

PicoVoice Cobra VAD

Picovoice Cobra Voice Activity Detection (VAD) engine has just been publicly released with support for Raspberry Pi, BeagleBone, NVIDIA Jetson Nano, Linux 64-bit, macOS 64-bit, Windows 64-bit, Android, iOS, and web browsers that support WebAssembly. Support for other Cortex-M and Cortex-A based SoCs can also be made available but only to enterprise customers. Picovoice already offered custom wake word detection with an easy and quick web-based training and offline voice recognition for Raspberry Pi, and even later ported their voice engine to Arduino. Cobra VAD is a new release, and, like other VADs, aims to detect the presence of a human voice within an audio stream. Picovoice Cobra can be found on Github, but note this is not an open-source solution, and instead, libpv_cobra.so dynamic library is provided for various targets, together with header files and demos in C, Python, Rust, and WebAssembly, as well as demo apps for iOS […]

Raspberry Pi smart audio devkit features AISonic IA8201 DSP, microphone array

AISonic-Raspberry Pi Development Kit

Knowles AISonic IA8201 Raspberry Pi development kit is designed to bring voice, audio edge processing, and machine learning (ML) listening capabilities to various systems, and can be used to evaluate the company’s AISonic IA8201 DSP that was introduced about two years ago. The kit is comprised of three boards with an adapter board with three buttons connecting to the Raspberry Pi, as well as the AISonic IA8210 DSP board itself connected via a flat cable to a microphone array. Knowles AISonic Raspberry Pi development kit Knowles did not provide the full details for the development but says it enables wake-on-voice processing for low latency voice UI, noise reduction, context awareness, and accelerated machine learning inferencing for edge processing of sensor inputs. Some of the use cases include Low Power Voice Wake to listen for specific OEM keywords to wake the host processor, Proximity Detection when combined with an ultrasonic capable […]

Offline speech recognition MCU module comes with speaker, microphone, and UART connectors

offline voice recognition mcu module

We found out about Unisound US516P6 RISC microcontroller inside an offline voice assistant module last May. The module offers offline speech recognition for just $2 to $4, with good performance, and excellent privacy since no cloud service nor Internet connection is needed. That module requires some soldering, but if you’d prefer something easier to connect the “SU-10A” offline speech recognition MCU module comes with connectors for a speaker, a microphone, as well as UART connectivity to a host MCU if needed. “SU-10A” module specifications: MCU – Unisound US516P6 RISC microcontroller @ 240 MHz with FPU, DSP instruction, FFT accelerator, 242KB SRAM, 2MB flash Audio Built-in 3W mono Class AB power amplifier. 2mm pitch connector for speaker (4 Ohms up to 2.9W,  8 Ohms up to 1.8W) 2mm pitch connector for electret microphone Debugging/programming – UART port for serial console (5V or 3.3V supported) Host interface – 2mm pitch 4-pin connector […]

Memfault IoT and embedded debugging platform