smart speaker News - Page 2 of 10 - CNX Software

January 13, 2025 by Debashis Das - 6 Comments

Fully enclosed ESP32-S3 board features 1.8-inch AMOLED, microphone & speaker for AI audio applications

ESP32 S3 1.8inch AMOLED display development board

Waveshare ESP32-S3-Touch-AMOLED-1.8 is an ESP32-S3 development board with an AMOLED display and AI audio support fully housed in a plastic enclosure. The most interesting feature of this devkit is its 1.8-inch AMOLED display with a 100000:1 contrast ratio and a wide 178° viewing angle, plus support for AI speech using its built-in microphone and speaker, and a built-in battery for IoT and AI applications. Other features include a QMI8658 6-axis IMU for motion detection, a PCF85063 RTC for time, and an ES8311 audio codec for high-quality audio. The ESP32-S3 provides Bluetooth and Wi-Fi connectivity and the board also features a USB-C port for power and programming. The AXP2101 power management IC enables battery charging and optimization, while GPIO, I2C, and UART pads allow expansion. Waveshare ESP32-S3-Touch-AMOLED-1.8 specifications Wireless MCU – Espressif Systems ESP32-S3R8 CPU – Dual-core Tensilica LX7 @ up to 240 MHz with vector instructions for AI acceleration. Memory – […]

December 20, 2024December 19, 2024 by Jean-Luc Aufranc (CNXSoft) - 3 Comments

$59 Voice “Preview Edition” adds an offline smart speaker to your Home Assistant server

Nabu Casa has just launched the Home Assistant Voice Preview Edition, a little ESP32 device with an XMOS XU316 audio processor, a dual-microphone array, an internal speaker, and a 3.5mm audio jack, that adds offline smart speaker functions to your Home Assistant server through WiFi. If your Home Assistant server is powerful enough, voice processing will be done directly on your local hardware using Home Assistant Voice software, but with lower-end hardware like a Raspberry Pi 4, audio processing can be done via a privacy-focused cloud instead. The solution also supports expansion thanks to a Grove connector on the bottom of the device. Voice Preview Edition specifications: SoC – Espressif ESP32-S3 dual-core Xtensa LX7 @ up to 240 MHz with vector extension for ML acceleration, 2.4 GHz WiFi & Bluetooth 5.0 LE connectivity Memory- 8 MB octal PSRAM Storage – 16 MB flash Audio DSP/Processor – XMOS XU316 with 16 […]

November 5, 2024November 5, 2024 by Tomisin Olujinmi - 1 Comment

M5Stack releases AX630C-powered offline “Module LLM” for local smart home and AI applications

The M5Stack Module LLM is yet another box-shaped device from the company that provides artificially intelligent control without internet access. It is described as an “integrated offline Large Language Model (LLM) inference module” which can be used to implement local LLM-based solutions in smart homes, voice assistants, and industrial control. Module LLM is powered by the AX630C SoC, equipped with 4GB LPDDR4 memory, 32GB storage, and a 3.2 TOPS (INT8) or 12.8 TOPS (INT4) NPU. M5Stack says the main chip has an average runtime power consumption of 1.5W, making it suitable for long-term operation. It has a built-in microphone, speaker, microSD card slot, and USB OTG. The USB port can connect peripherals such as cameras and debuggers, and the microSD card slot supports cold and hot firmware updates. The M5Stack Module LLM joins the list of other offline, on-device LLM-based solutions, such as the SenseCAP Watcher, Useful Sensors’ AI in […]

May 29, 2024May 29, 2024 by Jean-Luc Aufranc (CNXSoft) - 2 Comments

picoLLM is a cross-platform, on-device LLM inference engine

Large Language Models (LLMs) can run locally on mini PCs or single board computers like the Raspberry Pi 5 but with limited performance due to high memory usage and bandwidth requirements. That’s why Picovoice has developed the picoLLM Inference Engine cross-platform SDK optimized for running compressed large language models on systems running Linux (x86_64), macOS (arm64, x86_64), and Windows (x86_64), Raspberry Pi OS on Pi 5 and 4, Android and iOS mobile operating systems, as well as web browsers such as Chrome, Safari, Edge, and Firefox. Alireza Kenarsari, Picovoice CEO, told CNX Software that “picoLLM is a joint effort of Picovoice deep learning researchers who developed the X-bit quantization algorithm and engineers who built the cross-platform LLM inference engine to bring any LLM to any device and control back to enterprises”. The company says picoLLM delivers better accuracy than GPTQ when using Llama-3.8B MMLU (Massive Multitask Language Understanding) as a […]

May 14, 2024May 14, 2024 by Jean-Luc Aufranc (CNXSoft) - 9 Comments

Rockchip RK2118G/RK2118M dual-core Star-SE Armv8-M microcontrollers target smart audio applications

Rockchip RK2118G microcontroller block diagram

Rockchip RK2118G and RK2118M smart audio microcontrollers based on a dual-core Star-SE Armv8-M processor, an NPU for smart AI audio processor, three DSPs, 1024KB SRAM, optional DDR memory in package, and a range of peripherals. I first noticed the RK2118M in slides from the Rockchip Developer Conference 2024 last March, but I did not have enough information for an article at the time. Things have now changed since I’ve just received a bunch of datasheets including the one for the RK2118G and RK2118G microcontrollers, which look identical except for the DDR interface and optional built-in 64MB RAM for the RK2118G. The datasheets have only one reference to Arm with the string “Arm-V8M” and nothing else, and Cortex is not mentioned at all. But the slide above reveals the STAR-SE core looks to be an Arm Cortex-M33 core. We also learn the top frequencies for the “STAR-M33″/”STAR-SE” core (300MHz) and the […]

September 1, 2023 by Jean-Luc Aufranc (CNXSoft) - 6 Comments

ESP32-S3-BOX-3 devkit comes with 2.4-inch display, dual microphone, PCIe expansion connector

Espressif Systems has launched an update to their ESP32-S3-Box development kit for online and offline voice assistants with the ESP32-S3-BOX-3 devkit that still features a 2.4-inch capacitive touchscreen display with 320×240 resolution, two microphones, a built-in speaker, and a USB-C port, but replaces the PMOD connector by a PCIe connector for various expansion modules. The open-source ESP32-S3 development kit is powered by the ESP32-S3 SoC with AI extensions and can be used to implement all sorts of solutions using the company’s ESP-SR, ESP RainMaker, and Matter solutions such as an offline voice assistant, a chatbot powered by ChatGPT, a handheld gaming console, a tiny robot, a Matter-compatible Smart Home hub, and more. ESP32-S3-BOX-3 specifications: WiSoC – ESP32-S3 dual-core Tensilica LX7 up to 240 MHz with Wi-Fi 4 & Bluetooth 5, AI instructions, 512KB SRAM Memory and Storage – 16MB octal PSRAM and 16MB QSPI flash Display – 2.4-inch capacitive touchscreen […]

July 17, 2023 by Jean-Luc Aufranc (CNXSoft) - 3 Comments

Espressif ESP-SR enables on-device speech recognition framework on ESP32-S3 and ESP32 WiSoCs

ESP SR ESP32 on device speech recognition AFE

Espressif ESP-SR is a speech recognition framework enabling on-device speech recognition on ESP32 and ESP32-S3 wireless microcontrollers with the latter being recommended due to its vector extension for AI acceleration and larger, high-speech octal SPI PSRAM. The ESP-SR framework was first released on December 17, 2021 with version 1.0, before the v1.20 update was introduced in March of this year, but I only found out about ESP-SR offline speech recognition solution through a tweet by John Lee showing an ESP-SR demo video by @ThatProject. Comrades of the world, liberate your hands from the chains of typing and touching germy switches! Embrace the revolutionary power of speech recognition with ESP32-S3 + ESP-SR. Let your words flow freely, for the proletariat shall not be silenced by keyboards or bourgeois input… pic.twitter.com/bm3udteB3o — John Lee (@EspressifSystem) July 15, 2023 I initially was confused since ESP32 boards have supported speech recognition for years using […]

April 21, 2023April 21, 2023 by Jean-Luc Aufranc (CNXSoft) - 4 Comments

Offline voice recognition module supports Arduino programming, custom voice commands

We’ve already covered inexpensive offline voice recognition modules based on US516P6 or TW-ASR ONE microcontrollers that allow people to add smarts to their projects without a network connection for improved privacy and lower latency. Those are great in theory, but at the time (April 2022) documentation was lacking or only in Chinese, and they were fairly hard to use based on some of the comments in my earlier posts. But today, I’ve noticed DFRobot is now selling the “Gravity: Offline Voice Recognition Sensor – I2C & UART” module with support for Arduino programming, and it looks fairly easy to customize as we’ll see further below. Gravity Voice Recognition DF2301QG module specifications: Voice recognition module – WS-2520-TR module with MCU – TBD 121 commonly used fixed voice commands, one-fixed wake word Support for 1 learned wake-word, 17 user-defined commands Audio Output – Built-in speaker and external speaker interface Input – Dual […]