Status of Embedded GPU Ecosystem – Linux/Mesa Upstream Support (ELC 2018 Video)

Tweet The Embedded Linux Confernce is on-going, and the Linux Foundation has been uploading videos about talks in a timely manner on YouTube. I checked out at RISC-V keynote yesterday, but today I’ve watched a talk by Robert Foss (his real name, not related to FOSS) from Collabora entitled “Progress in the Embedded GPU Ecosystem”, where he discusses open source software support in Linux/Mesa from companies and reverse-engineering support. The first part deals with the history of embedded GPU support, especially when it comes to company support. Intel was the first and offers very good support for their drivers, following by AMD who also is a good citizen. NVIDIA has the Nouveau driver but they did not really backed it up, and Tegra support is apparently sponsored by an aircraft supplier. Other companies have been slower to help, but Qualcomm has made progress since 2015 and now support all their hardware, Broadcom has a “one man team” handling VideoCore IV/V,  …

Vulkan 1.1 and SPIR-V 1.3 Specifications Released

Tweet The Khronos Group released Vulkan 1.0 specifications in 2015 as a successor of OpenGL ES, compatible with OpenGL ES 3.1 or greater capable GPU, and taking less CPU resources thank to – for instance – better use of multi-core processors with support for multiple command buffers that can be created in parallel. A year later, we saw Vulkan efficiency in a demo, since then most vendors have implemented a Vulkan driver for their compatible hardware across multiple operating systems, including Imagination Technologies which recently released Vulkan drivers for Linux. The Khronos Group has now released Vulkan 1.1 and the associated SPIR-V 1.3 language specifications. New functionalities in Vulkan 1.1: Protected Content – Restrict access or copying from resources used for rendering and display, secure playback and display of protected multimedia content Subgroup Operations – Efficient mechanisms that enable parallel shader invocations to communicate, wide variety of parallel computation models supported Some Vulkan 1.0 extensions are now part of Vulkan …

Arm Introduces Mali-G52 & Mali-G31 GPUs, Mali-D51 Display Processor, and Mali-V52 Video Processor for Mainstream Devices

Tweet Arm has just announced four multimedia Mali IP blocks to be found in SoC for mainstream devices: Mali-G52 GPU with 30% faster performance over Mali-G51, and 3.6x better machine learning performance Mali-G31 GPU that’s 20% smaller, and 20% more efficient than Mali-G51, supports OpenGL ES 3.2 and Vulkan APIs Mali-D51 display processor 30% power saving, 50% lower latency compared to Mali-DP650 Mali-V52 video processor supporting 4K60/4K120 content Mali-G52 GPU Arm may have introduced Project Trillium for object detection and machine learning a few weeks ago, but the solution is better suited to premium devices, so the company’s Mali-G52 bitfrost mainstream GPU aims to fill the void for mid-range devices with up to 3.6 times faster machine learning capability over Mali-G51. Based on the first illustration, Mali-G52 will probably be coupled with DynamIQ Cortex A75/A55 processors. Other benefits of the new GPU include 30% more performance density, and 15% better energy efficiency. Mali-G52 will be clocked up to 850 MHz delivering …

Embedded Linux Conference & IoT Summit 2018 Schedule

Tweet The Embedded Linux Conference 2018 and the OpenIoT Summit 2018 will jointly take place next month, on March 12 – 14, 2018 in Portland, Oregon, USA. The former is a “vendor-neutral technical conference for companies and developers using Linux in embedded products”, while the latter is a “technical conference for the developers and architects working on industrial IoT”. The Linux Foundation has already published the schedule, and it’s always useful to learn what will be discussed about even for people who won’t attend. With that in mind, here’s my own virtual schedule with some of the talks I find interesting / relevant to this blog. Monday, March 12 10:50 – 11:40 – Progress in the Embedded GPU Ecosystem by Robert Foss, Collabora Ltd. Ten years ago no one would have expected the embedded GPU ecosystem in Linux to be what it is now. Today, a large number of GPUs have Open Source support and for those that aren’t supported …

T-bao Tbook X8S Pro Apollo Lake Laptop is Equipped with an NVIDIA Geforce GPU

Tweet Most products based on Intel Apollo Lake processor do so to leverage the low cost and low power of the chip that also embeds Intel HD graphics removing the need for an external graphics card. But T-bao Tbook X8S Pro laptop powered by Intel Celeron J3455 Apollo Lake “Desktop” processor also comes with an NVIDIA GeForce 920M GPU which should boost graphics performance. Tbook X8S Pro specifications: SoC – Intel Celeron J3455 quad core Apollo Lake processor @ 1.50 / 2.30 GHz with (unused) 12EU Intel HD Graphics 500; 10W TDP GPU – NVIDIA GeForce 920M @ 954 MHz with 2GB RAM System Memory – 6GB DDR3 Storage – 128GB eMMC flash or M.2 SSD (unclear), micro SD card slot up to 128 GB Display – 15.6″ IPS screen with 1920×1080 resolution Video Output – HDMI output Audio – Built-in microphone, and stereo speakers; 3.5mm audio jack Connectivity – Gigabit Ethernet, dual band 802.11b/g/n/ac WiFi, Bluetooth 4.0 USB – …

Imagination PowerVR Series8XT GT8540 GPU Can Drive up to Six 4K Screens, Supports Hardware Virtualization

Tweet Imagination Technologies introduces PowerVR Furian architecture last year with improved performance, power and density, as well as dual cluster PowerVR Series8XT GT8525 GPU based on the new architecture, and targeting high-end smartphones, virtual reality and automotive products. The company has now introduced a quad cluster Furian GPU called PowerVR Series8XT GT8540 that can simultaneously drive up to six 4K screens at 60fps thanks to an 80% fillrate density improvement, and supports virtualization providing separation of services and applications. The new GPU mostly targets the automotive market with some new cars now requiring multiple screen support with high resolution displays for cluster, Head-Up Display (HUD) and infotainment. Hardware virtualization is equally important for automotive application, as you’ll want to separate safety-critical code, from infotainment applications for example, so if the latter crashes, the safety-critical code can still run unhindered. Each would run on separate shaders processing unit, with up to 8 different  operating systems running in their own protected space: …

Allwinner SoCs with Mali GPU Get Mainline Linux OpenGL ES Support

Tweet OpenGL ES support in Linux for ARM SoC is usually pretty hard to get because of closed source binary blobs coupled with the manufacturers focus on Android. Workarounds include open driver projects such as Freedreno for Qualcomm Adreno GPU, Nouveau for Tegra, or Etnaviv for Vivante GPUs, as well as libhybris library that converts Linux calls into Android calls in order to leverage existing Android GPU binary blobs. Allwinner processors relies on either PoverVR or ARM Mali GPU, and the former does not have any open source project, while some work is still being going for the latter with Lima project, but it’s not ready yet. That means so far, you’re only option was to use libhybris for either GPU family. The good news is that Free Electrons engineers have been working on OpenGL ES support for ARM Mali GPU for Allwinner processor, and have been allowed to release the userspace binary blobs. Not quite as exciting as an …

Imagination Announces PowerVR Series2NX Neural Network Accelerator (NNA), and PowerVR Series9XE and 9XM GPUs

Tweet Imagination Technologies has just made two announcements: one for their PowerVR Series2NX neural network accelerator, and the other for the new high-end GPU families: PowerVR Series9XE and 9XM. PowerVR Series2NX neural network accelerator The companies claims 2NX can deliver twice the performance and half the bandwidth of nearest competitor, and it’s the first dedicated hardware solution with flexible bit-depth support from 16-bit down to 4-bit. Key benefits of their solution (based on market data available in August 2017 from a variety of sources) include: Highest inference/mW IP cores to deliver the lowest power consumption Highest inference/mm2 IP cores to enable the most cost-effective solutions Lowest bandwidth solution with support for fully flexible bit depth for weights and data including low bandwidth modes down to 4-bit 2048 MACs/cycle in a single core, with the ability to go to higher levels with multi core The PowerVR 2NX NNA is expected to be found in smartphone and other mobile devices leveraging Tensorflow …