Archive

Posts Tagged ‘gpu’

Imagination PowerVR GT7900 GPU is Made for Game Consoles and Notebooks

February 27th, 2015 3 comments

Imaginatino Technologies has just introduced the most powerful GPU of their new PowerVR Series7 GPU family with GT7900 featuring 16 unified shading clusters and 512 ALU cores for game consoles, micro-servers, and notebooks.

PowerVR_Series_7XT_GPUPowerVR GT7900 is still an embedded GPU with support for OpenGL ES 3.1 + AEP (Android Extension Pack) compatibility, full hardware support for tessellation and ASTC LDR and HDR texture compression standards. The new GPU delivers up to 800 GFLOPS (FP32) or 1.6 TFLOPS (FP16) graphics and compute performance at 800 MHz (using 16 nm FinFET+ manufacturing process), which compares to about 550 GFLOPS (FP32) for an Nvidia GeForce GT730M at comparable frequency. This is even more powerful than the Maxwell GPU found in Tegra X1 which delivers 1+ TFLOPS (FP16), although the Nvidia processor may have a lower power envelope.

FT7900 Block Diagram

GT7900 Block Diagram (Click to Enlarge)

Combined with PowerVR Series5 decode and encode video processors and PowerVR V2500 camera processor, GT7900 can be integrated into SoC that support 4K video output, playback (10-bit @ 60 fps) and recording.

Imagination Technologies did not disclose information regarding availability, but they may provide further details at the Game Developers Conference (GDC) 2015 and Mobile World Congress (MWC) 2015.

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter

Categories: Graphics Tags: games, gpu, imagination

ARM Releases Kernel Drivers for Mali-T880 / T860 GPUs, User Space Drivers for Mali-T76x GPUs

February 23rd, 2015 16 comments

ARM Mali GPU drivers includes both open source kernel drivers, and binary userspace drivers supporting framebuffer and/ior X11 implementation. The former is rarely an issue and is quickly released, but the latter requires porting and testing for a specific hardware platform, as well legal work, which greatly delay the releases.

ARM_Mali_GPU_Drivers

Release r5p0-06rel0 for User Space Binary Drivers

Mali-T880 GPU was announced at the beginning of the month together with ARM Cortex A72, and on February 17, 2015, ARM released an update to their Mali-T600 series, Mali-T700 series & Mali-T860/T880 GPU kernel device drivers with revision r5p1-00rel0 that adds supports to Mali-T860 and Mali-T880 GPU. These open source drivers are available for Android and Linux, and also support early Mali-T700 and T600 GPUs.

Separately, the company has also released Mali-T76X GPU drivers for Firefly board powered by Rockchip RK3288 quad core Cortex A17 processor featuring a Mali-T764 GPU. The first release only supports the framebuffer driver, but ARM is expecting to be able to release the X11 version in the next release (r5p1) planned at the end of March, which means some Linux desktop graphics accelerated will soon be available on Rockchip RK3288, and not only some OpenGL ES 3.0 demos on the framebuffer. The latest release (r5p0-06rel0) also supports Exynos powered Arndale Octa board, Samsung Chromebook 2, Arndale board, and Samsung Chromebook. According to an ARM representative, Rockchip also plans to release their own Linux GPU drivers targeting “TopMetal” hardware platform (should probably read PopMetal).

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter

ARM Unveils Cortex A72 Processor and Mali-T880 GPU

February 4th, 2015 4 comments

ARM has just announced their new Cortex A72 ARMv8 core with 3.5 times the performance of Cortex A15 ARMv7 cores, together with Mali-T880 GPU delivering 1.8 times the performance of  Mali-T760, and CoreLink CCI-500 Cache Coherent Interconnect to link the new CPU, GPU and I/Os together.

ARM Cortex A72

Cortex-A72Some of the key features of the new core include:

  • Architecture – ARMv8-A
  • Multicore – 1-4x SMP within a single processor cluster, and multiple coherent SMP processor clusters through AMBA 5 CHI or AMBA 4 ACE technology
  • ISA Support
    • AArch32 for full backward compatibility with ARMv7
    • AArch64 for 64-bit support and new architectural features
    • TrustZone security technology
    • NEON Advanced SIMD
    • DSP & SIMD extensions
    • VFPv4 Floating point
    • Hardware virtualization support
  • Cache – 48KB I-cache, 32KB D-cache, and 512 KB to 2MB L2 cache with ECC
  • Debug & Trace – CoreSight DK-A57
  • Process – 16nm FinFET

A cluster can support up to 4 Cortex A72 cores clocked up to 2.5 GHz in mobile devices and higher frequencies for larger form factor devices such as servers. Cortex A72 cores may also be combined with low power Cortex A53 cores in big.LITTLE configuration for power efficient SoCs.

Cortex_A72_Performance

Relative Performance – Cortex A15 vs Cortex A57 vs Cortex A72

ARM also claims 75% power usage reduction in typical mobile workloads thanks to the new 16nm FinFET manufacturing processor. Expected applications include premium smartphones, larger screen mobile devices, enterprise networking, servers, wireless infrastructure, digital TV, and automotive ADAS/IVAI.

ARM Mali-T880 GPU

Mali-T880_GPUTechnical specifications for Mali-T880 GPU:

  • Anti-Aliasing – 4x MSAA, 8x MSAA, 16x MSAA
  • API Support
    • OpenGL ES 1.1, 1.2, 2.0, 3.0, 3.1
    • OpenCL 1.1, 1.2
    • DirectX 11 FL11_2
    • RenderScript
  • Bus Interface – AMBA4, ACE-LITE
  • L2 Cache – 256 to 2048KB (256 to 512KB for every 4 shader cores
  • Memory System – Virtual Memory with built-in Memory Management Unit (MMU)
  • Multi-Core Scaling – 1 to 16 cores
  • Adaptive Scalable Texture Compression (ATSC) – Low dynamic range (LDR) and high dynamic range (HDR) for 2D and 3D images
  • ARM Frame Buffer Compression (AFBC) – 4×4 pixel block size (lossless image compression format)
  • Transaction Elimination – 16×16 pixel block size (Only performs a partial update to the frame buffer with the changed pixel blocks)
  • Smart Composition – 16×16 pixel block size (Identical pixel blocks of input surfaces are not read, not processed for composition and not written to final frame buffer)

The GPU is also manufactured with 16nm FinFET process. Mali-T880MP16 can be clocked up to 850 MHz, and outputs up to 1700 million triangles per second, and 13.6 gigapixels per second. That’s 1.8 times better performance than Mali-T760, and ARM also claims 40% more energy efficiency.

ARM Cortex A75 processor, Mali-T880 GPU, CoreLink-C500 cache coherent interconnect, Mali-DP550 display processor unit, and  Mali-V550 video processor can be combined to create SoCs support 4K video output, and decoding up to 120 fps.

Ten partners have already become licensees including Rockchip, Mediatek, and HiSilicon, and products are expected in (early) 2016. Further details may be found on ARM’s Cortex A72 and Mali-T880 product pages.

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter

FOSDEM 2015 Schedule – January 31 – February 1 2015

January 29th, 2015 8 comments

FOSDEM (Free and Open Source Software Developers’ European Meeting) takes place every year during the first week-end of February. This year the developer-oriented event expects to bring over 5000 geeks to share ideas and collaborate on open source projects. Contrary to most other events, it’s free to attend, and you don’t even need to register, just show up. FOSDEM 2015 will take place on January 31- February 1 in Brussels.

Fosdem_2015There will be 551 sessions divided into 5 keynotes, 40 lightning talks, 6 certification exams, and with the bulk being developer rooms and main tracks,  divided into 7 main tracks this year: Languages, Performance, Time, Typesetting, Hardware, Security and Miscellaneous.

I’m not going to attend, but it’s still interested to see what will be talked about, and I’ve concocted my own little virtual program out of the main tracks and developers’ rooms. There’s a few minutes overlap between some talks on Sunday.. Oh well.

If you won’t be able to attend, you should be able to watch the video and access the slides in a few weeks, as most sessions will be recorded.

What is the current status of Allwinner support in upstream u-boot and the kernel, which SoCs are supported, and which features (sound, video, etc.) are supported ?

The linux-sunxi community has been slowly but steadily working on getting Allwinner SoCs like the A10 supported in upstream u-boot and the kernel.

This talk will present the current status of Allwinner support upstream. Which SoCs are supported and which ones are not (yet) supported ? Which blocks if the supported SoCs are supported, and which are not ? Why are some SoCs / blocks not supported, and what are the plans to get them supported ? This are some of the questions this talk tries to answer.

Not all free operating systems are feature-full POSIX systems. FreeRTOS is a minimal operating system which is designed to run on micro-controllers, and provide real-time scheduling. It is used in industrial automation and automotive.

A brief introduction to FreeRTOS, depending on audience preference, will be followed by either a hands-on workshop using PCs, or a demonstration on a board. The workshop includes how to get started, what can be done with it, and what type of features and pitfalls to expect from FreeRTOS.

As ADAS and infotainment require more electronics, using an hypervisor is a solution to gather multiple boards into one. Xvisor is an open source lightweight hypervisor for embedded systems that perfectly fits the needs of the automative industry. It is a complete monolithic type-1 hypervisor with full virtualization and paravirtualisation support, showing better performances than KVM. We, OpenWide and the Institute for Technological Research SystemX, are working on its port on i.MX6 boards.

F*watch is an infinitely hackable GPS watch with many sensors based on a 100% Free design. Everything is Free, from the PCB and watch housing design to the software stack. Moreover, only Free software tools have been used during the development.

F*watch. Why should your watch be different?

The talk describes the development process and shows a first prototype, along with performance measurements and future plans.

The lowRISC project was established in the summer of 2014 with the aim of producing a complete open-source System-on-Chip in volume, with low-cost development boards. Alex Bradbury, one of the co-founders of the project will discuss the progress to date and the path to the first test chip. lowRISC implements the open RISC-V instruction set architecture and is exploring ideas on improving security via tagged memory and increasing flexibility through the addition of RISC-V ‘Minion’ cores to implement soft peripherals. This talk will discuss the potential benefits of a fully open-source hardware ecosystem, the challenges of getting to first silicon, and how the open source community at large can help.

Digital cameras provide almost every feature you could want. But if they don’t, you are forced to upgrade or go without. CHDK is a project which allows you to program new functionality to the majority of Canon cameras, in either C, Lua, or Basic. The talk features background on the project, code, tools, and the methods of compiling and introducing a new firmware into the camera.

Over the course of 1 hour, Steven Goodwin will guide the audience through the entire process of taking a normal (proprietary) camera and converting it into an open source version by installing custom firmware on it. He will then cover some of the features available (such as the on-device scripting language) and continue by explaining how to build and debug your own functionality. Starting with simple grids, continuing with games, and time-lapse code. And ending with a fully recompiled firmware running on the device.

The video4linux kernel subsystem reports which colorspace the captured video uses. But what does that really mean, and what do you have to do to correctly reproduce those colors? This talk will dive into the crazy world of colorspaces and give you a practical guide to colorspace handling. I will also demonstrate colorspace handling, both right and wrong.

Kernel profiling tools status on ARM and ARM64: – perf status, – ARM and ARM64 support, – callchain unwinding mechanisms and support, – patches status: merged, pending, in development, – links to discussions (LKML) and patches.

The profiling tools in the kernel are changing at a fast pace. This talk is about the support for ARM and ARM64 architecture and the development of features for these architectures, namely the callchain unwinding. The presentation goes over: – the detailed description of the feature, – the methods used to do the callchain unwinding (fp, exidx, dwarf etc.), – the status of the on-going patches, – the remaining work to be done, – the links to patches, discussions on the mailing lists, – -if needed and if time allows- a demo of the feature.

Building a medical device requires to follow certain rules specially when health care depend on it. The presentation will explain how Yocto help us in Kaptalia to solve this issue. In particular we will focus on fast boot, update with unskilled user base, Bluetooth Low Energy, security and data privacy.

During this event we will show how our team succeeded to build our first OS, start from a company with medical expert only and no prior expertise on embedded systems. At the end, a live demonstration for using the the monitor and sensor will be held.

LAVA is a python service created by Linaro for testing software on hardware which accepts test jobs to perform on selected hardware to provide a black box to continuous integration tests. Bisecting is a technique for finding commit in version control system that broke the software. Git provides the powerful “git bisect” subcommand for this purposes. In this talk we give and introduction to LAVA and explain howto combine LAVA and git bisect to automatically find offending commits in the Linux kernel.

Prospero Technologies has made a Linux based Digital Video Recorder which constantly records all UK broadcast TV so that the consumer no longer needs to schedule recordings. This will be a talk on the technologies used to achieve this, the open source software on the consumer device and how you can build your own 30 channel DVR.

The final version of the DVR uses a Freescale i.MX6 CPU with a video processing unit running a Linux built with Yocto. The talk will cover how well this is supported by gstreamer and how we built a QT application to display our HTML5 interface.

More and more embedded projects require support for advance connectivity. With it, comes the requirement to enforce a better security as well as private data protection. Using the layer model of Yocto, we show how we can extract from a complex project such as Tizen, advance connectivity and security and apply it to any embedded project.

The Internet of Things (IoT) is growing fast and opens large opportunities to embedded Linux. Unfortunately traditional embedded Linux has been weak when it comes to security and complex connectivity enabling. Tizen which has been developed as a Linux base OS for connected object (phone, TV, car) is on the other side very well equipped in that area. We will start by explaining what is Tizen architecture and how it provides Security and Connectivity facilities on top of a base Linux. We will then show how Yocto and Tizen-meta can be used to create embedded devices which benefit from several years of work done by the Tizen community. In particular we will review : – the mandatory access control enabling in an embedded device – the enforcement of good behavior by applications – resource access control – connectivity layers – HTML5 App enabling. – multi user mode enabling.

The ARM LLVM backend has been around for many years and generates high quality code, yet there are still standard benchmarks where GCC is generating more efficient code than LLVM. The goal of this talk is to get a better understanding of why the GCC-generated code for those benchmarks is executing more efficiently and also about finding out what we need to do on the LLVM side to address those code generation deficiencies. This talk presents current performance numbers for the SPEC CPU benchmark suites on ARM, comparing the performance of LLVM and GCC, with the main focus on the SPEC CPU integer benchmarks. To dive a little bit deeper, we will also have a closer look at the generated assembly code of selected benchmarks where LLVM is performing worse than GCC and use the results of this performance analysis to point out potential code generation opportunities for LLVM.

Connectivity is crucial for Internet of Things concept. For moving devices like position data loggers is typical solution GSM network. I will show you how you can use different types of GSM network for your IoT projects.

GSM network is easy way how to connect almost any device to internet. There are lot of GSM modules on market from different vendors but all devices has one thing in common – AT commands. There is standardized AT commands set for GSM networks. Using AT command you can send text messages, read phone number from list on SIM card, connect to internet and much more. I will show you basic command set for HTTP communication using basic GSM module SIM900 and Arduino.

This talk will give an overview over the Linux backports project and how to use it. The Linux backports project makes it possible to use a driver from a recent Linux mainline kernel with an older kernel version.

When you have a vendor board support package which does not use a bleeding edge mainline kernel, like it is the case most times, but you want to use some driver from a bleeding edge Linux kernel you can use backports. Backports “automatically” generates a tar with many drivers from a specific Linux mainline kernel which can be used with older kernel versions.

In this talk I will describe how the backports project, with its compatibility layer, the spatches and the normal patches. For practical usage I will show how to use backports with your own kernel in addition I will give a brief overview on how to add a new driver to backports.

Patchwork is a toolkit for connecting various devices into a network of things or, in a more broad case – Internet of Things (IoT). The main goal of creating this toolkit is to have a lightweight set of components that can help to quickly integrate different devices (i.e. Arduino, Raspberry Pi, Plugwise, etc) into a smart environment and expose specific devices’ capabilities as RESTful/SOAP/CoAP/MQTT/etc services and data streams.

The key features of patchwork include:

  • Lightweight (no RAM-consuming sliced pie of Java and OSGi, only bare necessities)
  • Cross-platform (can be deployed on OSX/Linux/Windows, tested on Raspberry Pi and BeagleBone Black boards)
  • Language-agnostic (device agents can be written in any programming language, APIs can be consumed by app written in any programming language)
  • Easily deployable (no JARs, no Eggs or Wheels for the core components, just a single native binary with statically linked dependencies)
  • Easily extendable (integrate new devices without modification of the core components, drop in solution)
  • Interchangeable (not happy with current existing Device Gateway or Catalog? replace it with another implementation without breaking the infrastructure)
  • Not re-inventing the wheel (we re-use as many existing technologies and components as possible)

libcurl is the world’s most used and most popular Internet transfer library, already used in every imaginable sort of embedded device out there. How did this happen and how do you use libcurl to transfer data to or from your device?

Embedded devices are very often network connected these days. Network connected embedded devices often need to transfer data to and from them as clients, using one or more of the popular internet protocols.

Daniel once founded the project and is still lead developer and maintainer of the curl project, making curl and libcurl. He is also active within IETF and maintain several other open source projects. Daniel is employed by Mozilla.

This presentation will reveal the process of porting Tizen:Common to open source hardware developer boards with SoC manufactured by Allwinner, Rockchip or Intel such as OLinuXino, Radxa Rock, Minnowboard. The following topics will be covered:

  • Building Tizen ARMv7 and x86 images from scratch
  • Adapting the Linux kernel, bootloader and Tizen:Common to popular single board computers
  • Do it yourself (DIY) open-source hardware Tizen tablet or laptop
  • Sharing knowledge and experience of the community.
The presentation will also provide information about U-Boot, Yocto project, the Linux-Sunxi and Linux-Rockchip, Minnowboard communities.

Although Tegra K1 uses the same Kepler architecture as NVIDIA desktop cards that Nouveau already supports, there are other challenges that need to be addressed before Nouveau can drive K1’s graphic acceleration: the fact that the GPU does not reside on the PCI bus requires architectural changes in the Nouveau core. The absence of dedicated GPU memory directly interferes with the way Nouveau is used to do memory management and leads to potentially sub-optimal behavior. Also, in a system where all devices share the same system memory, PRIME support is mandatory to perform any useful work and the relevance of a driver-agnostic memory allocator becomes perceptible.

This talk will discuss these challenges, and in particular the consequences of using a unified memory architecture, in the hope of triggering discussions that will help improving the general support of GPU architectures for new mobile platforms.

A brief look at the past, present, and future of the KiCad project. The discussion will be primarily on what near and long term future development is planned for the project as well as discussing the potential for collaboration with other EDA projects.

Yocto has an alleged steep learning curve. It can be a challenge for modules and evaluation board manufacturers to add support for their devices in Yocto as they don’t necessarily have a software background. This talk will highlight the steps required, techniques and good practices to create a well integrated machine configuration allowing to build images using the Yocto Linux build system. The Crystalfontz support from meta-fsl-arm-extra will be used to illustrate the talk.

The bitbox console is a small open hardware & open source game console. I will present the rationale behind it and the current status of the project, detail the hardware conception and particularly video signal generation from a cortex-m4 chip with no video subsystem. I will then proceed to show the different elements of the software stack : kernel, video engines, the boot loader and, finally, current programs and games, including a Gameboy emulator and a full motion video player.

If you want to build your own schedule before going, you can check the full list of events by subjects, but an easier way to organize your day is to check the sessions in chronological order, by checking out Saturday and Sunday schedules.

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter

Linux 3D GPU Acceleration Demo on Rockchip RK3288 based Firefly Board

January 26th, 2015 17 comments

Several devices and development boards based on Rockchip RK3288 processor already support Linux, usually with images based on Ubuntu, including Ugoos UM3 / UT3, Open Hour Chameleon, and Firefly-RK3288 among others. What these images lack however, is support for 2D and 3D graphics acceleration with the GPU, and hardware video decoding/encoding with the VPU. But Jas-hacks has made some progress with 3D graphics support.

Rockchip_RK3288_3D_GPU_AccelerationHe has managed to add GPU acceleration via EGL/OpenGL ES bringing 3D acceleration to the platform, but 2D support is not there yet, meaning X11 still heavily relies on software rendering. But the implementation is still good enough to run the usual es2gears and glmark2-es2 benchmarks. as well as some WebGL demos in Chromium. Performance is currently underwhelming, with 50 points in glmark-es2, even a bit lower than the score (54 points) I got with ODROID-X board powered by Exynos 4412 SoC with a Mali-400MP4 GPU, and running Unity desktop in Ubuntu Linaro 12.11. The desktop environment used also appears to greatly affect glmark-es2 performance, with Unity being much slower than LXDE for example. But let’s get to the demo.

Since the image used for the demo is a minimal image for testing, it has not been released, and you can’t try by yourself just yet. But I’m sure GPU acceleration will eventually make it to Ubuntu for Firefly-RK3288, and possibly other RK3288 hardware platforms where the manufacturer or/and the community are involved with Linux development.

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter

Nvidia Announces Tegra X1 64-bit Octa-core Processor with 256-Core Maxwell GPU

January 5th, 2015 15 comments

CES 2015 has started, and the first company to present their innovations is Nvidia. Last year their introduced Tegra K1, the first mobile processor with a desktop GPU with support for OpenGL. They’ve now raised the bar with Tegra X1, an octa-core 64-bit ARM processor with a 256-core Maxwell GPU delivering 1.5x to 2x the performance of Tegra K1, and supporting 4K60 video playaback with H.265 and VP9 codecs.

Tegra_X1Some geekbench benchmark comparing X1, K1, and Apple A8x.

Tegra_X1_BenchmarkNvidia CEO also claiemd Tegra X1 is the first mobie SoC delivering over 1 Teraflops, something that was achieved around year 2,000 with 20,000 Pentium pro processor consuming 1 millions Watts, while X1 SoC only consumes about 10 Watts.

Nvidia_Tegra_X1_TeraflopsPower efficiency is said to have improved too, as Maxwell GPU is said to delivers more performance with similar power efficiency.

Nvidia_Tegra_X1_power_efficiency

They also introced Drie CX mini computer for automotive application featuring Tegra X1. It can handle 16.6M pixel, or 2x 2K displays @ 60 Hz, and runs QNX, Linux, or Android.
Nvidia_Drive_CX_mini_ComputerThe company also showcase a Cockpit demo during the presentation.

And they’ve also announced Drive PX board with two Tegra X1 processors, 12-camera inputs for autonomous cars that drive themselves.

Nvidia_Drive_PXThe board uses a new technology called Deep Neural Networks for better object detection.

More information should be published soon on Nvidia Tegra page.

Update. Some specs:

Tegra X1 supports all major graphics standards, including Unreal Engine 4, DirectX 12, OpenGL 4.5, CUDA®, OpenGL ES 3.1 and the Android Extension Pack, making it easier for developers to bring PC games to mobile.
Tegra X1’s technical specifications include:

  • 256-core Maxwell GPU
  • 8 CPU cores (4x ARM Cortex A57 + 4x ARM Cortex A53)
  • 60 fps 4K video (H.265, H.264, VP9)
  • 1.3 gigapixel of camera throughput
  • 20nm process
Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter

3D Graphics Acceleration in Linux on Allwinner A80 based Cubieboard4

January 2nd, 2015 11 comments

Allwinner A80 is a powerful octa-core processor found in development boards and TV boxes such as Cubieboard4 or Tronsmart Draco AW80. Some early Ubuntu images and instructions had already been released for A80 Optimusboard and Draco AW80, but none of these featured GPU drivers for 3D acceleration, which to be honest, has limited advantages in Linux desktop distributions since desktop environments and most apps require full OpenGL support, i.e. not only OpenGL ES, and the only ARM SoC that can provide OpenGL support without external graphics card is Nvidia Tegra K1 SoC. Having said that GPU drivers would pave the way for smooth OpenELEC / Kodi user interface support in Allwinner A80 Linux distributions. That’s only one part of the puzzle, since the GPU normally handles the user interface, while the VPU takes care of video decoding.

Cubieboar4_Linux_GPU_DriversThe good news is that CubieTech release updates images for their Cubieboard4 (CC-A80) development with PowerVR GC6200 GPU support, which you can download on Baidu:

  • linaro-cb4-emmc-vga-v0.3.img.7z is the eMMC flash image with VGA output
  • linaro-cb4-emmc-hdmi-v0.3.img.7z.md5 is the eMMC flash image with HDMI output

The company also provided instructions they followed to create the image, using two files they got from Allwinner rogue_km.tar.gz , and discimage-release-1.4-fix_buffer_ideas_20141216_no_gl.tar.gz, available on Cubieboard server.

The steps below have been completed in an Ubuntu 14.04 computer’s terminal window, and may need some corrections, since I’ve mostly edited them from an email but not tried myself:

  1. Build drivers
    tar -zxpf rogue_km.tar.gz
    cp -rf rogue_km linux-3.4/modules
    cd rogue_km/build/linux/sunxi_linux && make -j4

    The two drivers files dc_drmfbdev.ko and pvrsrvkm.ko can be found in linux-3.4/modules/rogue_km/binary_sunxi_linux_xorg_release/target_armhf directory

  2. Copy drivers to SD card
    You’ll need to download and extract Linaro Ubuntu 14.04 rootfs, and copy the drivers to a bootable sdcard in the rootfs partition

    mkdir -p <path_to_rootfs>/lib/modules/3.4.3/extra
    cp dc_drmfbdev.ko pvrsrvkm.ko <path_to_rootfs>/lib/modules/3.4.39/extra/
    sync 
  3. Copy libraries to rootfs
    sudo tar -zxpf discimage-release-1.4-fix_buffer_ideas_20141216_no_gl.tar.gz
    cd <path_to_rootfs>/usr/local/pvr
    cp -rf etc/ <path_to_rootfs>/
    cp -rfd include share <path_to_rootfs>/usr
    cp -rfd lib/xorg <path_to_rootfs>/usr/lib
    cp -rfd lib/* <path_to_rootfs>/usr/lib/arm-linux-gnueabihf
    cp -rf ssl <path_to_rootfs>/etc
    cp -rf ssl/misc/ <path_to_rootfs>/usr/lib/ssl
    cd discimage/usr/
    cp -rfd lib/* <path_to_rootfs>/usr/lib/arm-linux-gnueabihf
    cp -rf local/bin <path_to_rootfs>/usr/local
    sync
  4. Insert the SD card into Cubieboard4 board, boot, and complete the steps as follows:
    mkdir -pv /usr/local/pvr/lib/dri/
     cd /usr/local/pvr/lib/dri/
    ln -s /usr/lib/arm-linux-gnueabihf/dri/pvr_dri.so
    ln -s /usr/lib/arm-linux-gnueabihcdf/dri/swrast_dri.so
    rm /usr/lib/arm-linux-gnueabihf/mesa-egl -rf
    cd /usr/lib/arm-linux-gnueabihf
    ln -s libGLESv2.so.1.4.3064661 libGLESv2.so.2
    

    Edit  /etc/modules to add the two lines:

    dc_drmfbdev
    pvrsrvkm

    Insert the modules, run depmod, and reboot to complete the installation:

    insmod /lib/modules/3.4.39/extra/dc_drmfbdev.ko
    insmod /lib/modules/3.4.39/extra/
    depmod
    reboot

    Done!

Now you can test 3D graphics acceleration works with glmark2-es2 or es2gears:

sudo apt-get install glmark2-es2
sudo glmark2-es2 

Thanks to Ovidiu for the info.

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter

TyGL OpenGL ES 2.0 Backend for WebKit Speeds Up Web Rendering by Up to 11 Times

December 23rd, 2014 3 comments

ARM, Szeged University in Hungary, and Samsung Research UK have been working on TyGL, a new backend for WebKit accelerated with OpenGL ES2.0, and developed and tested on ARM Mali-T628 GPU found in Samsung ARM Chromebook. It will typically provide 1.5 to 4.5 times higher performance, but in the best cases, it can achieve up to eleven times the performance of a CPU-only rendered page.

TyGL_ScreenshotThe key features of TyGL include:

  • Web rendering accelerated by GPU - Batching of draw calls delivers better results on GPUs. TyGL groups commands together to avoid frequent state changes while calling the Graphics Context API.
  • Automatic shader generationTyGL generates complex shaders from multiple shader fragments, and ensures the batches fit into the shader cache of the GPU.
  • Trapezoid based path rendering – Work in progress. It will leverage GPU capabilities such as the Pixel Local Storage extension for OpenGL ES.
  • No software fallback – Complete GPU-based hardware accelerated solution with no dependency on legacy software.

You can get more technical details about the implementation on TyGL: Hardware Accelerated Web Rendering blog post on ARM community.

They have now officially published benchmark results, but I found some benchmark results on Webkit mailing list:

Since EFL supports cairo, we compared EFL-TyGL and EFL-Cairo

The other good news is that TyGL is now open source, with the code available on github, and you can build it and give it a try on ARM Mali-T62X development boards such as Arndale Octa or ODROID-XU3 (Lite) running Ubuntu Linaro 14.04, or other Linux based distributions. The complete build is said to last about 10 hours, but this will obviously depend on your machine. TyGL should also work on other mobile GPU supporting OpenGL ES 2.0, but I understand this has not been tested yet.

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter