UP Xtreme ARL AI Dev Kit review – Benchmarks and AI workloads on an Intel Core Ultra 5 225H Arrow Lake SBC

That’s the last part of my review of three Intel-based UP AI development kits, and after testing the UP TWL AI Dev Kit with an Intel N150 CPU and the UP Squared Pro TWL AI Dev Kit with an Intel N150 CPU coupled with an Hailo-8L M.2 AI accelerator, I’ll now report my experience with the high-end UP Xtreme ARL AI Dev Kit with a 14-core Intel Core Ultra 5 225H “Arrow Lake” single board computer with Intel Arc 130T graphics delivering up to a combined 83 TOPS of AI performance.

I’ve followed the same procedure as with the previous models, using the pre-installed Ubuntu 24.04 Pro operating system to report system information, run some benchmarks, and go through AI workloads using Nx Meta and the AAEON UP AI Toolkit system. It just ran additional benchmarks and tests since it’s my first Intel Arrow Lake platform.

UP Xtreme ARL AI Dev Kit review

UP Xtreme ARL system information on Ubuntu 24.04

The UP Xtreme ARL SBC (AAEON UPX-ARL01) comes preloaded with Ubuntu 24.04.3 LTS installed on a 256.1 GB NVMe SSD, and the system is powered by a 14-core Intel Core Ultra 225H processor with 16 GB of RAM, as advertised.

AAEON UPX-ARL01 Intel Core Ultra 5 225H About Ubuntu 24.04

The inxi utility provides more insights into the system:


Everything appears to be detected properly, including the Gigabit Ethernet port, the 2.5 Gbps Ethernet port, the Sunplus USB FHD camera (UP USB camera), and the Kingston OM8PGP4256Q-A0 SSD. One oddity is the max CPU frequency reported by the system: 6300:6100:2500 for the P-cores, E-cores, and  LPE-cores. The P-cores and E-cores are supposed to max out at 4.9 GHz and 4.3 GHz, respectively, so that probably means the OPP tables are wrongly configured, and all benchmarks and utilities will report the wrong frequencies as above.

UP Xtreme ARL benchmarks

Let’s start the benchmarks with Thomas Kaiser’s sbc-bench.sh:


The 7-Zip benchmark’s first run achieved 50422 MIPS, and dropped to 44,687 and 44,986 for the second and third runs once the power limits kicked in. We can better understand what may have happened when monitoring the CPU frequency and temperature.  We can see a clear spike in the CPU frequency up to 4.9 GHz and the CPU temperature up to 100°C when a short burst of power is needed.Intel Core Ultra 5 225H SBC CPU Temperature spikes

But after the initial spike and longer multi-core tests, the CPU frequency stabilizes to around 3 GHz, and the temperature is at a manageable 64- 65°C.

Intel Core Ultra 5 225H SBC multi-core workload CPU Temperature stableLet’s check the power limits:


PL1, PL2, and PL4 power limits are set to 28W, 60W, and 120W, while Intel lists the PBP at 28W and MTP at 115W.

Contrary to what we experienced with the Intel N150 SBCs, there aren’t any options to change the power limits in the Core Ultra 5 225H SBC. It might be possible to use userspace for those who need to adjust the power limits up or down.

Intel Core Ultra 5 225H Arrow Lake CPU Configuration Aptio Setup AMI BIOS

We can test the CPU performance by running Geekbench 6.5.0 single-core and multi-core benchmarks.

AAEON UP Xtreme ARL Geekbench 6.5.0

The Intel Core i5 225H processor achieved 2,837 points for the single-core test and 13,205 points for the multicore test.

While we’ll test AI workloads in more detail a little below, I also tried to run Geekbench AI to evaluate the performance for artificial intelligence tasks. But the benchmark defaults to the TensorFlow Lite framework on the CPU by default… So it’s not representative of the actual performance of the Intel SoC.

AAEON UPX-ARL01 Geekbench AI score CPU

In that case, GeekBench AI reports 2,086 points for the single-precision score, 2073 points for the half-precision score, and 1534 points for the quantized score.

Let’s see if we can use the GPU or NPU (Intel AI Boost) too:


The NPU is not listed, but we can run the benchmark again using the OpenVINO framework on the Intel Arc 130T Graphics iGPU:


 

AAEON UPX-ARL01 Geekbench AI score Intel Arc GPU

This makes a huge difference, and now GeekBench AI reports 8,585 points for the single-precision score, 17,122 points for the half-precision score, and 21,702 points for the quantized score.

Let’s now test GPU performance with Unigine Heaven Benchmark 4.0, and the Intel Core Ultra 5 225H SBC achieved a score of 1,608 points while rendering the scene at 63.8 FPS on average at the standard 1920×1080 resolution using the built-in Intel Arc 130T Graphics.

Intel Core Ultra 5 225H Unigine Heaven Benchmark 4.0

I also tested several 4K or 8K YouTube videos in Firefox and Chrome.

Intel Core Ultra 5 225H 4Kp60 YouTube VP9 Firefox Ubuntu

For the first test, I searched for an 8K 60 FPS video on YouTube and played it in Firefox. The maximum resolution was 2160p60, so no 4K options for this VP9 video. It was somewhat watchable, but I still had 827 dropped frames out of 11193. (7.4% loss)

Intel Core Ultra 5 225H 4Kp60 YouTube VP9 Chrome Ubuntu

Since I could not select 8K, I decided to try it in Chrome. But I had the same limitation. Playback felt quite smoother, and the “Stats for Nerds” overlay window confirmed it with 106 frames dropped out of 9743 (1.1% loss).

Intel Core Ultra 5 225H 8Kp60 YouTube AV1 Firefox Ubuntu

I suspected getting a video with VP9 may have been the culprit here. So I browsed YouTube to find an 8K AV1 video (it took a few tries), and I could select 4320p60 in Firefox. It reported 1,234 frames dropped out of 22,541, or a 5.4% loss, better than the VP9 video at 4K 60 FPS.

Intel Core Ultra 5 225H 8Kp60 YouTube AV1 Chrome Ubuntu

Switching to Chrome works too, and I had even fewer dropped frames (222 out of 12,426) for a loss of about 1.8% of frames.

Intel Core Ultra 5 225H 4Kp60 YouTube AV1 Chrome Ubuntu

Finally, I also used that video to test 4Kp60 AV1 playback in Chrome for a longer 40+ minutes test. Results: 517 frames dropped out of 163,523, or 0.3% loss. So we’d recommend AV1 video whenever possible, and Chrome worked better than Firefox with the videos we tried.

I ran Speedometer 2.0 in Firefox to test the web browser performance.

Speedometer 2.0 Intel Core Ultra 5 225H CPU

375 runs per minute must be the best score I’ve ever had. Speedometer 2.0 is deprecated, and I still use it for comparison with older results. Let’s also run the more recent Speedometer 3.0.

Speedometer 3.0 Intel Core Ultra 5 225H CPU

That would be 23.9 points, also a pretty high web browsing score, with the caveat that web browser scores tend to increase over time as the software gets optimized for speed.

UP Xtreme ARL benchmarks compared to Intel/AMD mini PCs

Since the UP Xtreme ARL is the first Intel Core Ultra (Series 2) Arrow Lake I’ve reviewed, I’ll also compare the Ubuntu benchmarks results against fairly powerful Intel and AMD mini PCs I tested in Ubuntu: the GEEKOM GT1 Mega (Intel Core Ultra 9 185H), the GEEKOM A8 (AND Ryzen 9 8945HS), and the Khadas Mind 2 AI Maker Kit (Intel Core Ultra 7 258V).

Here’s a quick summary of the specifications of all four platforms:

UP Xtreme ARLGEEKOM GT1 MegaGEEKOM A8Khadas Mind 2 AI Maker Kit
SoCIntel Core Ultra 5 225HIntel Core Ultra 9 185HAMD Ryzen 9 8945HSIntel Core Ultra 7 285V
CPU14-core (4P+8E+2LPE) Arrow Lake processor up to 4.9 GHz (P-Cores), up to 4.3 GHz (E-Cores), up to 2.5 GHz (LPE-Cores)16-core/22-thread (6P+8E+2LP) Meteor Lake processor up to 5.1 GHz (P-cores), up to 3.8 GHz (E-cores), up to 2.5 GHz (LP-cores) 8-core/16-thread processor up to 5.2 GHz 8-core (4P+4E) Lunar Lake processor up to 4.8 GHz (P-Cores) and 3.7 GHz (E-Cores)
GPU7 Xe cores Intel Arc 130T GPU (63 TOPS)8 Xe cores Intel Arc GraphicsAMD Radeon 780M Graphics8 Xe cores Intel Arc 140V GPU
NPUIntel AI Boost (13 TOPS)Intel AI Boost (34 TOPS)Ryzen AI (16 TOPS)Intel AI Boost (47 TOPS)
Memory16GB LPDDR5-640032GB DDR5-560032GB DDR5-560032GB LPDDR5-8533
Storage256GB NVMe SSD2TB NVMe SSD2TB NVMe SSD1TB NVMe SSD
Tested OSUbuntu 24.04.3Ubuntu 24.10Ubuntu 24.04Ubuntu 24.10

Benchmark results comparison table:

UP Xtreme ARLGEEKOM GT1 MegaGEEKOM A8Khadas Mind 2 AI Maker Kit
sbc-bench.sh
- memcpy22,606.1 MB/s (P-core)21,364.6 MB/s (P-core)20,318.5 MB/s25,504.6 MB/s (P-Core)
- memset19,278.7 MB/s (P-core)36,928.3 MB/s (P-Core)62,156.7 MB/s65157.0 MB/s (P-Core)
- 7-zip (average)46,70067,96068,79019,980
31,480 (adjusted PL)
- 7-zip (top result)50,42271,62369,29722,093
32,768 (adjusted PL)
- OpenSSL AES-256 16K1,734,825.30k (P-Core)1,698,239.83k (P-Core)1,422,136.66k1,665,362.60k (P-Core)
Geekbench 6 Single2,8372,6052,6612,829
Geekbench 6 Multi13,20513,72813,2757,014
9,618 (adjusted PL)
Unigine Heaven score1,6081,9561,9721,057
1,698 (adjusted PL)
Speedometer 2.0 (Firefox)375
278298295
Speedometer 3.0 (Firefox)23.919.519.118.4

First, there’s an outlier in the memset dataset, as the bandwidth for the UP Xtreme ARL SBC looks rather low here, while the memcpy test is in the same range as the three other systems. The  Intel Core Ultra 5 225H shines in single-core benchmarks (OpenSSL, Geekbench 6.x single, and Speedometer), but struggles more with multi-core workloads. It can easily be explained since it comes with 14 cores, while the Intel Core Ultra 9 185H and AMD Ryzen 9 8945HS are 16-core or 16-thread processors with a higher PL1 value (45W vs 28W). In that regard, it’s much faster than the 30W 8-core Intel Core Ultra 7 285V found in the Khadas Mind 2 AI Maker kit. As a mid-range Core Ultra 5, the integrated GPU is also slightly weaker than all three other systems in Unigine Heaven 3D graphics benchmarks.

It would have been nice to compare the AI performance of all four systems since they all come with AI accelerators, but AI Linux benchmark tools were not quite ready when I tested the three older systems.

Features testing

I’ve also checked the key hardware features of the UP Xtreme ARL SBC:

  • HDMI 1 – Video OK, Audio OK
  • HDMI 2 – Video OK, Audio OK
  • DisplayPort – Video OK, No audio
  • Storage – NVMe SSD – OK: 3.45 GB/s sequential read speed, 1.78 GB/s sequential write speed
  • Ethernet
    • LAN1 – Gigabit Ethernet (top) – OK (iperf3 DL: 937 Mbps, UL: 930 Mbps, full-duplex:  929/925 Mbps)
    • LAN2 – 2.5 Gbps Ethernet – OK (iperf3 DL: 2.35 Gbps, UL: 2.34 Gbps, full-duplex: 2.35/2.33 Gbps)
  • USB ports tested with an ORICO NVMe SSD enclosure (EXT-4 partition), a USB HDD (for USB 2.0 port), an RF dongle for a wireless keyboard/mouse combo, and the USB camera part of the kit
    • USB 3.0 combo jack
      • Top – 10 Gbps; tested up to 992 MB/s with iozone3
      • Bottom – 10 Gbps; tested up to 1,001 MB/s with iozone3
    • USB 2.0 on HDMI combo jack – 480 Mbps; tested up to about 42 MB/s with iozone3
  • RTC – OK
  • GPIO – OK – Also check the 40-pin GPIO header layout

Here’s a photo of a triple display setup with the DP output connected to a KTC A32Q8 32-inch 4K monitor, and the HDMI outputs to a 10.1-inch Raspberry Pi All-in-One display (left) and a CrowView 14-inch portable monitor (right).

UP Xtreme ARL triple display setup

I have five sound outputs detected, including three “HDMI / DisplayPort – Built-in Audio” devices, but I only managed to get audio through HDMI, and not the DisplayPort.

UP Xtreme ARL Ubuntu 24.04 Sound Output

That means everything worked as expected for me, except for the DisplayPort Audio output, where the output was detected, but I didn’t receive any sound for whatever reason. Note this worked fine with the UP Squared Pro TWL using the same cable and monitor.

AI testing on the UP Xtreme ARL SBC

It’s now time to run the same AI workloads as I did on the UP TWL and UP Sqaured Pro TWL AI Dev Kits, namely Network Optix Nx Meta and the AAEON UP AI toolkit

Network Optix Nx Meta

Let’s install the Nx AI Certification Test:


It’s the same process as the other boards, except I selected OpenVino:


Once the installation is complete, we can run the script to test everything:


46 benchmarks completed, like on the UP TWL using the CPU/GPU:


We also had two stability tests that passed successfully:


You can check the full log for reference.

Let’s compare the UP Xtreme ARL results with a subset of the results we got on the UP TWL (Intel N150) and UP Squared Pro TWL (Intel N150 + Hailo-8L).

UP TWL
UP Squared Pro TWL
UP Ztreme ARL
Nx CPU
OpenVino
Hailo-8L
OpenVino
80-classes-object-detector[640x640]
3.91 FPS
3.73 FPS
38.04 FPS
26.72 FPS
80-classes-object-detector[320x320]
15.24 FPS
14.73 FPS
90.31 FPS
96.56 FPS
postprocessor-python-example
14.98 FPS
14.73 FPS
88.40 FPS
94.69 FPS
Postprocessor-python-image-example
15.73 FPS
15.17 FPS
90.55 FPS
97.14 FPS
postprocessor-c-image-example
14.77 FPS
14.75 FPS
83.18 FPS
99.24 FPS
postprocessor-c-example
14.75 FPS
14.76 FPS
89.15 FPS
102.14 FPS
Model-Yolov9-e-[640x640]
0.60 FPS
0.59 FPS
Failed
4.03 FPS
Model-Yolov9-e-converted-[640x640]
0.32 FPS
0.31 FPS
Failed
Failed
Model-Yolov4-[320x320]
15.25 FPS
14.68 FPS
Failed
98.54 FPS
Model-Mobilenet-V3
48.45 FPS
56.08 FPS
Failed
320.15 FPS
Total Benchmarks
46
46
6
46
Stability Tests
6/6
6/6
3/3
6/6

As expected, the UP Xtreme ARL delivers the highest performance in all tests, except the 80-classes-object-detector[640×640], where it performed object detection at 26.72 FPS and 38.04 FPS.  I was surprised that one test in the list above failed on the UP Xtreme ARL (OpenVino) but not on the UP Squared Pro TWL (OpenVino):


It looks like all OpenVino/Nx CPU tests fail a few benchmarks with a similar error, but still end up with 46 passed tests. So overall, the UP Xtreme ARL delivers the best performance and offers optimal compatibility compared to other platforms, but it’s obviously more expensive. The UP Squared Pro TWL with Hailo-8L offers the best price/performance ratio, but requires a bit more work as Network Optix didn’t implement most tests for the Hailo-8L accelerator. The UP TWL AI Dev Kit does not come with any AI accelerator at all, and everything runs on the CPU/GPU, so it will only be suitable for lightweight AI workloads or when a high inference rate is not needed.

AAEON UP AI toolkit demos

We are not done yet, as I still have to run the UP AI toolkit examples available on GitHub.

I followed the same steps to install and launch the AAEON UP AI toolkit as I did for the other boards:


Like in the previous reviews, I had an HTTP error once while running the prepare.sh script, but no issue with storage capacity since the 256GB is plenty enough. The script will launch the UP Edge AI Sizing Tool in Firefox.

UP Xtreme ARL Edge AI Sizing Tool

The Intel Arc Graphics (GPU) is detected, but I don’t see any NPU for the Intel AI Boost. Nevertheless, I could add Computer Vision demos (Object Detection) using the USB camera on the CPU or the GPU.

UP Edge AI Sizing Tool Add Demo Intel Arc Graphics iGPU

I got 13.73 FPS inference on the CPU…

UP Xtreme ARL UP Edge AI Sizing Tool CPU Object Detection

… and 29.99 FPS on the GPU, which did the job fine.

UP Xtreme ARL UP Edge AI Sizing Tool GPU Object Detection
Since the UP Xtreme ARL is a more powerful platform and ships with a 256 GB NVMe SSD, it’s easier to install more models without quickly running out of space, as I experienced on the 64GB eMMC flash found on UP TWL or UP Squared Pro TWL (unless we install an SSD on the latter).

Next up was a text generation model (Mistral-7B-Instruct) running on the GPU. I asked it about OpenVino, and it replied in 12.91s at 8.03 tokens/s.

UP Edge AI Sizing Tool Text Generation

I also added an “automatic speech recognition demo” running on the GPU. There’s a default audio file saying “How are you doing today?” and the transcription was completed in 0.40 seconds.

UP Xtreme ARL Automatic Speech Recognition
I decided to load another audio file – a short ~30s excerpt from a financial interview – and the transcription was done accurately in 1.70 seconds.

UP Edge AI Sizing Tool Automatic Speed Recognition English audio transcription

There’s also a Translate option. I found a 32-minute 54-second interview about philosophy in French, and asked the ASR demo to transcribe and translate it into English.

UP Edge AI Sizing Tool Automatic Speed Recognition English French transcription

It did so in 86.50s. Not all sentences make perfect sense, but in most parts, it’s close to the original meaning. As somebody who did some video/audio transcription and translation work in the past, the speed is scary. Transcribing one hour of audio typically takes 3 to 5 hours for a human, and translators can usually handle around 3,000 words per day. A thirty-second clip with an estimated 4,000 words would take 10 to 12 hours of work. That’s 400 to 500 times faster than a human. Human editing is likely taking longer on the AI-transcribed version, but is needed in either case.

Since last time around, I had trouble with the 64GB eMMC flash’s limited capacity, so I checked the space used on the SSD: 90GB. This includes the OS itself, benchmarking tools, AI tools, and models.

Power Consumption

I also measured the power consumption of the Arrow Lake AI development kit using a wall power meter:

  • Power off – 2.7 – 2.8 Watts
  • Idle – 15.4 – 16.8 Watts (fan active at all times)
  • Stress test (stress -c 14)
    • First ~30 seconds – 62.5- 64.0 Watts
    • Longer runs –  41.7 – 44.5 Watts
  • Object detection – Camera + GPU – 38.4 – 40.1 Watts

The board was connected to an HDMI monitor, 2.5GbE, a USB RF dongle for a wireless keyboard/mouse combo, and a USB camera. Idle power consumption looks to be on the high side; even if I disconnect all peripherals, leaving only the power cable, it drops to 13.2 – 14.4 Watts.

Conclusion

The AAEON UP Xtreme ARL AI Dev Kit features a powerful Arrow Lake SBC powered by an Intel Core 5 Ultra 225H SoC with a 69 TOPS Intel Arc 130T graphics delivering 69 TOPS and a 13 TOPS Intel AI Boost for a combined 83 TOPS of AI performance when also taking the CPU into account.

Most standard features I tested worked fine, and it’s a powerful and versatile AI platform with most AI tests running as expected. The only real issues I encountered are that DisplayPort audio did not work with my monitor (HDMI OK), and it looks like the Intel AI Boost is not used in the tools, or at least not explicitly.

Now that I have tested all three platforms from the UP AI Dev Kit ecosystem for close to 50 hours, I can leave final comments on all three:

  • UP TWL AI Dev Kit – It’s a low-power, entry-level Intel N150 platform with no AI accelerator. Most tests will work, but the CPU-only processing will limit practical applications. I found the 64GB eMMC flash to fill up quickly. Price: $279 with power supply and camera.
  • UP Squared Pro AI Dev Kit – Also based on Intel N150, but it ships with a 13 TOPS Hailo-8L M.2 AI accelerator that is almost as powerful as the UP Xtreme ARL kit. But fewer tests have been implemented in Nx Meta, and I found the SDK to be a pain to work with, having experienced frequent issues with version mismatches between Python, the Hailo drivers, and Hailo libraries.  It suffers from the same 64GB eMMC flash limitation as the UP TWL AI Dev Kit, but an M.2 socket allows the user to easily add NVMe storage if needed, although the heatsink must be removed from the Hailo-8L module for installation. It offers a good price/performance ratio, but most work may be needed on the Intel-only kits. Price: $469.00 with power supply, USB camera, and Hailo-8L AI accelerator.
  • UP Xtreme ARL Ai Dev Kit – It’s clearly the most powerful platform of the three, thanks to the Intel Core Ultra 5 225H SoC. Everything worked out of the box without headaches (like I had with the UP Squared Pro variant), and it can run more AI workloads. It’s the devkit of choice if the price is not an issue. Price: $899.00 with power supply and USB camera

I’d like to thank AAEON for sending the three Intel AI Dev Kits for review. You find all three on for purchased on the UP Shop

Share this:

Support CNX Software! Donate via cryptocurrencies, become a Patron on Patreon, or purchase goods on Amazon or Aliexpress. We also use affiliate links in articles to earn commissions if you make a purchase after clicking on those links.

Radxa Orion O6 Armv9 mini-ITX motherboard
Subscribe
Notify of
guest
The comment form collects your name, email and content to allow us keep track of the comments placed on the website. Please read and accept our website Terms and Privacy Policy to post a comment.
2 Comments
oldest
newest
Boardcon MINI1126B-P AI vision system-on-module wit Rockchip RV1126B-P SoC