H3 cedrus video acceleration, device tree problem?

schunckt · August 10, 2025

Hi all!

On NanoPi Duo2 I'm trying to use the builtin video hw processor.

ffmpeg already works with -hwaccel v4l2request but throws errors:

Press [q] to stop, [?] for help
[h264 @ 0x11645f0] Using V4L2 media driver cedrus (6.12.35) for S264
[V4L2RequestContext @ 0xae5f0db0] Failed to create buffer of type 1: Cannot allocate memory (12)
[h264 @ 0x11645f0] Failed setup for format drm_prime: hwaccel initialisation returned error.

dmesg

[ 8906.864389] cma: __cma_alloc: reserved: alloc failed, req-size: 3038 pages, ret: -12
[ 8906.872255] cma: number of available pages: 42@86+128@384+34@3550+34@6622+34@9694+34@12766+34@15838+34@18910+34@21982+1570@25054=> 1978 free of 26624 total pages
[ 8906.886783] cedrus 1c0e000.video-codec: dma alloc of size 12443648 failed
root@nanopiduo2:~#

I already did play around with armbianEnv tweaking extraargs cma but no success.

I found a link talking about VPU device tree dma limitations https://git.sec.in.tum.de/croemheld/linux/-/blob/v5.1-rc5/Documentation/devicetree/bindings/media/cedrus.txt

Zitat

Device-tree bindings for the VPU found in Allwinner SoCs, referred to as the Video Engine (VE) in Allwinner literature. The VPU can only access the first 256 MiB of DRAM, that are DMA-mapped starting from the DRAM base. This requires specific memory allocation and handling.

I already decompiled the DT and verified there are is no such "reserved-memory" section. Is this the root cause?

Maybe someone can provide some hints or ideas confirming that I'm on the right track? If yes I'd give it a try adjusting the DT.

T.

laibsch · August 10, 2025

are you sure this is not a genuine running-out-of-memory situation?

going · August 10, 2025

3 часа назад, schunckt сказал:

I found a link talking about VPU device tree dma limitations https://git.sec.in.tum.de/croemheld/linux/-/blob/v5.1-rc5/Documentation/devicetree/bindings/media/cedrus.txt

v5.1-rc5

3 часа назад, schunckt сказал:
[h264 @ 0x11645f0] Using V4L2 media driver cedrus (6.12.35)

v6.12.35

Please use the current documentation for the CURRENT kernel.

sun4i-a10-video-engine.yaml

sun8i-h3-deinterlace.yaml

Documentation/arch/arm/sunxi.rst

arch/arm/boot/dts/allwinner/sun8i-h3-nanopi-duo2.dts

arch/arm/boot/dts/allwinner/sun8i-h3.dtsi

P.S.

Please read this.

repository-for-v4l2request-hardware-video-decoding-rockchip-allwinner

Edited August 10, 2025 by going
Add P.S.

schunckt · August 11, 2025

Great, thanks for the links!

Meanwhile it works partially. The DT tweaking was not needed. I made a mistake by specifying the armbian extraargs. I added a second line to armbianEnv.txt but realized all args must be one line. I had to increase cma=256M (Yes, really, tested all lower values). Then it works, BUT ...

Fun fact, will further investigate:

Decoding with "-hwaccel drm" results in lower fps (about 6..8) whereas software decoder goes up to 10 😀

Decoding has been verified with htop. CPU only => 4 cores 100%. hwaccel one core about 20% which is likely th e yuv to rgb and scaling.

Maybe this is still an issue caused by DT, at least when reading https://gregdavill.com/posts/allwinner-s3-videoencoders/

It specifies

memory-region = <&cma_pool>;

which is missing in my decompiled DT, also the referenced reserved-memory section.

Maybe that's not needed if it's coded inside the driver or specified elswhere.

T.

robertoj · August 11, 2025

On 8/10/2025 at 7:34 AM, going said:

v6.12.35

Please use the current documentation for the CURRENT kernel.

If it doesn't work, compile your own Armbian with EDGE linux (what worked for me).

Stay away from Trixie at this time (its mpv doesn't work as well as Bookworm's)

going · August 11, 2025

4 минуты назад, robertoj сказал:

compile your own Armbian with EDGE linux

Which OS works well for you?

robertoj · August 11, 2025

1 minute ago, going said:

Which OS works well for you?

Self compiled Armbian Bookworm + XFCE with Linux Edge 6.15.x

Then follow all the instructions in https://forum.armbian.com/topic/32449-repository-for-v4l2request-hardware-video-decoding-rockchip-allwinner/#findComment-176981

Then add extraargs=cma=256M to armbianEnv.txt

Ryzer · August 11, 2025

Some fixes are kernel specific. If I understand correctly, the "memory-region" is only necessary when using the legacy cedar driver with a more recent kernel. It is supported up to kernel 6.1. You can confirm CMA allocation by running "sudo dmesg | grep CMA" or by running "cat /proc/meminfo | grep Cma"

That's interesting, although cedrus only acts as the video decoding engine while the display engine is responsible for the actual rendering.

schunckt · August 12, 2025

@robertoj right, that worked for me as well

Zitat

Then add extraargs=cma=256M to armbianEnv.txt

But the remaining issue is the slowness. Meanwhile I also tested ffmpeg unscaled and no rgb conversion to /dev/null outputt but still slow.

Maybe there is some pre/postprocessing done which cloud be tuned further. If i remember right there are some v4l2* features which may impact the processing but I'm not sure if this was camera capture related.

Another idea: It seems the VPU clock source is configurable (inside DT) maybe that's not quite right?

T.

robertoj · August 12, 2025

Did you compile your armbian OS with linux edge 6.15.x, bookworm, xfce?

https://forum.armbian.com/topic/32449-repository-for-v4l2request-hardware-video-decoding-rockchip-allwinner/#findComment-216587

Edited August 12, 2025 by robertoj

schunckt · August 12, 2025

No, I did not compile this time. I used the downloaded image (need to double check exactly which one).

Before trying this path (have to update my build env first😀) I'd try to get a better understanding about the root cause of the slowness.

I think next I'll play around with mpv instead of ffmpeg.

I'd prefer ffmpeg for other reasons, but testing mpv is worth to spend some time.

T.

robertoj · August 12, 2025

My main theory is that linux 6.12 doesn't have the v4l2 improvements needed for hw acceleration, that you can only get with linux 6.13....

The link i published explains that.

schunckt · August 13, 2025

Thanks for your feedback.

But as I wrote, it looks like hw accel works in general when checking the much lower CPU load vs. the fps.

CPU 4x100% => ~10fps

HW 1x20% => ~6fps

Thats why I think the VPU really gets used but not optimal.

btw. I did not yet get mpv to work with frambebuffer.

T.

Ryzer · August 15, 2025

Those specific patches only apply to the H61X SOCs.

Very Strange that hardware decoding is apparently slower. Out of interest if you run something like glxgears to see what the reported screen refresh rate is.

Not impossible to be the VPU but I suspect it is more likely to be the dma-buf transfers which could be a potential bottleneck. Could you provide a more detailed log

by --log-file=test1.txt

When working with the framebuffer, try drm-copy instead of drm.

robertoj · August 15, 2025

Can you try if any of the H3 images from libre-elec would get you video acceleration?

https://libreelec.tv/downloads/allwinner/

i once tried the orange pi pc image in my orange pi zero lts (h3) and it worked

schunckt · August 18, 2025

@Ryzer

There is no "ffmpeg -hwaccel drm-copy" option. Looks like this is mpv only, which - as said - doen't work with framebuffer.

glxgears also can't work because i have no OpenGL. But in general that's a good hint. I'll create some small videos with different resolutions and compressions and measure the fps. Maybe that helps to nail down the issue.

btw. I've not yet tested VLC.

Need to figure out if this is a ffmpeg issue or something else inside the kernel, i.e. v4l2request related ...

@robertoj Trying another image doesn't make much sense for me for several other reasons.

T.

Ryzer · August 18, 2025

@schunckt Yes, I should have clarified that drm-copy is an argument is for MPV, which according to the guide mentioned above should allow the frame-buffer to be accessed directly. It is worth noting that mpv makes use of ffmpeg under the hood. Tried with ffplay once but did not have much luck with it. last I checked VLC is not supported other than the legacy vaapi. Please see: https://linux-sunxi.org/Sunxi-Cedrus

You can use the sample media from linaro: https://samplemedia.linaro.org/

Just to check are you using the ffmpeg-v4l2-request?

schunckt · August 29, 2025

Hi there!

Made some further research. It seems I was wrong assuming hw GPU/VPU is used. The reduced CPU load is more likely a result of the lower fps when using -hwaccel drm (aka v4l2request) and the v4l2 layer falls back to soft decoding.

I also tried now mpv but just with null output (as it can't fbdev) and captured the debug trace like so

mpv Big_Buck_Bunny_720_10s_10MB.mp4  -vo=null --msg-level=all=trace
[vd] Codec list:
[vd]     h264 - H.264 / AVC / MPEG-4 AVC / MPEG-4 part 10
[vd]     h264_v4l2m2m (h264) - V4L2 mem2mem H.264 decoder wrapper
[vd] Opening decoder h264
[vd] Looking at hwdec h264-drm...
[vd] Could not create device.
[vd] No hardware decoding available for this codec.
[vd] Using software decoding.

no hw accel used!

I also captured a ffmpeg trace.

ffmpeg -v trace -re -hwaccel drm -i Big_Buck_Bunny_720_10s_10MB.mp4 -f null -

Maybe its in fact a duo2 only device tree related?

Could someone try the same on any other H3 and provide thr results?

If it really works with hwaccel and the org. 30fps I would like to have the trace logs, ideally the ffmpg* or both.

lsmod | grep ced
sunxi_cedrus           40960  0
v4l2_mem2mem           16384  1 sunxi_cedrus
videobuf2_dma_contig    16384  1 sunxi_cedrus
videobuf2_v4l2         16384  2 sunxi_cedrus,v4l2_mem2mem
videobuf2_common       45056  5 sunxi_cedrus,videobuf2_dma_contig,videobuf2_memops,v4l2_mem2mem,videobuf2_v4l2
videodev              188416  3 sunxi_cedrus,v4l2_mem2mem,videobuf2_v4l2
mc                     36864  5 sunxi_cedrus,videobuf2_common,videodev,v4l2_mem2mem,videobuf2_v4l2

*ffmpeg has been installed from below as the compiled 8x throws errors with -hwaccel.

# Install the precompiled
# ffmpeg version 5.1.6-0+deb12u1 Copyright (c) 2000-2024 the FFmpeg developers
# built with gcc 12 (Debian 12.2.0-14)
# https://forum.armbian.com/topic/32449-repository-for-v4l2request-hardware-video-decoding-rockchip-allwinner/

# As root, no sudo
# APT REPOSITORY SETUP
wget http://apt.undo.it:7242/apt.undo.it.asc -O /etc/apt/trusted.gpg.d/apt.undo.it.asc
. /etc/os-release && echo "deb http://apt.undo.it:7242 $VERSION_CODENAME main" | sudo tee /etc/apt/sources.list.d/apt.undo.it.list
echo -e "Package: *\nPin: release o=apt.undo.it\nPin-Priority: 600" | sudo tee /etc/apt/preferences.d/apt-undo-it

# We only install ffmpeg. mpv doesn't work anyway with framebuffer
# INSTALL FFMPEG AND MPV PACKAGES
apt install ffmpeg

robertoj · August 29, 2025

You are still using an old Linux. You need Linux 6.13 or newer.

You need to build your own Armbian OS.

Also, don't forget the cma=256M kernel argument

schunckt · September 2, 2025

Hi and thanks!

Tested now but no luck. Compiled edge 6.15.4

Linux nanopiduo2 6.15.4-edge-sunxi #1 SMP Fri Jun 27 10:13:43 UTC 2025 armv7l GNU/Linux

now i get

Device creation failed: -14.
[h264 @ 0x11f9b40] No device available for decoder: device type drm needed for codec h264.
Stream mapping:
  Stream #0:0 -> #0:0 (h264 (native) -> wrapped_avframe (native))
Device setup failed for decoder on input stream #0:0 : Bad address
[AVIOContext @ 0x11ac030] Statistics: 498846 bytes read, 0 seeks

command as follows:

ffmpeg -v trace -re -hwaccel drm -i Big_Buck_Bunny_720_10s_10MB.mp4 -f null -

also tried fbdev output but same result.

ffmpeg version is still

ffmpeg -v

ffmpeg version 5.1.7-0+deb12u1 Copyright (c) 2000-2025 the FFmpeg developers
  built with gcc 12 (Debian 12.2.0-14+deb12u1)
  configuration: --prefix=/usr --extra-version=0+deb12u1 --toolchain=hardened --libdir=/usr/lib/arm-linux-gnueabihf --incdir=/usr/include/arm-linux-gnueabihf --arch=arm --enable-gpl --disable-stripping --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libglslang --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librist --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libsvtav1 --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --disable-sndio --enable-libjxl --enable-pocketsphinx --enable-librsvg --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-libplacebo --enable-librav1e --enable-shared
  libavutil      57. 28.100 / 57. 28.100
  libavcodec     59. 37.100 / 59. 37.100
  libavformat    59. 27.100 / 59. 27.100
  libavdevice    59.  7.100 / 59.  7.100
  libavfilter     8. 44.100 /  8. 44.100
  libswscale      6.  7.100 /  6.  7.100
  libswresample   4.  7.100 /  4.  7.100
  libpostproc    56.  6.100 / 56.  6.100

Unfortunately there are countless forks of ffmpeg. I used the one from here https://forum.armbian.com/topic/32449-repository-for-v4l2request-hardware-video-decoding-rockchip-allwinner/

As of now I haven't found anything about a current kernel and ffmpeg working on any H3, only stuff back in 2019 and/or kernel 4.x 🙂

I also tried several extraargs=cma=### up to 256M but all same error.

I really woud love to get some more tech background on "how stuff works" which may help me troubleshoot.

=> Can we enable debug output from the cedrus driver? How?

=> Can we enable debug output from thev4l2 layer? How?

=> Which libraries are involved, eg. v4l2request, other v4l2* which may need additional install/update?

=> Do in need v4l2-util / v4l2-ctl to configure some stuff? If yes, what exactly? (Remember, Its currently for file playback. I know about for cam capture)

=> Could the v4l2loopback help?

=> Can anybody confirm it really works on H3 maybe on an OrangePi board?

If yes I can pull the Opi device tree and compare for possible differences.

T.

robertoj · September 2, 2025

Share your displaying configuration by running "neofetch" and post it here.

If your X11 is running on top of framebuffer, instead of DRM, it wont work.

I also would like to get more debug from cedrus, linux's v4l2... but at least you can add -v to mpv to get more debug

You only need the ffmpeg plugins offered by the original poster of the v4l2-request thread

This is tested with mpv player only.

v4l2-util and -ctl are only useful for webcams and video capture devices.

v4l2loopback is not involved here. I don't have it

I have an orange pi zero LTS, but I havent tested it there (I would use an SPI LCD, since it doesn't have an HDMI prot)

Edited September 2, 2025 by robertoj

schunckt · September 3, 2025

Hi again!

Quick update after i wrote that below. I found something very promising here

https://codesandbox.io/p/github/NathanJohnNJ/BananaPi-Camcorder/master

This is a different approach. Behind the scenes the cedar_ve driver gets used which also supports encoding accel.

I think I'll give it a try but that may take some time ...

(btw. this is where some confusion exists, at several places - there are two drivers: cedrus aka. sunxi-cedrus and cedar_ve)

--------------------------------------------------------------------------------

Im not using X11, i am using direct frambebuffer. But i'm quite sure the output is not the root cause as output to null also doesnt work

So for testing there is no need for SPI LCD

ffmpeg -hwaccel drm -i Big_Buck_Bunny_720_10s_10MB.mp4 -f null -

@robertoj Maybe you can test this above and watch the framerate.

if that achieves the 30fps run again with trace

ffmpeg -v trace -hwaccel drm -i Big_Buck_Bunny_720_10s_10MB.mp4 -f null -

and provide me the console output?

also tested again mpv just with the input file and debug output (but this time not the specific v4l request thread)

mpv  Big_Buck_Bunny_720_10s_10MB.mp4 --msg-level=vd=v,vo=v,vo/gpu/vaapi-egl=trace
...
...
[vd] No hardware decoding requested.
[vd] Using software decoding.
[vd] Detected 4 logical cores.
[vd] Requesting 5 threads for decoding.
[vd] Selected codec: h264 (H.264 / AVC / MPEG-4 AVC / MPEG-4 part 10)
[vd] DR failed - disabling.
[vd] Using software decoding.
[vd] Decoder format: 1280x720 yuv420p auto/auto/auto/auto/auto CL=mpeg2/4/h264

of course tested different parmeters but mpv always falls back to soft.

 mpv  Big_Buck_Bunny_720_10s_10MB.mp4 --msg-level=vd=v,vo=v,vo/gpu/vaapi-egl=trace --no-config --hwdec=yes


[vd] Trying hardware decoding via h264_v4l2m2m-v4l2m2m-copy.
[vd] Using underlying hw-decoder 'h264_v4l2m2m'
[ffmpeg/video] h264_v4l2m2m: Could not find a valid device
[ffmpeg/video] h264_v4l2m2m: can't configure decoder
Could not open codec.

Edited September 3, 2025 by schunckt

Ryzer · September 3, 2025

9 hours ago, schunckt said:

Quick update after i wrote that below. I found something very promising here

https://codesandbox.io/p/github/NathanJohnNJ/BananaPi-Camcorder/master

This is a different approach. Behind the scenes the cedar_ve driver gets used which also supports encoding accel.

This is built on an old kernel 5.10 and would probably need a fair bit of work to get up to date. Last repo I came across with support for the old cedar driver was kernel 6.1 but I have not found anything more recent than this. Have you not created the configuration file as mentioned:

It still looks to me like the packaged version of ffmpeg is being installed rather than from the custom repo. Notice that there is no option listed for v4l2-request. I had problems connecting and had to temporarily switch from my wifi to mobile in order to install the packages.

laibsch · September 3, 2025

1 hour ago, Ryzer said:

It still looks to me like the packaged version of ffmpeg is being installed rather than from the custom repo.

"apt policy ffmpeg" FTW ;-) apt also has a command-line option to force the installation of a particular package version. To automate this, you can put a file under /etc/apt/preferences.d/

robertoj · September 3, 2025

3 hours ago, Ryzer said:

It still looks to me like the packaged version of ffmpeg is being installed rather than from the custom repo

That will definitely make you not get hardware accelerated video

Make sure to follow the 3 commands to install the custom repo, its certificate and higher priority:

sudo wget http://apt.undo.it:7242/apt.undo.it.asc -O /etc/apt/trusted.gpg.d/apt.undo.it.asc

. /etc/os-release && echo "deb http://apt.undo.it:7242 $VERSION_CODENAME main" | sudo tee /etc/apt/sources.list.d/apt.undo.it.list $ echo -e "Package: *\nPin: release o=apt.undo.it\nPin-Priority: 600" | sudo tee /etc/apt/preferences.d/apt-undo-it

sudo apt update

Then check that the new custom dpkgs are available for install/upgrade

apt list ffmpeg-whatever

Here's how I force install a specific version:

sudo apt install packagename=nn.nn.n

schunckt · September 7, 2025

Hi!

Am 3.9.2025 um 18:51 schrieb Ryzer:

Have you not created the configuration file as mentioned:

i already tried this without luck. Based on the version ffmpeg showed it installed the right one.

I also did uninstall previous ffmpeg and verified its really gone.

I still suspect there is something wrong specific for the duo2.

Am 3.9.2025 um 18:51 schrieb Ryzer:

It still looks to me like the packaged version

Could you please take a look what exact version ffmpeg does show?

T.

Ryzer · September 7, 2025

Firstly can you actually connect to the repo without issues? I my case I was getting timeouts and i suspect it was being blocked by my ISP for whatever reason and to work around I to tether my SBC to mobile for internet via mobile data then install the packages this way.

This is my current ffmpeg:

ffmpeg -version
ffmpeg version 5.1.4-0+deb12u1.v4l2request Copyright (c) 2000-2023 the FFmpeg de                                           velopers
built with gcc 12 (Debian 12.2.0-14)
configuration: --prefix=/usr --extra-version=0+deb12u1.v4l2request --toolchain=h                                           ardened --libdir=/usr/lib/arm-linux-gnueabihf --incdir=/usr/include/arm-linux-gn                                           ueabihf --arch=arm --enable-gpl --disable-stripping --enable-gnutls --enable-lad                                           spa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable                                           -libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite                                            --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgls                                           lang --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enab                                           le-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-                                           libpulse --enable-librabbitmq --enable-librist --enable-librubberband --enable-l                                           ibshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --                                           enable-libssh --enable-libsvtav1 --enable-libtheora --enable-libtwolame --enable                                           -libv4l2 --enable-v4l2-request --enable-v4l2-m2m --enable-libvidstab --enable-li                                           bvorbis --enable-libvpx --enable-libwebp --enable-libudev --enable-libx265 --ena                                           ble-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi -                                           -enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enabl                                           e-sdl2 --disable-sndio --enable-libjxl --enable-pocketsphinx --enable-librsvg --                                           enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --ena                                           ble-frei0r --enable-libx264 --enable-libplacebo --enable-librav1e --enable-share

Admittedly not the most up to date as it has been a while since I last attempted. From what I can tell from the latest pages it looks like using drm* has now been replaced with v4l2request more recently.

I doubt it would specifically related to the duo2 and more likely at the soc dtsi definitions level. Now I encountered many of the same problems you did earlier but the older A10/A20 VE is limited to physically access only the first 256mb of memory. The H3 does not have the same constraints.

@robertoj You are using a more modern setup? what is your current reported ffmpeg version.

robertoj · September 8, 2025

I have the same debian as you: 5.1.4-xxx-v4l2request, installed over Debian Bookworm, with Linux 6.15 (OS built by myself).

When I tried the ffmpeg-7.x.x-v4l2request for Trixie, I could not get hardware acceleration.

Sign In

H3 cedrus video acceleration, device tree problem?

Recommended Posts

schunckt

laibsch

going

schunckt

robertoj

going

robertoj

Ryzer

schunckt

robertoj

schunckt

robertoj

schunckt

Ryzer

robertoj

schunckt

Ryzer

schunckt

robertoj

schunckt

robertoj

schunckt

Ryzer

laibsch

robertoj

schunckt

Ryzer

robertoj

Join the conversation

Similar Content

Forums

My Activity Streams

Download

Store

Important Information