Johnny on the couch Posted Wednesday at 06:41 PM Posted Wednesday at 06:41 PM Kernel crash on freshly installed image for LonganPi 3H. Image downloaded from: https://www.armbian.com/longanpi-3h/ Ubuntu 24.04 Noble. Sometimes it will crash few seconds after boot, before logging or doing anything. Didn't install or do anything, just apt-get update and upgrade. logs: https://paste.next.armbian.com/ocofepenup [ 1511.118664] Internal error: Oops: 0000000096000044 [#1] SMP [ 1511.124251] Modules linked in: sunrpc rtl8xxxu mac80211 cfg80211 sunxi_cedrus(C) rfkill libarc4 v4l2_mem2mem videobuf2_dma_contig videobuf2_memops videobuf2_v4l2 videodev videobuf2_common polyval_ce sun50i_h6_prcm_ppu mc polyval_generic dump_reg cpufreq_dt zram zsmalloc binfmt_misc sch_fq_codel fuse dm_mod realtek sun6i_rtc_ccu dwmac_sun8i mdio_mux [ 1511.155156] CPU: 1 UID: 0 PID: 2658 Comm: sshd Tainted: G WC 6.12.23-current-sunxi64 #1 [ 1511.164459] Tainted: [W]=WARN, [C]=CRAP [ 1511.168292] Hardware name: Sipeed Longan Pi 3H (DT) [ 1511.173166] pstate: 600000c5 (nZCv daIF -PAN -UAO -TCO -DIT -SSBS BTYPE=--) [ 1511.180124] pc : enqueue_timer+0x3c/0x150 [ 1511.184141] lr : __mod_timer+0x30c/0x370 [ 1511.188063] sp : ffff800085a13920 [ 1511.191376] x29: ffff800085a13920 x28: 00000000000005a8 x27: 0000000000000000 [ 1511.198514] x26: ffff8000821f6000 x25: 0000000000000001 x24: 0000000100049ec4 [ 1511.205652] x23: 000000000000000c x22: ffff0000ff760000 x21: 0000000100049ecc [ 1511.212789] x20: ffff0000ff760000 x19: 0000000100049ec4 x18: 0000000000000042 [ 1511.219926] x17: 85a0e7cbaf550a08 x16: 01010000d9864102 x15: 1880ada177e5cf73 [ 1511.227064] x14: 54c33ab316000103 x13: 6a5d85a0e7cbaf55 x12: 0a0801010000d986 [ 1511.234201] x11: 41021880ada177e5 x10: cf7354c33ab31600 x9 : ffff800080125244 [ 1511.241341] x8 : ffff800080b33870 x7 : ffff0000ff7600d0 x6 : ffff800082201980 [ 1511.248479] x5 : 000000000000000c x4 : 000000000000000c x3 : 0000000100049ecc [ 1511.255617] x2 : ffff0000ff760070 x1 : 0000000000000000 x0 : ffff0000ff760000 [ 1511.262756] Call trace: [ 1511.265204] enqueue_timer+0x3c/0x150 [ 1511.268868] __mod_timer+0x30c/0x370 [ 1511.272444] mod_timer+0x1c/0x30 [ 1511.275673] sk_reset_timer+0x28/0x98 [ 1511.279339] tcp_schedule_loss_probe.part.0+0x13c/0x270 [ 1511.284567] tcp_write_xmit+0x320/0x1390 [ 1511.288492] __tcp_push_pending_frames+0x44/0x108 [ 1511.293197] tcp_push+0xbc/0x168 [ 1511.296431] tcp_sendmsg_locked+0xa14/0xc38 [ 1511.300616] tcp_sendmsg+0x40/0x70 [ 1511.304021] inet6_sendmsg+0x4c/0x78 [ 1511.307599] __sock_sendmsg+0x64/0xc0 [ 1511.311265] sock_write_iter+0xa8/0x118 [ 1511.315102] vfs_write+0x334/0x3b8 [ 1511.318508] ksys_write+0xf8/0x120 [ 1511.321911] __arm64_sys_write+0x24/0x38 [ 1511.325835] invoke_syscall+0x50/0x120 [ 1511.329590] el0_svc_common.constprop.0+0x48/0xf0 [ 1511.334296] do_el0_svc+0x24/0x38 [ 1511.337615] el0_svc+0x30/0xd0 [ 1511.340675] el0t_64_sync_handler+0x120/0x130 [ 1511.345034] el0t_64_sync+0x190/0x198 [ 1511.348703] Code: a9025bf5 aa0003f6 aa0303f5 f8657841 (f9000261) [ 1511.354794] ---[ end trace 0000000000000000 ]--- [ 1511.359410] note: sshd[2658] exited with irqs disabled [ 1571.130516] rcu: INFO: rcu_sched detected stalls on CPUs/tasks: [ 1571.136471] rcu: 1-...0: (7 GPs behind) idle=ad6c/1/0x4000000000000000 softirq=10876/10876 fqs=6851 [ 1571.145603] rcu: (detected by 2, t=15004 jiffies, g=19153, q=1042 ncpus=4) [ 1571.152564] Sending NMI from CPU 2 to CPUs 1: [ 1581.152989] rcu: rcu_sched kthread starved for 555 jiffies! g19153 f0x0 RCU_GP_DOING_FQS(6) ->state=0x0 ->cpu=0 [ 1581.167420] rcu: Unless rcu_sched kthread gets sufficient CPU time, OOM is now expected behavior. [ 1581.176371] rcu: RCU grace-period kthread stack dump: [ 1581.181418] task:rcu_sched state:R running task stack:0 pid:17 tgid:17 ppid:2 flags:0x00000008 [ 1581.192290] Call trace: [ 1581.194738] __switch_to+0xe8/0x150 [ 1581.198235] 0x0 [ 1581.200080] rcu: Stack dump where RCU GP kthread last ran: [ 1581.205561] Sending NMI from CPU 2 to CPUs 0: 0 Quote
laibsch Posted Thursday at 10:57 AM Posted Thursday at 10:57 AM looks like a really cool machine. unfortunately, no maintainer in armbian and I am not aware if any of the regular developpers has that SBC. It seems to be based on the H618 which it shares with the Orange Pi Zero3 which is reported working with the latest Armbian release. Did this board ever work with an older release? Is this a regression? I see somebody else mentioning they have issues with the board booting, , apparently due to issues with thermal management. And your crash apparently started in pstate. 0 Quote
Johnny on the couch Posted yesterday at 09:19 AM Author Posted yesterday at 09:19 AM Hi, I am not aware of the older Longan image. I've used it for a few months with OrangePi Zero3 image without kernel panics. I've collected few more kernel crashes on the same image (latest Ubuntu Noble freshly installed on SD card): apt update on USB WiFi, LAN still connected, DKMS aic8800 built, freshly reinstalled image Spoiler 20% [7 Packages store 0 B] [8 Contents-arm64 1,210 kB/85.3 MB 1%] [ 577.526353] Unable to handle kernel NULL pointer dereference at virtual address 000000000000000d Mem abort info: ESR = 0x0000000096000004 EC = 0x25: DABT (current EL), IL = 32 bits SET = 0, FnV = 0 EA = 0, S1PTW = 0 FSC = 0x04: level 0 translation fault Data abort info: ISV = 0, ISS = 0x00000004, ISS2 = 0x00000000 CM = 0, WnR = 0, TnD = 0, TagAccess = 0 GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0 user pgtable: 4k pages, 48-bit VAs, pgdp=000000011aeef000 [000000000000000d] pgd=0000000000000000, p4d=0000000000000000 Internal error: Oops: 0000000096000004 [#1] SMP Modules linked in: sunrpc rtl8xxxu mac80211 cfg80211 rfkill libarc4 polyval_ce sunxi_cedrus(C) v4l2_mem2mem sun50i_h6_prcm_ppu polyval_generic videobuf2_dma_contig videobuf2_memops videobuf2_v4l2 dump_reg videodev videobuf2_common mc cpufreq_dt zram zsmalloc binfmt_misc sch_fq_codel fuse dm_mod realtek sun6i_rtc_ccu dwmac_sun8i mdio_mux CPU: 2 UID: 42 PID: 4102 Comm: store Tainted: G C 6.12.23-current-sunxi64 #1 Tainted: [C]=CRAP Hardware name: Sipeed Longan Pi 3H (DT) pstate: 80000005 (Nzcv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--) pc : ext4_rename2+0x58/0x120 lr : vfs_rename+0x484/0xa60 sp : ffff800086fd3bb0 x29: ffff800086fd3bb0 x28: 0000000000000000 x27: ffff800086fd3d38 x26: 0000000000000000 x25: ffff0000f83a5bc8 x24: ffff8000822e52b8 x23: ffff0000f902b540 x22: ffff0000c070f240 x21: 0000000000000001 x20: 0000000000000000 x19: ffff0000e091f270 x18: 0000000000000000 x17: 0000000000000000 x16: 0000000000000000 x15: 0000000000000000 x14: 0000000000000000 x13: ffff0000c2294033 x12: ffff800086fd3bc4 x11: 00000007e2a04d4a x10: 0000000000000002 x9 : ffff80008038de34 x8 : ffff800086fd3c58 x7 : 0000000000000001 x6 : ffff8000804aba28 x5 : 0000000000000000 x4 : ffff0000f902b540 x3 : 0000000000000001 x2 : ffff0000c070f240 x1 : ffff0000e091f270 x0 : 0000000000000000 Call trace: ext4_rename2+0x58/0x120 vfs_rename+0x484/0xa60 do_renameat2+0x4a4/0x518 __arm64_sys_renameat+0x58/0x78 invoke_syscall+0x50/0x120 el0_svc_common.constprop.0+0x48/0xf0 do_el0_svc+0x24/0x38 el0_svc+0x30/0xd0 el0t_64_sync_handler+0x120/0x130 el0t_64_sync+0x190/0x198 Code: b9400c20 aa0303f5 aa0403f7 37700220 (b9400c60) ---[ end trace 0000000000000000 ]--- While doing apt install zfs-dkms Spoiler Unable to handle kernel paging request at virtual address fffffdfec1630210 Mem abort info: ESR = 0x0000000096000045 EC = 0x25: DABT (current EL), IL = 32 bits SET = 0, FnV = 0 EA = 0, S1PTW = 0 FSC = 0x05: level 1 translation fault Data abort info: ISV = 0, ISS = 0x00000045, ISS2 = 0x00000000 CM = 0, WnR = 1, TnD = 0, TagAccess = 0 GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0 swapper pgtable: 4k pages, 48-bit VAs, pgdp=0000000041db4000 [fffffdfec1630210] pgd=100000013f7bd003, p4d=100000013f7bd003, pud=0000000000000000 Internal error: Oops: 0000000096000045 [#1] SMP Modules linked in: sunrpc rtl8xxxu mac80211 cfg80211 rfkill libarc4 sunxi_cedrus(C) v4l2_mem2mem polyval_ce polyval_generic videobuf2_dma_contig videobuf2_memops sun50i_h6_prcm_ppu videobuf2_v4l2 videodev videobuf2_common mc dump_reg cpufreq_dt zram zsmalloc binfmt_misc sch_fq_codel fuse dm_mod realtek sun6i_rtc_ccu dwmac_sun8i mdio_mux CPU: 2 UID: 0 PID: 47314 Comm: cc1 Tainted: G C 6.12.23-current-sunxi64 #1 Tainted: [C]=CRAP Hardware name: Sipeed Longan Pi 3H (DT) pstate: 200000c5 (nzCv daIF -PAN -UAO -TCO -DIT -SSBS BTYPE=--) pc : __rmqueue_pcplist+0x33c/0xb58 lr : __rmqueue_pcplist+0x444/0xb58 sp : ffff80008f73b890 x29: ffff80008f73b910 x28: 0000000000000010 x27: fffffdffc15f1c00 x26: 0000000000000110 x25: 0000000000000001 x24: 0000000000000000 x23: ffff0000ff7cfd00 x22: 000000000000037d x21: ffff0000ff794ff0 x20: dead000000000040 x19: 0000000000000001 x18: ffffffffffffffff x17: 00000000f0000000 x16: 0000000000000010 x15: 0000000000000068 x14: 0000000000000100 x13: 1fffe00018aa1fa1 x12: 0000000000000001 x11: fffffdffc15f1c00 x10: fffffdfec1630208 x9 : ffff800080307834 x8 : fffffdffc122e1c8 x7 : ffff80007da5e000 x6 : ffff80008f73b890 x5 : ffff0000ff7cff48 x4 : 0000000000000003 x3 : 0000000000000003 x2 : fffffdffc15f1c08 x1 : 0000000000000003 x0 : 0000000000000008 Call trace: __rmqueue_pcplist+0x33c/0xb58 get_page_from_freelist+0x67c/0x1698 __alloc_pages_noprof+0x15c/0xe00 alloc_pages_mpol_noprof+0x98/0x218 folio_alloc_mpol_noprof+0x1c/0x70 vma_alloc_folio_noprof+0x88/0xe8 vma_alloc_zeroed_movable_folio+0x40/0x58 __handle_mm_fault+0xcb0/0x1208 handle_mm_fault+0x134/0x258 do_page_fault+0x13c/0x520 do_translation_fault+0xa4/0xc0 do_mem_abort+0x4c/0xa8 el0_da+0x2c/0xa0 el0t_64_sync_handler+0xb4/0x130 el0t_64_sync+0x190/0x198 Code: 2a0103e3 1ac12000 d503201f a940144a (f9000545) ---[ end trace 0000000000000000 ]--- note: cc1[47314] exited with irqs disabled panic3: Spoiler [ 8838.754724] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000030 [ 8838.763594] Mem abort info: [ 8838.766400] ESR = 0x0000000096000004 [ 8838.770155] EC = 0x25: DABT (current EL), IL = 32 bits [ 8838.775472] SET = 0, FnV = 0 [ 8838.778532] EA = 0, S1PTW = 0 [ 8838.781678] FSC = 0x04: level 0 translation fault [ 8838.786559] Data abort info: [ 8838.789445] ISV = 0, ISS = 0x00000004, ISS2 = 0x00000000 [ 8838.794932] CM = 0, WnR = 0, TnD = 0, TagAccess = 0 [ 8838.799987] GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0 [ 8838.805303] user pgtable: 4k pages, 48-bit VAs, pgdp=000000010387c000 [ 8838.811747] [0000000000000030] pgd=0000000000000000, p4d=0000000000000000 [ 8838.818567] Internal error: Oops: 0000000096000004 [#1] SMP [ 8838.824143] Modules linked in: sunrpc rtl8xxxu mac80211 cfg80211 sunxi_cedrus(C) rfkill libarc4 v4l2_mem2mem videobuf2_dma_contig videobuf2_memops polyval_ce videobuf2_v4l2 sun50i_h6_prcm_ppu videodev polyval_generic videobuf2_common mc dump_reg cpufreq_dt zram zsmalloc binfmt_misc sch_fq_codel fuse dm_mod sun6i_rtc_ccu realtek dwmac_sun8i mdio_mux [ 8838.855106] CPU: 0 UID: 0 PID: 0 Comm: swapper/0 Tainted: G WC 6.12.23-current-sunxi64 #1 [ 8838.864588] Tainted: [W]=WARN, [C]=CRAP [ 8838.868422] Hardware name: Sipeed Longan Pi 3H (DT) [ 8838.873297] pstate: 80000005 (Nzcv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--) [ 8838.880260] pc : tcp_schedule_loss_probe.part.0+0x20/0x270 [ 8838.885759] lr : tcp_write_xmit+0x320/0x1390 [ 8838.890034] sp : ffff800080003d60 [ 8838.893348] x29: ffff800080003d80 x28: 00000000000005a8 x27: 0000000000000000 [ 8838.900493] x26: 00000000000005a8 x25: 0000000000000001 x24: ffff0000c2828b90 [ 8838.907638] x23: 000000000000002d x22: ffff0000c2828a00 x21: ffff0000c149ba28 [ 8838.914784] x20: ffff8000821f6000 x19: 0000000000000000 x18: 0000000000000042 [ 8838.921929] x17: 9989833584a40a08 x16: 010100002988f401 x15: 188078bc986c43d2 [ 8838.929074] x14: b302168c16000103 x13: 96629989833584a4 x12: 0a08010100002988 [ 8838.936219] x11: f401188078bc986c x10: 43d2b302168c1600 x9 : ffff800080f479b8 [ 8838.943363] x8 : ffff800080b33870 x7 : ffff0000c2828b88 x6 : ffff800082201980 [ 8838.950508] x5 : 0000000002b3d1bb x4 : 0000000002b3d3df x3 : 000000000000000a [ 8838.957652] x2 : 00000000002092dc x1 : 0000000000010020 x0 : 0000000000000000 [ 8838.964797] Call trace: [ 8838.967245] tcp_schedule_loss_probe.part.0+0x20/0x270 [ 8838.972388] tcp_write_xmit+0x320/0x1390 [ 8838.976316] tcp_tsq_write.part.0+0x70/0xc8 [ 8838.980506] tcp_tsq_handler+0x100/0x148 [ 8838.984433] tcp_tasklet_func+0xcc/0x140 [ 8838.988362] tasklet_action_common+0xf0/0x288 [ 8838.992724] tasklet_action+0x38/0x50 [ 8838.996391] handle_softirqs+0x11c/0x2f8 [ 8839.000318] __do_softirq+0x1c/0x28 [ 8839.003813] ____do_softirq+0x18/0x30 [ 8839.007482] call_on_irq_stack+0x24/0x58 [ 8839.011411] do_softirq_own_stack+0x24/0x38 [ 8839.015600] irq_exit_rcu+0x88/0xc8 [ 8839.019091] el1_interrupt+0x38/0x58 [ 8839.022678] el1h_64_irq_handler+0x18/0x28 [ 8839.026784] el1h_64_irq+0x64/0x68 [ 8839.030190] default_idle_call+0x38/0xd8 [ 8839.034118] do_idle+0x204/0x268 [ 8839.037358] cpu_startup_entry+0x3c/0x50 [ 8839.041291] rest_init+0xc4/0xd0 [ 8839.044524] start_kernel+0x52c/0x7a8 [ 8839.048193] __primary_switched+0x80/0x90 [ 8839.052221] Code: a9027bfd 910083fd a90353f3 aa0003f3 (f9401800) [ 8839.058311] ---[ end trace 0000000000000000 ]--- [ 8839.062929] Kernel panic - not syncing: Oops: Fatal exception in interrupt [ 8839.069801] SMP: stopping secondary CPUs [ 8839.073731] Kernel Offset: disabled [ 8839.077219] CPU features: 0x00,00000080,00200000,0200420b [ 8839.082610] Memory Limit: none [ 8839.085669] ---[ end Kernel panic - not syncing: Oops: Fatal exception in interrupt ]--- 0 Quote
going Posted yesterday at 12:15 PM Posted yesterday at 12:15 PM 2 часа назад, Johnny on the couch сказал: 've collected few more kernel crashes on the same image (latest Ubuntu Noble freshly installed on SD card): When you have started the device on a new operating system image, does the kernel panic appear? After what action does the core panic? 0 Quote
Johnny on the couch Posted yesterday at 12:34 PM Author Posted yesterday at 12:34 PM Sometimes panic is after reboot, sometimes is after few hours of uptime. Panic1 from the previous post is while doing apt update. 2nd was while compiling zfs-dkms and 3rd I don't rember. The same Noble image works on eMMC, without panic, for now. I'll try to reproduce it with another SD card. 0 Quote
going Posted yesterday at 01:05 PM Posted yesterday at 01:05 PM 3 часа назад, Johnny on the couch сказал: panic3: All the kernel panic options that you have shown here occur on v6.12.23. It all happens randomly. I suspect that this may be due to the presence of broken (faulty) blocks on the memory device. It's easy to check this.If you connect the SD card via an adapter to a Linux computer: sudo fsck.ext4 /dev/sdX1 0 Quote
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.