Jump to content

Recommended Posts

Posted

Kernel crash on freshly installed image for LonganPi 3H. Image downloaded from: https://www.armbian.com/longanpi-3h/ Ubuntu 24.04 Noble.
Sometimes it will crash few seconds after boot, before logging or doing anything.
Didn't install or do anything, just apt-get update and upgrade.

 

logs: https://paste.next.armbian.com/ocofepenup
 

[ 1511.118664] Internal error: Oops: 0000000096000044 [#1] SMP
[ 1511.124251] Modules linked in: sunrpc rtl8xxxu mac80211 cfg80211 sunxi_cedrus(C) rfkill libarc4 v4l2_mem2mem videobuf2_dma_contig videobuf2_memops videobuf2_v4l2 videodev videobuf2_common polyval_ce sun50i_h6_prcm_ppu mc polyval_generic dump_reg cpufreq_dt zram zsmalloc binfmt_misc sch_fq_codel fuse dm_mod realtek sun6i_rtc_ccu dwmac_sun8i mdio_mux
[ 1511.155156] CPU: 1 UID: 0 PID: 2658 Comm: sshd Tainted: G        WC         6.12.23-current-sunxi64 #1
[ 1511.164459] Tainted: [W]=WARN, [C]=CRAP
[ 1511.168292] Hardware name: Sipeed Longan Pi 3H (DT)
[ 1511.173166] pstate: 600000c5 (nZCv daIF -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
[ 1511.180124] pc : enqueue_timer+0x3c/0x150
[ 1511.184141] lr : __mod_timer+0x30c/0x370
[ 1511.188063] sp : ffff800085a13920
[ 1511.191376] x29: ffff800085a13920 x28: 00000000000005a8 x27: 0000000000000000
[ 1511.198514] x26: ffff8000821f6000 x25: 0000000000000001 x24: 0000000100049ec4
[ 1511.205652] x23: 000000000000000c x22: ffff0000ff760000 x21: 0000000100049ecc
[ 1511.212789] x20: ffff0000ff760000 x19: 0000000100049ec4 x18: 0000000000000042
[ 1511.219926] x17: 85a0e7cbaf550a08 x16: 01010000d9864102 x15: 1880ada177e5cf73
[ 1511.227064] x14: 54c33ab316000103 x13: 6a5d85a0e7cbaf55 x12: 0a0801010000d986
[ 1511.234201] x11: 41021880ada177e5 x10: cf7354c33ab31600 x9 : ffff800080125244
[ 1511.241341] x8 : ffff800080b33870 x7 : ffff0000ff7600d0 x6 : ffff800082201980
[ 1511.248479] x5 : 000000000000000c x4 : 000000000000000c x3 : 0000000100049ecc
[ 1511.255617] x2 : ffff0000ff760070 x1 : 0000000000000000 x0 : ffff0000ff760000
[ 1511.262756] Call trace:
[ 1511.265204]  enqueue_timer+0x3c/0x150
[ 1511.268868]  __mod_timer+0x30c/0x370
[ 1511.272444]  mod_timer+0x1c/0x30
[ 1511.275673]  sk_reset_timer+0x28/0x98
[ 1511.279339]  tcp_schedule_loss_probe.part.0+0x13c/0x270
[ 1511.284567]  tcp_write_xmit+0x320/0x1390
[ 1511.288492]  __tcp_push_pending_frames+0x44/0x108
[ 1511.293197]  tcp_push+0xbc/0x168
[ 1511.296431]  tcp_sendmsg_locked+0xa14/0xc38
[ 1511.300616]  tcp_sendmsg+0x40/0x70
[ 1511.304021]  inet6_sendmsg+0x4c/0x78
[ 1511.307599]  __sock_sendmsg+0x64/0xc0
[ 1511.311265]  sock_write_iter+0xa8/0x118
[ 1511.315102]  vfs_write+0x334/0x3b8
[ 1511.318508]  ksys_write+0xf8/0x120
[ 1511.321911]  __arm64_sys_write+0x24/0x38
[ 1511.325835]  invoke_syscall+0x50/0x120
[ 1511.329590]  el0_svc_common.constprop.0+0x48/0xf0
[ 1511.334296]  do_el0_svc+0x24/0x38
[ 1511.337615]  el0_svc+0x30/0xd0
[ 1511.340675]  el0t_64_sync_handler+0x120/0x130
[ 1511.345034]  el0t_64_sync+0x190/0x198
[ 1511.348703] Code: a9025bf5 aa0003f6 aa0303f5 f8657841 (f9000261)
[ 1511.354794] ---[ end trace 0000000000000000 ]---
[ 1511.359410] note: sshd[2658] exited with irqs disabled
[ 1571.130516] rcu: INFO: rcu_sched detected stalls on CPUs/tasks:
[ 1571.136471] rcu:     1-...0: (7 GPs behind) idle=ad6c/1/0x4000000000000000 softirq=10876/10876 fqs=6851
[ 1571.145603] rcu:     (detected by 2, t=15004 jiffies, g=19153, q=1042 ncpus=4)
[ 1571.152564] Sending NMI from CPU 2 to CPUs 1:
[ 1581.152989] rcu: rcu_sched kthread starved for 555 jiffies! g19153 f0x0 RCU_GP_DOING_FQS(6) ->state=0x0 ->cpu=0
[ 1581.167420] rcu:     Unless rcu_sched kthread gets sufficient CPU time, OOM is now expected behavior.
[ 1581.176371] rcu: RCU grace-period kthread stack dump:
[ 1581.181418] task:rcu_sched       state:R  running task     stack:0     pid:17    tgid:17    ppid:2      flags:0x00000008
[ 1581.192290] Call trace:
[ 1581.194738]  __switch_to+0xe8/0x150
[ 1581.198235]  0x0
[ 1581.200080] rcu: Stack dump where RCU GP kthread last ran:
[ 1581.205561] Sending NMI from CPU 2 to CPUs 0:

 

Posted

looks like a really cool machine.  unfortunately, no maintainer in armbian and I am not aware if any of the regular developpers has that SBC.  It seems to be based on the H618 which it shares with the Orange Pi Zero3 which is reported working with the latest Armbian release.

 

Did this board ever work with an older release? Is this a regression?  I see somebody else mentioning they have issues with the board booting, , apparently due to issues with thermal management.  And your crash apparently started in pstate.

Posted

Hi, I am not aware of the older Longan image. I've used it for a few months with OrangePi Zero3 image without kernel panics.

 

I've collected few more kernel crashes on the same image (latest Ubuntu Noble freshly installed on SD card):

 

apt update on USB WiFi, LAN still connected, DKMS aic8800 built, freshly reinstalled image
 

Spoiler


20% [7 Packages store 0 B] [8 Contents-arm64 1,210 kB/85.3 MB 1%]
[  577.526353] Unable to handle kernel NULL pointer dereference at virtual address 000000000000000d
 Mem abort info:
   ESR = 0x0000000096000004
   EC = 0x25: DABT (current EL), IL = 32 bits
   SET = 0, FnV = 0
   EA = 0, S1PTW = 0
   FSC = 0x04: level 0 translation fault
 Data abort info:
   ISV = 0, ISS = 0x00000004, ISS2 = 0x00000000
   CM = 0, WnR = 0, TnD = 0, TagAccess = 0
   GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0
 user pgtable: 4k pages, 48-bit VAs, pgdp=000000011aeef000
 [000000000000000d] pgd=0000000000000000, p4d=0000000000000000
 Internal error: Oops: 0000000096000004 [#1] SMP
 Modules linked in: sunrpc rtl8xxxu mac80211 cfg80211 rfkill libarc4 polyval_ce sunxi_cedrus(C) v4l2_mem2mem sun50i_h6_prcm_ppu polyval_generic videobuf2_dma_contig videobuf2_memops videobuf2_v4l2 dump_reg videodev videobuf2_common mc cpufreq_dt zram zsmalloc binfmt_misc sch_fq_codel fuse dm_mod realtek sun6i_rtc_ccu dwmac_sun8i mdio_mux
 CPU: 2 UID: 42 PID: 4102 Comm: store Tainted: G         C         6.12.23-current-sunxi64 #1
 Tainted: [C]=CRAP
 Hardware name: Sipeed Longan Pi 3H (DT)
 pstate: 80000005 (Nzcv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
 pc : ext4_rename2+0x58/0x120
 lr : vfs_rename+0x484/0xa60
 sp : ffff800086fd3bb0
 x29: ffff800086fd3bb0 x28: 0000000000000000 x27: ffff800086fd3d38
 x26: 0000000000000000 x25: ffff0000f83a5bc8 x24: ffff8000822e52b8
 x23: ffff0000f902b540 x22: ffff0000c070f240 x21: 0000000000000001
 x20: 0000000000000000 x19: ffff0000e091f270 x18: 0000000000000000
 x17: 0000000000000000 x16: 0000000000000000 x15: 0000000000000000
 x14: 0000000000000000 x13: ffff0000c2294033 x12: ffff800086fd3bc4
 x11: 00000007e2a04d4a x10: 0000000000000002 x9 : ffff80008038de34
 x8 : ffff800086fd3c58 x7 : 0000000000000001 x6 : ffff8000804aba28
 x5 : 0000000000000000 x4 : ffff0000f902b540 x3 : 0000000000000001
 x2 : ffff0000c070f240 x1 : ffff0000e091f270 x0 : 0000000000000000
 Call trace:
  ext4_rename2+0x58/0x120
  vfs_rename+0x484/0xa60
  do_renameat2+0x4a4/0x518
  __arm64_sys_renameat+0x58/0x78
  invoke_syscall+0x50/0x120
  el0_svc_common.constprop.0+0x48/0xf0
  do_el0_svc+0x24/0x38
  el0_svc+0x30/0xd0
  el0t_64_sync_handler+0x120/0x130
  el0t_64_sync+0x190/0x198
 Code: b9400c20 aa0303f5 aa0403f7 37700220 (b9400c60)
 ---[ end trace 0000000000000000 ]---


 

While doing apt install zfs-dkms

Spoiler
 Unable to handle kernel paging request at virtual address fffffdfec1630210
 Mem abort info:
   ESR = 0x0000000096000045
   EC = 0x25: DABT (current EL), IL = 32 bits
   SET = 0, FnV = 0
   EA = 0, S1PTW = 0
   FSC = 0x05: level 1 translation fault
 Data abort info:
   ISV = 0, ISS = 0x00000045, ISS2 = 0x00000000
   CM = 0, WnR = 1, TnD = 0, TagAccess = 0
   GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0
 swapper pgtable: 4k pages, 48-bit VAs, pgdp=0000000041db4000
 [fffffdfec1630210] pgd=100000013f7bd003, p4d=100000013f7bd003, pud=0000000000000000
 Internal error: Oops: 0000000096000045 [#1] SMP
 Modules linked in: sunrpc rtl8xxxu mac80211 cfg80211 rfkill libarc4 sunxi_cedrus(C) v4l2_mem2mem polyval_ce polyval_generic videobuf2_dma_contig videobuf2_memops sun50i_h6_prcm_ppu videobuf2_v4l2 videodev videobuf2_common mc dump_reg cpufreq_dt zram zsmalloc binfmt_misc sch_fq_codel fuse dm_mod realtek sun6i_rtc_ccu dwmac_sun8i mdio_mux
 CPU: 2 UID: 0 PID: 47314 Comm: cc1 Tainted: G         C         6.12.23-current-sunxi64 #1
 Tainted: [C]=CRAP
 Hardware name: Sipeed Longan Pi 3H (DT)
 pstate: 200000c5 (nzCv daIF -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
 pc : __rmqueue_pcplist+0x33c/0xb58
 lr : __rmqueue_pcplist+0x444/0xb58
 sp : ffff80008f73b890
 x29: ffff80008f73b910 x28: 0000000000000010 x27: fffffdffc15f1c00
 x26: 0000000000000110 x25: 0000000000000001 x24: 0000000000000000
 x23: ffff0000ff7cfd00 x22: 000000000000037d x21: ffff0000ff794ff0
 x20: dead000000000040 x19: 0000000000000001 x18: ffffffffffffffff
 x17: 00000000f0000000 x16: 0000000000000010 x15: 0000000000000068
 x14: 0000000000000100 x13: 1fffe00018aa1fa1 x12: 0000000000000001
 x11: fffffdffc15f1c00 x10: fffffdfec1630208 x9 : ffff800080307834
 x8 : fffffdffc122e1c8 x7 : ffff80007da5e000 x6 : ffff80008f73b890
 x5 : ffff0000ff7cff48 x4 : 0000000000000003 x3 : 0000000000000003
 x2 : fffffdffc15f1c08 x1 : 0000000000000003 x0 : 0000000000000008
 Call trace:
  __rmqueue_pcplist+0x33c/0xb58
  get_page_from_freelist+0x67c/0x1698
  __alloc_pages_noprof+0x15c/0xe00
  alloc_pages_mpol_noprof+0x98/0x218
  folio_alloc_mpol_noprof+0x1c/0x70
  vma_alloc_folio_noprof+0x88/0xe8
  vma_alloc_zeroed_movable_folio+0x40/0x58
  __handle_mm_fault+0xcb0/0x1208
  handle_mm_fault+0x134/0x258
  do_page_fault+0x13c/0x520
  do_translation_fault+0xa4/0xc0
  do_mem_abort+0x4c/0xa8
  el0_da+0x2c/0xa0
  el0t_64_sync_handler+0xb4/0x130
  el0t_64_sync+0x190/0x198
 Code: 2a0103e3 1ac12000 d503201f a940144a (f9000545)
 ---[ end trace 0000000000000000 ]---
 note: cc1[47314] exited with irqs disabled

 

 

panic3:

Spoiler
[ 8838.754724] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000030
[ 8838.763594] Mem abort info:
[ 8838.766400]   ESR = 0x0000000096000004
[ 8838.770155]   EC = 0x25: DABT (current EL), IL = 32 bits
[ 8838.775472]   SET = 0, FnV = 0
[ 8838.778532]   EA = 0, S1PTW = 0
[ 8838.781678]   FSC = 0x04: level 0 translation fault
[ 8838.786559] Data abort info:
[ 8838.789445]   ISV = 0, ISS = 0x00000004, ISS2 = 0x00000000
[ 8838.794932]   CM = 0, WnR = 0, TnD = 0, TagAccess = 0
[ 8838.799987]   GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0
[ 8838.805303] user pgtable: 4k pages, 48-bit VAs, pgdp=000000010387c000
[ 8838.811747] [0000000000000030] pgd=0000000000000000, p4d=0000000000000000
[ 8838.818567] Internal error: Oops: 0000000096000004 [#1] SMP
[ 8838.824143] Modules linked in: sunrpc rtl8xxxu mac80211 cfg80211 sunxi_cedrus(C) rfkill libarc4 v4l2_mem2mem videobuf2_dma_contig videobuf2_memops polyval_ce videobuf2_v4l2 sun50i_h6_prcm_ppu videodev polyval_generic videobuf2_common mc dump_reg cpufreq_dt zram zsmalloc binfmt_misc sch_fq_codel fuse dm_mod sun6i_rtc_ccu realtek dwmac_sun8i mdio_mux
[ 8838.855106] CPU: 0 UID: 0 PID: 0 Comm: swapper/0 Tainted: G        WC         6.12.23-current-sunxi64 #1
[ 8838.864588] Tainted: [W]=WARN, [C]=CRAP
[ 8838.868422] Hardware name: Sipeed Longan Pi 3H (DT)
[ 8838.873297] pstate: 80000005 (Nzcv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
[ 8838.880260] pc : tcp_schedule_loss_probe.part.0+0x20/0x270
[ 8838.885759] lr : tcp_write_xmit+0x320/0x1390
[ 8838.890034] sp : ffff800080003d60
[ 8838.893348] x29: ffff800080003d80 x28: 00000000000005a8 x27: 0000000000000000
[ 8838.900493] x26: 00000000000005a8 x25: 0000000000000001 x24: ffff0000c2828b90
[ 8838.907638] x23: 000000000000002d x22: ffff0000c2828a00 x21: ffff0000c149ba28
[ 8838.914784] x20: ffff8000821f6000 x19: 0000000000000000 x18: 0000000000000042
[ 8838.921929] x17: 9989833584a40a08 x16: 010100002988f401 x15: 188078bc986c43d2
[ 8838.929074] x14: b302168c16000103 x13: 96629989833584a4 x12: 0a08010100002988
[ 8838.936219] x11: f401188078bc986c x10: 43d2b302168c1600 x9 : ffff800080f479b8
[ 8838.943363] x8 : ffff800080b33870 x7 : ffff0000c2828b88 x6 : ffff800082201980
[ 8838.950508] x5 : 0000000002b3d1bb x4 : 0000000002b3d3df x3 : 000000000000000a
[ 8838.957652] x2 : 00000000002092dc x1 : 0000000000010020 x0 : 0000000000000000
[ 8838.964797] Call trace:
[ 8838.967245]  tcp_schedule_loss_probe.part.0+0x20/0x270
[ 8838.972388]  tcp_write_xmit+0x320/0x1390
[ 8838.976316]  tcp_tsq_write.part.0+0x70/0xc8
[ 8838.980506]  tcp_tsq_handler+0x100/0x148
[ 8838.984433]  tcp_tasklet_func+0xcc/0x140
[ 8838.988362]  tasklet_action_common+0xf0/0x288
[ 8838.992724]  tasklet_action+0x38/0x50
[ 8838.996391]  handle_softirqs+0x11c/0x2f8
[ 8839.000318]  __do_softirq+0x1c/0x28
[ 8839.003813]  ____do_softirq+0x18/0x30
[ 8839.007482]  call_on_irq_stack+0x24/0x58
[ 8839.011411]  do_softirq_own_stack+0x24/0x38
[ 8839.015600]  irq_exit_rcu+0x88/0xc8
[ 8839.019091]  el1_interrupt+0x38/0x58
[ 8839.022678]  el1h_64_irq_handler+0x18/0x28
[ 8839.026784]  el1h_64_irq+0x64/0x68
[ 8839.030190]  default_idle_call+0x38/0xd8
[ 8839.034118]  do_idle+0x204/0x268
[ 8839.037358]  cpu_startup_entry+0x3c/0x50
[ 8839.041291]  rest_init+0xc4/0xd0
[ 8839.044524]  start_kernel+0x52c/0x7a8
[ 8839.048193]  __primary_switched+0x80/0x90
[ 8839.052221] Code: a9027bfd 910083fd a90353f3 aa0003f3 (f9401800)
[ 8839.058311] ---[ end trace 0000000000000000 ]---
[ 8839.062929] Kernel panic - not syncing: Oops: Fatal exception in interrupt
[ 8839.069801] SMP: stopping secondary CPUs
[ 8839.073731] Kernel Offset: disabled
[ 8839.077219] CPU features: 0x00,00000080,00200000,0200420b
[ 8839.082610] Memory Limit: none
[ 8839.085669] ---[ end Kernel panic - not syncing: Oops: Fatal exception in interrupt ]---

 

 

Posted
2 часа назад, Johnny on the couch сказал:

've collected few more kernel crashes on the same image (latest Ubuntu Noble freshly installed on SD card):

When you have started the device on a new operating system image, does the kernel panic appear?

 

After what action does the core panic?

Posted

Sometimes panic is after reboot, sometimes is after few hours of uptime. Panic1 from the previous post is while doing apt update. 2nd was while compiling zfs-dkms and 3rd I don't rember.

 

The same Noble image works on eMMC, without panic, for now. I'll try to reproduce it with another SD card.

Posted
3 часа назад, Johnny on the couch сказал:

panic3:

All the kernel panic options that you have shown here occur on v6.12.23.

It all happens randomly.

I suspect that this may be due to the presence of broken (faulty) blocks on the memory device.
It's easy to check this.If you connect the SD card via an adapter to a Linux computer:

sudo fsck.ext4 /dev/sdX1

 

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...

Important Information

Terms of Use - Privacy Policy - Guidelines