jbergler Posted October 21, 2020 Posted October 21, 2020 @aprayoga if you still need it here's a full boot log of the crash (the actual stacktrace of the crash is inconsistent for me) Spoiler DDR Version 1.24 20191016 In A: 0x1804 SecureInit read PBA: 0x1c04 SecureInit ret = 0, SecureMode = 0 atags_set_bootdev: ret:(0) GPT 0x3380ec0 signature is wrong recovery gpt... GPT 0x3380ec0 signature is wrong recovery gpt fail! LoadTrust Addr:0x4000 No find bl30.bin No find bl32.bin Load uboot, ReadLba = 2000 Load OK, addr=0x200000, size=0xdcfd8 RunBL31 0x40000 NOTICE: BL31: v1.3(debug):42583b6 NOTICE: BL31: Built : 07:55:13, Oct 15 2019 NOTICE: BL31: Rockchip release version: v1.1 INFO: GICv3 with legacy support detected. ARM GICV3 driver initialized in EL3 INFO: Using opteed sec cpu_context! INFO: boot cpu mask: 0 INFO: plat_rockchip_pmu_init(1190): pd status 3e INFO: BL31: Initializing runtime services WARNING: No OPTEE provided by BL2 boot loader, Booting device without OPTEE initialization. SMC`s destined for OPTEE will return SMC_UNK ERROR: Error initializing runtime service opteed_fast INFO: BL31: Preparing for EL3 exit to normal world INFO: Entry point address = 0x200000 INFO: SPSR = 0x3c9 U-Boot 2020.07-armbian (Oct 13 2020 - 16:49:59 +0200) SoC: Rockchip rk3399 Reset cause: POR DRAM: 3.9 GiB PMIC: RK808 SF: Detected w25q128 with page size 256 Bytes, erase size 4 KiB, total 16 MiB MMC: mmc@fe320000: 1, sdhci@fe330000: 0 Loading Environment from MMC... scanning bus for devices... Hit any key to stop autoboot: 0 Card did not respond to voltage select! switch to partitions #0, OK mmc0(part 0) is current device Scanning mmc 0:1... Found U-Boot script /boot/boot.scr 27236864 bytes read in 2646 ms (9.8 MiB/s) Starting kernel ... [ 16.291853] systemd[1]: Failed to start Import ZFS pools by cache file. [ 16.768236] systemd[1]: Failed to start Set console font and keymap. ckchipdrm analogix_dp dw_hdmi dw_mipi_dsi drm_kms_helper cec rc_core drm adc_keys drm_panel_orientation_quirks [ 23.935692] CPU: 4 PID: 0 Comm: swapper/4 Tainted: G C 5.8.16-rockchip64 #20.08.14 [ 23.936474] Hardware name: Helios64 (DT) [ 23.936827] pstate: 80000085 (Nzcv daIf -PAN -UAO BTYPE=--) [ 23.937336] pc : __update_load_avg_se+0x64/0x308 [ 23.937751] lr : update_load_avg+0x278/0x5e0 [ 23.938131] sp : ffff800011abbd80 [ 23.938428] x29: ffff800011abbd80 x28: 0000000000000001 [ 23.938901] x27: ffff0000f6ea5700 x26: ffff800011abc000 [ 23.939373] x25: ffff8000114f1d20 x24: 0000000000000000 [ 23.939846] x23: ffff0000f77a7b40 x22: 0000000000000005 [ 23.940317] x21: 000000058fae126f x20: ffff0000f61c80c0 [ 23.940789] x19: ffff0000f61c80c0 x18: 0000000000000000 [ 23.941261] x17: 0000000000000000 x16: 0000000000000000 [ 23.941733] x15: 0000000000000000 x14: 0000000000000000 [ 23.942205] x13: 0000000000000332 x12: 0000000000000339 [ 23.942676] x11: 0000000000000001 x10: 0000000000000a20 [ 23.943148] x9 : ffff800011c53e70 x8 : 0000000000000000 [ 23.943620] x7 : 0000000000000000 x6 : 0000000019d0a216 [ 23.944092] x5 : 0000000000000000 x4 : 000000000000005e [ 23.944563] x3 : 0000000000000000 x2 : 000000058fae1000 [ 23.945034] x1 : ffff0000f77a7bc0 x0 : 000000000000005e [ 23.945508] Call trace: [ 23.945734] __update_load_avg_se+0x64/0x308 [ 23.946118] update_load_avg+0x278/0x5e0 [ 23.946472] enqueue_entity+0x68/0x228 [ 23.946809] enqueue_task_fair+0x78/0x4d8 [ 23.947171] ttwu_do_activate+0x84/0xe8 [ 23.947516] sched_ttwu_pending+0xc0/0x168 [ 23.947884] flush_smp_call_function_queue+0xec/0x258 [ 23.948334] generic_smp_call_function_single_interrupt+0x14/0x20 [ 23.948879] handle_IPI+0x258/0x3e8 [ 23.949195] gic_handle_irq+0x154/0x158 [ 23.949539] el1_irq+0xb8/0x180 [ 23.949824] arch_cpu_idle+0x28/0x218 [ 23.950158] default_idle_call+0x1c/0x44 [ 23.950510] do_idle+0x210/0x288 [ 23.950801] cpu_startup_entry+0x24/0x68 [ 23.951155] secondary_start_kernel+0x140/0x178 [ 23.951564] Code: 2a0403e0 8b030042 f9402023 f9006262 (b9401cc1) [ 23.952116] ---[ end trace fd422e59e1cb21cd ]--- [ 23.952527] Kernel panic - not syncing: Fatal exception in interrupt [ 23.953092] SMP: stopping secondary CPUs [ 23.953448] Kernel Offset: disabled [ 23.953763] CPU features: 0x240022,2000600c [ 23.954135] Memory Limit: none [ 23.954422] ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]--- Spoiler channel 0 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 1 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 0 training pass! channel 1 training pass! change freq to 416MHz 0,1 Channel 0: LPDDR4,416MHz Bus Width=32 Col=10 Bank=8 Row=16 CS=1 Die Bus-Width=16 Size=2048MB Channel 1: LPDDR4,416MHz Bus Width=32 Col=10 Bank=8 Row=16 CS=1 Die Bus-Width=16 Size=2048MB 256B stride channel 0 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 1 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 0 training pass! channel 1 training pass! channel 0, cs 0, advanced training done channel 1, cs 0, advanced training done change freq to 856MHz 1,0 ch 0 ddrconfig = 0x101, ddrsize = 0x40 ch 1 ddrconfig = 0x101, ddrsize = 0x40 pmugrf_os_reg[2] = 0x32C1F2C1, stride = 0xD ddr_set_rate to 328MHZ ddr_set_rate to 666MHZ ddr_set_rate to 928MHZ channel 0, cs 0, advanced training done ll return SMC_UNK ERROR: Error initializing runtime service opteed_fast INFO: BL31: Preparing for EL3 exit to normal world INFO: Entry point address = 0x200000 INFO: SPSR = 0x3c9 U-Boot 2020.07-armbian (Oct 13 2020 - 16:49:59 +0200) SoC: Rockchip rk3399 Reset cause: POR DRAM: 3.9 GiB PMIC: RK808 eth0: ethernet@fe300000 scanning bus for devices... Hit any key to stop autoboot: 0 Card did not respond to voltage select! Scanning mmc 0:1... Found U-Boot script /boot/boot.scr 3185 bytes read in 18 ms (171.9 KiB/s) ## Executing script at 00500000 Boot script loaded from mmc 0 166 bytes read in 12 ms (12.7 KiB/s) 16311106 bytes read in 1601 ms (9.7 MiB/s) 27236864 bytes read in 2645 ms (9.8 MiB/s) OK ## Flattened Device Tree blob at 01f00000 Booting using the fdt blob at 0x1f00000 Loading Ramdisk to f4f58000, end f5ee6302 ... OK Loading Device Tree to 00000000f4edc000, end 00000000f4f57fff ... OK [ 15.396135] OF: graph: no port node found in /syscon@ff770000/usb2-phy@e450/otg-port [ 16.304499] systemd[1]: Failed to start Import ZFS pools by cache file. [ 16.784363] systemd[1]: Failed to start Set console font and keymap. [ 21.126776] kernel BUG at arch/arm64/kernel/traps.c:470! [ 21.127260] Internal error: Oops - BUG: 0 [#1] PREEMPT SMP [ 21.127750] Modules linked in: rfkill governor_performance zstd zram r8152 snd_soc_hdmi_codec leds_pwm gpio_charger pwm_fan snd_soc_rockchip_i2s snd_soc_core snd_pcm_dmaengine panfrost snd_pcm snd_timer snd soundcore gpu_sched sg rockchip_rga hantro_vpu(C) rockchip_vdec(C) videobuf2_dma_sg v4l2_h264 videobuf2_dma_contig videobuf2_vmalloc v4l2_mem2mem videobuf2_memopsx5 : 00000000d5300000 x4 : ffff800011806118 [ 21.142542] x3 : 00000000d6500000 x2 : 0000000000000002 [ 21.143014] x1 : ffff0000f6ea6580 x0 : 0000000040000085 [ 21.143488] Call trace: [ 21.143717] do_undefinstr+0x2ec/0x310 [ 21.144061] el1_sync_handler+0x88/0x110 [ 21.144414] el1_sync+0x7c/0x100 [ 21.144713] check_preemption_disabled+0x48/0x108 [ 21.145143] irq_exit+0x70/0xe0 [ 21.145434] handle_IPI+0x25c/0x3e8 [ 21.145758] gic_handle_irq+0x154/0x158 [ 21.146103] el1_irq+0xb8/0x180 [ 21.146386] arch_cpu_idle+0x28/0x218 [ 21.146721] default_idle_call+0x1c/0x44 [ 21.147082] do_idle+0x210/0x288 [ 21.147373] cpu_startup_entry+0x28/0x68 [ 21.147725] secondary_start_kernel+0x140/0x178 [ 21.148137] Code: f9401bf7 17ffff7d a9025bf5 f9001bf7 (d4210000) [ 21.148691] ---[ end trace 9f0e999f37a83ba9 ]--- [ 21.149105] Kernel panic - not syncing: Attempted to kill the idle task! [ 21.149704] SMP: stopping secondary CPUs [ 21.150064] Kernel Offset: disabled [ 21.150378] CPU features: 0x240022,2000600c [ 21.150750] Memory Limit: none [ 21.151036] ---[ end Kernel panic - not syncing: Attempted to kill the idle task! ]--- 1
flower Posted October 21, 2020 Posted October 21, 2020 Sadly buster.13 is still unstable. Got this after a few hours without much load. [ 8251.657645] ------------[ cut here ]------------ [ 8251.658075] kernel BUG at arch/arm64/kernel/traps.c:470! [ 8251.658551] Internal error: Oops - BUG: 0 [#1] PREEMPT SMP [ 8251.659038] Modules linked in: iptable_nat iptable_filter bpfilter wireguard libchacha20poly1305 poly1305_neon ip6_udp_tunnel udp_tunnel libblake2s libcurve25519_generic libblake2s_generic veth xt_nat xt_tcpudp xt_conntrack xt_MASQUERADE nf_conntrack_netlink xfrm_user xfrm_algo nft_counter xt_addrtype nft_compat nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables nfnetlink br_netfilter bridge governor_performance zstd zram r8152 snd_soc_hdmi_codec snd_soc_rockchip_i2s leds_pwm snd_soc_core panfrost pwm_fan gpio_charger hantro_vpu(C) gpu_sched snd_pcm_dmaengine snd_pcm rockchip_vdec(C) rockchip_rga v4l2_h264 snd_timer videobuf2_dma_sg videobuf2_dma_contig snd rockchipdrm v4l2_mem2mem dw_mipi_dsi videobuf2_vmalloc videobuf2_memops dw_hdmi videobuf2_v4l2 soundcore analogix_dp videobuf2_common drm_kms_helper cec rc_core videodev fusb30x(C) gpio_beeper mc drm drm_panel_orientation_quirks sg cpufreq_dt lm75 ip_tables x_tables autofs4 raid456 async_raid6_recov async_memcpy [ 8251.659153] async_pq async_xor async_tx raid1 multipath linear raid10 raid0 md_mod realtek dwmac_rk stmmac_platform stmmac mdio_xpcs adc_keys [ 8251.667928] CPU: 4 PID: 0 Comm: swapper/4 Tainted: G C 5.8.16-rockchip64 #20.08.13 [ 8251.668710] Hardware name: Helios64 (DT) [ 8251.669062] pstate: 00000085 (nzcv daIf -PAN -UAO BTYPE=--) [ 8251.669564] pc : do_undefinstr+0x2ec/0x310 [ 8251.669931] lr : do_undefinstr+0x1e0/0x310 [ 8251.670296] sp : ffff800011abbd50 [ 8251.670593] x29: ffff800011abbd50 x28: ffff0000f6ea5700 [ 8251.671067] x27: ffff0000f6ea5700 x26: ffff800011abc000 [ 8251.671540] x25: ffff8000114f1d20 x24: 0000000000000000 [ 8251.672012] x23: 0000000040000085 x22: ffff800010df6d90 [ 8251.672484] x21: ffff800011abbf00 x20: ffff0000f6ea5700 [ 8251.672956] x19: ffff800011abbdc0 x18: 0000000000000000 [ 8251.673428] x17: 0000000000000000 x16: 0000000000000000 [ 8251.673900] x15: 0000000000000006 x14: 00000d554264da40 [ 8251.674372] x13: 0000000000000322 x12: 000000000000036f [ 8251.674844] x11: 0000000000000001 x10: 0000000000000004 [ 8251.675316] x9 : ffff0000f77a8590 x8 : ffff0000f77a7bc0 [ 8251.675788] x7 : ffff0000f6211dc0 x6 : ffff800011abbda8 [ 8251.676260] x5 : 00000000d5300000 x4 : ffff800011806118 [ 8251.676731] x3 : 0000000034000000 x2 : 0000000000000002 [ 8251.677203] x1 : ffff0000f6ea5700 x0 : 0000000040000085 [ 8251.677676] Call trace: [ 8251.677900] do_undefinstr+0x2ec/0x310 [ 8251.678240] el1_sync_handler+0x88/0x110 [ 8251.678591] el1_sync+0x7c/0x100 [ 8251.678886] check_preemption_disabled+0x30/0x108 [ 8251.679305] __this_cpu_preempt_check+0x1c/0x34 [ 8251.679710] irq_exit+0x70/0xe0 [ 8251.679996] handle_IPI+0x25c/0x3e8 [ 8251.680312] gic_handle_irq+0x154/0x158 [ 8251.680655] el1_irq+0xb8/0x180 [ 8251.680938] arch_cpu_idle+0x28/0x218 [ 8251.681269] default_idle_call+0x1c/0x44 [ 8251.681622] do_idle+0x210/0x288 [ 8251.681912] cpu_startup_entry+0x28/0x68 [ 8251.682265] secondary_start_kernel+0x140/0x178 [ 8251.682673] Code: f9401bf7 17ffff7d a9025bf5 f9001bf7 (d4210000) [ 8251.683223] ---[ end trace 47af1c5026823974 ]--- [ 8251.683636] Kernel panic - not syncing: Attempted to kill the idle task! [ 8251.684232] SMP: stopping secondary CPUs [ 8251.684588] Kernel Offset: disabled [ 8251.684903] CPU features: 0x240022,2000600c [ 8251.685275] Memory Limit: none [ 8251.685562] ---[ end Kernel panic - not syncing: Attempted to kill the idle task! ]--- 1
registr123 Posted October 21, 2020 Posted October 21, 2020 6 hours ago, jbergler said: If you, like myself, installed on eMMC and are experiencing the crashes on 20.08.14 - I booted up via a 20.08.10 sdcard and fixed the environment on emmc @aprayoga It's probably unrelated, but while working through the above I noticed that I ran out of space on /boot. I installed to eMMC the first version that was working, if that helps. I chose f2fs when I installed on eMMAC and this is the resulting partition layout mmcblk1 179:32 0 14.6G 0 disk ├─mmcblk1p1 179:33 0 96M 0 part └─mmcblk1p2 179:34 0 14.3G 0 part mmcblk1boot0 179:64 0 4M 1 disk mmcblk1boot1 179:96 0 4M 1 disk Sadly I didn't grab enough info from what was in the boot partition before I nuked it and reinstalled the appropriate packages. I choose ext4 resulting in this : mmcblk1 179:32 0 14.6G 0 disk └─mmcblk1p1 179:33 0 14.4G 0 part mmcblk1boot0 179:64 0 4M 1 disk mmcblk1boot1 179:96 0 4M 1 disk
flower Posted October 21, 2020 Posted October 21, 2020 i got this kernel exception while running iperf3 speed tests. helios64 did not crash though (i did a clean reboot) cpu load is at 100% while sending or receiving (this is expected because of no tx offloading) i have seen some connection drops with very long living connections (mutliple hours) maybe thats related. one additional want to have: as i am not at home over weekends i'd love to see a feature that helios64 would auto restart on crash. i see the red led blinking. that way my wireguard/pihole would continue working [12403.586397] Hardware name: Helios64 (DT) [12403.586763] pstate: 00000005 (nzcv daif -PAN -UAO BTYPE=--) [12403.587273] pc : dev_watchdog+0x39c/0x3a8 [12403.587646] lr : dev_watchdog+0x39c/0x3a8 [12403.588014] sp : ffff800011abbd30 [12403.588321] x29: ffff800011abbd30 x28: ffff0000e7543e80 [12403.588808] x27: 0000000000000004 x26: 0000000000000140 [12403.589296] x25: 00000000ffffffff x24: 0000000000000002 [12403.589783] x23: ffff0000e6d703dc x22: ffff0000e6d70000 [12403.590270] x21: ffff0000e6d70480 x20: ffff800011807000 [12403.590757] x19: 0000000000000000 x18: 0000000000000000 [12403.591244] x17: 0000000000000000 x16: 0000000000000000 [12403.591730] x15: ffff80001182e000 x14: ffff800011a10242 [12403.592217] x13: 0000000000000000 x12: ffff800011a0f000 [12403.592705] x11: ffff80001182e000 x10: ffff800011a0f888 [12403.593192] x9 : 0000000000000000 x8 : 0000000000000006 [12403.593679] x7 : 0000000000000364 x6 : 0000000000000003 [12403.594166] x5 : 0000000000000000 x4 : 0000000000000000 [12403.594652] x3 : 0000000000000100 x2 : 0000000000000103 [12403.595138] x1 : b97470e46a7cb200 x0 : 0000000000000000 [12403.595626] Call trace: [12403.595861] dev_watchdog+0x39c/0x3a8 [12403.596206] call_timer_fn+0x30/0x1e0 [12403.596548] run_timer_softirq+0x1e0/0x5b0 [12403.596930] efi_header_end+0x16c/0x400 [12403.597287] irq_exit+0xc8/0xe0 [12403.597583] __handle_domain_irq+0x98/0x108 [12403.597971] gic_handle_irq+0x60/0x158 [12403.598319] el1_irq+0xb8/0x180 [12403.598615] arch_cpu_idle+0x28/0x218 [12403.598961] default_idle_call+0x1c/0x44 [12403.599326] do_idle+0x210/0x288 [12403.599629] cpu_startup_entry+0x24/0x68 [12403.599995] secondary_start_kernel+0x140/0x178 [12403.600410] ---[ end trace ad1687ba894eaefe ]--- [12403.600896] r8152 2-1.4:1.0 eth1: Tx timeout [12403.604757] xhci-hcd xhci-hcd.0.auto: bad transfer trb length 16754908 in event trb [12403.605617] r8152 2-1.4:1.0 eth1: Tx status -2 [12403.606197] xhci-hcd xhci-hcd.0.auto: bad transfer trb length 16754908 in event trb [12403.606964] r8152 2-1.4:1.0 eth1: Tx status -2 [12403.607640] r8152 2-1.4:1.0 eth1: Tx status -2 [12403.608300] r8152 2-1.4:1.0 eth1: Tx status -2
gprovost Posted October 22, 2020 Author Posted October 22, 2020 @flower On which kernel are you running ? Did you revert to LK 5.8.14 ? 12 hours ago, flower said: one additional want to have: as i am not at home over weekends i'd love to see a feature that helios64 would auto restart on crash. i see the red led blinking. that way my wireguard/pihole would continue working You can use watchdog service for that.
barnumbirr Posted October 22, 2020 Posted October 22, 2020 Sadly, 20.03.13 / 5.8.16 completely broke my system. This is the error I see over serial: ### ERROR ### Please RESET the board ### Here's the full output: Spoiler DDR Version 1.24 20191016 In channel 0 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 1 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 0 training pass! channel 1 training pass! change freq to 416MHz 0,1 Channel 0: LPDDR4,416MHz Bus Width=32 Col=10 Bank=8 Row=16 CS=1 Die Bus-Width=16 Size=2048MB Channel 1: LPDDR4,416MHz Bus Width=32 Col=10 Bank=8 Row=16 CS=1 Die Bus-Width=16 Size=2048MB 256B stride channel 0 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 1 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 0 training pass! channel 1 training pass! channel 0, cs 0, advanced training done channel 1, cs 0, advanced training done change freq to 856MHz 1,0 ch 0 ddrconfig = 0x101, ddrsize = 0x40 ch 1 ddrconfig = 0x101, ddrsize = 0x40 pmugrf_os_reg[2] = 0x32C1F2C1, stride = 0xD ddr_set_rate to 328MHZ ddr_set_rate to 666MHZ ddr_set_rate to 928MHZ channel 0, cs 0, advanced training done channel 1, cs 0, advanced training done ddr_set_rate to 416MHZ, ctl_index 0 ddr_set_rate to 856MHZ, ctl_index 1 support 416 856 328 666 928 MHz, current 856MHz OUT Boot1: 2019-03-14, version: 1.19 CPUId = 0x0 ChipType = 0x10, 253 SdmmcInit=2 0 BootCapSize=100000 UserCapSize=14910MB FwPartOffset=2000 , 100000 mmc0:cmd5,20 SdmmcInit=0 0 BootCapSize=0 UserCapSize=30436MB FwPartOffset=2000 , 0 StorageInit ok = 79105 SecureMode = 0 SecureInit read PBA: 0x4 SecureInit read PBA: 0x404 SecureInit read PBA: 0x804 SecureInit read PBA: 0xc04 SecureInit read PBA: 0x1004 SecureInit read PBA: 0x1404 SecureInit read PBA: 0x1804 SecureInit read PBA: 0x1c04 SecureInit ret = 0, SecureMode = 0 atags_set_bootdev: ret:(0) GPT 0x3380ec0 signature is wrong recovery gpt... GPT 0x3380ec0 signature is wrong recovery gpt fail! LoadTrust Addr:0x4000 No find bl30.bin No find bl32.bin Load uboot, ReadLba = 2000 Load OK, addr=0x200000, size=0xded90 RunBL31 0x40000 NOTICE: BL31: v1.3(debug):42583b6 NOTICE: BL31: Built : 07:55:13, Oct 15 2019 NOTICE: BL31: Rockchip release version: v1.1 INFO: GICv3 with legacy support detected. ARM GICV3 driver initialized in EL3 INFO: Using opteed sec cpu_context! INFO: boot cpu mask: 0 INFO: plat_rockchip_pmu_init(1190): pd status 3e INFO: BL31: Initializing runtime services WARNING: No OPTEE provided by BL2 boot loader, Booting device without OPTEE initialization. SMC`s destined for OPTEE will return SMC_UNK ERROR: Error initializing runtime service opteed_fast INFO: BL31: Preparing for EL3 exit to normal world INFO: Entry point address = 0x200000 INFO: SPSR = 0x3c9 U-Boot 2020.07-armbian (Oct 05 2020 - 15:57:22 +0200) SoC: Rockchip rk3399 Reset cause: POR DRAM: 3.9 GiB PMIC: RK808 SF: Detected w25q128 with page size 256 Bytes, erase size 4 KiB, total 16 MiB MMC: mmc@fe320000: 1, sdhci@fe330000: 0 Loading Environment from MMC... mmc fail to send stop cmd *** Warning - !read failed, using default environment In: serial Out: serial Err: serial Model: Helios64 Revision: 1.2 - 4GB non ECC Net: eth0: ethernet@fe300000 scanning bus for devices... Hit any key to stop autoboot: 0 switch to partitions #0, OK mmc1 is current device Scanning mmc 1:1... Found U-Boot script /boot/boot.scr 3185 bytes read in 6 ms (517.6 KiB/s) ## Executing script at 00500000 Boot script loaded from mmc 1 166 bytes read in 5 ms (32.2 KiB/s) 16005149 bytes read in 682 ms (22.4 MiB/s) 27236864 bytes read in 1155 ms (22.5 MiB/s) libfdt fdt_check_header(): FDT_ERR_BADMAGIC No FDT memory address configured. Please configure the FDT address via "fdt addr <address>" command. Aborting! ## Loading init Ramdisk from Legacy Image at 06000000 ... Image Name: uInitrd Image Type: AArch64 Linux RAMDisk Image (gzip compressed) Data Size: 16005085 Bytes = 15.3 MiB Load Address: 00000000 Entry Point: 00000000 Verifying Checksum ... OK ERROR: Did not find a cmdline Flattened Device Tree Loading Ramdisk to f4fa1000, end f5ee47dd ... OK FDT and ATAGS support not compiled in - hanging ### ERROR ### Please RESET the board ###
ebin-dev Posted October 22, 2020 Posted October 22, 2020 2 hours ago, barnumbirr said: Sadly, 20.03.13 / 5.8.16 completely broke my system. You could try to boot from SD: although the boot order officially is SPI, EMMC, SD - Helios64 seems to boot from an inserted SD card although there is a bootable system installed on emmc. btw: I have downgraded to kernel 5.8.14.
barnumbirr Posted October 22, 2020 Posted October 22, 2020 1 minute ago, ebin-dev said: You could try to boot from SD: although the boot order officially is SPI, EMMC, SD - Helios64 boots from an inserted SD card although there is a bootable system installed on emmc (at least mine). btw: I have downgraded to kernel 5.8.14. I've been booting from the SD ever since I setup my Helios64. From the logs, it seems I'm missing the .dtb file so I'll try and get my hand on it.
piter75 Posted October 22, 2020 Posted October 22, 2020 44 minutes ago, barnumbirr said: I've been booting from the SD ever since I setup my Helios64. From the logs, it seems I'm missing the .dtb file so I'll try and get my hand on it. If you can mount the SD with some ext4 capable system... Add the following stanza to boot/armbianEnv.txt in the mounted volume and you should be good to go. fdtfile=rockchip/rk3399-kobol-helios64.dtb There was a device tree file rename in the meantime and your u-boot does not know that
barnumbirr Posted October 22, 2020 Posted October 22, 2020 (edited) 1 hour ago, piter75 said: If you can mount the SD with some ext4 capable system... Add the following stanza to boot/armbianEnv.txt in the mounted volume and you should be good to go. fdtfile=rockchip/rk3399-kobol-helios64.dtb There was a device tree file rename in the meantime and your u-boot does not know that Managed to boot again. Adding fdtfile=rockchip/rk3399-kobol-helios64.dtb to boot/armbianEnv.txt wasn't enough as for some reason the dtb-5.8.16-rockchip64 directory in /boot had vanished from my system. I've still got some issues as fancontrol fails with the following error: Spoiler Oct 22 15:16:29 helios64 fancontrol[1184]: Error: file /dev/fan-p6/pwm1 doesn't exist Oct 22 15:16:29 helios64 fancontrol[1184]: Error: file /dev/fan-p7/pwm1 doesn't exist The system gets quite hot as a result, for now I've temporarely hard coded some values in /sys/class/hwmon/hwmon*/pwm1. I also get the following logs in the serial console: Spoiler Starting kernel ... [ 65.817320] Internal error: Oops: 86000004 [#1] PREEMPT SMP [ 65.817825] Modules linked in: snd_soc_hdmi_codec hantro_vpu(C) rockchip_vdec(C) rockchipdrm rockchip_rga snd_soc_rockchip_i2s v4l2_h264 videobuf2_dma_contig dw_mipi_dsi snd_soc_core dw_hdmi v4l2_mem2mem videobuf2_vmalloc videobuf2_dma_sg snd_pcm_dmaengine r8152 analogix_dp snd_pcm videobuf2_memops videobuf2_v4l2 drm_kms_helper panfrost snd_timer videobuf2_common cec videodev snd rc_core gpu_sched sg drm mc gpio_charger soundcore leds_pwm fusb30x(C) pwm_fan drm_panel_orientation_quirks gpio_beeper cpufreq_dt nfsd auth_rpcgss nfs_acl dm_mod lockd grace sunrpc lm75 ip_tables x_tables autofs4 raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx realtek md_mod dwmac_rk stmmac_platform stmmac mdio_xpcs adc_keys [ 65.823764] CPU: 4 PID: 534 Comm: kworker/4:3 Tainted: G C 5.8.16-rockchip64 #20.08.14 [ 65.824577] Hardware name: Helios64 (DT) [ 65.824943] Workqueue: events dbs_work_handler [ 65.825343] pstate: 80000085 (Nzcv daIf -PAN -UAO BTYPE=--) [ 65.825839] pc : 0x800010010320 [ 65.826128] lr : return_address+0x58/0x90 [ 65.826485] sp : ffff800012663500 [ 65.826782] x29: ffff800012663530 x28: 00000000000000fa [ 65.827255] x27: ffff0000f5a66cf8 x26: ffff0000e4f85700 [ 65.827728] x25: ffff8000119cf3a8 x24: 0000000001000001 [ 65.828200] x23: ffff0000f5a66cf0 x22: ffff800012663728 [ 65.828672] x21: ffff0000f5a66cf8 x20: ffff800010dff2d0 [ 65.829144] x19: ffff0000e4f85700 x18: 0000000000000000 [ 65.829616] x17: 0000000000000000 x16: 0000000000000000 [ 65.830087] x15: 0000000000000000 x14: 0000000000000000 [ 65.830559] x13: 00000000000002c4 x12: 00000000000002cd [ 65.831030] x11: 0000000000000001 x10: 0000000000000a20 [ 65.831502] x9 : ffff800012663460 x8 : ffff0000e4f86180 [ 65.831974] x7 : 0000000000000001 x6 : 0000000000000001 [ 65.832446] x5 : 0000000000000002 x4 : fff6800010014610 [ 65.832917] x3 : ffff800012663520 x2 : fff68000100145d8 [ 65.833395] x1 : ffff800010dff640 x0 : ffff0000e4f85700 [ 65.833873] Call trace: [ 65.834097] 0x800010010320 [ 65.834350] return_address+0x0/0x90 [ 65.834676] Code: bad PC value [ 65.834953] ---[ end trace de998757a1cc1983 ]--- [ 65.835366] note: kworker/4:3[534] exited with preempt_count 1 [ 712.382286] Unable to handle kernel paging request at virtual address 0001000000000027 [ 712.383017] Mem abort info: [ 712.383278] ESR = 0x96000004 [ 712.383563] EC = 0x25: DABT (current EL), IL = 32 bits [ 712.384045] SET = 0, FnV = 0 [ 712.384328] EA = 0, S1PTW = 0 [ 712.384617] Data abort info: [ 712.384884] ISV = 0, ISS = 0x00000004 [ 712.385235] CM = 0, WnR = 0 [ 712.385511] [0001000000000027] address between user and kernel address ranges [ 712.386158] Internal error: Oops: 96000004 [#2] PREEMPT SMP [ 712.386664] Modules linked in: snd_soc_hdmi_codec hantro_vpu(C) rockchip_vdec(C) rockchipdrm rockchip_rga snd_soc_rockchip_i2s v4l2_h264 videobuf2_dma_contig dw_mipi_dsi snd_soc_core dw_hdmi v4l2_mem2mem videobuf2_vmalloc videobuf2_dma_sg snd_pcm_dmaengine r8152 analogix_dp snd_pcm videobuf2_memops videobuf2_v4l2 drm_kms_helper panfrost snd_timer videobuf2_common cec videodev snd rc_core gpu_sched sg drm mc gpio_charger soundcore leds_pwm fusb30x(C) pwm_fan drm_panel_orientation_quirks gpio_beeper cpufreq_dt nfsd auth_rpcgss nfs_acl dm_mod lockd grace sunrpc lm75 ip_tables x_tables autofs4 raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx realtek md_mod dwmac_rk stmmac_platform stmmac mdio_xpcs adc_keys [ 712.392782] CPU: 1 PID: 1051 Comm: kworker/1:3 Tainted: G D C 5.8.16-rockchip64 #20.08.14 [ 712.393618] Hardware name: Helios64 (DT) [ 712.393998] Workqueue: events_power_efficient sync_hw_clock [ 712.394511] pstate: 20000085 (nzCv daIf -PAN -UAO BTYPE=--) [ 712.395023] pc : rt_mutex_enqueue_pi+0x58/0xc8 [ 712.395433] lr : task_blocks_on_rt_mutex+0x1d4/0x258 [ 712.395884] sp : ffff800012a238d0 [ 712.396190] x29: ffff800012a238d0 x28: 0000000000000001 [ 712.396677] x27: 0000000000000001 x26: ffff0000e4f85e34 [ 712.397163] x25: ffff0000e4f85700 x24: 0000000000000000 [ 712.397649] x23: ffff0000f6106580 x22: ffff800012a239a8 [ 712.398135] x21: ffff800012a239c0 x20: ffff800012a239a8 [ 712.398622] x19: ffff0000f5a668a8 x18: 0000000000000000 [ 712.399107] x17: 0000000000000000 x16: 0000000000000000 [ 712.399593] x15: 0000000000000000 x14: 0000000000000000 [ 712.400079] x13: 0000000000000078 x12: 0000000000000124 [ 712.400564] x11: 0000000000000064 x10: 0000000000000014 [ 712.401051] x9 : ffff800010f6cf48 x8 : ffff0000e4f85700 [ 712.401537] x7 : ffff800012a239a8 x6 : 0000000000000000 [ 712.402022] x5 : 0000000000000078 x4 : 0001000000000007 [ 712.402508] x3 : 0000ffffffffffff x2 : 0000ffffffffffff [ 712.402994] x1 : ffff0000e4f85e40 x0 : ffff800012a239c0 [ 712.403480] Call trace: [ 712.403713] rt_mutex_enqueue_pi+0x58/0xc8 [ 712.404092] task_blocks_on_rt_mutex+0x1d4/0x258 [ 712.404521] rt_mutex_slowlock+0x94/0x198 [ 712.404893] rt_mutex_lock+0x5c/0x68 [ 712.405231] i2c_adapter_lock_bus+0x14/0x20 [ 712.405616] i2c_transfer+0x50/0x128 [ 712.405948] i2c_transfer_buffer_flags+0x5c/0x88 [ 712.406374] regmap_i2c_write+0x20/0x58 [ 712.406731] _regmap_raw_write_impl+0x6f8/0x8c0 [ 712.407149] _regmap_bus_raw_write+0x68/0x88 [ 712.407543] _regmap_write+0x6c/0x160 [ 712.407884] _regmap_update_bits+0xf8/0x110 [ 712.408271] regmap_update_bits_base+0x64/0x98 [ 712.408682] rk808_rtc_set_time+0xc4/0x1a8 [ 712.409061] rtc_set_time+0x88/0x200 [ 712.409396] rtc_set_ntp_time+0x110/0x130 [ 712.409769] sync_hw_clock+0x90/0x150 [ 712.410112] process_one_work+0x1c4/0x470 [ 712.410485] worker_thread+0x4c/0x420 [ 712.410826] kthread+0x118/0x150 [ 712.411130] ret_from_fork+0x10/0x34 [ 712.411468] Code: 52800006 b4000123 aa0303e2 91002044 (b9402843) [ 712.412023] ---[ end trace de998757a1cc1984 ]--- [ 712.412448] note: kworker/1:3[1051] exited with preempt_count 2 Does anyone have an idea how to fix this? EDIT: I've also attached the console log files from boot: Spoiler DDR Version 1.24 20191016 In channel 0 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 1 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 0 training pass! channel 1 training pass! change freq to 416MHz 0,1 Channel 0: LPDDR4,416MHz Bus Width=32 Col=10 Bank=8 Row=16 CS=1 Die Bus-Width=16 Size=2048MB Channel 1: LPDDR4,416MHz Bus Width=32 Col=10 Bank=8 Row=16 CS=1 Die Bus-Width=16 Size=2048MB 256B stride channel 0 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 1 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 0 training pass! channel 1 training pass! channel 0, cs 0, advanced training done channel 1, cs 0, advanced training done change freq to 856MHz 1,0 ch 0 ddrconfig = 0x101, ddrsize = 0x40 ch 1 ddrconfig = 0x101, ddrsize = 0x40 pmugrf_os_reg[2] = 0x32C1F2C1, stride = 0xD ddr_set_rate to 328MHZ ddr_set_rate to 666MHZ ddr_set_rate to 928MHZ channel 0, cs 0, advanced training done channel 1, cs 0, advanced training done ddr_set_rate to 416MHZ, ctl_index 0 ddr_set_rate to 856MHZ, ctl_index 1 support 416 856 328 666 928 MHz, current 856MHz OUT Boot1: 2019-03-14, version: 1.19 CPUId = 0x0 ChipType = 0x10, 254 SdmmcInit=2 0 BootCapSize=100000 UserCapSize=14910MB FwPartOffset=2000 , 100000 mmc0:cmd5,20 SdmmcInit=0 0 BootCapSize=0 UserCapSize=30436MB FwPartOffset=2000 , 0 StorageInit ok = 79625 SecureMode = 0 SecureInit read PBA: 0x4 SecureInit read PBA: 0x404 SecureInit read PBA: 0x804 SecureInit read PBA: 0xc04 SecureInit read PBA: 0x1004 SecureInit read PBA: 0x1404 SecureInit read PBA: 0x1804 SecureInit read PBA: 0x1c04 SecureInit ret = 0, SecureMode = 0 atags_set_bootdev: ret:(0) GPT 0x3380ec0 signature is wrong recovery gpt... GPT 0x3380ec0 signature is wrong recovery gpt fail! LoadTrust Addr:0x4000 No find bl30.bin No find bl32.bin Load uboot, ReadLba = 2000 Load OK, addr=0x200000, size=0xded90 RunBL31 0x40000 NOTICE: BL31: v1.3(debug):42583b6 NOTICE: BL31: Built : 07:55:13, Oct 15 2019 NOTICE: BL31: Rockchip release version: v1.1 INFO: GICv3 with legacy support detected. ARM GICV3 driver initialized in EL3 INFO: Using opteed sec cpu_context! INFO: boot cpu mask: 0 INFO: plat_rockchip_pmu_init(1190): pd status 3e INFO: BL31: Initializing runtime services WARNING: No OPTEE provided by BL2 boot loader, Booting device without OPTEE initialization. SMC`s destined for OPTEE will return SMC_UNK ERROR: Error initializing runtime service opteed_fast INFO: BL31: Preparing for EL3 exit to normal world INFO: Entry point address = 0x200000 INFO: SPSR = 0x3c9 U-Boot 2020.07-armbian (Oct 05 2020 - 15:57:22 +0200) SoC: Rockchip rk3399 Reset cause: POR DRAM: 3.9 GiB PMIC: RK808 SF: Detected w25q128 with page size 256 Bytes, erase size 4 KiB, total 16 MiB MMC: mmc@fe320000: 1, sdhci@fe330000: 0 Loading Environment from MMC... mmc fail to send stop cmd *** Warning - !read failed, using default environment In: serial Out: serial Err: serial Model: Helios64 Revision: 1.2 - 4GB non ECC Net: eth0: ethernet@fe300000 scanning bus for devices... Hit any key to stop autoboot: 0 switch to partitions #0, OK mmc1 is current device Scanning mmc 1:1... Found U-Boot script /boot/boot.scr 3185 bytes read in 6 ms (517.6 KiB/s) ## Executing script at 00500000 Boot script loaded from mmc 1 208 bytes read in 5 ms (40 KiB/s) 16005149 bytes read in 682 ms (22.4 MiB/s) 27236864 bytes read in 1155 ms (22.5 MiB/s) 79946 bytes read in 13 ms (5.9 MiB/s) 2698 bytes read in 12 ms (218.8 KiB/s) Applying kernel provided DT fixup script (rockchip-fixup.scr) ## Executing script at 09000000 ## Loading init Ramdisk from Legacy Image at 06000000 ... Image Name: uInitrd Image Type: AArch64 Linux RAMDisk Image (gzip compressed) Data Size: 16005085 Bytes = 15.3 MiB Load Address: 00000000 Entry Point: 00000000 Verifying Checksum ... OK ## Flattened Device Tree blob at 01f00000 Booting using the fdt blob at 0x1f00000 Loading Ramdisk to f4fa1000, end f5ee47dd ... OK Loading Device Tree to 00000000f4f25000, end 00000000f4fa0fff ... OK Starting kernel ... Edited October 22, 2020 by barnumbirr Added more info for context
tekrantz Posted October 22, 2020 Posted October 22, 2020 17 minutes ago, barnumbirr said: fdtfile=rockchip/rk3399-kobol-helios64.dtb Managed to boot again. Adding to boot/armbianEnv.txt wasn't enough as for some reason the dtb-5.8.16-rockchip64 directory in /boot had vanished from my system. I've still got some issues as fancontrol fails with the following error: Hide contents Oct 22 15:16:29 helios64 fancontrol[1184]: Error: file /dev/fan-p6/pwm1 doesn't exist Oct 22 15:16:29 helios64 fancontrol[1184]: Error: file /dev/fan-p7/pwm1 doesn't exist The system gets quite hot as a result, for now I've temporarely hard coded some values in /sys/class/hwmon/hwmon*/pwm1. I also get the following logs in the serial console: Reveal hidden contents Starting kernel ... [ 65.817320] Internal error: Oops: 86000004 [#1] PREEMPT SMP [ 65.817825] Modules linked in: snd_soc_hdmi_codec hantro_vpu(C) rockchip_vdec(C) rockchipdrm rockchip_rga snd_soc_rockchip_i2s v4l2_h264 videobuf2_dma_contig dw_mipi_dsi snd_soc_core dw_hdmi v4l2_mem2mem videobuf2_vmalloc videobuf2_dma_sg snd_pcm_dmaengine r8152 analogix_dp snd_pcm videobuf2_memops videobuf2_v4l2 drm_kms_helper panfrost snd_timer videobuf2_common cec videodev snd rc_core gpu_sched sg drm mc gpio_charger soundcore leds_pwm fusb30x(C) pwm_fan drm_panel_orientation_quirks gpio_beeper cpufreq_dt nfsd auth_rpcgss nfs_acl dm_mod lockd grace sunrpc lm75 ip_tables x_tables autofs4 raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx realtek md_mod dwmac_rk stmmac_platform stmmac mdio_xpcs adc_keys [ 65.823764] CPU: 4 PID: 534 Comm: kworker/4:3 Tainted: G C 5.8.16-rockchip64 #20.08.14 [ 65.824577] Hardware name: Helios64 (DT) [ 65.824943] Workqueue: events dbs_work_handler [ 65.825343] pstate: 80000085 (Nzcv daIf -PAN -UAO BTYPE=--) [ 65.825839] pc : 0x800010010320 [ 65.826128] lr : return_address+0x58/0x90 [ 65.826485] sp : ffff800012663500 [ 65.826782] x29: ffff800012663530 x28: 00000000000000fa [ 65.827255] x27: ffff0000f5a66cf8 x26: ffff0000e4f85700 [ 65.827728] x25: ffff8000119cf3a8 x24: 0000000001000001 [ 65.828200] x23: ffff0000f5a66cf0 x22: ffff800012663728 [ 65.828672] x21: ffff0000f5a66cf8 x20: ffff800010dff2d0 [ 65.829144] x19: ffff0000e4f85700 x18: 0000000000000000 [ 65.829616] x17: 0000000000000000 x16: 0000000000000000 [ 65.830087] x15: 0000000000000000 x14: 0000000000000000 [ 65.830559] x13: 00000000000002c4 x12: 00000000000002cd [ 65.831030] x11: 0000000000000001 x10: 0000000000000a20 [ 65.831502] x9 : ffff800012663460 x8 : ffff0000e4f86180 [ 65.831974] x7 : 0000000000000001 x6 : 0000000000000001 [ 65.832446] x5 : 0000000000000002 x4 : fff6800010014610 [ 65.832917] x3 : ffff800012663520 x2 : fff68000100145d8 [ 65.833395] x1 : ffff800010dff640 x0 : ffff0000e4f85700 [ 65.833873] Call trace: [ 65.834097] 0x800010010320 [ 65.834350] return_address+0x0/0x90 [ 65.834676] Code: bad PC value [ 65.834953] ---[ end trace de998757a1cc1983 ]--- [ 65.835366] note: kworker/4:3[534] exited with preempt_count 1 [ 712.382286] Unable to handle kernel paging request at virtual address 0001000000000027 [ 712.383017] Mem abort info: [ 712.383278] ESR = 0x96000004 [ 712.383563] EC = 0x25: DABT (current EL), IL = 32 bits [ 712.384045] SET = 0, FnV = 0 [ 712.384328] EA = 0, S1PTW = 0 [ 712.384617] Data abort info: [ 712.384884] ISV = 0, ISS = 0x00000004 [ 712.385235] CM = 0, WnR = 0 [ 712.385511] [0001000000000027] address between user and kernel address ranges [ 712.386158] Internal error: Oops: 96000004 [#2] PREEMPT SMP [ 712.386664] Modules linked in: snd_soc_hdmi_codec hantro_vpu(C) rockchip_vdec(C) rockchipdrm rockchip_rga snd_soc_rockchip_i2s v4l2_h264 videobuf2_dma_contig dw_mipi_dsi snd_soc_core dw_hdmi v4l2_mem2mem videobuf2_vmalloc videobuf2_dma_sg snd_pcm_dmaengine r8152 analogix_dp snd_pcm videobuf2_memops videobuf2_v4l2 drm_kms_helper panfrost snd_timer videobuf2_common cec videodev snd rc_core gpu_sched sg drm mc gpio_charger soundcore leds_pwm fusb30x(C) pwm_fan drm_panel_orientation_quirks gpio_beeper cpufreq_dt nfsd auth_rpcgss nfs_acl dm_mod lockd grace sunrpc lm75 ip_tables x_tables autofs4 raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx realtek md_mod dwmac_rk stmmac_platform stmmac mdio_xpcs adc_keys [ 712.392782] CPU: 1 PID: 1051 Comm: kworker/1:3 Tainted: G D C 5.8.16-rockchip64 #20.08.14 [ 712.393618] Hardware name: Helios64 (DT) [ 712.393998] Workqueue: events_power_efficient sync_hw_clock [ 712.394511] pstate: 20000085 (nzCv daIf -PAN -UAO BTYPE=--) [ 712.395023] pc : rt_mutex_enqueue_pi+0x58/0xc8 [ 712.395433] lr : task_blocks_on_rt_mutex+0x1d4/0x258 [ 712.395884] sp : ffff800012a238d0 [ 712.396190] x29: ffff800012a238d0 x28: 0000000000000001 [ 712.396677] x27: 0000000000000001 x26: ffff0000e4f85e34 [ 712.397163] x25: ffff0000e4f85700 x24: 0000000000000000 [ 712.397649] x23: ffff0000f6106580 x22: ffff800012a239a8 [ 712.398135] x21: ffff800012a239c0 x20: ffff800012a239a8 [ 712.398622] x19: ffff0000f5a668a8 x18: 0000000000000000 [ 712.399107] x17: 0000000000000000 x16: 0000000000000000 [ 712.399593] x15: 0000000000000000 x14: 0000000000000000 [ 712.400079] x13: 0000000000000078 x12: 0000000000000124 [ 712.400564] x11: 0000000000000064 x10: 0000000000000014 [ 712.401051] x9 : ffff800010f6cf48 x8 : ffff0000e4f85700 [ 712.401537] x7 : ffff800012a239a8 x6 : 0000000000000000 [ 712.402022] x5 : 0000000000000078 x4 : 0001000000000007 [ 712.402508] x3 : 0000ffffffffffff x2 : 0000ffffffffffff [ 712.402994] x1 : ffff0000e4f85e40 x0 : ffff800012a239c0 [ 712.403480] Call trace: [ 712.403713] rt_mutex_enqueue_pi+0x58/0xc8 [ 712.404092] task_blocks_on_rt_mutex+0x1d4/0x258 [ 712.404521] rt_mutex_slowlock+0x94/0x198 [ 712.404893] rt_mutex_lock+0x5c/0x68 [ 712.405231] i2c_adapter_lock_bus+0x14/0x20 [ 712.405616] i2c_transfer+0x50/0x128 [ 712.405948] i2c_transfer_buffer_flags+0x5c/0x88 [ 712.406374] regmap_i2c_write+0x20/0x58 [ 712.406731] _regmap_raw_write_impl+0x6f8/0x8c0 [ 712.407149] _regmap_bus_raw_write+0x68/0x88 [ 712.407543] _regmap_write+0x6c/0x160 [ 712.407884] _regmap_update_bits+0xf8/0x110 [ 712.408271] regmap_update_bits_base+0x64/0x98 [ 712.408682] rk808_rtc_set_time+0xc4/0x1a8 [ 712.409061] rtc_set_time+0x88/0x200 [ 712.409396] rtc_set_ntp_time+0x110/0x130 [ 712.409769] sync_hw_clock+0x90/0x150 [ 712.410112] process_one_work+0x1c4/0x470 [ 712.410485] worker_thread+0x4c/0x420 [ 712.410826] kthread+0x118/0x150 [ 712.411130] ret_from_fork+0x10/0x34 [ 712.411468] Code: 52800006 b4000123 aa0303e2 91002044 (b9402843) [ 712.412023] ---[ end trace de998757a1cc1984 ]--- [ 712.412448] note: kworker/1:3[1051] exited with preempt_count 2 Does anyone have an idea how to fix this? The /dev/fan* entries should be created by the included udev rules. In particular: cat 90-helios64-hwmon.rules # Helios64 persistent hwmon ACTION=="remove", GOTO="helios64_hwmon_end" # KERNELS=="p6-fan", SUBSYSTEMS=="platform", ENV{_HELIOS64_FAN_}="p6", ENV{_IS_HELIOS64_FAN_}="1", ENV{IS_HELIOS64_HWMON}="1" KERNELS=="p7-fan", SUBSYSTEMS=="platform", ENV{_HELIOS64_FAN_}="p7", ENV{_IS_HELIOS64_FAN_}="1", ENV{IS_HELIOS64_HWMON}="1" KERNELS=="2-004c", SUBSYSTEMS=="i2c", DRIVERS=="lm75", ENV{IS_HELIOS64_HWMON}="1" SUBSYSTEM!="hwmon", GOTO="helios64_hwmon_end" ENV{HWMON_PATH}="/sys%p" # ATTR{name}=="cpu", ENV{IS_HELIOS64_HWMON}="1", ENV{HELIOS64_SYMLINK}="/dev/thermal-cpu" # ENV{IS_HELIOS64_HWMON}=="1", ATTR{name}=="lm75", ENV{HELIOS64_SYMLINK}="/dev/thermal-board" ENV{_IS_HELIOS64_FAN_}=="1", ENV{HELIOS64_SYMLINK}="/dev/fan-$env{_HELIOS64_FAN_}" # ENV{IS_HELIOS64_HWMON}=="1", RUN+="/bin/ln -sf $env{HWMON_PATH} $env{HELIOS64_SYMLINK}" LABEL="helios64_hwmon_end"
barnumbirr Posted October 22, 2020 Posted October 22, 2020 22 minutes ago, tekrantz said: The /dev/fan* entries should be created by the included udev rules. In particular: cat 90-helios64-hwmon.rules # Helios64 persistent hwmon ACTION=="remove", GOTO="helios64_hwmon_end" # KERNELS=="p6-fan", SUBSYSTEMS=="platform", ENV{_HELIOS64_FAN_}="p6", ENV{_IS_HELIOS64_FAN_}="1", ENV{IS_HELIOS64_HWMON}="1" KERNELS=="p7-fan", SUBSYSTEMS=="platform", ENV{_HELIOS64_FAN_}="p7", ENV{_IS_HELIOS64_FAN_}="1", ENV{IS_HELIOS64_HWMON}="1" KERNELS=="2-004c", SUBSYSTEMS=="i2c", DRIVERS=="lm75", ENV{IS_HELIOS64_HWMON}="1" SUBSYSTEM!="hwmon", GOTO="helios64_hwmon_end" ENV{HWMON_PATH}="/sys%p" # ATTR{name}=="cpu", ENV{IS_HELIOS64_HWMON}="1", ENV{HELIOS64_SYMLINK}="/dev/thermal-cpu" # ENV{IS_HELIOS64_HWMON}=="1", ATTR{name}=="lm75", ENV{HELIOS64_SYMLINK}="/dev/thermal-board" ENV{_IS_HELIOS64_FAN_}=="1", ENV{HELIOS64_SYMLINK}="/dev/fan-$env{_HELIOS64_FAN_}" # ENV{IS_HELIOS64_HWMON}=="1", RUN+="/bin/ln -sf $env{HWMON_PATH} $env{HELIOS64_SYMLINK}" LABEL="helios64_hwmon_end" Ouch, this is what I currently have: root@helios64:~# ls -la /etc/udev/rules.d/ total 32 drwxr-xr-x 2 root root 4096 Oct 21 15:43 . drwxr-xr-x 4 root root 4096 Aug 30 20:58 .. -rw-r--r-- 1 root root 8821 Sep 14 18:07 99-openmediavault-dev-disk-by-id.rules -rw-r--r-- 1 root root 1348 Jan 7 2020 99-openmediavault-md-raid.rules -rw-r--r-- 1 root root 1647 Jan 7 2020 99-openmediavault-nonrot.rules -rw-r--r-- 1 root root 1378 Jan 7 2020 99-openmediavault-scheduler.rules I can't downgrade using armbian-config because it exists as soon as I confirm I'm okay with the "risk". Is there a way for me to downgrade using apt? ( don't know the list op packages that need to be specified). If a downgrade isn't possible, can I reconfigure the ones currently installed? (again, don't know the package list) EDIT: after about 30 minutes, my Helios now reboots on it's own because of Kernel panic with the following logs: Spoiler 1849.330507] Unable to handle kernel paging request at virtual address ffff8000141a4100 [ 1849.331220] Mem abort info: [ 1849.331473] ESR = 0x96000007 [ 1849.331803] EC = 0x25: DABT (current EL), IL = 32 bits [ 1849.332276] SET = 0, FnV = 0 [ 1849.332551] EA = 0, S1PTW = 0 [ 1849.332834] Data abort info: [ 1849.333094] ISV = 0, ISS = 0x00000007 [ 1849.333436] CM = 0, WnR = 0 [ 1849.333705] swapper pgtable: 4k pages, 48-bit VAs, pgdp=00000000035dc000 [ 1849.334298] [ffff8000141a4100] pgd=00000000f7fff003, p4d=00000000f7fff003, pud=00000000f7ffe003, pmd=000000003ba6c003, pte=0000000000000000 [ 1849.335409] Internal error: Oops: 96000007 [#4] PREEMPT SMP [ 1849.335904] Modules linked in: snd_soc_hdmi_codec hantro_vpu(C) rockchip_vdec(C) snd_soc_rockchip_i2s rockchip_rga rockchipdrm v4l2_h264 snd_soc_core videobuf2_dma_contig dw_mipi_dsi videobuf2_vmalloc videobuf2_dma_sg v4l2_mem2mem snd_pcm_dmaengine dw_hdmi snd_pcm videobuf2_memops videobuf2_v4l2 r8152 analogix_dp snd_timer videobuf2_common drm_kms_helper snd videodev cec panfrost soundcore rc_core mc gpu_sched gpio_charger sg drm pwm_fan leds_pwm fusb30x(C) drm_panel_orientation_quirks gpio_beeper cpufreq_dt nfsd auth_rpcgss dm_mod nfs_acl lockd grace lm75 sunrpc ip_tables x_tables autofs4 raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx realtek md_mod dwmac_rk stmmac_platform stmmac mdio_xpcs adc_keys [ 1849.341840] CPU: 4 PID: 1 Comm: systemd Tainted: G D WC 5.8.16-rockchip64 #20.08.14 [ 1849.342606] Hardware name: Helios64 (DT) [ 1849.342959] pstate: 60000005 (nZCv daif -PAN -UAO BTYPE=--) [ 1849.343464] pc : kmem_cache_alloc_node+0x190/0x3a0 [ 1849.343893] lr : kmem_cache_alloc_node+0x17c/0x3a0 [ 1849.344318] sp : ffff800011acbbd0 [ 1849.344615] x29: ffff800011acbbd0 x28: ffff0000f6e48400 [ 1849.345088] x27: 0000000000008204 x26: ffff80001007c5f0 [ 1849.345560] x25: ffff0000f6e88000 x24: ffff8000119f9000 [ 1849.346032] x23: 00000000ffffffff x22: 0000000000000cc0 [ 1849.346504] x21: ffff80001007c5f0 x20: ffff0000f6e48400 [ 1849.346976] x19: ffff8000141a39c0 x18: 0000000000000000 [ 1849.347448] x17: 0000000000000000 x16: 0000000000000000 [ 1849.347919] x15: 0000000000000000 x14: 0000000000000000 [ 1849.348391] x13: 0000000000000000 x12: 0000000000000000 [ 1849.348863] x11: 0000000000000000 x10: 0000000000000000 [ 1849.349334] x9 : 0000000000000000 x8 : 0000000000000000 [ 1849.349806] x7 : ffff0000f6e88000 x6 : ffff0000f6e88000 [ 1849.350278] x5 : 0000000000000001 x4 : ffff0000f6e88000 [ 1849.350749] x3 : 0000000000008204 x2 : ffff8000114f9650 [ 1849.351221] x1 : 0000000000000740 x0 : 0000000000000001 [ 1849.351694] Call trace: [ 1849.351918] kmem_cache_alloc_node+0x190/0x3a0 [ 1849.352318] copy_process+0x1d0/0x1540 [ 1849.352654] _do_fork+0x80/0x448 [ 1849.352945] __do_sys_clone+0x6c/0x98 [ 1849.353274] __arm64_sys_clone+0x20/0x30 [ 1849.353626] el0_svc_common.constprop.0+0x70/0x188 [ 1849.354052] do_el0_svc+0x24/0x90 [ 1849.354353] el0_sync_handler+0x90/0x198 [ 1849.354705] el0_sync+0x158/0x180 [ 1849.355009] Code: 17ffffbd b9402281 b94010e0 11000400 (f8616a62) [ 1849.355557] ---[ end trace d25e6a4621bdedc7 ]--- [ 1849.356003] Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b [ 1849.356695] SMP: stopping secondary CPUs [ 1849.357051] Kernel Offset: disabled [ 1849.357366] CPU features: 0x240022,2000600c [ 1849.357738] Memory Limit: none [ 1849.358026] Rebooting in 90 seconds.. Even though subsequent serial console logs say that the kernel is starting, I can't access the device over SSH or serial anymore and have hard reset the device using the physical button.
tekrantz Posted October 22, 2020 Posted October 22, 2020 4 hours ago, barnumbirr said: Ouch, this is what I currently have: root@helios64:~# ls -la /etc/udev/rules.d/ total 32 drwxr-xr-x 2 root root 4096 Oct 21 15:43 . drwxr-xr-x 4 root root 4096 Aug 30 20:58 .. -rw-r--r-- 1 root root 8821 Sep 14 18:07 99-openmediavault-dev-disk-by-id.rules -rw-r--r-- 1 root root 1348 Jan 7 2020 99-openmediavault-md-raid.rules -rw-r--r-- 1 root root 1647 Jan 7 2020 99-openmediavault-nonrot.rules -rw-r--r-- 1 root root 1378 Jan 7 2020 99-openmediavault-scheduler.rules I can't downgrade using armbian-config because it exists as soon as I confirm I'm okay with the "risk". Is there a way for me to downgrade using apt? ( don't know the list op packages that need to be specified). If a downgrade isn't possible, can I reconfigure the ones currently installed? (again, don't know the package list) EDIT: after about 30 minutes, my Helios now reboots on it's own because of Kernel panic with the following logs: Reveal hidden contents 1849.330507] Unable to handle kernel paging request at virtual address ffff8000141a4100 [ 1849.331220] Mem abort info: [ 1849.331473] ESR = 0x96000007 [ 1849.331803] EC = 0x25: DABT (current EL), IL = 32 bits [ 1849.332276] SET = 0, FnV = 0 [ 1849.332551] EA = 0, S1PTW = 0 [ 1849.332834] Data abort info: [ 1849.333094] ISV = 0, ISS = 0x00000007 [ 1849.333436] CM = 0, WnR = 0 [ 1849.333705] swapper pgtable: 4k pages, 48-bit VAs, pgdp=00000000035dc000 [ 1849.334298] [ffff8000141a4100] pgd=00000000f7fff003, p4d=00000000f7fff003, pud=00000000f7ffe003, pmd=000000003ba6c003, pte=0000000000000000 [ 1849.335409] Internal error: Oops: 96000007 [#4] PREEMPT SMP [ 1849.335904] Modules linked in: snd_soc_hdmi_codec hantro_vpu(C) rockchip_vdec(C) snd_soc_rockchip_i2s rockchip_rga rockchipdrm v4l2_h264 snd_soc_core videobuf2_dma_contig dw_mipi_dsi videobuf2_vmalloc videobuf2_dma_sg v4l2_mem2mem snd_pcm_dmaengine dw_hdmi snd_pcm videobuf2_memops videobuf2_v4l2 r8152 analogix_dp snd_timer videobuf2_common drm_kms_helper snd videodev cec panfrost soundcore rc_core mc gpu_sched gpio_charger sg drm pwm_fan leds_pwm fusb30x(C) drm_panel_orientation_quirks gpio_beeper cpufreq_dt nfsd auth_rpcgss dm_mod nfs_acl lockd grace lm75 sunrpc ip_tables x_tables autofs4 raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx realtek md_mod dwmac_rk stmmac_platform stmmac mdio_xpcs adc_keys [ 1849.341840] CPU: 4 PID: 1 Comm: systemd Tainted: G D WC 5.8.16-rockchip64 #20.08.14 [ 1849.342606] Hardware name: Helios64 (DT) [ 1849.342959] pstate: 60000005 (nZCv daif -PAN -UAO BTYPE=--) [ 1849.343464] pc : kmem_cache_alloc_node+0x190/0x3a0 [ 1849.343893] lr : kmem_cache_alloc_node+0x17c/0x3a0 [ 1849.344318] sp : ffff800011acbbd0 [ 1849.344615] x29: ffff800011acbbd0 x28: ffff0000f6e48400 [ 1849.345088] x27: 0000000000008204 x26: ffff80001007c5f0 [ 1849.345560] x25: ffff0000f6e88000 x24: ffff8000119f9000 [ 1849.346032] x23: 00000000ffffffff x22: 0000000000000cc0 [ 1849.346504] x21: ffff80001007c5f0 x20: ffff0000f6e48400 [ 1849.346976] x19: ffff8000141a39c0 x18: 0000000000000000 [ 1849.347448] x17: 0000000000000000 x16: 0000000000000000 [ 1849.347919] x15: 0000000000000000 x14: 0000000000000000 [ 1849.348391] x13: 0000000000000000 x12: 0000000000000000 [ 1849.348863] x11: 0000000000000000 x10: 0000000000000000 [ 1849.349334] x9 : 0000000000000000 x8 : 0000000000000000 [ 1849.349806] x7 : ffff0000f6e88000 x6 : ffff0000f6e88000 [ 1849.350278] x5 : 0000000000000001 x4 : ffff0000f6e88000 [ 1849.350749] x3 : 0000000000008204 x2 : ffff8000114f9650 [ 1849.351221] x1 : 0000000000000740 x0 : 0000000000000001 [ 1849.351694] Call trace: [ 1849.351918] kmem_cache_alloc_node+0x190/0x3a0 [ 1849.352318] copy_process+0x1d0/0x1540 [ 1849.352654] _do_fork+0x80/0x448 [ 1849.352945] __do_sys_clone+0x6c/0x98 [ 1849.353274] __arm64_sys_clone+0x20/0x30 [ 1849.353626] el0_svc_common.constprop.0+0x70/0x188 [ 1849.354052] do_el0_svc+0x24/0x90 [ 1849.354353] el0_sync_handler+0x90/0x198 [ 1849.354705] el0_sync+0x158/0x180 [ 1849.355009] Code: 17ffffbd b9402281 b94010e0 11000400 (f8616a62) [ 1849.355557] ---[ end trace d25e6a4621bdedc7 ]--- [ 1849.356003] Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b [ 1849.356695] SMP: stopping secondary CPUs [ 1849.357051] Kernel Offset: disabled [ 1849.357366] CPU features: 0x240022,2000600c [ 1849.357738] Memory Limit: none [ 1849.358026] Rebooting in 90 seconds.. Even though subsequent serial console logs say that the kernel is starting, I can't access the device over SSH or serial anymore and have hard reset the device using the physical button. You should be able to just create a file 90-helios64-hwmon.rules in the /etc/udev/rules.d/ directory with the above text and reboot and it will create the /dev/fan* entries for you. That worked for me.
barnumbirr Posted October 22, 2020 Posted October 22, 2020 2 hours ago, tekrantz said: You should be able to just create a file 90-helios64-hwmon.rules in the /etc/udev/rules.d/ directory with the above text and reboot and it will create the /dev/fan* entries for you. That worked for me. Managed to run armbian-config to downgrade to kernel version 5.8.14. Thought that would be good enough to fix it but now the device just hangs at "Starting kernel ..." Can't access it via SSH either. Here's the output of the serial console: Spoiler DDR Version 1.24 20191016 In channel 0 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 1 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 0 training pass! channel 1 training pass! change freq to 416MHz 0,1 Channel 0: LPDDR4,416MHz Bus Width=32 Col=10 Bank=8 Row=16 CS=1 Die Bus-Width=16 Size=2048MB Channel 1: LPDDR4,416MHz Bus Width=32 Col=10 Bank=8 Row=16 CS=1 Die Bus-Width=16 Size=2048MB 256B stride channel 0 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 1 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 0 training pass! channel 1 training pass! channel 0, cs 0, advanced training done channel 1, cs 0, advanced training done change freq to 856MHz 1,0 ch 0 ddrconfig = 0x101, ddrsize = 0x40 ch 1 ddrconfig = 0x101, ddrsize = 0x40 pmugrf_os_reg[2] = 0x32C1F2C1, stride = 0xD ddr_set_rate to 328MHZ ddr_set_rate to 666MHZ ddr_set_rate to 928MHZ channel 0, cs 0, advanced training done channel 1, cs 0, advanced training done ddr_set_rate to 416MHZ, ctl_index 0 ddr_set_rate to 856MHZ, ctl_index 1 support 416 856 328 666 928 MHz, current 856MHz OUT Boot1: 2019-03-14, version: 1.19 CPUId = 0x0 ChipType = 0x10, 256 SdmmcInit=2 0 BootCapSize=100000 UserCapSize=14910MB FwPartOffset=2000 , 100000 mmc0:cmd5,20 SdmmcInit=0 0 BootCapSize=0 UserCapSize=30436MB FwPartOffset=2000 , 0 StorageInit ok = 79326 SecureMode = 0 SecureInit read PBA: 0x4 SecureInit read PBA: 0x404 SecureInit read PBA: 0x804 SecureInit read PBA: 0xc04 SecureInit read PBA: 0x1004 SecureInit read PBA: 0x1404 SecureInit read PBA: 0x1804 SecureInit read PBA: 0x1c04 SecureInit ret = 0, SecureMode = 0 atags_set_bootdev: ret:(0) GPT 0x3380ec0 signature is wrong recovery gpt... GPT 0x3380ec0 signature is wrong recovery gpt fail! LoadTrust Addr:0x4000 No find bl30.bin No find bl32.bin Load uboot, ReadLba = 2000 Load OK, addr=0x200000, size=0xdd6b0 RunBL31 0x40000 NOTICE: BL31: v1.3(debug):42583b6 NOTICE: BL31: Built : 07:55:13, Oct 15 2019 NOTICE: BL31: Rockchip release version: v1.1 INFO: GICv3 with legacy support detected. ARM GICV3 driver initialized in EL3 INFO: Using opteed sec cpu_context! INFO: boot cpu mask: 0 INFO: plat_rockchip_pmu_init(1190): pd status 3e INFO: BL31: Initializing runtime services WARNING: No OPTEE provided by BL2 boot loader, Booting device without OPTEE initialization. SMC`s destined for OPTEE will return SMC_UNK ERROR: Error initializing runtime service opteed_fast INFO: BL31: Preparing for EL3 exit to normal world INFO: Entry point address = 0x200000 INFO: SPSR = 0x3c9 U-Boot 2020.07-armbian (Oct 19 2020 - 08:25:23 +0200) SoC: Rockchip rk3399 Reset cause: POR DRAM: 3.9 GiB PMIC: RK808 SF: Detected w25q128 with page size 256 Bytes, erase size 4 KiB, total 16 MiB MMC: mmc@fe320000: 1, sdhci@fe330000: 0 Loading Environment from MMC... *** Warning - bad CRC, using default environment In: serial Out: serial Err: serial Model: Helios64 Revision: 1.2 - 4GB non ECC Net: eth0: ethernet@fe300000 scanning bus for devices... Hit any key to stop autoboot: 0 switch to partitions #0, OK mmc1 is current device Scanning mmc 1:1... Found U-Boot script /boot/boot.scr 3185 bytes read in 6 ms (517.6 KiB/s) ## Executing script at 00500000 Boot script loaded from mmc 1 208 bytes read in 6 ms (33.2 KiB/s) 16004897 bytes read in 682 ms (22.4 MiB/s) 27275776 bytes read in 1158 ms (22.5 MiB/s) 79946 bytes read in 14 ms (5.4 MiB/s) 2698 bytes read in 10 ms (262.7 KiB/s) Applying kernel provided DT fixup script (rockchip-fixup.scr) ## Executing script at 09000000 ## Loading init Ramdisk from Legacy Image at 06000000 ... Image Name: uInitrd Image Type: AArch64 Linux RAMDisk Image (gzip compressed) Data Size: 16004833 Bytes = 15.3 MiB Load Address: 00000000 Entry Point: 00000000 Verifying Checksum ... OK ## Flattened Device Tree blob at 01f00000 Booting using the fdt blob at 0x1f00000 Loading Ramdisk to f4fa3000, end f5ee66e1 ... OK Loading Device Tree to 00000000f4f27000, end 00000000f4fa2fff ... OK Starting kernel ... EDIT: tried to fix the system via chroot, downgraded to kernel 5.8.13 but that's even worse as it just kernel panics on boot: Spoiler DDR Version 1.24 20191016 In channel 0 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 1 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 0 training pass! channel 1 training pass! change freq to 416MHz 0,1 Channel 0: LPDDR4,416MHz Bus Width=32 Col=10 Bank=8 Row=16 CS=1 Die Bus-Width=16 Size=2048MB Channel 1: LPDDR4,416MHz Bus Width=32 Col=10 Bank=8 Row=16 CS=1 Die Bus-Width=16 Size=2048MB 256B stride channel 0 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 1 CS = 0 MR0=0x18 MR4=0x1 MR5=0x1 MR8=0x10 MR12=0x72 MR14=0x72 MR18=0x0 MR19=0x0 MR24=0x8 MR25=0x0 channel 0 training pass! channel 1 training pass! channel 0, cs 0, advanced training done channel 1, cs 0, advanced training done change freq to 856MHz 1,0 ch 0 ddrconfig = 0x101, ddrsize = 0x40 ch 1 ddrconfig = 0x101, ddrsize = 0x40 pmugrf_os_reg[2] = 0x32C1F2C1, stride = 0xD ddr_set_rate to 328MHZ ddr_set_rate to 666MHZ ddr_set_rate to 928MHZ channel 0, cs 0, advanced training done channel 1, cs 0, advanced training done ddr_set_rate to 416MHZ, ctl_index 0 ddr_set_rate to 856MHZ, ctl_index 1 support 416 856 328 666 928 MHz, current 856MHz OUT Boot1: 2019-03-14, version: 1.19 CPUId = 0x0 ChipType = 0x10, 253 SdmmcInit=2 0 BootCapSize=100000 UserCapSize=14910MB FwPartOffset=2000 , 100000 mmc0:cmd5,20 SdmmcInit=0 0 BootCapSize=0 UserCapSize=30436MB FwPartOffset=2000 , 0 StorageInit ok = 79300 SecureMode = 0 SecureInit read PBA: 0x4 SecureInit read PBA: 0x404 SecureInit read PBA: 0x804 SecureInit read PBA: 0xc04 SecureInit read PBA: 0x1004 SecureInit read PBA: 0x1404 SecureInit read PBA: 0x1804 SecureInit read PBA: 0x1c04 SecureInit ret = 0, SecureMode = 0 atags_set_bootdev: ret:(0) GPT 0x3380ec0 signature is wrong recovery gpt... GPT 0x3380ec0 signature is wrong recovery gpt fail! LoadTrust Addr:0x4000 No find bl30.bin No find bl32.bin Load uboot, ReadLba = 2000 Load OK, addr=0x200000, size=0xdd6b0 RunBL31 0x40000 NOTICE: BL31: v1.3(debug):42583b6 NOTICE: BL31: Built : 07:55:13, Oct 15 2019 NOTICE: BL31: Rockchip release version: v1.1 INFO: GICv3 with legacy support detected. ARM GICV3 driver initialized in EL3 INFO: Using opteed sec cpu_context! INFO: boot cpu mask: 0 INFO: plat_rockchip_pmu_init(1190): pd status 3e INFO: BL31: Initializing runtime services WARNING: No OPTEE provided by BL2 boot loader, Booting device without OPTEE initialization. SMC`s destined for OPTEE will return SMC_UNK ERROR: Error initializing runtime service opteed_fast INFO: BL31: Preparing for EL3 exit to normal world INFO: Entry point address = 0x200000 INFO: SPSR = 0x3c9 U-Boot 2020.07-armbian (Oct 19 2020 - 08:25:23 +0200) SoC: Rockchip rk3399 Reset cause: POR DRAM: 3.9 GiB PMIC: RK808 SF: Detected w25q128 with page size 256 Bytes, erase size 4 KiB, total 16 MiB MMC: mmc@fe320000: 1, sdhci@fe330000: 0 Loading Environment from MMC... *** Warning - bad CRC, using default environment In: serial Out: serial Err: serial Model: Helios64 Revision: 1.2 - 4GB non ECC Net: eth0: ethernet@fe300000 scanning bus for devices... Hit any key to stop autoboot: 0 switch to partitions #0, OK mmc1 is current device Scanning mmc 1:1... Found U-Boot script /boot/boot.scr 3185 bytes read in 5 ms (622.1 KiB/s) ## Executing script at 00500000 Boot script loaded from mmc 1 208 bytes read in 5 ms (40 KiB/s) 16002766 bytes read in 681 ms (22.4 MiB/s) 27275776 bytes read in 1161 ms (22.4 MiB/s) 79946 bytes read in 13 ms (5.9 MiB/s) 2698 bytes read in 11 ms (239.3 KiB/s) Applying kernel provided DT fixup script (rockchip-fixup.scr) ## Executing script at 09000000 ## Loading init Ramdisk from Legacy Image at 06000000 ... Image Name: uInitrd Image Type: AArch64 Linux RAMDisk Image (gzip compressed) Data Size: 16002702 Bytes = 15.3 MiB Load Address: 00000000 Entry Point: 00000000 Verifying Checksum ... OK ## Flattened Device Tree blob at 01f00000 Booting using the fdt blob at 0x1f00000 Loading Ramdisk to f4fa3000, end f5ee5e8e ... OK Loading Device Tree to 00000000f4f27000, end 00000000f4fa2fff ... OK Starting kernel ... [ 35.795879] Internal error: Oops: 96000004 [#1] PREEMPT SMP [ 35.796384] Modules linked in: snd_soc_hdmi_codec hantro_vpu(C) rockchip_vdec(C) rockchipdrm rockchip_rga snd_soc_rockchip_i2s v4l2_h264 dw_mipi_dsi videobuf2_dma_contig snd_soc_core dw_hdmi v4l2_mem2mem videobuf2_vmalloc videobuf2_dma_sg r8152 analogix_dp snd_pcm_dmaengine videobuf2_memops videobuf2_v4l2 snd_pcm drm_kms_helper panfrost cec videobuf2_common snd_timer videodev rc_core gpu_sched sg snd drm mc soundcore gpio_charger drm_panel_orientation_quirks fusb30x(C) leds_pwm pwm_fan gpio_beeper cpufreq_dt nfsd auth_rpcgss nfs_acl lockd grace dm_mod sunrpc lm75 ip_tables x_tables autofs4 raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx realtek md_mod dwmac_rk stmmac_platform stmmac mdio_xpcs adc_keys [ 35.802325] CPU: 5 PID: 0 Comm: swapper/5 Tainted: G C 5.8.13-rockchip64 #20.08.8 [ 35.803100] Hardware name: Helios64 (DT) [ 35.803455] pstate: 00000085 (nzcv daIf -PAN -UAO BTYPE=--) [ 35.803959] pc : __ww_mutex_lock.isra.0+0x148/0x808 [ 35.804397] lr : rcu_note_context_switch+0xb8/0x550 [ 35.804829] sp : ffff800011c63e20 [ 35.805128] x29: ffff800011c63e20 x28: ffff0000f6ea6580 [ 35.805601] x27: ffff800011a02000 x26: 0000000000000000 [ 35.806075] x25: 000000000000ff8e x24: ffff800011809980 [ 35.806548] x23: 0000000000000000 x22: ffff0000f6ea6590 [ 35.807021] x21: ffff800011269a78 x20: ffff8000115087c0 [ 35.807494] x19: ffff8000115087d0 x18: 0000000000000000 [ 35.807966] x17: 0000000000000000 x16: 0000000000000000 [ 35.808438] x15: 0000000000000000 x14: 0000063c0d107ae6 [ 35.808910] x13: 0000000000000081 x12: 0000000000000089 [ 35.809382] x11: 0000000000000001 x10: 0000000000000a20 [ 35.809855] x9 : ffff800011c63e70 x8 : ffff0000f6ea7000 [ 35.810327] x7 : 00000000ffffffff x6 : ffff800011265a52 [ 35.810799] x5 : 00ffffffffffffff x4 : ffff0000f77c9898 [ 35.811271] x3 : ffff8000114ee018 x2 : ffff0000f6ea6590 [ 35.811743] x1 : 000000000000002a x0 : 0000000000000028 [ 35.812216] Call trace: [ 35.812442] __ww_mutex_lock.isra.0+0x148/0x808 [ 35.812848] rcu_note_context_switch+0x78/0x550 [ 35.813253] __schedule+0xac/0x808 [ 35.813559] schedule_idle+0x28/0x48 [ 35.813882] do_idle+0x184/0x288 [ 35.814173] cpu_startup_entry+0x24/0x68 [ 35.814528] secondary_start_kernel+0x140/0x178 [ 35.814937] Code: 79402f79 aa1603e2 eb1602ff 540002c0 (f9400ee0) [ 35.815487] ---[ end trace 0f1e6a4a6bca601e ]--- [ 35.815899] Kernel panic - not syncing: Attempted to kill the idle task! [ 35.816495] SMP: stopping secondary CPUs [ 35.816856] Kernel Offset: disabled [ 35.817171] CPU features: 0x240022,2000600c [ 35.817543] Memory Limit: none [ 35.817828] Rebooting in 90 seconds..
aldweb Posted October 22, 2020 Posted October 22, 2020 On 10/18/2020 at 5:26 PM, gprovost said: It might be that one side of front panel (the side with red LEDs) touch a bit the metal opening shorting the LED therefore lighting them up. Could you trip to loosen a bit the 2x screws holding the front panel, then push a bit back the PCB, then tighten again the screw. Otherwise putting a piece of tape on the PCB side that touch the metal opening, to isolate the LED, could help. For next batch we will have to increase more the gap because mass production doesn't seem to meet exactly our tolerance requirement :-/ Thanks, the trick of putting a piece of tape on the PCB side that touches the metal opening solved the issue for me too. It should be added to the wiki...
gprovost Posted October 23, 2020 Author Posted October 23, 2020 @barnumbirr Not really sure what happened to your setup and it's going to be hard to help will all those sequence of event you posted. I would suggest you just do a fresh install. You can find previous build here : https://archive.armbian.com/helios64/archive/ 1
barnumbirr Posted October 23, 2020 Posted October 23, 2020 8 hours ago, gprovost said: @barnumbirr Not really sure what happened to your setup and it's going to be hard to help will all those sequence of event you posted. I would suggest you just do a fresh install. You can find previous build here : https://archive.armbian.com/helios64/archive/ @gprovostI'll do just that: I have a full SD Card backup of 5.8.13 around so I'll give that a spin. Quick question: as I'm not running the stock fans, is there a way for /etc/fancontrol not to be overwritten at every update?
Salamandar Posted October 23, 2020 Posted October 23, 2020 (edited) On 10/18/2020 at 8:56 PM, Bethlehem said: My only concern now is the CPU temperature. It sometimes runs up to around 65-70 C. I got a spare 2-pin cpu fan that I bought for my Raspberry Pi 4 and I think it could be installed onto the Helios64's heatsink. I just have to figure out where should the two power pins be put onto. That's easy : do NOT follow the instructions from Kobol. Turn the fans the other way so that they PUSH air in the enclosure. I'm around 43°C in idle AND compilation (that's still a lot for idle though, but way more acceptable). Edited October 23, 2020 by Salamandar
barnumbirr Posted October 23, 2020 Posted October 23, 2020 16 minutes ago, Salamandar said: That's easy : do NOT follow the instructions from Kobol. Turn the fans the other way so that they PUSH air in the enclosure. I'm around 43°C in idle AND compilation (that's still a lot for idle though, but way more acceptable). Wait, this can't be right. That's a 30°C delta over the standard fan configuration :O
axeleroy Posted October 23, 2020 Posted October 23, 2020 (edited) Hello, After I had issues with DNS resolution on my Helios64, I flashed a brand new Armbian image and now I cannot seem to connect to any devices on the network. I tried pinging my computer, router and 8.8.8.8 without success (which I can from my computers or router). I literally just I set root's password. I literally have no idea why this is happening. Every other computer in the network are working fine and the Helios64 has no problem the first few days I used it. Big thanks to anyone willing to help me! --- Version: Armbian 20.08.13 Debian 10 Buster (Kernel 5.8.16) DHCP config: Default gateway: 192.168.0.1 (my router's IP) Primary DNS: 80.67.169.12 (DNS from https://en.wikipedia.org/wiki/French_Data_Network) Secondary DNS: 192.168.1.1 (my ISP's router) My router's DHCP is set to give 192.168.0.110 to my Helio64's MAC address. Edited October 23, 2020 by axeleroy Clarifying that I tried pinging from other devices
axeleroy Posted October 23, 2020 Posted October 23, 2020 Forgot to add ifconfig output, it might be useful root@helios64:~# ifconfig eth0: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500 inet 192.168.0.110 netmask 255.255.255.0 broadcast 192.168.0.255 inet6 fe80::5550:854d:6ad0:b6d7 prefixlen 64 scopeid 0x20<link> ether 64:62:66:d0:03:7c txqueuelen 1000 (Ethernet) RX packets 390 bytes 60863 (59.4 KiB) RX errors 0 dropped 1 overruns 0 frame 0 TX packets 384 bytes 35588 (34.7 KiB) TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0 device interrupt 27 eth1: flags=4099<UP,BROADCAST,MULTICAST> mtu 1500 ether 64:62:66:d0:03:7d txqueuelen 1000 (Ethernet) RX packets 0 bytes 0 (0.0 B) RX errors 0 dropped 0 overruns 0 frame 0 TX packets 0 bytes 0 (0.0 B) TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0 lo: flags=73<UP,LOOPBACK,RUNNING> mtu 65536 inet 127.0.0.1 netmask 255.0.0.0 inet6 ::1 prefixlen 128 scopeid 0x10<host> loop txqueuelen 1000 (Local Loopback) RX packets 32 bytes 2640 (2.5 KiB) RX errors 0 dropped 0 overruns 0 frame 0 TX packets 32 bytes 2640 (2.5 KiB) TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0 I also did further testing: When pinging from my router, I get a timeout (set to 2000ms) When pinging my desktop and laptop computers from the Helios64, I get a response. When pinging its IP (192.168.0.110) from the Helios64, I get a response. When pinging the Helios from my desktop and laptop, I get a response.
flower Posted October 23, 2020 Posted October 23, 2020 Forgot to add ifconfig output, it might be useful root@helios64:~# ifconfigeth0: flags=4163 mtu 1500 inet 192.168.0.110 netmask 255.255.255.0 broadcast 192.168.0.255 inet6 fe80::5550:854d:6ad0:b6d7 prefixlen 64 scopeid 0x20 ether 64:62:66:d0:03:7c txqueuelen 1000 (Ethernet) RX packets 390 bytes 60863 (59.4 KiB) RX errors 0 dropped 1 overruns 0 frame 0 TX packets 384 bytes 35588 (34.7 KiB) TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0 device interrupt 27 eth1: flags=4099 mtu 1500 ether 64:62:66:d0:03:7d txqueuelen 1000 (Ethernet) RX packets 0 bytes 0 (0.0 RX errors 0 dropped 0 overruns 0 frame 0 TX packets 0 bytes 0 (0.0 TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0lo: flags=73 mtu 65536 inet 127.0.0.1 netmask 255.0.0.0 inet6 ::1 prefixlen 128 scopeid 0x10 loop txqueuelen 1000 (Local Loopback) RX packets 32 bytes 2640 (2.5 KiB) RX errors 0 dropped 0 overruns 0 frame 0 TX packets 32 bytes 2640 (2.5 KiB) TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0 I also did further testing: When pinging from my router, I get a timeout (set to 2000ms) When pinging my desktop and laptop computers from the Helios64, I get a response. When pinging its IP (192.168.0.110) from the Helios64, I get a response. When pinging the Helios from my desktop and laptop, I get a response. Maybe its an ip conflict?apt-get install arp-scanarp-scan --localnet -lI didnt have any net problems with eth0 with any version. Because of stability issues i am on kernel 5.8 armbian release .10 thoughGesendet von meinem CLT-L29 mit Tapatalk
axeleroy Posted October 24, 2020 Posted October 24, 2020 (edited) 19 hours ago, flower said: Maybe its an ip conflict? apt-get install arp-scan arp-scan --localnet -l I switched to eth1 to install it, then switched back to eth0 and ran arp-scan. There are lots of reference to my decommissioned Raspberry Pi on 192.168.0.110. Turns out I forgot to delete the ARP binding on my router to make sure my old Pi was given 192.168.0.110. PEBKAC… Thanks a lot anyway! Edited October 24, 2020 by axeleroy
dancgn Posted October 25, 2020 Posted October 25, 2020 So, there was the new 20.08.16 today... Someone test it? Same anouying bug that kill the server? I'm back to 20.08.10, that was the one that works for me... 1
registr123 Posted October 25, 2020 Posted October 25, 2020 1 hour ago, dancgn said: So, there was the new 20.08.16 today... Someone test it? Same anouying bug that kill the server? I'm back to 20.08.10, that was the one that works for me... didn't work for me also. rolled back to 20.08.10
Gareth Halfacree Posted October 25, 2020 Posted October 25, 2020 Received my Helios64 and installed Armbian Focal. All seems well for me, except: $ lscpu Architecture: aarch64 CPU op-mode(s): 32-bit, 64-bit Byte Order: Little Endian CPU(s): 6 On-line CPU(s) list: 0-5 Thread(s) per core: 1 Core(s) per socket: 3 Socket(s): 2 NUMA node(s): 1 Vendor ID: ARM Model: 4 Model name: Cortex-A53 Stepping: r0p4 CPU max MHz: 1800.0000 CPU min MHz: 408.0000 BogoMIPS: 48.00 NUMA node0 CPU(s): 0-5 Vulnerability Itlb multihit: Not affected Vulnerability L1tf: Not affected Vulnerability Mds: Not affected Vulnerability Meltdown: Not affected Vulnerability Spec store bypass: Vulnerable Vulnerability Spectre v1: Mitigation; __user pointer sanitization Vulnerability Spectre v2: Vulnerable Vulnerability Srbds: Not affected Vulnerability Tsx async abort: Not affected Flags: fp asimd evtstrm aes pmull sha1 sha2 crc32 cpuid Are there mitigations in the works for those speculative store bypass and SPECTRE v2 vulnerabilities? Also, is there any way to monitor the UPS battery's charge status? I've found gpio-charger/status, which gives me "Not charging" when it's fully topped up and "Charging" when it's charging or (confusingly) discharging, but I can't find anything else of use in there.
flower Posted October 25, 2020 Posted October 25, 2020 1 hour ago, Gareth Halfacree said: Are there mitigations in the works for those speculative store bypass and SPECTRE v2 vulnerabilities? not sure about arm but afaik they are already mitigated in linux. you may need to compile an own kernel. i wouldnt though: as long as you dont plan to use your helios64 as a cloud hosting platform for customers or run untrusted code there is no need to do so. its not worth the performance impact.
Borromini Posted October 26, 2020 Posted October 26, 2020 (edited) 11 hours ago, flower said: not sure about arm but afaik they are already mitigated in linux. you may need to compile an own kernel. I'd reckon Armbian maintainers turn these on by default, you can grep for them (if you know the name of the mitigation symbols in the kernel) in /boot/config-*. E.g. the CONFIG_CPU_SPECTRE symbol on my Exynos-based Odroid XU4: # grep -i spectre /boot/config-5.4.72-odroidxu4 CONFIG_CPU_SPECTRE=y Apparently 5.10 comes with rewritten Spectre mitigation code for ARM64. Edited October 26, 2020 by Borromini Extra background information.
flower Posted October 26, 2020 Posted October 26, 2020 Ok, folks its over this unit was sold as a "high quality nas". i was expecting a little tweaking and some flaws but not those instabilites. they are just inaccable for a nas. a nas is about data integrity! and there is no filesystem which likes regular kernel locks. if kobol would sit in germany i would send it back. i have migrated all disks and docker containers to an old i6900k i had lying around. i will have a look again in a few months. btw: i even advertised and sold this unit to friends and would have continued to do so. but: no official communication - and even your install page is broken since weeks - thats just too much. sad
registr123 Posted October 26, 2020 Posted October 26, 2020 2 hours ago, flower said: if kobol would sit in germany i would send it back. I'm pretty sure you will be able to sell it plenty of people in germany would like to get it of your hands. 1
Recommended Posts