fri.K

  • Posts

    6
  • Joined

  • Last visited

fri.K's Achievements

  1. Problem looks very similar like on Odroid N2 SBC with the same SoC RK3399 [ 3.946307] kernel: meson8b-dwmac ff3f0000.ethernet: IRQ eth_wake_irq not found [ 3.947735] kernel: meson8b-dwmac ff3f0000.ethernet: IRQ eth_lpi not found [ 3.959955] kernel: meson8b-dwmac ff3f0000.ethernet: PTP uses main clock [ 3.960322] kernel: meson8b-dwmac ff3f0000.ethernet: no reset control found [ 3.987067] kernel: meson8b-dwmac ff3f0000.ethernet: User ID: 0x11, Synopsys ID: 0x37 [ 3.988569] kernel: meson8b-dwmac ff3f0000.ethernet: DWMAC1000 [ 3.993813] kernel: meson8b-dwmac ff3f0000.ethernet: DMA HW capability register supported [ 4.008458] kernel: meson8b-dwmac ff3f0000.ethernet: RX Checksum Offload Engine supported [ 4.017824] kernel: meson8b-dwmac ff3f0000.ethernet: COE Type 2 [ 4.038408] kernel: meson8b-dwmac ff3f0000.ethernet: TX Checksum insertion supported [ 4.045460] kernel: meson8b-dwmac ff3f0000.ethernet: Wake-Up On Lan supported [ 4.056526] kernel: meson8b-dwmac ff3f0000.ethernet: Normal descriptors [ 4.070716] kernel: meson8b-dwmac ff3f0000.ethernet: Ring mode enabled [ 4.076492] kernel: meson8b-dwmac ff3f0000.ethernet: Enable RX Mitigation via HW Watchdog Timer [ 4.076496] kernel: meson8b-dwmac ff3f0000.ethernet: device MAC address 00:11:22:33:44:FF [ 97.207023] kernel: meson8b-dwmac ff3f0000.ethernet eth0: PHY [0.0:00] driver [RTL8211F Gigabit Ethernet] (irq=42) [ 97.209362] kernel: meson8b-dwmac ff3f0000.ethernet eth0: No Safety Features support found [ 97.209373] kernel: meson8b-dwmac ff3f0000.ethernet eth0: PTP not supported by HW [ 97.209582] kernel: meson8b-dwmac ff3f0000.ethernet eth0: configuring for phy/rgmii link mode [ 100.670345] kernel: meson8b-dwmac ff3f0000.ethernet eth0: Link is Up - 1Gbps/Full - flow control off [ 100.677458] kernel: meson8b-dwmac ff3f0000.ethernet eth0: Link is Down [ 100.819033] kernel: meson8b-dwmac ff3f0000.ethernet eth0: PHY [0.0:00] driver [RTL8211F Gigabit Ethernet] (irq=42) [ 100.861699] kernel: meson8b-dwmac ff3f0000.ethernet eth0: No Safety Features support found [ 100.861714] kernel: meson8b-dwmac ff3f0000.ethernet eth0: PTP not supported by HW [ 100.861727] kernel: meson8b-dwmac ff3f0000.ethernet eth0: configuring for phy/rgmii link mode If I manually try to activate a bit different USB card on Helios64 I got this: # nmcli conn up Team0 Connection successfully activated (master waiting for slaves) (D-Bus active path: /org/freedesktop/NetworkManager/ActiveConnection/8) root@helios64:~# [179508.201299] Internal error: Oops - BUG: 0 [#1] PREEMPT SMP [179508.201798] Modules linked in: macvlan veth nf_conntrack_netlink xfrm_user xfrm_algo br_netfilter bridge aufs team_mode_loadbalance team governor_performance rfkill cdc_ether usbnet xt_conntrack nft_counter nft_chain_nat xt_nat xt_tcpudp zram xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xt_addrtype nft_compat nf_tables nfnetlink r8152 snd_soc_hdmi_codec ftdi_sio usbserial snd_soc_rockchip_i2s snd_soc_core gpio_charger rockchip_vdec(C) snd_pcm_dmaengine hantro_vpu(C) rockchipdrm snd_pcm leds_pwm pwm_fan dw_mipi_dsi snd_timer panfrost rockchip_rga v4l2_h264 dw_hdmi snd videobuf2_dma_contig analogix_dp soundcore videobuf2_dma_sg gpu_sched v4l2_mem2mem fusb302 drm_kms_helper videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 tcpm videobuf2_common cec videodev rc_core typec mc drm sg drm_panel_orientation_quirks gpio_beeper cpufreq_dt nfsd auth_rpcgss nfs_acl lockd grace ledtrig_netdev lm75 sunrpc ip_tables x_tables autofs4 raid10 raid1 raid0 multipath linear raid456 [179508.201952] async_raid6_recov async_memcpy async_pq async_xor async_tx md_mod uas realtek dwmac_rk stmmac_platform stmmac pcs_xpcs adc_keys [179508.210617] CPU: 4 PID: 1029 Comm: kworker/4:42 Tainted: G C 5.10.63-rockchip64 #21.08.2 [179508.211445] Hardware name: Helios64 (DT) [179508.211809] Workqueue: usb_hub_wq hub_event [179508.212187] pstate: 00000005 (nzcv daif -PAN -UAO -TCO BTYPE=--) [179508.212732] pc : rtl8152_post_reset+0x1d4/0x1e0 [r8152] [179508.213201] lr : rtl8152_post_reset+0x3c/0x1e0 [r8152] [179508.213658] sp : ffff80001958bbd0 [179508.213959] x29: ffff80001958bbd0 x28: ffff00000754f800 [179508.214435] x27: 0000000000000010 x26: 0000000000000000 [179508.214911] x25: ffff000007610b20 x24: 00000000ffffffed [179508.215387] x23: 0000000000000000 x22: ffff000007885000 [179508.215863] x21: ffff000007464800 x20: ffff8000118b9948 [179508.216339] x19: ffff000007885980 x18: ffff8000118dee10 [179508.216815] x17: 0000000000000000 x16: 0000000000000000 [179508.217291] x15: 0000000000000428 x14: ffff80001958b790 [179508.217765] x13: 00000000ffffffea x12: ffff80001194ee48 [179508.218241] x11: 0000000000000003 x10: ffff800011936e08 [179508.218716] x9 : ffff800011936e60 x8 : 0000000000017fe8 [179508.219192] x7 : c0000000ffffefff x6 : 0000000000000001 [179508.219668] x5 : 0000000000000001 x4 : 0000000000000000 [179508.220144] x3 : 0000000000000001 x2 : b9eafe8d9839b500 [179508.220619] x1 : 0000000000000000 x0 : 0000000000000010 [179508.221096] Call trace: [179508.221324] rtl8152_post_reset+0x1d4/0x1e0 [r8152] [179508.221761] usb_reset_device+0x128/0x258 [179508.222122] hub_event+0xe94/0x1618 [179508.222439] process_one_work+0x1ec/0x4d0 [179508.222801] worker_thread+0x48/0x478 [179508.223133] kthread+0x140/0x150 [179508.223428] ret_from_fork+0x10/0x34 [179508.223752] Code: d2800100 910042a3 14009623 17ffffdc (d4210000) [179508.224297] ---[ end trace 087e94c7a84ef6dd ]--- [179508.224711] note: kworker/4:42[1029] exited with preempt_count 1 [179508.225383] ------------[ cut here ]------------ [179508.225812] WARNING: CPU: 4 PID: 0 at kernel/rcu/tree.c:624 rcu_eqs_enter.isra.63+0x138/0x140 [179508.226564] Modules linked in: macvlan veth nf_conntrack_netlink xfrm_user xfrm_algo br_netfilter bridge aufs team_mode_loadbalance team governor_performance rfkill cdc_ether usbnet xt_conntrack nft_counter nft_chain_nat xt_nat xt_tcpudp zram xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xt_addrtype nft_compat nf_tables nfnetlink r8152 snd_soc_hdmi_codec ftdi_sio usbserial snd_soc_rockchip_i2s snd_soc_core gpio_charger rockchip_vdec(C) snd_pcm_dmaengine hantro_vpu(C) rockchipdrm snd_pcm leds_pwm pwm_fan dw_mipi_dsi snd_timer panfrost rockchip_rga v4l2_h264 dw_hdmi snd videobuf2_dma_contig analogix_dp soundcore videobuf2_dma_sg gpu_sched v4l2_mem2mem fusb302 drm_kms_helper videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 tcpm videobuf2_common cec videodev rc_core typec mc drm sg drm_panel_orientation_quirks gpio_beeper cpufreq_dt nfsd auth_rpcgss nfs_acl lockd grace ledtrig_netdev lm75 sunrpc ip_tables x_tables autofs4 raid10 raid1 raid0 multipath linear raid456 [179508.226715] async_raid6_recov async_memcpy async_pq async_xor async_tx md_mod uas realtek dwmac_rk stmmac_platform stmmac pcs_xpcs adc_keys [179508.235375] CPU: 4 PID: 0 Comm: swapper/4 Tainted: G D C 5.10.63-rockchip64 #21.08.2 [179508.236156] Hardware name: Helios64 (DT) [179508.236511] pstate: 20000085 (nzCv daIf -PAN -UAO -TCO BTYPE=--) [179508.237046] pc : rcu_eqs_enter.isra.63+0x138/0x140 [179508.237474] lr : rcu_eqs_enter.isra.63+0x1c/0x140 [179508.237894] sp : ffff800011cebf10 [179508.238194] x29: ffff800011cebf10 x28: 0000000000000000 [179508.238671] x27: 0000000000000000 x26: ffff000000710e80 [179508.239147] x25: 0000000000000000 x24: ffff800011307b40 [179508.239623] x23: ffff80001157e978 x22: ffff8000118b9948 [179508.240097] x21: ffff8000118ba2e8 x20: ffff8000118b99c8 [179508.240573] x19: ffff800011580a40 x18: 0000000000000004 [179508.241049] x17: 0000000000000001 x16: 0000000000000019 [179508.241524] x15: ffff8000118da498 x14: 000000000000015a [179508.242000] x13: 0000000000000000 x12: 0000000000000001 [179508.242476] x11: 0000000000000040 x10: ffff8000118d9c98 [179508.242952] x9 : ffff8000118d9c90 x8 : ffff000000800028 [179508.243428] x7 : 0000000000000000 x6 : 000003eb30cab0ae [179508.243904] x5 : 00ffffffffffffff x4 : ffff8000e6226000 [179508.244380] x3 : 0000000000000001 x2 : 4000000000000000 [179508.244854] x1 : 4000000000000002 x0 : ffff0000f77a6a40 [179508.245330] Call trace: [179508.245556] rcu_eqs_enter.isra.63+0x138/0x140 [179508.245956] rcu_idle_enter+0x10/0x20 [179508.246286] default_idle_call+0x40/0x1bc [179508.246646] do_idle+0x204/0x278 [179508.246939] cpu_startup_entry+0x24/0x60 [179508.247295] secondary_start_kernel+0x168/0x178 [179508.247700] ---[ end trace 087e94c7a84ef6de ]---
  2. H, I'm also facing issues teaming and bonding both NICs in Helios64. Similar configuration works on Helios4 (second NIC added vie USB 3.0 port). Once 1Gb link sets the speed it crashes and can't be set up: $ dmesg [ 3097.357372] rk_gmac-dwmac fe300000.ethernet eth0: Link is Up - 1Gbps/Full - flow control off [ 3097.357423] IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready [ 3097.372171] rk_gmac-dwmac fe300000.ethernet eth0: Link is Down [ 3097.379293] bond0: (slave eth0): Error: Device is in use and cannot be enslaved [ 3097.412528] rk_gmac-dwmac fe300000.ethernet eth0: PHY [stmmac-0:00] driver [RTL8211F Gigabit Ethernet] (irq=POLL) [ 3097.625230] rk_gmac-dwmac fe300000.ethernet: Failed to reset the dma [ 3097.625246] rk_gmac-dwmac fe300000.ethernet eth0: stmmac_hw_setup: DMA engine initialization failed [ 3097.625257] rk_gmac-dwmac fe300000.ethernet eth0: stmmac_open: Hw setup failed [ 3097.676541] rk_gmac-dwmac fe300000.ethernet eth0: PHY [stmmac-0:00] driver [RTL8211F Gigabit Ethernet] (irq=POLL) [ 3097.885517] rk_gmac-dwmac fe300000.ethernet: Failed to reset the dma [ 3097.885538] rk_gmac-dwmac fe300000.ethernet eth0: stmmac_hw_setup: DMA engine initialization failed [ 3097.885554] rk_gmac-dwmac fe300000.ethernet eth0: stmmac_open: Hw setup failed [ 3097.944560] rk_gmac-dwmac fe300000.ethernet eth0: PHY [stmmac-0:00] driver [RTL8211F Gigabit Ethernet] (irq=POLL) [ 3098.156236] rk_gmac-dwmac fe300000.ethernet: Failed to reset the dma [ 3098.156253] rk_gmac-dwmac fe300000.ethernet eth0: stmmac_hw_setup: DMA engine initialization failed [ 3098.156264] rk_gmac-dwmac fe300000.ethernet eth0: stmmac_open: Hw setup failed [ 3098.204543] rk_gmac-dwmac fe300000.ethernet eth0: PHY [stmmac-0:00] driver [RTL8211F Gigabit Ethernet] (irq=POLL) [ 3098.410564] rk_gmac-dwmac fe300000.ethernet: Failed to reset the dma [ 3098.410581] rk_gmac-dwmac fe300000.ethernet eth0: stmmac_hw_setup: DMA engine initialization failed [ 3098.410592] rk_gmac-dwmac fe300000.ethernet eth0: stmmac_open: Hw setup failed [ 3098.556619] rk_gmac-dwmac fe300000.ethernet eth0: PHY [stmmac-0:00] driver [RTL8211F Gigabit Ethernet] (irq=POLL) [ 3098.768443] rk_gmac-dwmac fe300000.ethernet: Failed to reset the dma [ 3098.768459] rk_gmac-dwmac fe300000.ethernet eth0: stmmac_hw_setup: DMA engine initialization failed [ 3098.768465] rk_gmac-dwmac fe300000.ethernet eth0: stmmac_open: Hw setup failed Looking at https://wiki.kobol.io/helios64/ethernet $ ethtool -i eth0 driver: st_gmac version: Jan_2016 firmware-version: expansion-rom-version: bus-info: supports-statistics: yes supports-test: no supports-eeprom-access: no supports-register-dump: yes supports-priv-flags: no $ lshw *-network:0 description: Ethernet interface physical id: 7 logical name: eth0 serial: 64:62:FF:FF:FF:FF capacity: 1Gbit/s capabilities: ethernet physical tp mii 10bt 10bt-fd 100bt 100bt-fd 1000bt-fd autonegotiation configuration: autonegotiation=on broadcast=yes driver=st_gmac driverversion=Jan_2016 link=no multicast=yes port=twisted pair $ ethtool eth0 Settings for eth0: Supported ports: [ TP MII ] Supported link modes: 10baseT/Half 10baseT/Full 100baseT/Half 100baseT/Full 1000baseT/Full Supported pause frame use: Symmetric Receive-only Supports auto-negotiation: Yes Supported FEC modes: Not reported Advertised link modes: 10baseT/Half 10baseT/Full 100baseT/Half 100baseT/Full 1000baseT/Full Advertised pause frame use: Symmetric Receive-only Advertised auto-negotiation: Yes Advertised FEC modes: Not reported Speed: Unknown! Duplex: Unknown! (255) Port: Twisted Pair PHYAD: 0 Transceiver: internal Auto-negotiation: on MDI-X: Unknown Cannot get wake-on-lan settings: Operation not permitted Current message level: 0x0000003f (63) drv probe link timer ifdown ifup Link detected: no $ uname -a Linux nas 5.10.63-rockchip64 #21.08.2 SMP PREEMPT Wed Sep 8 10:57:23 UTC 2021 aarch64 GNU/Linux Is it a kernel bug? If yes, how to debug it further?
  3. @Mangix, try 5.4.66-mvebu #20.08.3 , I also had random reboots after some updates on heavy NFS loads again, previously I had this problem, but after reinstalling system from scratch to spare sd card system was rock solid. I inserted spare card again and it's stable again, but I have no time to test where the problem is now thought.
  4. Hi, as of my problem with Helios4 rebooting on heavy NFS load I couldn't make any progress, even worse it even stopped booting. Disabling watchdog did not help as serial also freezes and I was unable to do anything, there was no even blinking LED as I remember. That forced me to use another SD card with fresh official Debian 10 installation image and configure it once again. There is only basic configuration, only SSSD for LDAP and NFS server, no docker and no http services but NAS runs rock solid for 6 days now even it's some cheep 4GB uSD card. So I still don't know if it's previous Samsung Evo 32GB sd card, or specific configuration, or some issues related to OS upgrades but as there is no spontaneous reboots any more I'm happy that it's not electronics ailing. Hope my new Helios64 will also work even more stable If I manage to debug more on my ex problem I'll write, but I treat it as super low important from now. My advice for people with spontaneous reboots on Helios4: try to use another SD card with fresh OS, do minimal configuration and try to stress it for few days.
  5. Hi, I'm struggling with Helios4 rebooting on hight SATA load. My Helios4 works with 4x2TB HGST SATA3 disk running in raid5 and Samsung Evo SD card Since beginning there were some problems with stability, but quite rare ~1per 3 months, but I didn't find it as a big problem. But when I upgraded OS to $ cat /etc/os-release PRETTY_NAME="Debian GNU/Linux 10 (buster)" NAME="Debian GNU/Linux" VERSION_ID="10" then issue came to be a problem because it can handle about a 1 minute or less 40-50MB/s transfer, then all LEDs stops blinking, and after short while it reboots. I connected via serial console, but tail -f /var/log/syslog didn't show anything at the moment of freeze, serial seems to freeze also. How can I debug it further, any ideas? There is no difference if I use current or legacy kernel, system is updated. I do not use SPI nor flash. My PSU was replaced about a month ago because original one died like many here. Unfortunately I had to buy used PSU as there was no other fast solution in my country at that time, should I consider it as an potential problem? I made some more tests and I don't think that PSU is to blame. I ran stress in many configurations, stress -c 2 -i 2 -m 2 -d 4 but Helios4 was stable for 20 minutes, then I ran iperf3 -s and stressed it with client for 20 minutes, then all together and no problem could be observed. Then I downloaded 16GB file over http and there was no problem, what is more speed was quite good ~75MB/s. At the end I started download of the same file over NFS with speed ~35MB/s and after 30 seconds it freezed. So problem is with NFS nfs-kernel-server 1:1.3.4-2.5+deb10u1 armhf Any idea what can be wrong with it? $ cat /etc/exports (...) /path/Data 192.168.19.0/24(fsid=10,rw,subtree_check,crossmnt) /path 192.168.19.0/24(ro,fsid=0,root_squash,no_subtree_check,hide) (...) both standard PCs (x86_64) and Odroid N2 (aarch64) are clients to this NFS server