************************ crashinfo ************************* /exports/testreports/38859/testresults/racer-special7-zfs-DNE-centos7_x86_64-centos7_x86_64/oleg234-client-timeout-core (3.10.0-7.9-debug) +==========================+ | *** Crashinfo v1.3.7 *** | +==========================+ +++WARNING+++ PARTIAL DUMP with size(vmcore) < 25% size(RAM) KERNEL: /tmp/crash-anaysis.aRXe4/vmlinux [TAINTED] DUMPFILE: /exports/testreports/38859/testresults/racer-special7-zfs-DNE-centos7_x86_64-centos7_x86_64/oleg234-client-timeout-core [PARTIAL DUMP] CPUS: 4 DATE: Fri Jan 19 13:00:46 EST 2024 UPTIME: 01:07:04 LOAD AVERAGE: 24.00, 24.01, 26.14 TASKS: 181 NODENAME: oleg234-client.virtnet RELEASE: 3.10.0-7.9-debug VERSION: #1 SMP Sat Mar 26 23:28:42 EDT 2022 MACHINE: x86_64 (2399 Mhz) MEMORY: 4 GB PANIC: "" +--------------------------+ >------------------------| Per-cpu Stacks ('bt -a') |------------------------< +--------------------------+ -- CPU#0 -- PID=0 CPU=0 CMD=swapper/0 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 rest_init+0x8e #7 start_kernel+0x456 #8 x86_64_start_reservations+0x2a #9 x86_64_start_kernel+0x152 #10 start_cpu+0x5 -- CPU#1 -- PID=0 CPU=1 CMD=swapper/1 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#2 -- PID=0 CPU=2 CMD=swapper/2 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#3 -- PID=0 CPU=3 CMD=swapper/3 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 +--------------------------------+ >---------------------| How This Dump Has Been Created |---------------------< +--------------------------------+ Cannot identify the specific condition that triggered vmcore +---------------+ >------------------------------| Tasks Summary |------------------------------< +---------------+ Number of Threads That Ran Recently ----------------------------------- last second 11 last 5s 32 last 60s 37 ----- Total Numbers of Threads per State ------ TASK_INTERRUPTIBLE 153 TASK_RUNNING 1 TASK_UNINTERRUPTIBLE 24 +++WARNING+++ There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details +-----------------------+ >--------------------------| 5 Most Recent Threads |--------------------------< +-----------------------+ PID CMD Age ARGS ----- -------------- ------ ---------------------------- 20 rcuos/1 0 ms (no user stack) 13 watchdog/0 0 ms (no user stack) 1860 lnet_discovery 0 ms (no user stack) 5360 ldlm_bl_02 47 ms (no user stack) 48 kworker/2:1 62 ms (no user stack) +------------------------+ >-------------------------| Memory Usage (kmem -i) |-------------------------< +------------------------+ PAGES TOTAL PERCENTAGE TOTAL MEM 955079 3.6 GB ---- FREE 788887 3 GB 82% of TOTAL MEM USED 166192 649.2 MB 17% of TOTAL MEM SHARED 13637 53.3 MB 1% of TOTAL MEM BUFFERS 5146 20.1 MB 0% of TOTAL MEM CACHED 51365 200.6 MB 5% of TOTAL MEM SLAB 68842 268.9 MB 7% of TOTAL MEM TOTAL HUGE 0 0 ---- HUGE FREE 0 0 0% of TOTAL HUGE TOTAL SWAP 262143 1024 MB ---- SWAP USED 0 0 0% of TOTAL SWAP SWAP FREE 262143 1024 MB 100% of TOTAL SWAP COMMIT LIMIT 739682 2.8 GB ---- COMMITTED 68046 265.8 MB 9% of TOTAL LIMIT +++three oldest UNINTERRUPTIBLE threads ... ran 3491s ago PID=9836 CPU=2 CMD=mrename #0 __schedule+0x2e2 #1 schedule_preempt_disabled+0x39 #2 __mutex_lock_slowpath+0x13a #3 mutex_lock+0x2d #4 lock_rename+0xc0 #5 SYSC_renameat2+0x22f #6 sys_renameat2+0xe #7 sys_rename+0x1e #8 system_call_fastpath+0x1f, 477 bytes of data ... ran 3491s ago PID=9838 CPU=0 CMD=lfs #0 __schedule+0x2e2 #1 schedule_preempt_disabled+0x39 #2 __mutex_lock_slowpath+0x13a #3 mutex_lock+0x2d #4 do_last+0x296 #5 path_openat+0xcd #6 do_filp_open+0x4d #7 do_sys_open+0x124 #8 sys_open+0x1e #9 system_call_fastpath+0x1f, 477 bytes of data ... ran 3491s ago PID=9617 CPU=0 CMD=mrename #0 __schedule+0x2e2 #1 schedule_preempt_disabled+0x39 #2 __mutex_lock_slowpath+0x13a #3 mutex_lock+0x2d #4 lock_rename+0x31 #5 SYSC_renameat2+0x22f #6 sys_renameat2+0xe #7 sys_rename+0x1e #8 system_call_fastpath+0x1f, 477 bytes of data +-------------------------------+ >----------------------| Scheduler Runqueues (per CPU) |----------------------< +-------------------------------+ ---+ CPU=0 ---- | CURRENT TASK , CMD=swapper/0 ---+ CPU=1 ---- | CURRENT TASK , CMD=swapper/1 ---+ CPU=2 ---- | CURRENT TASK , CMD=swapper/2 ---+ CPU=3 ---- | CURRENT TASK , CMD=swapper/3 +------------------------+ >-------------------------| Network Status Summary |-------------------------< +------------------------+ TCP Connection Info ------------------- ESTABLISHED 6 LISTEN 3 NAGLE disabled (TCP_NODELAY): 5 user_data set (NFS etc.): 4 UDP Connection Info ------------------- 2 UDP sockets, 0 in ESTABLISHED Unix Connection Info ------------------------ ESTABLISHED 26 CLOSE 18 LISTEN 8 Raw sockets info -------------------- CLOSE 1 Interfaces Info --------------- How long ago (in seconds) interfaces transmitted/received? Name RX TX ---- ---------- --------- lo n/a 4021.9 eth0 n/a 2.4 RSS_TOTAL=96260 pages, %mem= 1.4 +++WARNING+++ Possible hang +++WARNING+++ Run 'hanginfo' to get more details +------------+ >-------------------------------| Mounted FS |-------------------------------< +------------+ MOUNT SUPERBLK TYPE DEVNAME DIRNAME ffff880138cca000 ffff880139940800 rootfs rootfs / ffff88012aaaa000 ffff88012aa70000 sysfs sysfs /sys ffff88012aaaa1c0 ffff880139944000 proc proc /proc ffff88012aaaa380 ffff880137668000 devtmpfs devtmpfs /dev ffff88012aaaa540 ffff8800b6cf1000 securityfs securityfs /sys/kernel/security ffff88012aaaa700 ffff88012aa70800 tmpfs tmpfs /dev/shm ffff88012aaaa8c0 ffff88013730d000 devpts devpts /dev/pts ffff88012aaaaa80 ffff88012aa71000 tmpfs tmpfs /run ffff88012aaaac40 ffff88012aa71800 tmpfs tmpfs /sys/fs/cgroup ffff88012aaaae00 ffff88012aa72000 cgroup cgroup /sys/fs/cgroup/systemd ffff88012aaaafc0 ffff88012aa72800 pstore pstore /sys/fs/pstore ffff88012aaab180 ffff88012aa74800 cgroup cgroup /sys/fs/cgroup/hugetlb ffff88012aaab340 ffff88012aa74000 cgroup cgroup /sys/fs/cgroup/net_cls,net_prio ffff88012aaab500 ffff88012aa73800 cgroup cgroup /sys/fs/cgroup/cpuset ffff88012aaab6c0 ffff88012aa73000 cgroup cgroup /sys/fs/cgroup/blkio ffff88012aaab880 ffff88012aa75000 cgroup cgroup /sys/fs/cgroup/devices ffff88012aaaba40 ffff88012aa75800 cgroup cgroup /sys/fs/cgroup/cpu,cpuacct ffff88012aaabc00 ffff88012aa76000 cgroup cgroup /sys/fs/cgroup/perf_event ffff88012aaabdc0 ffff88012aa76800 cgroup cgroup /sys/fs/cgroup/memory ffff88012aac6000 ffff88012aa77000 cgroup cgroup /sys/fs/cgroup/freezer ffff88012aac61c0 ffff88012aa77800 cgroup cgroup /sys/fs/cgroup/pids ffff880137652540 ffff880137005000 configfs configfs /sys/kernel/config ffff88012a42c000 ffff8800b6eb0000 ext4 /dev/nbd0 / ffff880138ccb880 ffff8800b6d24800 rpc_pipefs rpc_pipefs /var/lib/nfs/rpc_pipefs ffff880138ccb500 ffff8800b6e9f000 autofs systemd-1 /proc/sys/fs/binfmt_misc ffff880137652700 ffff880139947800 debugfs debugfs /sys/kernel/debug ffff88012a42c1c0 ffff8800b6eb3800 hugetlbfs hugetlbfs /dev/hugepages ffff88012aac6a80 ffff88013730e000 mqueue mqueue /dev/mqueue ffff8801376528c0 ffff8800b6e28800 binfmt_misc binfmt_misc /proc/sys/fs/binfmt_misc/ ffff88012a42c540 ffff8800b6eb4000 ramfs none /mnt ffff88012a42c700 ffff8800b6eb6000 tmpfs none /var/lib/stateless/writable ffff88012aac6c40 ffff88012a57d000 squashfs /dev/vda /home/green/git/lustre-release ffff880137652a80 ffff8800b6eb6000 tmpfs none /var/cache/man ffff880138ccba40 ffff8800b6eb6000 tmpfs none /var/log ffff880137652c40 ffff8800b6eb6000 tmpfs none /var/lib/dbus ffff88012a42c8c0 ffff8800b6eb6000 tmpfs none /tmp ffff880137652e00 ffff8800b6eb6000 tmpfs none /var/lib/dhclient ffff880138ccbc00 ffff8800b6eb6000 tmpfs none /var/tmp ffff880137652fc0 ffff8800b6eb6000 tmpfs none /var/lib/NetworkManager ffff88012a42ca80 ffff8800b6eb6000 tmpfs none /var/lib/systemd/random-seed ffff88012aac6e00 ffff8800b6eb6000 tmpfs none /var/spool ffff880137653180 ffff8800b6eb6000 tmpfs none /var/lib/nfs ffff880138ccbdc0 ffff8800b6eb6000 tmpfs none /var/lib/gssproxy ffff88012a42cc40 ffff8800b6eb6000 tmpfs none /var/lib/logrotate ffff88012aac6fc0 ffff8800b6eb6000 tmpfs none /etc ffff88012aac7180 ffff8800b6eb6000 tmpfs none /var/lib/rsyslog ffff88012aac7340 ffff8800b6eb6000 tmpfs none /var/lib/dhclient/var/lib/dhclient ffff88012aac7500 ffff88012a57c000 nfs4 192.168.200.253:/exports/state/oleg234-client.virtnet /var/lib/stateless/state ffff88012aac7a40 ffff88012a57c000 nfs4 192.168.200.253:/exports/state/oleg234-client.virtnet /boot ffff88012aac7c00 ffff88012a57c000 nfs4 192.168.200.253:/exports/state/oleg234-client.virtnet /etc/etc/kdump.conf ffff88012aac7dc0 ffff8800b6d24800 rpc_pipefs sunrpc /var/lib/nfs/var/lib/nfs/rpc_pipefs ffff8800b17541c0 ffff8800b60f6000 nfs4 192.168.200.253://exports/testreports/38859/testresults/racer-special7-zfs-DNE-centos7_x86_64-centos7_x86_64 /tmp/tmp/testlogs ffff8800b1754000 ffff8800b7b0e800 tmpfs tmpfs /run/user/0 ffff8800aad8a000 ffff88012a57d000 squashfs /dev/vda /usr/sbin/mount.lustre ffff8800aad8a8c0 ffff8800b6eb4800 lustre 192.168.202.134@tcp:/lustre /mnt/lustre ffff8800aad8aa80 ffff8800aadf7800 lustre 192.168.202.134@tcp:/lustre /mnt/lustre2 +-------------------------------+ >----------------------| Last 40 lines of dmesg buffer |----------------------< +-------------------------------+ [ 721.172598] [] do_last+0x296/0x1280 [ 721.174817] [] ? link_path_walk+0x27e/0x8c0 [ 721.177644] [] path_openat+0xcd/0x5b0 [ 721.180604] [] ? __handle_mm_fault+0x31d/0x720 [ 721.182741] [] do_filp_open+0x4d/0xb0 [ 721.184572] [] ? _raw_spin_unlock+0xe/0x20 [ 721.186743] [] ? __alloc_fd+0xc3/0x170 [ 721.188631] [] do_sys_open+0x124/0x220 [ 721.190012] [] ? system_call_after_swapgs+0x96/0x13a [ 721.192542] [] SyS_open+0x1e/0x20 [ 721.194283] [] system_call_fastpath+0x1f/0x24 [ 721.196414] [] ? system_call_after_swapgs+0xa2/0x13a [ 721.198666] INFO: task lfs:9838 blocked for more than 120 seconds. [ 721.200902] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 721.203773] lfs D ffff8800a956ec58 12928 9838 1 0x00000004 [ 721.206290] Call Trace: [ 721.208273] [] schedule_preempt_disabled+0x39/0x90 [ 721.210294] [] __mutex_lock_slowpath+0x13a/0x340 [ 721.213374] [] mutex_lock+0x2d/0x40 [ 721.215043] [] do_last+0x296/0x1280 [ 721.217127] [] ? link_path_walk+0x27e/0x8c0 [ 721.219098] [] path_openat+0xcd/0x5b0 [ 721.221248] [] ? __handle_mm_fault+0x31d/0x720 [ 721.223716] [] do_filp_open+0x4d/0xb0 [ 721.225493] [] ? _raw_spin_unlock+0xe/0x20 [ 721.227409] [] ? __alloc_fd+0xc3/0x170 [ 721.230023] [] do_sys_open+0x124/0x220 [ 721.231568] [] ? system_call_after_swapgs+0x96/0x13a [ 721.234023] [] SyS_open+0x1e/0x20 [ 721.235907] [] system_call_fastpath+0x1f/0x24 [ 721.239178] [] ? system_call_after_swapgs+0xa2/0x13a [ 732.653148] LustreError: 11-0: lustre-MDT0000-mdc-ffff8800b6eb4800: operation ldlm_enqueue to node 192.168.202.134@tcp failed: rc = -107 [ 732.657649] LustreError: Skipped 7 previous similar messages [ 732.658245] Lustre: lustre-MDT0000-mdc-ffff8800b6eb4800: Connection to lustre-MDT0000 (at 192.168.202.134@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 732.660738] LustreError: 167-0: lustre-MDT0000-mdc-ffff8800b6eb4800: This client was evicted by lustre-MDT0000; in progress operations using this service will fail. [ 732.663027] LustreError: 30435:0:(file.c:5393:ll_inode_revalidate_fini()) lustre: revalidate FID [0x240000402:0x45bb:0x0] error: rc = -5 [ 732.663029] LustreError: 30435:0:(file.c:5393:ll_inode_revalidate_fini()) Skipped 407 previous similar messages [ 732.675088] LustreError: 30806:0:(file.c:246:ll_close_inode_openhandle()) lustre-clilmv-ffff8800b6eb4800: inode [0x200000403:0x498b:0x0] mdc close failed: rc = -108 [ 732.680084] LustreError: 30806:0:(file.c:246:ll_close_inode_openhandle()) Skipped 15 previous similar messages [ 732.716708] Lustre: lustre-MDT0000-mdc-ffff8800b6eb4800: Connection restored to (at 192.168.202.134@tcp) ****************************************************************************** ************************ A Summary Of Problems Found ************************* ****************************************************************************** -------------------- A list of all +++WARNING+++ messages -------------------- PARTIAL DUMP with size(vmcore) < 25% size(RAM) There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details Possible hang Run 'hanginfo' to get more details ------------------------------------------------------------------------------ ** Execution took 11.96s (real) 6.04s (CPU), Child processes: 5.87s