************************ crashinfo ************************* /exports/testreports/38859/testresults/racer-ldiskfs-DNE-centos7_x86_64-centos7_x86_64-retry1/oleg343-client-timeout-core (3.10.0-7.9-debug) +==========================+ | *** Crashinfo v1.3.7 *** | +==========================+ +++WARNING+++ PARTIAL DUMP with size(vmcore) < 25% size(RAM) KERNEL: /tmp/crash-anaysis.zUatq/vmlinux [TAINTED] DUMPFILE: /exports/testreports/38859/testresults/racer-ldiskfs-DNE-centos7_x86_64-centos7_x86_64-retry1/oleg343-client-timeout-core [PARTIAL DUMP] CPUS: 4 DATE: Fri Jan 19 12:05:42 EST 2024 UPTIME: 00:11:58 LOAD AVERAGE: 76.70, 154.96, 113.70 TASKS: 267 NODENAME: oleg343-client.virtnet RELEASE: 3.10.0-7.9-debug VERSION: #1 SMP Sat Mar 26 23:28:42 EDT 2022 MACHINE: x86_64 (2399 Mhz) MEMORY: 4 GB PANIC: "" +--------------------------+ >------------------------| Per-cpu Stacks ('bt -a') |------------------------< +--------------------------+ -- CPU#0 -- PID=0 CPU=0 CMD=swapper/0 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 rest_init+0x8e #7 start_kernel+0x456 #8 x86_64_start_reservations+0x2a #9 x86_64_start_kernel+0x152 #10 start_cpu+0x5 -- CPU#1 -- PID=0 CPU=1 CMD=swapper/1 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#2 -- PID=0 CPU=2 CMD=swapper/2 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#3 -- PID=0 CPU=3 CMD=swapper/3 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 +--------------------------------+ >---------------------| How This Dump Has Been Created |---------------------< +--------------------------------+ Cannot identify the specific condition that triggered vmcore +++WARNING+++ High Load Averages: 76.70, 154.96, 113.70 +---------------+ >------------------------------| Tasks Summary |------------------------------< +---------------+ Number of Threads That Ran Recently ----------------------------------- last second 18 last 5s 27 last 60s 37 ----- Total Numbers of Threads per State ------ TASK_INTERRUPTIBLE 189 TASK_RUNNING 1 TASK_UNINTERRUPTIBLE 74 +++WARNING+++ There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details +-----------------------+ >--------------------------| 5 Most Recent Threads |--------------------------< +-----------------------+ PID CMD Age ARGS ----- -------------- ------ ---------------------------- 533 kworker/2:2 0 ms (no user stack) 1846 monitor_thread 0 ms (no user stack) 1852 ptlrpcd_01_01 0 ms (no user stack) 5217 ldlm_bl_01 54 ms (no user stack) 1847 lnet_discovery 60 ms (no user stack) +------------------------+ >-------------------------| Memory Usage (kmem -i) |-------------------------< +------------------------+ PAGES TOTAL PERCENTAGE TOTAL MEM 955079 3.6 GB ---- FREE 820070 3.1 GB 85% of TOTAL MEM USED 135009 527.4 MB 14% of TOTAL MEM SHARED 26908 105.1 MB 2% of TOTAL MEM BUFFERS 5145 20.1 MB 0% of TOTAL MEM CACHED 64193 250.8 MB 6% of TOTAL MEM SLAB 22406 87.5 MB 2% of TOTAL MEM TOTAL HUGE 0 0 ---- HUGE FREE 0 0 0% of TOTAL HUGE TOTAL SWAP 262143 1024 MB ---- SWAP USED 0 0 0% of TOTAL SWAP SWAP FREE 262143 1024 MB 100% of TOTAL SWAP COMMIT LIMIT 739682 2.8 GB ---- COMMITTED 107471 419.8 MB 14% of TOTAL LIMIT +++three oldest UNINTERRUPTIBLE threads ... ran 515s ago PID=25185 CPU=2 CMD=fallocate #0 __schedule+0x2e2 #1 schedule_preempt_disabled+0x39 #2 __mutex_lock_slowpath+0x13a #3 mutex_lock+0x2d #4 ima_file_check+0xa6 #5 do_last+0x587 #6 path_openat+0xcd #7 do_filp_open+0x4d #8 do_sys_open+0x124 #9 sys_open+0x1e #10 system_call_fastpath+0x1f, 477 bytes of data ... ran 515s ago PID=24784 CPU=0 CMD=file_concat.sh #0 __schedule+0x2e2 #1 schedule_preempt_disabled+0x39 #2 __mutex_lock_slowpath+0x13a #3 mutex_lock+0x2d #4 ima_file_check+0xa6 #5 do_last+0x587 #6 path_openat+0xcd #7 do_filp_open+0x4d #8 do_sys_open+0x124 #9 sys_open+0x1e #10 system_call_fastpath+0x1f, 477 bytes of data ... ran 515s ago PID=25209 CPU=0 CMD=lfs #0 __schedule+0x2e2 #1 schedule+0x29 #2 schedule_timeout+0x209 #3 wait_for_completion+0xfc #4 osc_io_data_version_end+0x59 #5 cl_io_end+0x5c #6 lov_io_end_wrapper+0xeb #7 lov_io_data_version_end+0x78 #8 cl_io_end+0x5c #9 cl_io_loop+0xad #10 ll_ioc_data_version+0x20b #11 ll_data_version+0x3c #12 ll_migrate+0xa12 #13 ll_dir_ioctl+0x6a0b #14 do_vfs_ioctl+0x40d #15 sys_ioctl+0xa1 #16 system_call_fastpath+0x1f, 477 bytes of data +-------------------------------+ >----------------------| Scheduler Runqueues (per CPU) |----------------------< +-------------------------------+ ---+ CPU=0 ---- | CURRENT TASK , CMD=swapper/0 ---+ CPU=1 ---- | CURRENT TASK , CMD=swapper/1 ---+ CPU=2 ---- | CURRENT TASK , CMD=swapper/2 ---+ CPU=3 ---- | CURRENT TASK , CMD=swapper/3 +------------------------+ >-------------------------| Network Status Summary |-------------------------< +------------------------+ TCP Connection Info ------------------- ESTABLISHED 6 LISTEN 3 NAGLE disabled (TCP_NODELAY): 5 user_data set (NFS etc.): 4 UDP Connection Info ------------------- 2 UDP sockets, 0 in ESTABLISHED Unix Connection Info ------------------------ ESTABLISHED 26 CLOSE 18 LISTEN 8 Raw sockets info -------------------- ESTABLISHED 1 Interfaces Info --------------- How long ago (in seconds) interfaces transmitted/received? Name RX TX ---- ---------- --------- lo n/a 716.4 eth0 n/a 0.7 RSS_TOTAL=228808 pages, %mem= 3.1 +++WARNING+++ Possible hang +++WARNING+++ Run 'hanginfo' to get more details +------------+ >-------------------------------| Mounted FS |-------------------------------< +------------+ MOUNT SUPERBLK TYPE DEVNAME DIRNAME ffff880138cca000 ffff880139940800 rootfs rootfs / ffff88012aaa2000 ffff88012aaa8000 sysfs sysfs /sys ffff88012aaa21c0 ffff880139944000 proc proc /proc ffff88012aaa2380 ffff880137668000 devtmpfs devtmpfs /dev ffff88012aaa2540 ffff8800b6c9c800 securityfs securityfs /sys/kernel/security ffff88012aaa2700 ffff88012aaa8800 tmpfs tmpfs /dev/shm ffff88012aaa28c0 ffff880137315000 devpts devpts /dev/pts ffff88012aaa2a80 ffff88012aaa9000 tmpfs tmpfs /run ffff88012aaa2c40 ffff88012aaa9800 tmpfs tmpfs /sys/fs/cgroup ffff88012aaa2e00 ffff88012aaaa000 cgroup cgroup /sys/fs/cgroup/systemd ffff88012aaa2fc0 ffff88012aaaa800 pstore pstore /sys/fs/pstore ffff88012aaa3180 ffff88012aaac800 cgroup cgroup /sys/fs/cgroup/memory ffff88012aaa3340 ffff88012aaac000 cgroup cgroup /sys/fs/cgroup/hugetlb ffff88012aaa3500 ffff88012aaab800 cgroup cgroup /sys/fs/cgroup/pids ffff88012aaa36c0 ffff88012aaab000 cgroup cgroup /sys/fs/cgroup/blkio ffff88012aaa3880 ffff88012aaad000 cgroup cgroup /sys/fs/cgroup/net_cls,net_prio ffff88012aaa3a40 ffff88012aaad800 cgroup cgroup /sys/fs/cgroup/perf_event ffff88012aaa3c00 ffff88012aaae000 cgroup cgroup /sys/fs/cgroup/freezer ffff88012aaa3dc0 ffff88012aaae800 cgroup cgroup /sys/fs/cgroup/cpu,cpuacct ffff88012aab2000 ffff88012aaaf000 cgroup cgroup /sys/fs/cgroup/cpuset ffff88012aab21c0 ffff88012aaaf800 cgroup cgroup /sys/fs/cgroup/devices ffff8801377de380 ffff8801377f2800 configfs configfs /sys/kernel/config ffff8801377de540 ffff8801377f4800 ext4 /dev/nbd0 / ffff880137652380 ffff8801377f6800 rpc_pipefs rpc_pipefs /var/lib/nfs/rpc_pipefs ffff88012aab28c0 ffff8800b63e9800 autofs systemd-1 /proc/sys/fs/binfmt_misc ffff880137652540 ffff880139947800 debugfs debugfs /sys/kernel/debug ffff8801377de700 ffff8800b6ff6000 hugetlbfs hugetlbfs /dev/hugepages ffff8801377de8c0 ffff880137316000 mqueue mqueue /dev/mqueue ffff8801377dea80 ffff8800b6ff4800 binfmt_misc binfmt_misc /proc/sys/fs/binfmt_misc/ ffff880137652700 ffff8800b58b1000 ramfs none /mnt ffff8801376528c0 ffff8800b58b1800 tmpfs none /var/lib/stateless/writable ffff880137652a80 ffff8800b63eb000 squashfs /dev/vda /home/green/git/lustre-release ffff880137652c40 ffff8800b58b1800 tmpfs none /var/cache/man ffff8801377dec40 ffff8800b58b1800 tmpfs none /var/log ffff880138ccba40 ffff8800b58b1800 tmpfs none /var/lib/dbus ffff880137652e00 ffff8800b58b1800 tmpfs none /tmp ffff880137652fc0 ffff8800b58b1800 tmpfs none /var/lib/dhclient ffff880137653180 ffff8800b58b1800 tmpfs none /var/tmp ffff880137653340 ffff8800b58b1800 tmpfs none /var/lib/NetworkManager ffff880137653500 ffff8800b58b1800 tmpfs none /var/lib/systemd/random-seed ffff88012aab2a80 ffff8800b58b1800 tmpfs none /var/spool ffff8801377dee00 ffff8800b58b1800 tmpfs none /var/lib/nfs ffff88012aab2c40 ffff8800b58b1800 tmpfs none /var/lib/gssproxy ffff8801377defc0 ffff8800b58b1800 tmpfs none /var/lib/logrotate ffff88012aab2e00 ffff8800b58b1800 tmpfs none /etc ffff8801376536c0 ffff8800b58b1800 tmpfs none /var/lib/rsyslog ffff8801377df180 ffff8800b58b1800 tmpfs none /var/lib/dhclient/var/lib/dhclient ffff88012aab2fc0 ffff8800b6381000 nfs4 192.168.200.253:/exports/state/oleg343-client.virtnet /var/lib/stateless/state ffff880137653880 ffff8800b6381000 nfs4 192.168.200.253:/exports/state/oleg343-client.virtnet /boot ffff8801377df340 ffff8800b6381000 nfs4 192.168.200.253:/exports/state/oleg343-client.virtnet /etc/etc/kdump.conf ffff88012aab36c0 ffff8801377f6800 rpc_pipefs sunrpc /var/lib/nfs/var/lib/nfs/rpc_pipefs ffff8800b5bfafc0 ffff8800b61d5800 nfs4 192.168.200.253://exports/testreports/38859/testresults/racer-ldiskfs-DNE-centos7_x86_64-centos7_x86_64-retry1 /tmp/tmp/testlogs ffff8800b43e4a80 ffff8800b61d1800 tmpfs tmpfs /run/user/0 ffff8800ab0fa000 ffff8800b63eb000 squashfs /dev/vda /usr/sbin/mount.lustre ffff880138ccbc00 ffff8800b63ed000 lustre 192.168.203.143@tcp:/lustre /mnt/lustre ffff8800b5bfb500 ffff8800aa5d2800 lustre 192.168.203.143@tcp:/lustre /mnt/lustre2 +-------------------------------+ >----------------------| Last 40 lines of dmesg buffer |----------------------< +-------------------------------+ [ 361.148759] INFO: task ls:24553 blocked for more than 120 seconds. [ 361.151002] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 361.153904] ls D ffff8800a6d50000 11840 24553 9475 0x00000000 [ 361.156657] Call Trace: [ 361.157296] [] schedule_preempt_disabled+0x39/0x90 [ 361.159942] [] __mutex_lock_slowpath+0x13a/0x340 [ 361.161929] [] mutex_lock+0x2d/0x40 [ 361.163923] [] do_last+0x296/0x1280 [ 361.165353] [] ? link_path_walk+0x27e/0x8c0 [ 361.167480] [] path_openat+0xcd/0x5b0 [ 361.170459] [] do_filp_open+0x4d/0xb0 [ 361.171830] [] ? _raw_spin_unlock+0xe/0x20 [ 361.173060] [] ? __alloc_fd+0xc3/0x170 [ 361.174347] [] do_sys_open+0x124/0x220 [ 361.176336] [] ? system_call_after_swapgs+0x96/0x13a [ 361.179820] [] SyS_openat+0x14/0x20 [ 361.183500] [] system_call_fastpath+0x1f/0x24 [ 361.185522] [] ? system_call_after_swapgs+0xa2/0x13a [ 361.188928] INFO: task ls:24554 blocked for more than 120 seconds. [ 361.190420] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 361.192253] ls D ffff8800a9290d80 11976 24554 9475 0x00000000 [ 361.195246] Call Trace: [ 361.196058] [] schedule_preempt_disabled+0x39/0x90 [ 361.197738] [] __mutex_lock_slowpath+0x13a/0x340 [ 361.200029] [] mutex_lock+0x2d/0x40 [ 361.201903] [] do_last+0x296/0x1280 [ 361.204713] [] ? link_path_walk+0x27e/0x8c0 [ 361.208578] [] path_openat+0xcd/0x5b0 [ 361.211651] [] do_filp_open+0x4d/0xb0 [ 361.215114] [] ? _raw_spin_unlock+0xe/0x20 [ 361.216421] [] ? __alloc_fd+0xc3/0x170 [ 361.218431] [] do_sys_open+0x124/0x220 [ 361.220268] [] ? system_call_after_swapgs+0x96/0x13a [ 361.222448] [] SyS_openat+0x14/0x20 [ 361.224525] [] system_call_fastpath+0x1f/0x24 [ 361.226603] [] ? system_call_after_swapgs+0xa2/0x13a [ 420.909219] LustreError: 11-0: lustre-MDT0000-mdc-ffff8800aa5d2800: operation mds_close to node 192.168.203.143@tcp failed: rc = -107 [ 420.913069] Lustre: lustre-MDT0000-mdc-ffff8800aa5d2800: Connection to lustre-MDT0000 (at 192.168.203.143@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 420.918582] LustreError: 167-0: lustre-MDT0000-mdc-ffff8800aa5d2800: This client was evicted by lustre-MDT0000; in progress operations using this service will fail. [ 420.981187] Lustre: lustre-MDT0000-mdc-ffff8800aa5d2800: Connection restored to (at 192.168.203.143@tcp) ****************************************************************************** ************************ A Summary Of Problems Found ************************* ****************************************************************************** -------------------- A list of all +++WARNING+++ messages -------------------- PARTIAL DUMP with size(vmcore) < 25% size(RAM) High Load Averages: 76.70, 154.96, 113.70 There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details Possible hang Run 'hanginfo' to get more details ------------------------------------------------------------------------------ ** Execution took 10.33s (real) 5.29s (CPU), Child processes: 5.03s