************************ crashinfo ************************* /exports/testreports/38859/testresults/racer-special2-zfs-DNE-centos7_x86_64-centos7_x86_64/oleg348-client-timeout-core (3.10.0-7.9-debug) +==========================+ | *** Crashinfo v1.3.7 *** | +==========================+ +++WARNING+++ PARTIAL DUMP with size(vmcore) < 25% size(RAM) KERNEL: /tmp/crash-anaysis.FBUOS/vmlinux [TAINTED] DUMPFILE: /exports/testreports/38859/testresults/racer-special2-zfs-DNE-centos7_x86_64-centos7_x86_64/oleg348-client-timeout-core [PARTIAL DUMP] CPUS: 4 DATE: Fri Jan 19 13:00:55 EST 2024 UPTIME: 01:07:12 LOAD AVERAGE: 24.00, 24.01, 26.27 TASKS: 182 NODENAME: oleg348-client.virtnet RELEASE: 3.10.0-7.9-debug VERSION: #1 SMP Sat Mar 26 23:28:42 EDT 2022 MACHINE: x86_64 (2399 Mhz) MEMORY: 4 GB PANIC: "" +--------------------------+ >------------------------| Per-cpu Stacks ('bt -a') |------------------------< +--------------------------+ -- CPU#0 -- PID=0 CPU=0 CMD=swapper/0 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 rest_init+0x8e #7 start_kernel+0x456 #8 x86_64_start_reservations+0x2a #9 x86_64_start_kernel+0x152 #10 start_cpu+0x5 -- CPU#1 -- PID=0 CPU=1 CMD=swapper/1 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#2 -- PID=0 CPU=2 CMD=swapper/2 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#3 -- PID=0 CPU=3 CMD=swapper/3 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 +--------------------------------+ >---------------------| How This Dump Has Been Created |---------------------< +--------------------------------+ Cannot identify the specific condition that triggered vmcore +---------------+ >------------------------------| Tasks Summary |------------------------------< +---------------+ Number of Threads That Ran Recently ----------------------------------- last second 14 last 5s 31 last 60s 38 ----- Total Numbers of Threads per State ------ TASK_INTERRUPTIBLE 154 TASK_RUNNING 1 TASK_UNINTERRUPTIBLE 24 +++WARNING+++ There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details +-----------------------+ >--------------------------| 5 Most Recent Threads |--------------------------< +-----------------------+ PID CMD Age ARGS ----- -------------- ------ ---------------------------- 1856 socknal_sd01_01 0 ms (no user stack) 47 kworker/1:1 0 ms (no user stack) 1852 socknal_reaper 0 ms (no user stack) 13 watchdog/0 0 ms (no user stack) 5354 ldlm_bl_01 55 ms (no user stack) +------------------------+ >-------------------------| Memory Usage (kmem -i) |-------------------------< +------------------------+ PAGES TOTAL PERCENTAGE TOTAL MEM 955079 3.6 GB ---- FREE 795425 3 GB 83% of TOTAL MEM USED 159654 623.6 MB 16% of TOTAL MEM SHARED 18109 70.7 MB 1% of TOTAL MEM BUFFERS 5146 20.1 MB 0% of TOTAL MEM CACHED 55906 218.4 MB 5% of TOTAL MEM SLAB 60281 235.5 MB 6% of TOTAL MEM TOTAL HUGE 0 0 ---- HUGE FREE 0 0 0% of TOTAL HUGE TOTAL SWAP 262143 1024 MB ---- SWAP USED 0 0 0% of TOTAL SWAP SWAP FREE 262143 1024 MB 100% of TOTAL SWAP COMMIT LIMIT 739682 2.8 GB ---- COMMITTED 67648 264.2 MB 9% of TOTAL LIMIT +++three oldest UNINTERRUPTIBLE threads ... ran 3464s ago PID=16189 CPU=2 CMD=mrename #0 __schedule+0x2e2 #1 schedule_preempt_disabled+0x39 #2 __mutex_lock_slowpath+0x13a #3 mutex_lock+0x2d #4 lock_rename+0x31 #5 SYSC_renameat2+0x22f #6 sys_renameat2+0xe #7 sys_rename+0x1e #8 system_call_fastpath+0x1f, 477 bytes of data ... ran 3464s ago PID=16184 CPU=2 CMD=mrename #0 __schedule+0x2e2 #1 schedule_preempt_disabled+0x39 #2 __mutex_lock_slowpath+0x13a #3 mutex_lock+0x2d #4 lock_rename+0x31 #5 SYSC_renameat2+0x22f #6 sys_renameat2+0xe #7 sys_rename+0x1e #8 system_call_fastpath+0x1f, 477 bytes of data ... ran 3464s ago PID=16198 CPU=0 CMD=mrename #0 __schedule+0x2e2 #1 schedule_preempt_disabled+0x39 #2 __mutex_lock_slowpath+0x13a #3 mutex_lock+0x2d #4 lock_rename+0x31 #5 SYSC_renameat2+0x22f #6 sys_renameat2+0xe #7 sys_rename+0x1e #8 system_call_fastpath+0x1f, 477 bytes of data +-------------------------------+ >----------------------| Scheduler Runqueues (per CPU) |----------------------< +-------------------------------+ ---+ CPU=0 ---- | CURRENT TASK , CMD=swapper/0 ---+ CPU=1 ---- | CURRENT TASK , CMD=swapper/1 ---+ CPU=2 ---- | CURRENT TASK , CMD=swapper/2 ---+ CPU=3 ---- | CURRENT TASK , CMD=swapper/3 +------------------------+ >-------------------------| Network Status Summary |-------------------------< +------------------------+ TCP Connection Info ------------------- ESTABLISHED 6 LISTEN 3 NAGLE disabled (TCP_NODELAY): 5 user_data set (NFS etc.): 4 UDP Connection Info ------------------- 2 UDP sockets, 0 in ESTABLISHED Unix Connection Info ------------------------ ESTABLISHED 26 CLOSE 18 LISTEN 8 Raw sockets info -------------------- CLOSE 1 Interfaces Info --------------- How long ago (in seconds) interfaces transmitted/received? Name RX TX ---- ---------- --------- lo n/a 4029.6 eth0 n/a 1.9 RSS_TOTAL=100512 pages, %mem= 1.4 +++WARNING+++ Possible hang +++WARNING+++ Run 'hanginfo' to get more details +------------+ >-------------------------------| Mounted FS |-------------------------------< +------------+ MOUNT SUPERBLK TYPE DEVNAME DIRNAME ffff880138cca000 ffff880139940800 rootfs rootfs / ffff88012aa9e000 ffff88012aa48000 sysfs sysfs /sys ffff88012aa9e1c0 ffff880139944000 proc proc /proc ffff88012aa9e380 ffff880137668000 devtmpfs devtmpfs /dev ffff88012aa9e540 ffff88012a9e2000 securityfs securityfs /sys/kernel/security ffff88012aa9e700 ffff88012aa48800 tmpfs tmpfs /dev/shm ffff88012aa9e8c0 ffff88013717f000 devpts devpts /dev/pts ffff88012aa9ea80 ffff88012aa49000 tmpfs tmpfs /run ffff88012aa9ec40 ffff88012aa49800 tmpfs tmpfs /sys/fs/cgroup ffff88012aa9ee00 ffff88012aa4a000 cgroup cgroup /sys/fs/cgroup/systemd ffff88012aa9efc0 ffff88012aa4a800 pstore pstore /sys/fs/pstore ffff88012aa9f180 ffff88012aa4c800 cgroup cgroup /sys/fs/cgroup/cpuset ffff88012aa9f340 ffff88012aa4c000 cgroup cgroup /sys/fs/cgroup/blkio ffff88012aa9f500 ffff88012aa4b800 cgroup cgroup /sys/fs/cgroup/freezer ffff88012aa9f6c0 ffff88012aa4b000 cgroup cgroup /sys/fs/cgroup/net_cls,net_prio ffff88012aa9f880 ffff88012aa4d000 cgroup cgroup /sys/fs/cgroup/devices ffff88012aa9fa40 ffff88012aa4d800 cgroup cgroup /sys/fs/cgroup/perf_event ffff88012aa9fc00 ffff88012aa4e000 cgroup cgroup /sys/fs/cgroup/cpu,cpuacct ffff88012aa9fdc0 ffff88012aa4e800 cgroup cgroup /sys/fs/cgroup/pids ffff88012aa22000 ffff88012aa4f000 cgroup cgroup /sys/fs/cgroup/hugetlb ffff88012aa221c0 ffff88012aa4f800 cgroup cgroup /sys/fs/cgroup/memory ffff88012aa22540 ffff8800b6d2c000 configfs configfs /sys/kernel/config ffff88012aa22700 ffff8800b6d2f000 ext4 /dev/nbd0 / ffff88012aa228c0 ffff8800b6d29000 rpc_pipefs rpc_pipefs /var/lib/nfs/rpc_pipefs ffff8801376528c0 ffff88012a519000 autofs systemd-1 /proc/sys/fs/binfmt_misc ffff8800b6d92000 ffff88012a4b1000 hugetlbfs hugetlbfs /dev/hugepages ffff8800b6d921c0 ffff88012b268000 mqueue mqueue /dev/mqueue ffff88012aa22a80 ffff880139947800 debugfs debugfs /sys/kernel/debug ffff88012aa22c40 ffff8800b60d5800 binfmt_misc binfmt_misc /proc/sys/fs/binfmt_misc/ ffff88012aa22fc0 ffff88012abfd800 ramfs none /mnt ffff88012aa23180 ffff88012a51c000 tmpfs none /var/lib/stateless/writable ffff880137652a80 ffff88012abfa000 squashfs /dev/vda /home/green/git/lustre-release ffff880137652c40 ffff88012a51c000 tmpfs none /var/cache/man ffff88012aa23340 ffff88012a51c000 tmpfs none /var/log ffff8800b6d92380 ffff88012a51c000 tmpfs none /var/lib/dbus ffff8800b6d92540 ffff88012a51c000 tmpfs none /tmp ffff88012aa23500 ffff88012a51c000 tmpfs none /var/lib/dhclient ffff880137652e00 ffff88012a51c000 tmpfs none /var/tmp ffff88012aa236c0 ffff88012a51c000 tmpfs none /var/lib/NetworkManager ffff8800b6d92700 ffff88012a51c000 tmpfs none /var/lib/systemd/random-seed ffff88012aa23880 ffff88012a51c000 tmpfs none /var/spool ffff8800b6d928c0 ffff88012a51c000 tmpfs none /var/lib/nfs ffff8800b6d92a80 ffff88012a51c000 tmpfs none /var/lib/gssproxy ffff880137652fc0 ffff88012a51c000 tmpfs none /var/lib/logrotate ffff88012aa23a40 ffff88012a51c000 tmpfs none /etc ffff880137653180 ffff88012a51c000 tmpfs none /var/lib/rsyslog ffff880138ccb6c0 ffff88012a51c000 tmpfs none /var/lib/dhclient/var/lib/dhclient ffff8800b6d92c40 ffff88012a4b5000 nfs4 192.168.200.253:/exports/state/oleg348-client.virtnet /var/lib/stateless/state ffff88012aa23c00 ffff88012a4b5000 nfs4 192.168.200.253:/exports/state/oleg348-client.virtnet /boot ffff880137653340 ffff88012a4b5000 nfs4 192.168.200.253:/exports/state/oleg348-client.virtnet /etc/etc/kdump.conf ffff880137653500 ffff8800b6d29000 rpc_pipefs sunrpc /var/lib/nfs/var/lib/nfs/rpc_pipefs ffff8800b3eb8c40 ffff88012abff000 nfs4 192.168.200.253://exports/testreports/38859/testresults/racer-special2-zfs-DNE-centos7_x86_64-centos7_x86_64 /tmp/tmp/testlogs ffff8800b3e20700 ffff8800af198800 tmpfs tmpfs /run/user/0 ffff8800b3eb9340 ffff88012abfa000 squashfs /dev/vda /usr/sbin/mount.lustre ffff880138ccba40 ffff8800b6d2f800 lustre 192.168.203.148@tcp:/lustre /mnt/lustre ffff880138ccbc00 ffff88012d4ce000 lustre 192.168.203.148@tcp:/lustre /mnt/lustre2 +-------------------------------+ >----------------------| Last 40 lines of dmesg buffer |----------------------< +-------------------------------+ [ 721.029507] [] user_path_at_empty+0x67/0xc0 [ 721.030566] [] user_path_at+0x11/0x20 [ 721.032029] [] vfs_fstatat+0x63/0xc0 [ 721.033407] [] SYSC_newstat+0x2e/0x60 [ 721.034654] [] ? system_call_after_swapgs+0xa2/0x13a [ 721.036587] [] ? system_call_after_swapgs+0x96/0x13a [ 721.038287] [] ? system_call_after_swapgs+0xa2/0x13a [ 721.040281] [] ? system_call_after_swapgs+0x96/0x13a [ 721.042265] [] ? system_call_after_swapgs+0xa2/0x13a [ 721.044189] [] ? system_call_after_swapgs+0x96/0x13a [ 721.045796] [] ? system_call_after_swapgs+0xa2/0x13a [ 721.047568] [] ? system_call_after_swapgs+0x96/0x13a [ 721.049495] [] ? system_call_after_swapgs+0xa2/0x13a [ 721.051231] [] ? system_call_after_swapgs+0x96/0x13a [ 721.053048] [] ? system_call_after_swapgs+0xa2/0x13a [ 721.054966] [] SyS_newstat+0xe/0x10 [ 721.056263] [] system_call_fastpath+0x1f/0x24 [ 721.057813] [] ? system_call_after_swapgs+0xa2/0x13a [ 721.059344] INFO: task mrename:16377 blocked for more than 120 seconds. [ 721.060771] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 721.062731] mrename D ffff88012f300000 12648 16377 30450 0x00000000 [ 721.065134] Call Trace: [ 721.065977] [] schedule_preempt_disabled+0x39/0x90 [ 721.067883] [] __mutex_lock_slowpath+0x13a/0x340 [ 721.069436] [] mutex_lock+0x2d/0x40 [ 721.070632] [] lock_rename+0xc0/0xe0 [ 721.072256] [] SYSC_renameat2+0x22f/0x570 [ 721.073907] [] ? handle_mm_fault+0xc2/0x150 [ 721.075316] [] ? system_call_after_swapgs+0xa2/0x13a [ 721.077037] [] ? system_call_after_swapgs+0x96/0x13a [ 721.078692] [] ? system_call_after_swapgs+0xa2/0x13a [ 721.080475] [] ? system_call_after_swapgs+0x96/0x13a [ 721.081964] [] SyS_renameat2+0xe/0x10 [ 721.083107] [] SyS_rename+0x1e/0x20 [ 721.084023] [] system_call_fastpath+0x1f/0x24 [ 721.085630] [] ? system_call_after_swapgs+0xa2/0x13a [ 735.679846] LustreError: 11-0: lustre-MDT0001-mdc-ffff88012d4ce000: operation ldlm_enqueue to node 192.168.203.148@tcp failed: rc = -107 [ 735.685211] Lustre: lustre-MDT0001-mdc-ffff88012d4ce000: Connection to lustre-MDT0001 (at 192.168.203.148@tcp) was lost; in progress operations using this service will wait for recovery to complete [ 735.696673] LustreError: 167-0: lustre-MDT0001-mdc-ffff88012d4ce000: This client was evicted by lustre-MDT0001; in progress operations using this service will fail. [ 735.759763] Lustre: lustre-MDT0001-mdc-ffff88012d4ce000: Connection restored to (at 192.168.203.148@tcp) ****************************************************************************** ************************ A Summary Of Problems Found ************************* ****************************************************************************** -------------------- A list of all +++WARNING+++ messages -------------------- PARTIAL DUMP with size(vmcore) < 25% size(RAM) There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details Possible hang Run 'hanginfo' to get more details ------------------------------------------------------------------------------ ** Execution took 11.92s (real) 5.90s (CPU), Child processes: 5.94s