************************ crashinfo ************************* /exports/testreports/38859/testresults/racer-ldiskfs-DNE-centos7_x86_64-centos7_x86_64/oleg328-client-timeout-core (3.10.0-7.9-debug) +==========================+ | *** Crashinfo v1.3.7 *** | +==========================+ +++WARNING+++ PARTIAL DUMP with size(vmcore) < 25% size(RAM) KERNEL: /tmp/crash-anaysis.n6ICU/vmlinux [TAINTED] DUMPFILE: /exports/testreports/38859/testresults/racer-ldiskfs-DNE-centos7_x86_64-centos7_x86_64/oleg328-client-timeout-core [PARTIAL DUMP] CPUS: 4 DATE: Fri Jan 19 12:05:45 EST 2024 UPTIME: 00:12:04 LOAD AVERAGE: 51.13, 125.01, 97.30 TASKS: 328 NODENAME: oleg328-client.virtnet RELEASE: 3.10.0-7.9-debug VERSION: #1 SMP Sat Mar 26 23:28:42 EDT 2022 MACHINE: x86_64 (2399 Mhz) MEMORY: 4 GB PANIC: "" +--------------------------+ >------------------------| Per-cpu Stacks ('bt -a') |------------------------< +--------------------------+ -- CPU#0 -- PID=10009 CPU=0 CMD=file_rename.sh #-1 copy_pte_range+0x1d9, 449 bytes of data #0 copy_page_range+0x40e #1 dup_mm+0x4af #2 copy_process+0x156e #3 do_fork+0x91 #4 sys_clone+0x16 #5 stub_clone+0x44 #6 system_call_fastpath+0x1f, 477 bytes of data -- CPU#1 -- PID=1066 CPU=1 CMD=dd #-1 ldlm_res_hop_keycmp+0x20, 449 bytes of data #0 cfs_hash_bd_lookup_intent+0x57 #1 cfs_hash_bd_lookup_locked+0x16 #2 ldlm_resource_get+0x7a #3 ldlm_lock_match_with_skip+0xbf #4 osc_enqueue_base+0x109 #5 osc_lock_enqueue+0x381 #6 cl_lock_enqueue+0x62 #7 lov_lock_enqueue+0x99 #8 cl_lock_enqueue+0x62 #9 cl_lock_request+0x67 #10 cl_io_lock+0xf6 #11 cl_io_loop+0x8d #12 cl_setattr_ost+0x2af #13 ll_setattr_raw+0x121e #14 ll_setattr+0x56 #15 notify_change+0x309 #16 do_truncate+0x75 #17 do_sys_ftruncate.constprop.17+0x120 #18 sys_ftruncate+0xe #19 system_call_fastpath+0x1f, 477 bytes of data -- CPU#2 -- PID=2085 CPU=2 CMD=dir_remote.sh #-1 copy_strings+0x2ce, 449 bytes of data #0 do_execve_common+0x57a #1 sys_execve+0x29 #2 stub_execve+0x48, 477 bytes of data -- CPU#3 -- PID=2044 CPU=3 CMD=mkdir #-1 format_decode+0xca, 449 bytes of data #0 vsnprintf+0x68 #1 libcfs_debug_msg+0x9ed #2 _debug_req+0x5fb #3 ptlrpc_get_mod_rpc_slot+0xd0 #4 mdc_reint+0x1a #5 mdc_create+0x30d #6 lmv_create+0x555 #7 ll_new_node+0x290 #8 ll_mkdir+0xa9 #9 vfs_mkdir+0x109 #10 sys_mkdirat+0x6f #11 sys_mkdir+0x19 #12 system_call_fastpath+0x1f, 477 bytes of data +--------------------------------+ >---------------------| How This Dump Has Been Created |---------------------< +--------------------------------+ Cannot identify the specific condition that triggered vmcore +++WARNING+++ High Load Averages: 51.13, 125.01, 97.30 +---------------+ >------------------------------| Tasks Summary |------------------------------< +---------------+ Number of Threads That Ran Recently ----------------------------------- last second 176 last 5s 185 last 60s 196 ----- Total Numbers of Threads per State ------ TASK_INTERRUPTIBLE 270 TASK_RUNNING 16 TASK_UNINTERRUPTIBLE 39 +++WARNING+++ There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details +-----------------------+ >--------------------------| 5 Most Recent Threads |--------------------------< +-----------------------+ PID CMD Age ARGS ----- -------------- ------ ---------------------------- 2044 mkdir 0 ms mkdir -p /mnt/lustre/racer0/17 17/ 1066 dd 0 ms dd if=/dev/zero of=/mnt/lustre2/racer1/19/14 bs=64k count=0 seek=203 10006 dir_create.sh 0 ms /bin/bash ./dir_create.sh /mnt/lustre/racer0 20 1862 ptlrpcd_01_00 0 ms (no user stack) 10981 ldlm_cb01_002 0 ms (no user stack) +------------------------+ >-------------------------| Memory Usage (kmem -i) |-------------------------< +------------------------+ PAGES TOTAL PERCENTAGE TOTAL MEM 955079 3.6 GB ---- FREE 713393 2.7 GB 74% of TOTAL MEM USED 241686 944.1 MB 25% of TOTAL MEM SHARED 16078 62.8 MB 1% of TOTAL MEM BUFFERS 5145 20.1 MB 0% of TOTAL MEM CACHED 53543 209.2 MB 5% of TOTAL MEM SLAB 128098 500.4 MB 13% of TOTAL MEM TOTAL HUGE 0 0 ---- HUGE FREE 0 0 0% of TOTAL HUGE TOTAL SWAP 262143 1024 MB ---- SWAP USED 0 0 0% of TOTAL SWAP SWAP FREE 262143 1024 MB 100% of TOTAL SWAP COMMIT LIMIT 739682 2.8 GB ---- COMMITTED 92794 362.5 MB 12% of TOTAL LIMIT +-------------------------------+ >----------------------| Scheduler Runqueues (per CPU) |----------------------< +-------------------------------+ ---+ CPU=0 ---- | CURRENT TASK , CMD=file_rename.sh 1904 dd 0.03542 2022 ll_agl_1908 0.02018 2083 dir_create.sh 0.00000 2086 file_rename.sh 0.00000 ---+ CPU=1 ---- | CURRENT TASK , CMD=dd 31966 dd 0.21628 1940 touch 0.05722 2084 dir_remote.sh 0.00000 ---+ CPU=2 ---- | CURRENT TASK , CMD=dir_remote.sh 10030 dir_create.sh 3.69374 2087 file_rename.sh 0.00000 ---+ CPU=3 ---- | CURRENT TASK , CMD=mkdir 9 rcu_sched 6.23466 1958 lfs 0.04005 +------------------------+ >-------------------------| Network Status Summary |-------------------------< +------------------------+ TCP Connection Info ------------------- ESTABLISHED 6 LISTEN 3 NAGLE disabled (TCP_NODELAY): 5 user_data set (NFS etc.): 4 UDP Connection Info ------------------- 2 UDP sockets, 0 in ESTABLISHED Unix Connection Info ------------------------ ESTABLISHED 26 CLOSE 18 LISTEN 8 Raw sockets info -------------------- CLOSE 1 Interfaces Info --------------- How long ago (in seconds) interfaces transmitted/received? Name RX TX ---- ---------- --------- lo n/a 722.4 eth0 n/a 0.0 RSS_TOTAL=292664 pages, %mem= 2.5 +------------+ >-------------------------------| Mounted FS |-------------------------------< +------------+ MOUNT SUPERBLK TYPE DEVNAME DIRNAME ffff880138cca000 ffff880139940800 rootfs rootfs / ffff88012a9d6000 ffff88012aa88000 sysfs sysfs /sys ffff88012a9d61c0 ffff880139944000 proc proc /proc ffff88012a9d6380 ffff880137668000 devtmpfs devtmpfs /dev ffff88012a9d6540 ffff880137668800 securityfs securityfs /sys/kernel/security ffff88012a9d6700 ffff88012aa88800 tmpfs tmpfs /dev/shm ffff88012a9d68c0 ffff880137315000 devpts devpts /dev/pts ffff88012a9d6a80 ffff88012aa89000 tmpfs tmpfs /run ffff88012a9d6c40 ffff88012aa89800 tmpfs tmpfs /sys/fs/cgroup ffff88012a9d6e00 ffff88012aa8a000 cgroup cgroup /sys/fs/cgroup/systemd ffff88012a9d6fc0 ffff88012aa8a800 pstore pstore /sys/fs/pstore ffff88012a9d7180 ffff88012aa8c800 cgroup cgroup /sys/fs/cgroup/memory ffff88012a9d7340 ffff88012aa8c000 cgroup cgroup /sys/fs/cgroup/pids ffff88012a9d7500 ffff88012aa8b800 cgroup cgroup /sys/fs/cgroup/cpu,cpuacct ffff88012a9d76c0 ffff88012aa8b000 cgroup cgroup /sys/fs/cgroup/hugetlb ffff88012a9d7880 ffff88012aa8d000 cgroup cgroup /sys/fs/cgroup/devices ffff88012a9d7a40 ffff88012aa8d800 cgroup cgroup /sys/fs/cgroup/freezer ffff88012a9d7c00 ffff88012aa8e000 cgroup cgroup /sys/fs/cgroup/net_cls,net_prio ffff88012a9d7dc0 ffff88012aa8e800 cgroup cgroup /sys/fs/cgroup/blkio ffff88012aa60000 ffff88012aa8f000 cgroup cgroup /sys/fs/cgroup/cpuset ffff88012aa601c0 ffff88012aa8f800 cgroup cgroup /sys/fs/cgroup/perf_event ffff880137652700 ffff88013766b800 configfs configfs /sys/kernel/config ffff88012aa60380 ffff8800b6d1a800 ext4 /dev/nbd0 / ffff88012aa60540 ffff88013766f800 rpc_pipefs rpc_pipefs /var/lib/nfs/rpc_pipefs ffff880137652c40 ffff8800b6722800 autofs systemd-1 /proc/sys/fs/binfmt_misc ffff88012aa60700 ffff88013700a000 mqueue mqueue /dev/mqueue ffff880138ccb340 ffff8800b6c95800 hugetlbfs hugetlbfs /dev/hugepages ffff880137652e00 ffff880139947800 debugfs debugfs /sys/kernel/debug ffff880137652fc0 ffff8800b6721800 binfmt_misc binfmt_misc /proc/sys/fs/binfmt_misc/ ffff88012aa608c0 ffff8800b5902000 ramfs none /mnt ffff880138ccb500 ffff8800b6c97800 tmpfs none /var/lib/stateless/writable ffff880137653340 ffff8800b6725800 squashfs /dev/vda /home/green/git/lustre-release ffff88012aa60a80 ffff8800b6c97800 tmpfs none /var/cache/man ffff880137653500 ffff8800b6c97800 tmpfs none /var/log ffff8801377e0540 ffff8800b6c97800 tmpfs none /var/lib/dbus ffff88012aa60c40 ffff8800b6c97800 tmpfs none /tmp ffff88012aa60e00 ffff8800b6c97800 tmpfs none /var/lib/dhclient ffff8801377e0700 ffff8800b6c97800 tmpfs none /var/tmp ffff8801376536c0 ffff8800b6c97800 tmpfs none /var/lib/NetworkManager ffff880137653880 ffff8800b6c97800 tmpfs none /var/lib/systemd/random-seed ffff880138ccb6c0 ffff8800b6c97800 tmpfs none /var/spool ffff880137653a40 ffff8800b6c97800 tmpfs none /var/lib/nfs ffff88012aa60fc0 ffff8800b6c97800 tmpfs none /var/lib/gssproxy ffff880137653c00 ffff8800b6c97800 tmpfs none /var/lib/logrotate ffff8801377e08c0 ffff8800b6c97800 tmpfs none /etc ffff88012aa61180 ffff8800b6c97800 tmpfs none /var/lib/rsyslog ffff8801377e0a80 ffff8800b6c97800 tmpfs none /var/lib/dhclient/var/lib/dhclient ffff88012aa61340 ffff8800b675b000 nfs4 192.168.200.253:/exports/state/oleg328-client.virtnet /var/lib/stateless/state ffff88012aa61a40 ffff8800b675b000 nfs4 192.168.200.253:/exports/state/oleg328-client.virtnet /boot ffff88012aa61c00 ffff8800b675b000 nfs4 192.168.200.253:/exports/state/oleg328-client.virtnet /etc/etc/kdump.conf ffff880137653dc0 ffff88013766f800 rpc_pipefs sunrpc /var/lib/nfs/var/lib/nfs/rpc_pipefs ffff8800aecbc1c0 ffff8800b6d1c800 nfs4 192.168.200.253://exports/testreports/38859/testresults/racer-ldiskfs-DNE-centos7_x86_64-centos7_x86_64 /tmp/tmp/testlogs ffff8800b5b3a380 ffff8800b67e7800 tmpfs tmpfs /run/user/0 ffff8800aecbc380 ffff8800b6725800 squashfs /dev/vda /usr/sbin/mount.lustre ffff8800b35741c0 ffff88012a5fa800 lustre 192.168.203.128@tcp:/lustre /mnt/lustre ffff8800aecbd180 ffff8800a99d2000 lustre 192.168.203.128@tcp:/lustre /mnt/lustre2 +-------------------------------+ >----------------------| Last 40 lines of dmesg buffer |----------------------< +-------------------------------+ [ 645.995334] LustreError: 32370:0:(lov_object.c:185:lov_init_sub()) [0x0, 0x50000): { 0x0BD10BD0, 131075, 0x1, 0, 0x10, 1, 65536 } [ 645.997936] LustreError: 32370:0:(lov_object.c:185:lov_init_sub()) header@ffff88006e2ae0c8[0x4, 1, [0x2c0000400:0x71a:0x0] hash]{ [ 645.999965] LustreError: 32370:0:(lov_object.c:185:lov_init_sub()) ....lovsub@ffff88006e2ae148[131072] [ 646.001696] LustreError: 32370:0:(lov_object.c:185:lov_init_sub()) ....osc@ffff8801368884b0id: 0x2c0000400:1818 idx: 1 gen: 0 kms_valid: 0 kms 0 rc: 0 force_sync: 0 min_xid: 0 size: 0 mtime: 1705683634 atime: 1705683634 ctime: 1705683634 blocks: 0 [ 646.005812] LustreError: 32370:0:(lov_object.c:185:lov_init_sub()) } header@ffff88006e2ae0c8 [ 646.007467] LustreError: 32370:0:(lov_object.c:185:lov_init_sub()) [0x50000, 0xffffffffffffffff): { 0x0BD10BD0, 131076, 0x1, 65535, 0x0, 1, 2097152 } [ 646.009865] LustreError: 32370:0:(lov_object.c:185:lov_init_sub()) [ 646.011184] LustreError: 32370:0:(lov_object.c:185:lov_init_sub()) } header@ffff88012c898e10 [ 646.012666] LustreError: 32370:0:(lov_object.c:185:lov_init_sub()) owned. [ 646.014096] LustreError: 32370:0:(lov_object.c:186:lov_init_sub()) header@ffff88008caf1ed8[0x0, 1, [0x200000402:0x37c4:0x0] hash] [ 646.016069] LustreError: 32370:0:(lov_object.c:186:lov_init_sub()) try to own. [ 646.018040] LustreError: 32370:0:(lcommon_cl.c:196:cl_file_inode_init()) lustre: failed to initialize cl_object [0x200000402:0x37c4:0x0]: rc = -5 [ 646.020502] LustreError: 32370:0:(lcommon_cl.c:196:cl_file_inode_init()) Skipped 184 previous similar messages [ 646.022323] LustreError: 32370:0:(llite_lib.c:3629:ll_prep_inode()) new_inode -fatal: rc -5 [ 646.023746] LustreError: 32370:0:(llite_lib.c:3629:ll_prep_inode()) Skipped 184 previous similar messages [ 658.799150] LustreError: 4719:0:(lov_object.c:184:lov_init_sub()) header@ffff88012c898bb8[0x4, 2, [0x280000400:0x712:0x0] hash]{ [ 658.801769] LustreError: 4719:0:(lov_object.c:184:lov_init_sub()) ....lovsub@ffff88012c898c38[0] [ 658.803907] LustreError: 4719:0:(lov_object.c:184:lov_init_sub()) ....osc@ffff880072a00e10id: 0x280000400:1810 idx: 0 gen: 0 kms_valid: 0 kms 0 rc: 0 force_sync: 0 min_xid: 0 size: 0 mtime: 0 atime: 0 ctime: 0 blocks: 0 [ 658.808956] LustreError: 4719:0:(lov_object.c:184:lov_init_sub()) } header@ffff88012c898bb8 [ 658.811289] LustreError: 4719:0:(lov_object.c:184:lov_init_sub()) stripe 0 is already owned. [ 658.813266] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) header@ffff88012c898e10[0x4, 1, [0x240000404:0xe71:0x0] hash]{ [ 658.816131] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) ....vvp@ffff88012c898e90(0) inode: ffff88012e7fd8c8 162129603832319601/37748740 100644 1 0 ffff88012c898e90 [0x240000404:0xe71:0x0] [ 658.820780] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) ....lov@ffff88008280c658entries: 4, valid, lsm{ffff880134dcd080 0x0BD60BD0 1 4}: [ 658.824966] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) [0x0, 0x50000): { 0x0BD10BD0, 65537, 0x1, 0, 0x10, 1, 65536 } [ 658.828329] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) header@ffff88012c898bb8[0x4, 2, [0x280000400:0x712:0x0] hash]{ [ 658.831499] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) ....lovsub@ffff88012c898c38[0] [ 658.833835] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) ....osc@ffff880072a00e10id: 0x280000400:1810 idx: 0 gen: 0 kms_valid: 0 kms 0 rc: 0 force_sync: 0 min_xid: 0 size: 0 mtime: 0 atime: 0 ctime: 0 blocks: 0 [ 658.838464] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) } header@ffff88012c898bb8 [ 658.840334] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) [0x50000, 0xffffffffffffffff): { 0x0BD10BD0, 65538, 0x1, 65535, 0x0, 1, 2097152 } [ 658.843210] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) [0x0, 0x50000): { 0x0BD10BD0, 131075, 0x1, 0, 0x10, 1, 65536 } [ 658.845733] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) header@ffff88006e2ae0c8[0x4, 1, [0x2c0000400:0x71a:0x0] hash]{ [ 658.848375] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) ....lovsub@ffff88006e2ae148[131072] [ 658.850442] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) ....osc@ffff8801368884b0id: 0x2c0000400:1818 idx: 1 gen: 0 kms_valid: 0 kms 0 rc: 0 force_sync: 0 min_xid: 0 size: 0 mtime: 1705683634 atime: 1705683634 ctime: 1705683634 blocks: 0 [ 658.855831] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) } header@ffff88006e2ae0c8 [ 658.858445] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) [0x50000, 0xffffffffffffffff): { 0x0BD10BD0, 131076, 0x1, 65535, 0x0, 1, 2097152 } [ 658.863164] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) [ 658.864814] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) } header@ffff88012c898e10 [ 658.867224] LustreError: 4719:0:(lov_object.c:185:lov_init_sub()) owned. [ 658.869310] LustreError: 4719:0:(lov_object.c:186:lov_init_sub()) header@ffff880071fb4bb8[0x0, 1, [0x200000402:0x37c4:0x0] hash] [ 658.872701] LustreError: 4719:0:(lov_object.c:186:lov_init_sub()) try to own. ****************************************************************************** ************************ A Summary Of Problems Found ************************* ****************************************************************************** -------------------- A list of all +++WARNING+++ messages -------------------- PARTIAL DUMP with size(vmcore) < 25% size(RAM) High Load Averages: 51.13, 125.01, 97.30 There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details ------------------------------------------------------------------------------ ** Execution took 11.77s (real) 6.29s (CPU), Child processes: 5.47s