************************ crashinfo ************************* /exports/testreports/38859/testresults/racer-special8-ldiskfs-DNE-centos7_x86_64-centos7_x86_64/oleg414-server-vmcore (3.10.0-7.9-debug) +==========================+ | *** Crashinfo v1.3.7 *** | +==========================+ +++WARNING+++ PARTIAL DUMP with size(vmcore) < 25% size(RAM) KERNEL: /tmp/crash-anaysis.fy7kP/vmlinux [TAINTED] DUMPFILE: /exports/testreports/38859/testresults/racer-special8-ldiskfs-DNE-centos7_x86_64-centos7_x86_64/oleg414-server-vmcore [PARTIAL DUMP] CPUS: 4 DATE: Fri Jan 19 12:02:16 EST 2024 UPTIME: 00:08:28 LOAD AVERAGE: 7.02, 3.61, 1.58 TASKS: 430 NODENAME: oleg414-server.virtnet RELEASE: 3.10.0-7.9-debug VERSION: #1 SMP Sat Mar 26 23:28:42 EDT 2022 MACHINE: x86_64 (2399 Mhz) MEMORY: 4 GB PANIC: "Kernel panic - not syncing: LBUG" +--------------------------+ >------------------------| Per-cpu Stacks ('bt -a') |------------------------< +--------------------------+ -- CPU#0 -- PID=0 CPU=0 CMD=swapper/0 #0 crash_nmi_callback+0x31 #1 nmi_handle+0x97 #2 do_nmi+0x12c #3 end_repeat_nmi+0x1e #-1 native_safe_halt+0xb, 507 bytes of data #4 native_safe_halt+0xb #5 default_idle+0x1e #6 default_enter_idle+0x45 #7 cpuidle_enter_state+0x40 #8 cpuidle_idle_call+0xd8 #9 arch_cpu_idle+0xe #10 cpu_startup_entry+0x14a #11 rest_init+0x8e #12 start_kernel+0x456 #13 x86_64_start_reservations+0x2a #14 x86_64_start_kernel+0x152 #15 start_cpu+0x5 -- CPU#1 -- PID=14226 CPU=1 CMD=ll_ost_io00_004 #0 crash_nmi_callback+0x31 #1 nmi_handle+0x97 #2 do_nmi+0x12c #3 end_repeat_nmi+0x1e #-1 native_queued_spin_lock_slowpath+0x1d, 507 bytes of data #4 native_queued_spin_lock_slowpath+0x1d #5 do_raw_spin_lock+0x6d #6 _raw_spin_lock+0x1e #7 cfs_hash_spin_lock+0x9 #8 ldlm_resource_get+0x6a #9 ldlm_resource_prolong+0x6e #10 ofd_prolong_extent_locks+0x12d #11 ofd_punch_hpreq_check+0xb8 #12 ofd_punch_hpreq_fini+0xe #13 ptlrpc_server_hpreq_fini+0x47 #14 ptlrpc_server_finish_active_request+0x88 #15 ptlrpc_server_handle_request+0x424 #16 ptlrpc_main+0xc37 #17 kthread+0xe4 #18 ret_from_fork_nospec_begin+0x7 -- CPU#2 -- PID=14212 CPU=2 CMD=mdt00_011 #0 machine_kexec+0x19e #1 __crash_kexec+0x72 #2 panic+0xf3 #3 lbug_with_loc+0x9b #4 osd_write+0x929 #5 dt_record_write+0x33 #6 llog_osd_write_rec+0xfdb #7 llog_write_rec+0x290 #8 llog_cat_add_rec+0x1d9 #9 llog_add+0x17f #10 osp_sync_add+0x1f9 #11 osp_attr_set+0x3cd #12 lod_sub_attr_set+0x1d7 #13 lod_obj_stripe_attr_set_cb+0x40 #14 lod_obj_for_each_stripe+0x12d #15 lod_attr_set+0x51a #16 lod_layout_change+0x367 #17 dt_layout_change+0x1a140 #18 mdd_layout_change+0x1712 #19 mdt_layout_change+0x2bf #20 mdt_intent_layout+0x910 #21 mdt_intent_opc+0x1c8 #22 mdt_intent_policy+0xfa #23 ldlm_lock_enqueue+0x3b1 #24 ldlm_handle_enqueue+0x359 #25 tgt_enqueue+0x68 #26 tgt_request_handle+0x74e #27 ptlrpc_server_handle_request+0x26e #28 ptlrpc_main+0xc37 #29 kthread+0xe4 #30 ret_from_fork_nospec_begin+0x7 -- CPU#3 -- PID=14223 CPU=3 CMD=ll_ost_io00_003 #0 crash_nmi_callback+0x31 #1 nmi_handle+0x97 #2 do_nmi+0x12c #3 end_repeat_nmi+0x1e #-1 memcmp+0xf, 507 bytes of data #4 memcmp+0xf #5 ldlm_res_hop_keycmp+0x17 #6 cfs_hash_bd_lookup_intent+0x57 #7 cfs_hash_bd_lookup_locked+0x16 #8 ldlm_resource_get+0x7a #9 ldlm_resource_prolong+0x6e #10 ofd_prolong_extent_locks+0x12d #11 ofd_punch_hpreq_check+0xb8 #12 ptlrpc_server_request_add+0x96 #13 ptlrpc_server_handle_req_in+0x667 #14 ptlrpc_main+0xbb0 #15 kthread+0xe4 #16 ret_from_fork_nospec_begin+0x7 +--------------------------------+ >---------------------| How This Dump Has Been Created |---------------------< +--------------------------------+ *** Panic *** BUG: 73 of 73 active objects replaced +---------------+ >------------------------------| Tasks Summary |------------------------------< +---------------+ Number of Threads That Ran Recently ----------------------------------- last second 172 last 5s 261 last 60s 286 ----- Total Numbers of Threads per State ------ TASK_INTERRUPTIBLE 411 TASK_RUNNING 7 TASK_UNINTERRUPTIBLE 9 +++WARNING+++ There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details +-----------------------+ >--------------------------| 5 Most Recent Threads |--------------------------< +-----------------------+ PID CMD Age ARGS ----- -------------- ------ ---------------------------- 8813 ll_ost_io00_001 0 ms (no user stack) 15121 ll_ost_io00_013 0 ms (no user stack) 9 rcu_sched 0 ms (no user stack) 15120 ll_ost_io00_012 0 ms (no user stack) 14272 ll_ost_io00_006 0 ms (no user stack) +------------------------+ >-------------------------| Memory Usage (kmem -i) |-------------------------< +------------------------+ PAGES TOTAL PERCENTAGE TOTAL MEM 955067 3.6 GB ---- FREE 663790 2.5 GB 69% of TOTAL MEM USED 291277 1.1 GB 30% of TOTAL MEM SHARED 33865 132.3 MB 3% of TOTAL MEM BUFFERS 31666 123.7 MB 3% of TOTAL MEM CACHED 68366 267.1 MB 7% of TOTAL MEM SLAB 61013 238.3 MB 6% of TOTAL MEM TOTAL HUGE 0 0 ---- HUGE FREE 0 0 0% of TOTAL HUGE TOTAL SWAP 262143 1024 MB ---- SWAP USED 0 0 0% of TOTAL SWAP SWAP FREE 262143 1024 MB 100% of TOTAL SWAP COMMIT LIMIT 739676 2.8 GB ---- COMMITTED 58988 230.4 MB 7% of TOTAL LIMIT +++WARNING+++ 2 processes in UNINTERRUPTIBLE state are committing journal +-------------------------------+ >----------------------| Scheduler Runqueues (per CPU) |----------------------< +-------------------------------+ ---+ CPU=0 ---- | CURRENT TASK , CMD=swapper/0 ---+ CPU=1 ---- | CURRENT TASK , CMD=ll_ost_io00_004 3075 socknal_sd00_02 19.21931 ---+ CPU=2 ---- | CURRENT TASK , CMD=mdt00_011 ---+ CPU=3 ---- | CURRENT TASK , CMD=ll_ost_io00_003 3073 socknal_sd00_00 19.02379 +------------------------+ >-------------------------| Network Status Summary |-------------------------< +------------------------+ TCP Connection Info ------------------- ESTABLISHED 5 LISTEN 3 NAGLE disabled (TCP_NODELAY): 4 user_data set (NFS etc.): 4 UDP Connection Info ------------------- 2 UDP sockets, 0 in ESTABLISHED Unix Connection Info ------------------------ ESTABLISHED 26 CLOSE 17 LISTEN 8 Raw sockets info -------------------- ESTABLISHED 1 Interfaces Info --------------- How long ago (in seconds) interfaces transmitted/received? Name RX TX ---- ---------- --------- lo n/a 506.1 eth0 n/a 0.0 RSS_TOTAL=50292 pages, %mem= 0.8 +------------+ >-------------------------------| Mounted FS |-------------------------------< +------------+ MOUNT SUPERBLK TYPE DEVNAME DIRNAME ffff880138cca000 ffff880139940800 rootfs rootfs / ffff88012a356000 ffff88012a358000 sysfs sysfs /sys ffff88012a3561c0 ffff880139944000 proc proc /proc ffff88012a356380 ffff880137668000 devtmpfs devtmpfs /dev ffff88012a356540 ffff8800b521b800 securityfs securityfs /sys/kernel/security ffff88012a356700 ffff88012a358800 tmpfs tmpfs /dev/shm ffff88012a3568c0 ffff880137023000 devpts devpts /dev/pts ffff88012a356a80 ffff88012a359000 tmpfs tmpfs /run ffff88012a356c40 ffff88012a359800 tmpfs tmpfs /sys/fs/cgroup ffff88012a356e00 ffff88012a35a000 cgroup cgroup /sys/fs/cgroup/systemd ffff88012a356fc0 ffff88012a35a800 pstore pstore /sys/fs/pstore ffff88012a357180 ffff88012a35c800 cgroup cgroup /sys/fs/cgroup/perf_event ffff88012a357340 ffff88012a35c000 cgroup cgroup /sys/fs/cgroup/hugetlb ffff88012a357500 ffff88012a35b800 cgroup cgroup /sys/fs/cgroup/net_cls,net_prio ffff88012a3576c0 ffff88012a35b000 cgroup cgroup /sys/fs/cgroup/cpu,cpuacct ffff88012a357880 ffff88012a35d000 cgroup cgroup /sys/fs/cgroup/devices ffff88012a357a40 ffff88012a35d800 cgroup cgroup /sys/fs/cgroup/freezer ffff88012a357c00 ffff88012a35e000 cgroup cgroup /sys/fs/cgroup/memory ffff88012a357dc0 ffff88012a35e800 cgroup cgroup /sys/fs/cgroup/blkio ffff88012a372000 ffff88012a35f000 cgroup cgroup /sys/fs/cgroup/pids ffff88012a3721c0 ffff88012a35f800 cgroup cgroup /sys/fs/cgroup/cpuset ffff880138ccb180 ffff880129c5a800 configfs configfs /sys/kernel/config ffff880137652e00 ffff88012a315000 ext4 /dev/nbd0 / ffff8800b45b4000 ffff8800b521d800 rpc_pipefs rpc_pipefs /var/lib/nfs/rpc_pipefs ffff880137652fc0 ffff880137024000 mqueue mqueue /dev/mqueue ffff88012a372380 ffff880139947800 debugfs debugfs /sys/kernel/debug ffff880138ccb340 ffff8800b47cf800 hugetlbfs hugetlbfs /dev/hugepages ffff880137653180 ffff8800b40cd800 autofs systemd-1 /proc/sys/fs/binfmt_misc ffff88012a372540 ffff8800b40de800 binfmt_misc binfmt_misc /proc/sys/fs/binfmt_misc/ ffff88012a372700 ffff8800b45fc000 ramfs none /mnt ffff88012a3728c0 ffff8800b40ce800 squashfs /dev/vda /home/green/git/lustre-release ffff8800b45b48c0 ffff880129c5d000 tmpfs none /var/lib/stateless/writable ffff880137653340 ffff880129c5d000 tmpfs none /var/cache/man ffff880138ccb500 ffff880129c5d000 tmpfs none /var/log ffff880137653500 ffff880129c5d000 tmpfs none /var/lib/dbus ffff88012a372a80 ffff880129c5d000 tmpfs none /tmp ffff8801376536c0 ffff880129c5d000 tmpfs none /var/lib/dhclient ffff880138ccb6c0 ffff880129c5d000 tmpfs none /var/tmp ffff880137653880 ffff880129c5d000 tmpfs none /var/lib/NetworkManager ffff880138ccb880 ffff880129c5d000 tmpfs none /var/lib/systemd/random-seed ffff880137653a40 ffff880129c5d000 tmpfs none /var/spool ffff880137653c00 ffff880129c5d000 tmpfs none /var/lib/nfs ffff88012a372c40 ffff880129c5d000 tmpfs none /var/lib/gssproxy ffff88012a372e00 ffff880129c5d000 tmpfs none /var/lib/logrotate ffff8800b45b4a80 ffff880129c5d000 tmpfs none /etc ffff8800b45b4c40 ffff880129c5d000 tmpfs none /var/lib/rsyslog ffff88012a372fc0 ffff880129c5d000 tmpfs none /var/lib/dhclient/var/lib/dhclient ffff8800b45b4e00 ffff8800b5265800 nfs4 192.168.200.253:/exports/state/oleg414-server.virtnet /var/lib/stateless/state ffff880137653dc0 ffff8800b5265800 nfs4 192.168.200.253:/exports/state/oleg414-server.virtnet /boot ffff880138ccba40 ffff8800b5265800 nfs4 192.168.200.253:/exports/state/oleg414-server.virtnet /etc/etc/kdump.conf ffff88012a373340 ffff8800b521d800 rpc_pipefs sunrpc /var/lib/nfs/var/lib/nfs/rpc_pipefs ffff88012a373180 ffff8800b40ce800 squashfs /dev/vda /usr/sbin/mount.lustre ffff8800b206d500 ffff8800a9167800 lustre /dev/mapper/mds1_flakey /mnt/lustre-mds1 ffff8800b206d340 ffff88008d677800 lustre /dev/mapper/mds2_flakey /mnt/lustre-mds2 ffff8800b206ce00 ffff88012b8e8800 lustre /dev/mapper/ost1_flakey /mnt/lustre-ost1 ffff8800b206c000 ffff880131196000 lustre /dev/mapper/ost2_flakey /mnt/lustre-ost2 +-------------------------------+ >----------------------| Last 40 lines of dmesg buffer |----------------------< +-------------------------------+ [ 509.229128] [] osd_write+0x929/0xcc0 [osd_ldiskfs] [ 509.231208] [] dt_record_write+0x33/0x130 [obdclass] [ 509.237797] [] llog_osd_write_rec+0xfdb/0x1c80 [obdclass] [ 509.242424] [] llog_write_rec+0x290/0x590 [obdclass] [ 509.244206] [] llog_cat_add_rec+0x1d9/0xa50 [obdclass] [ 509.246663] [] ? fld_cache_lookup+0xae/0x1e0 [fld] [ 509.249417] [] llog_add+0x17f/0x1f0 [obdclass] [ 509.252224] [] osp_sync_add+0x1f9/0x760 [osp] [ 509.256553] [] osp_attr_set+0x3cd/0x680 [osp] [ 509.261043] [] ? lod_sub_get_thandle+0x2c7/0x450 [lod] [ 509.265331] [] lod_sub_attr_set+0x1d7/0x500 [lod] [ 509.271190] [] ? osd_attr_set+0x287/0xb00 [osd_ldiskfs] [ 509.273565] [] lod_obj_stripe_attr_set_cb+0x40/0x100 [lod] [ 509.277388] [] lod_obj_for_each_stripe+0x12d/0x310 [lod] [ 509.281169] [] lod_attr_set+0x51a/0xb60 [lod] [ 509.283338] [] ? lod_gen_component_id+0x210/0x210 [lod] [ 509.285649] [] lod_layout_change+0x367/0x3f0 [lod] [ 509.288148] [] ? osd_write_lock+0x5f/0xc0 [osd_ldiskfs] [ 509.290380] [] dt_layout_change+0x20/0xc0 [mdd] [ 509.292911] [] mdd_layout_change+0x1712/0x1db0 [mdd] [ 509.295535] [] mdt_layout_change+0x2bf/0x450 [mdt] [ 509.298179] [] mdt_intent_layout+0x910/0xeb0 [mdt] [ 509.300919] [] mdt_intent_opc+0x1c8/0xc50 [mdt] [ 509.307291] [] ? mdt_intent_open+0x480/0x480 [mdt] [ 509.312441] [] mdt_intent_policy+0xfa/0x460 [mdt] [ 509.315591] [] ldlm_lock_enqueue+0x3b1/0xbb0 [ptlrpc] [ 509.321303] [] ? cfs_hash_rw_unlock+0x15/0x20 [libcfs] [ 509.323804] [] ? cfs_hash_add+0xa6/0x180 [libcfs] [ 509.326231] [] ldlm_handle_enqueue+0x359/0x17c0 [ptlrpc] [ 509.333437] [] ? lustre_msg_buf_v2+0x140/0x1f0 [ptlrpc] [ 509.335833] [] tgt_enqueue+0x68/0x240 [ptlrpc] [ 509.338923] [] tgt_request_handle+0x74e/0x19d0 [ptlrpc] [ 509.341139] [] ptlrpc_server_handle_request+0x26e/0xcf0 [ptlrpc] [ 509.343984] [] ptlrpc_main+0xc37/0x16d0 [ptlrpc] [ 509.348026] [] ? __switch_to+0xcd/0x4e0 [ 509.349627] [] ? ptlrpc_wait_event+0x5e0/0x5e0 [ptlrpc] [ 509.351432] [] kthread+0xe4/0xf0 [ 509.353332] [] ? kthread_create_on_node+0x140/0x140 [ 509.355944] [] ret_from_fork_nospec_begin+0x7/0x21 [ 509.358433] [] ? kthread_create_on_node+0x140/0x140 ****************************************************************************** ************************ A Summary Of Problems Found ************************* ****************************************************************************** -------------------- A list of all +++WARNING+++ messages -------------------- PARTIAL DUMP with size(vmcore) < 25% size(RAM) There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details 2 processes in UNINTERRUPTIBLE state are committing journal ------------------------------------------------------------------------------ ** Execution took 12.48s (real) 7.28s (CPU), Child processes: 5.20s