************************ crashinfo ************************* /exports/testreports/38859/testresults/racer-ldiskfs-DNE-centos7_x86_64-centos7_x86_64/oleg328-server-timeout-core (3.10.0-7.9-debug) +==========================+ | *** Crashinfo v1.3.7 *** | +==========================+ +++WARNING+++ PARTIAL DUMP with size(vmcore) < 25% size(RAM) KERNEL: /tmp/crash-anaysis.RQjcA/vmlinux [TAINTED] DUMPFILE: /exports/testreports/38859/testresults/racer-ldiskfs-DNE-centos7_x86_64-centos7_x86_64/oleg328-server-timeout-core [PARTIAL DUMP] CPUS: 4 DATE: Fri Jan 19 12:05:46 EST 2024 UPTIME: 00:11:59 LOAD AVERAGE: 23.06, 10.27, 4.29 TASKS: 467 NODENAME: oleg328-server.virtnet RELEASE: 3.10.0-7.9-debug VERSION: #1 SMP Sat Mar 26 23:28:42 EDT 2022 MACHINE: x86_64 (2399 Mhz) MEMORY: 4 GB PANIC: "" +--------------------------+ >------------------------| Per-cpu Stacks ('bt -a') |------------------------< +--------------------------+ -- CPU#0 -- PID=0 CPU=0 CMD=swapper/0 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 rest_init+0x8e #7 start_kernel+0x456 #8 x86_64_start_reservations+0x2a #9 x86_64_start_kernel+0x152 #10 start_cpu+0x5 -- CPU#1 -- PID=0 CPU=1 CMD=swapper/1 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#2 -- PID=0 CPU=2 CMD=swapper/2 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#3 -- PID=0 CPU=3 CMD=swapper/3 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 +--------------------------------+ >---------------------| How This Dump Has Been Created |---------------------< +--------------------------------+ Cannot identify the specific condition that triggered vmcore +---------------+ >------------------------------| Tasks Summary |------------------------------< +---------------+ Number of Threads That Ran Recently ----------------------------------- last second 176 last 5s 267 last 60s 302 ----- Total Numbers of Threads per State ------ TASK_INTERRUPTIBLE 463 TASK_RUNNING 1 +++WARNING+++ There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details +-----------------------+ >--------------------------| 5 Most Recent Threads |--------------------------< +-----------------------+ PID CMD Age ARGS ----- -------------- ------ ---------------------------- 3841 l2arc_feed 0 ms (no user stack) 9 rcu_sched 0 ms (no user stack) 7687 lquota_wb_lustr 0 ms (no user stack) 7686 lquota_wb_lustr 57 ms (no user stack) 3074 socknal_reaper 57 ms (no user stack) +------------------------+ >-------------------------| Memory Usage (kmem -i) |-------------------------< +------------------------+ PAGES TOTAL PERCENTAGE TOTAL MEM 955067 3.6 GB ---- FREE 602455 2.3 GB 63% of TOTAL MEM USED 352612 1.3 GB 36% of TOTAL MEM SHARED 54650 213.5 MB 5% of TOTAL MEM BUFFERS 52464 204.9 MB 5% of TOTAL MEM CACHED 84973 331.9 MB 8% of TOTAL MEM SLAB 81243 317.4 MB 8% of TOTAL MEM TOTAL HUGE 0 0 ---- HUGE FREE 0 0 0% of TOTAL HUGE TOTAL SWAP 262143 1024 MB ---- SWAP USED 0 0 0% of TOTAL SWAP SWAP FREE 262143 1024 MB 100% of TOTAL SWAP COMMIT LIMIT 739676 2.8 GB ---- COMMITTED 58985 230.4 MB 7% of TOTAL LIMIT +-------------------------------+ >----------------------| Scheduler Runqueues (per CPU) |----------------------< +-------------------------------+ ---+ CPU=0 ---- | CURRENT TASK , CMD=swapper/0 ---+ CPU=1 ---- | CURRENT TASK , CMD=swapper/1 ---+ CPU=2 ---- | CURRENT TASK , CMD=swapper/2 ---+ CPU=3 ---- | CURRENT TASK , CMD=swapper/3 +------------------------+ >-------------------------| Network Status Summary |-------------------------< +------------------------+ TCP Connection Info ------------------- ESTABLISHED 5 LISTEN 3 NAGLE disabled (TCP_NODELAY): 4 user_data set (NFS etc.): 4 Unusual Situations: Doing Retransmission: 2 (run xportshow --retrans for details) UDP Connection Info ------------------- 2 UDP sockets, 0 in ESTABLISHED Unix Connection Info ------------------------ ESTABLISHED 26 CLOSE 17 LISTEN 8 Raw sockets info -------------------- ESTABLISHED 1 Interfaces Info --------------- How long ago (in seconds) interfaces transmitted/received? Name RX TX ---- ---------- --------- lo n/a 716.7 eth0 n/a 0.0 RSS_TOTAL=50848 pages, %mem= 0.8 +------------+ >-------------------------------| Mounted FS |-------------------------------< +------------+ MOUNT SUPERBLK TYPE DEVNAME DIRNAME ffff880138cca000 ffff880139940800 rootfs rootfs / ffff88012a9fc000 ffff88012a2a8000 sysfs sysfs /sys ffff88012a9fc1c0 ffff880139944000 proc proc /proc ffff88012a9fc380 ffff880137668000 devtmpfs devtmpfs /dev ffff88012a9fc540 ffff8800b5213800 securityfs securityfs /sys/kernel/security ffff88012a9fc700 ffff88012a2a8800 tmpfs tmpfs /dev/shm ffff88012a9fc8c0 ffff8801377fa800 devpts devpts /dev/pts ffff88012a9fca80 ffff88012a2a9000 tmpfs tmpfs /run ffff88012a9fcc40 ffff88012a2a9800 tmpfs tmpfs /sys/fs/cgroup ffff88012a9fce00 ffff88012a2aa000 cgroup cgroup /sys/fs/cgroup/systemd ffff88012a9fcfc0 ffff88012a2aa800 pstore pstore /sys/fs/pstore ffff88012a9fd180 ffff88012a2ac800 cgroup cgroup /sys/fs/cgroup/pids ffff88012a9fd340 ffff88012a2ac000 cgroup cgroup /sys/fs/cgroup/devices ffff88012a9fd500 ffff88012a2ab800 cgroup cgroup /sys/fs/cgroup/cpuset ffff88012a9fd6c0 ffff88012a2ab000 cgroup cgroup /sys/fs/cgroup/perf_event ffff88012a9fd880 ffff88012a2ad000 cgroup cgroup /sys/fs/cgroup/freezer ffff88012a9fda40 ffff88012a2ad800 cgroup cgroup /sys/fs/cgroup/hugetlb ffff88012a9fdc00 ffff88012a2ae000 cgroup cgroup /sys/fs/cgroup/blkio ffff88012a9fddc0 ffff88012a2ae800 cgroup cgroup /sys/fs/cgroup/net_cls,net_prio ffff88012a346000 ffff88012a2af000 cgroup cgroup /sys/fs/cgroup/cpu,cpuacct ffff88012a3461c0 ffff88012a2af800 cgroup cgroup /sys/fs/cgroup/memory ffff880137652540 ffff8800b5216000 configfs configfs /sys/kernel/config ffff8801377e8a80 ffff880129cec000 ext4 /dev/nbd0 / ffff8801377e8c40 ffff8800b51ef800 rpc_pipefs rpc_pipefs /var/lib/nfs/rpc_pipefs ffff88012a3468c0 ffff8800b4075000 autofs systemd-1 /proc/sys/fs/binfmt_misc ffff88012a346a80 ffff880139947800 debugfs debugfs /sys/kernel/debug ffff880137652700 ffff8801377fb800 mqueue mqueue /dev/mqueue ffff88012a346c40 ffff8800b4073800 hugetlbfs hugetlbfs /dev/hugepages ffff88012a346e00 ffff8800b4073000 binfmt_misc binfmt_misc /proc/sys/fs/binfmt_misc/ ffff8801377e8e00 ffff8800b4a28800 ramfs none /mnt ffff8801376528c0 ffff88012e12e000 squashfs /dev/vda /home/green/git/lustre-release ffff88012a347180 ffff8800b4074000 tmpfs none /var/lib/stateless/writable ffff88012a347340 ffff8800b4074000 tmpfs none /var/cache/man ffff880137652a80 ffff8800b4074000 tmpfs none /var/log ffff880137652c40 ffff8800b4074000 tmpfs none /var/lib/dbus ffff880137652e00 ffff8800b4074000 tmpfs none /tmp ffff880137652fc0 ffff8800b4074000 tmpfs none /var/lib/dhclient ffff88012a347500 ffff8800b4074000 tmpfs none /var/tmp ffff880137653180 ffff8800b4074000 tmpfs none /var/lib/NetworkManager ffff880138ccae00 ffff8800b4074000 tmpfs none /var/lib/systemd/random-seed ffff880137653340 ffff8800b4074000 tmpfs none /var/spool ffff880137653500 ffff8800b4074000 tmpfs none /var/lib/nfs ffff88012a3476c0 ffff8800b4074000 tmpfs none /var/lib/gssproxy ffff88012a347880 ffff8800b4074000 tmpfs none /var/lib/logrotate ffff880138ccafc0 ffff8800b4074000 tmpfs none /etc ffff88012a347a40 ffff8800b4074000 tmpfs none /var/lib/rsyslog ffff8801376536c0 ffff8800b4074000 tmpfs none /var/lib/dhclient/var/lib/dhclient ffff8801377e8fc0 ffff8800b28fc800 nfs4 192.168.200.253:/exports/state/oleg328-server.virtnet /var/lib/stateless/state ffff88012a347dc0 ffff8800b28fc800 nfs4 192.168.200.253:/exports/state/oleg328-server.virtnet /boot ffff880137653880 ffff8800b28fc800 nfs4 192.168.200.253:/exports/state/oleg328-server.virtnet /etc/etc/kdump.conf ffff880137653a40 ffff8800b51ef800 rpc_pipefs sunrpc /var/lib/nfs/var/lib/nfs/rpc_pipefs ffff8800b29d3500 ffff88012e12e000 squashfs /dev/vda /usr/sbin/mount.lustre ffff88012c459180 ffff8800b4257800 lustre /dev/mapper/mds1_flakey /mnt/lustre-mds1 ffff88012c4588c0 ffff88012e326800 lustre /dev/mapper/mds2_flakey /mnt/lustre-mds2 ffff88012c458e00 ffff88012c415000 lustre /dev/mapper/ost1_flakey /mnt/lustre-ost1 ffff88012c459880 ffff88009c553000 lustre /dev/mapper/ost2_flakey /mnt/lustre-ost2 +-------------------------------+ >----------------------| Last 40 lines of dmesg buffer |----------------------< +-------------------------------+ [ 573.721015] Lustre: 14280:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 2 previous similar messages [ 578.489352] Lustre: 6531:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff880123f3a300 x1788538575763008/t4295074921(0) o101->e51af348-1edb-4b2f-8e6a-e53b58788337@192.168.203.28@tcp:45/0 lens 376/45880 e 0 to 0 dl 1705683965 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 578.498717] Lustre: 6531:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 1 previous similar message [ 583.881880] Lustre: 14301:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff88008c852300 x1788538578077760/t4295075756(0) o101->e51af348-1edb-4b2f-8e6a-e53b58788337@192.168.203.28@tcp:51/0 lens 376/47104 e 0 to 0 dl 1705683971 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 583.893228] Lustre: 14301:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 2 previous similar messages [ 585.578084] LustreError: 14262:0:(out_lib.c:1188:out_tx_index_delete_undo()) lustre-MDT0000-osd: Oops, can not rollback index_delete yet: rc = -524 [ 585.583582] LustreError: 14262:0:(out_lib.c:1188:out_tx_index_delete_undo()) Skipped 2 previous similar messages [ 586.173025] LustreError: 7688:0:(llog_cat.c:737:llog_cat_cancel_arr_rec()) lustre-MDT0000-osp-MDT0001: fail to cancel 1 llog-records: rc = -2 [ 586.177641] LustreError: 14298:0:(mdt_open.c:1280:mdt_cross_open()) lustre-MDT0001: [0x240000404:0x11d4:0x0] doesn't exist!: rc = -14 [ 586.181714] LustreError: 7688:0:(llog_cat.c:737:llog_cat_cancel_arr_rec()) Skipped 5 previous similar messages [ 586.185887] LustreError: 7688:0:(llog_cat.c:773:llog_cat_cancel_records()) lustre-MDT0000-osp-MDT0001: fail to cancel 1 of 1 llog-records: rc = -2 [ 586.189913] LustreError: 7688:0:(llog_cat.c:773:llog_cat_cancel_records()) Skipped 5 previous similar messages [ 587.453515] LustreError: 14285:0:(mdt_open.c:1280:mdt_cross_open()) lustre-MDT0001: [0x240000404:0x11d4:0x0] doesn't exist!: rc = -14 [ 587.457610] LustreError: 14285:0:(mdt_open.c:1280:mdt_cross_open()) Skipped 5 previous similar messages [ 588.662058] LustreError: 14280:0:(mdt_open.c:1280:mdt_cross_open()) lustre-MDT0001: [0x240000404:0x11d4:0x0] doesn't exist!: rc = -14 [ 590.874608] LustreError: 14226:0:(mdt_open.c:1280:mdt_cross_open()) lustre-MDT0001: [0x240000404:0x11d4:0x0] doesn't exist!: rc = -14 [ 590.878848] LustreError: 14226:0:(mdt_open.c:1280:mdt_cross_open()) Skipped 8 previous similar messages [ 592.531149] Lustre: 14291:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff88008461f300 x1788538582417472/t4295057863(0) o101->e51af348-1edb-4b2f-8e6a-e53b58788337@192.168.203.28@tcp:59/0 lens 376/46232 e 0 to 0 dl 1705683979 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 592.538363] Lustre: 14291:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 1 previous similar message [ 595.184517] LustreError: 14286:0:(mdt_open.c:1280:mdt_cross_open()) lustre-MDT0001: [0x240000404:0x11d4:0x0] doesn't exist!: rc = -14 [ 595.187176] LustreError: 14286:0:(mdt_open.c:1280:mdt_cross_open()) Skipped 3 previous similar messages [ 606.987982] LustreError: 14304:0:(mdt_open.c:1280:mdt_cross_open()) lustre-MDT0001: [0x240000404:0x11d4:0x0] doesn't exist!: rc = -14 [ 606.990779] LustreError: 14304:0:(mdt_open.c:1280:mdt_cross_open()) Skipped 3 previous similar messages [ 612.441713] Lustre: 14280:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff88008b316900 x1788538593761600/t4295060624(0) o101->73cef02e-c9b6-4dc4-b0a9-19826a011212@192.168.203.28@tcp:79/0 lens 376/47320 e 0 to 0 dl 1705683999 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 612.448319] Lustre: 14280:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 3 previous similar messages [ 624.805016] LustreError: 14193:0:(mdt_open.c:1280:mdt_cross_open()) lustre-MDT0001: [0x240000404:0x11d4:0x0] doesn't exist!: rc = -14 [ 624.808965] LustreError: 14193:0:(mdt_open.c:1280:mdt_cross_open()) Skipped 3 previous similar messages [ 626.815840] LustreError: 7682:0:(out_lib.c:1188:out_tx_index_delete_undo()) lustre-MDT0001-osd: Oops, can not rollback index_delete yet: rc = -524 [ 627.938987] LustreError: 6559:0:(llog_cat.c:737:llog_cat_cancel_arr_rec()) lustre-MDT0001-osp-MDT0000: fail to cancel 1 llog-records: rc = -2 [ 627.943870] LustreError: 6559:0:(llog_cat.c:773:llog_cat_cancel_records()) lustre-MDT0001-osp-MDT0000: fail to cancel 1 of 1 llog-records: rc = -2 [ 647.130108] Lustre: 14279:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff8800991f0a00 x1788538609213184/t4295087394(0) o101->e51af348-1edb-4b2f-8e6a-e53b58788337@192.168.203.28@tcp:114/0 lens 376/47920 e 0 to 0 dl 1705684034 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 647.141539] Lustre: 14279:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 7 previous similar messages [ 657.034793] LustreError: 14297:0:(mdt_open.c:1280:mdt_cross_open()) lustre-MDT0001: [0x240000404:0x11d4:0x0] doesn't exist!: rc = -14 [ 657.037304] LustreError: 14297:0:(mdt_open.c:1280:mdt_cross_open()) Skipped 51 previous similar messages [ 672.033366] Lustre: lustre-OST0001-osc-MDT0001: update sequence from 0x2c0000400 to 0x2c0000402 [ 672.033391] Lustre: lustre-OST0000-osc-MDT0001: update sequence from 0x280000400 to 0x280000402 [ 714.336451] Lustre: lustre-OST0000-osc-MDT0000: update sequence from 0x280000401 to 0x280000403 [ 714.339781] Lustre: lustre-OST0001-osc-MDT0000: update sequence from 0x2c0000401 to 0x2c0000403 [ 714.547028] Lustre: 14277:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff880086a2ee00 x1788538644122816/t4295080132(0) o101->73cef02e-c9b6-4dc4-b0a9-19826a011212@192.168.203.28@tcp:181/0 lens 376/47320 e 0 to 0 dl 1705684101 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 714.553192] Lustre: 14277:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 13 previous similar messages ****************************************************************************** ************************ A Summary Of Problems Found ************************* ****************************************************************************** -------------------- A list of all +++WARNING+++ messages -------------------- PARTIAL DUMP with size(vmcore) < 25% size(RAM) There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details ------------------------------------------------------------------------------ ** Execution took 11.84s (real) 6.58s (CPU), Child processes: 5.24s