************************ crashinfo ************************* /exports/testreports/38859/testresults/racer-special1-zfs-DNE-centos7_x86_64-centos7_x86_64/oleg231-server-timeout-core (3.10.0-7.9-debug) +==========================+ | *** Crashinfo v1.3.7 *** | +==========================+ +++WARNING+++ PARTIAL DUMP with size(vmcore) < 25% size(RAM) KERNEL: /tmp/crash-anaysis.IV47I/vmlinux [TAINTED] DUMPFILE: /exports/testreports/38859/testresults/racer-special1-zfs-DNE-centos7_x86_64-centos7_x86_64/oleg231-server-timeout-core [PARTIAL DUMP] CPUS: 4 DATE: Fri Jan 19 13:01:48 EST 2024 UPTIME: 01:08:04 LOAD AVERAGE: 0.00, 0.01, 0.47 TASKS: 757 NODENAME: oleg231-server.virtnet RELEASE: 3.10.0-7.9-debug VERSION: #1 SMP Sat Mar 26 23:28:42 EDT 2022 MACHINE: x86_64 (2399 Mhz) MEMORY: 4 GB PANIC: "" +--------------------------+ >------------------------| Per-cpu Stacks ('bt -a') |------------------------< +--------------------------+ -- CPU#0 -- PID=0 CPU=0 CMD=swapper/0 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 rest_init+0x8e #7 start_kernel+0x456 #8 x86_64_start_reservations+0x2a #9 x86_64_start_kernel+0x152 #10 start_cpu+0x5 -- CPU#1 -- PID=0 CPU=1 CMD=swapper/1 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#2 -- PID=0 CPU=2 CMD=swapper/2 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 -- CPU#3 -- PID=0 CPU=3 CMD=swapper/3 #-1 native_safe_halt+0xb, 449 bytes of data #0 default_idle+0x1e #1 default_enter_idle+0x45 #2 cpuidle_enter_state+0x40 #3 cpuidle_idle_call+0xd8 #4 arch_cpu_idle+0xe #5 cpu_startup_entry+0x14a #6 start_secondary+0x1eb #7 start_cpu+0x5 +--------------------------------+ >---------------------| How This Dump Has Been Created |---------------------< +--------------------------------+ Cannot identify the specific condition that triggered vmcore +---------------+ >------------------------------| Tasks Summary |------------------------------< +---------------+ Number of Threads That Ran Recently ----------------------------------- last second 37 last 5s 91 last 60s 158 ----- Total Numbers of Threads per State ------ TASK_INTERRUPTIBLE 753 TASK_RUNNING 1 +++WARNING+++ There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details +-----------------------+ >--------------------------| 5 Most Recent Threads |--------------------------< +-----------------------+ PID CMD Age ARGS ----- -------------- ------ ---------------------------- 46 kworker/0:1 0 ms (no user stack) 968 kworker/1:1H 0 ms (no user stack) 690 kworker/0:1H 0 ms (no user stack) 882 in:imjournal 0 ms /usr/sbin/rsyslogd -n 11377 mmp 0 ms (no user stack) +------------------------+ >-------------------------| Memory Usage (kmem -i) |-------------------------< +------------------------+ PAGES TOTAL PERCENTAGE TOTAL MEM 955067 3.6 GB ---- FREE 530557 2 GB 55% of TOTAL MEM USED 424510 1.6 GB 44% of TOTAL MEM SHARED 8797 34.4 MB 0% of TOTAL MEM BUFFERS 5081 19.8 MB 0% of TOTAL MEM CACHED 64476 251.9 MB 6% of TOTAL MEM SLAB 123325 481.7 MB 12% of TOTAL MEM TOTAL HUGE 0 0 ---- HUGE FREE 0 0 0% of TOTAL HUGE TOTAL SWAP 262143 1024 MB ---- SWAP USED 0 0 0% of TOTAL SWAP SWAP FREE 262143 1024 MB 100% of TOTAL SWAP COMMIT LIMIT 739676 2.8 GB ---- COMMITTED 57981 226.5 MB 7% of TOTAL LIMIT +-------------------------------+ >----------------------| Scheduler Runqueues (per CPU) |----------------------< +-------------------------------+ ---+ CPU=0 ---- | CURRENT TASK , CMD=swapper/0 ---+ CPU=1 ---- | CURRENT TASK , CMD=swapper/1 ---+ CPU=2 ---- | CURRENT TASK , CMD=swapper/2 ---+ CPU=3 ---- | CURRENT TASK , CMD=swapper/3 +------------------------+ >-------------------------| Network Status Summary |-------------------------< +------------------------+ TCP Connection Info ------------------- ESTABLISHED 5 LISTEN 3 NAGLE disabled (TCP_NODELAY): 4 user_data set (NFS etc.): 4 UDP Connection Info ------------------- 2 UDP sockets, 0 in ESTABLISHED Unix Connection Info ------------------------ ESTABLISHED 26 CLOSE 17 LISTEN 8 Raw sockets info -------------------- CLOSE 1 Interfaces Info --------------- How long ago (in seconds) interfaces transmitted/received? Name RX TX ---- ---------- --------- lo n/a 4082.3 eth0 n/a 3.0 RSS_TOTAL=50560 pages, %mem= 0.8 +------------+ >-------------------------------| Mounted FS |-------------------------------< +------------+ MOUNT SUPERBLK TYPE DEVNAME DIRNAME ffff880138cca000 ffff880139940800 rootfs rootfs / ffff88012a258000 ffff88012a2e8000 sysfs sysfs /sys ffff88012a2581c0 ffff880139944000 proc proc /proc ffff88012a258380 ffff880137668000 devtmpfs devtmpfs /dev ffff88012a258540 ffff880137668800 securityfs securityfs /sys/kernel/security ffff88012a258700 ffff88012a2e8800 tmpfs tmpfs /dev/shm ffff88012a2588c0 ffff880137315000 devpts devpts /dev/pts ffff88012a258a80 ffff88012a2e9000 tmpfs tmpfs /run ffff88012a258c40 ffff88012a2e9800 tmpfs tmpfs /sys/fs/cgroup ffff88012a258e00 ffff88012a2ea000 cgroup cgroup /sys/fs/cgroup/systemd ffff88012a258fc0 ffff88012a2ea800 pstore pstore /sys/fs/pstore ffff88012a259180 ffff88012a2ec800 cgroup cgroup /sys/fs/cgroup/net_cls,net_prio ffff88012a259340 ffff88012a2ec000 cgroup cgroup /sys/fs/cgroup/cpuset ffff88012a259500 ffff88012a2eb800 cgroup cgroup /sys/fs/cgroup/memory ffff88012a2596c0 ffff88012a2eb000 cgroup cgroup /sys/fs/cgroup/blkio ffff88012a259880 ffff88012a2ed000 cgroup cgroup /sys/fs/cgroup/perf_event ffff88012a259a40 ffff88012a2ed800 cgroup cgroup /sys/fs/cgroup/freezer ffff88012a259c00 ffff88012a2ee000 cgroup cgroup /sys/fs/cgroup/cpu,cpuacct ffff88012a259dc0 ffff88012a2ee800 cgroup cgroup /sys/fs/cgroup/hugetlb ffff88012a27a000 ffff88012a2ef000 cgroup cgroup /sys/fs/cgroup/pids ffff88012a27a1c0 ffff88012a2ef800 cgroup cgroup /sys/fs/cgroup/devices ffff880137652700 ffff88013766f000 configfs configfs /sys/kernel/config ffff8801377e81c0 ffff8800b524e000 ext4 /dev/nbd0 / ffff88012a27a380 ffff88013766f800 rpc_pipefs rpc_pipefs /var/lib/nfs/rpc_pipefs ffff88012a27a540 ffff880129c4b800 autofs systemd-1 /proc/sys/fs/binfmt_misc ffff880137652e00 ffff880139947800 debugfs debugfs /sys/kernel/debug ffff880137652fc0 ffff880137316000 mqueue mqueue /dev/mqueue ffff88012a27a700 ffff880129c4c000 hugetlbfs hugetlbfs /dev/hugepages ffff8801377e8380 ffff88012b4bb800 binfmt_misc binfmt_misc /proc/sys/fs/binfmt_misc/ ffff8801377e8700 ffff88012b4be000 ramfs none /mnt ffff8801377e88c0 ffff8800b4dd6000 squashfs /dev/vda /home/green/git/lustre-release ffff8801377e8a80 ffff8800b4dd2000 tmpfs none /var/lib/stateless/writable ffff880137653180 ffff8800b4dd2000 tmpfs none /var/cache/man ffff880137653340 ffff8800b4dd2000 tmpfs none /var/log ffff880137653500 ffff8800b4dd2000 tmpfs none /var/lib/dbus ffff8801377e8c40 ffff8800b4dd2000 tmpfs none /tmp ffff88012a27a8c0 ffff8800b4dd2000 tmpfs none /var/lib/dhclient ffff8801377e8e00 ffff8800b4dd2000 tmpfs none /var/tmp ffff88012a27aa80 ffff8800b4dd2000 tmpfs none /var/lib/NetworkManager ffff8801377e8fc0 ffff8800b4dd2000 tmpfs none /var/lib/systemd/random-seed ffff88012a27ac40 ffff8800b4dd2000 tmpfs none /var/spool ffff8801377e9180 ffff8800b4dd2000 tmpfs none /var/lib/nfs ffff88012a27ae00 ffff8800b4dd2000 tmpfs none /var/lib/gssproxy ffff8801377e9340 ffff8800b4dd2000 tmpfs none /var/lib/logrotate ffff8801376536c0 ffff8800b4dd2000 tmpfs none /etc ffff880137653880 ffff8800b4dd2000 tmpfs none /var/lib/rsyslog ffff880137653a40 ffff8800b4dd2000 tmpfs none /var/lib/dhclient/var/lib/dhclient ffff8801377e9500 ffff88012b4ba000 nfs4 192.168.200.253:/exports/state/oleg231-server.virtnet /var/lib/stateless/state ffff8801377e9c00 ffff88012b4ba000 nfs4 192.168.200.253:/exports/state/oleg231-server.virtnet /boot ffff88012a27afc0 ffff88012b4ba000 nfs4 192.168.200.253:/exports/state/oleg231-server.virtnet /etc/etc/kdump.conf ffff880137653c00 ffff88013766f800 rpc_pipefs sunrpc /var/lib/nfs/var/lib/nfs/rpc_pipefs ffff8800b4307dc0 ffff8800b4dd6000 squashfs /dev/vda /usr/sbin/mount.lustre ffff8800b1e28e00 ffff88012e1f6800 lustre lustre-mdt1/mdt1 /mnt/lustre-mds1 ffff8800b1f48fc0 ffff880099adc800 lustre lustre-mdt2/mdt2 /mnt/lustre-mds2 ffff8800b1e28a80 ffff880095a7d800 lustre lustre-ost1/ost1 /mnt/lustre-ost1 ffff8800b4307500 ffff88008f4bf800 lustre lustre-ost2/ost2 /mnt/lustre-ost2 +-------------------------------+ >----------------------| Last 40 lines of dmesg buffer |----------------------< +-------------------------------+ [ 274.550765] [<0>] ptlrpc_main+0xc37/0x16d0 [ptlrpc] [ 274.552452] [<0>] kthread+0xe4/0xf0 [ 274.553422] [<0>] ret_from_fork_nospec_begin+0x7/0x21 [ 274.559039] [<0>] 0xfffffffffffffffe [ 275.017281] Lustre: mdt00_006: service thread pid 16606 was inactive for 40.149 seconds. Watchdog stack traces are limited to 3 per 300 seconds, skipping this one. [ 275.021684] Lustre: Skipped 7 previous similar messages [ 334.919863] LustreError: 7345:0:(ldlm_lockd.c:261:expired_lock_main()) ### lock callback timer expired after 101s: evicting client at 192.168.202.31@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff880130f94a00/0x5506c851a94dc989 lrc: 3/0,0 mode: PR/PR res: [0x200000402:0x14b6:0x0].0x0 bits 0x1b/0x0 rrc: 4 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.202.31@tcp remote: 0xce820b82ab5d83fd expref: 782 pid: 16621 timeout: 333 lvb_type: 0 [ 335.206947] LustreError: 17894:0:(mdd_dir.c:4452:mdd_migrate_cmd_check()) lustre-MDD0001: '10' migration was interrupted, run 'lfs migrate -m 0 -c 2 -H crush 10' to finish migration. [ 335.219949] LustreError: 17894:0:(mdd_dir.c:4452:mdd_migrate_cmd_check()) Skipped 9 previous similar messages [ 340.508305] LustreError: 17928:0:(mdt_handler.c:777:mdt_pack_acl2body()) lustre-MDT0001: unable to read [0x240000403:0x13d1:0x0] ACL: rc = -2 [ 342.954210] Lustre: 7357:0:(lod_lov.c:1429:lod_parse_striping()) lustre-MDT0000-mdtlov: EXTENSION flags=40 set on component[2]=1 of non-SEL file [0x200000403:0x1291:0x0] with magic=0xbd60bd0 [ 342.960368] Lustre: 7357:0:(lod_lov.c:1429:lod_parse_striping()) Skipped 1 previous similar message [ 358.172583] LustreError: 16979:0:(mdd_object.c:3645:mdd_close()) lustre-MDD0000: failed to get lu_attr of [0x200000403:0xfb1:0x0]: rc = -2 [ 359.927240] LustreError: 17942:0:(mdt_reint.c:2443:mdt_reint_migrate()) lustre-MDT0001: migrate [0x240000403:0x166d:0x0]/7 failed: rc = -2 [ 359.933924] LustreError: 17942:0:(mdt_reint.c:2443:mdt_reint_migrate()) Skipped 90 previous similar messages [ 468.551920] LustreError: 7345:0:(ldlm_lockd.c:261:expired_lock_main()) ### lock callback timer expired after 100s: evicting client at 192.168.202.31@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff88009cafa300/0x5506c851a9586057 lrc: 3/0,0 mode: PR/PR res: [0x200000402:0x19fc:0x0].0x0 bits 0x1b/0x0 rrc: 4 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.202.31@tcp remote: 0xce820b82ab60b177 expref: 333 pid: 16607 timeout: 467 lvb_type: 0 [ 468.565824] LustreError: 17823:0:(ldlm_lockd.c:1495:ldlm_handle_enqueue()) ### lock on destroyed export ffff880133675800 ns: mdt-lustre-MDT0000_UUID lock: ffff8800a6e28f00/0x5506c851a9587ba1 lrc: 3/0,0 mode: PR/PR res: [0x200000402:0x19e2:0x0].0x0 bits 0x13/0x0 rrc: 11 type: IBT gid 0 flags: 0x50200000000000 nid: 192.168.202.31@tcp remote: 0xce820b82ab60b988 expref: 142 pid: 17823 timeout: 0 lvb_type: 0 [ 481.802382] Lustre: DEBUG MARKER: == racer test 2: racer rename: oleg231-client.virtnet DURATION=300 ========================================================== 12:01:45 (1705683705) [ 483.145029] Lustre: 16619:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff8800896f2800 x1788538542952320/t4295020374(0) o101->45aa66e1-e2ed-4253-b00e-100865308d55@192.168.202.31@tcp:684/0 lens 376/34760 e 0 to 0 dl 1705683849 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 487.015928] Lustre: 9035:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff88007a838f00 x1788538544218816/t4295022720(0) o101->48b532ba-9d1b-4b82-90a8-fe6a5385e4bc@192.168.202.31@tcp:688/0 lens 376/48232 e 0 to 0 dl 1705683853 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 491.302619] Lustre: 17870:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff88007d343700 x1788538545722048/t4295022228(0) o101->45aa66e1-e2ed-4253-b00e-100865308d55@192.168.202.31@tcp:692/0 lens 376/45248 e 0 to 0 dl 1705683857 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 505.139629] Lustre: 17870:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff880121cd1900 x1788538551784000/t4295024966(0) o101->45aa66e1-e2ed-4253-b00e-100865308d55@192.168.202.31@tcp:706/0 lens 376/45248 e 0 to 0 dl 1705683871 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 505.151278] Lustre: 17870:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 2 previous similar messages [ 522.702199] Lustre: 17942:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff880076a51e00 x1788538557370752/t4295028195(0) o101->48b532ba-9d1b-4b82-90a8-fe6a5385e4bc@192.168.202.31@tcp:724/0 lens 376/48232 e 0 to 0 dl 1705683889 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 522.711109] Lustre: 17942:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 6 previous similar messages [ 555.702112] Lustre: 16621:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff8800a6f60a00 x1788538568684352/t4295032684(0) o101->45aa66e1-e2ed-4253-b00e-100865308d55@192.168.202.31@tcp:2/0 lens 376/46816 e 0 to 0 dl 1705683922 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 555.710917] Lustre: 16621:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 8 previous similar messages [ 589.858727] LustreError: 16978:0:(lod_lov.c:1459:lod_parse_striping()) lustre-MDT0000-mdtlov: DOM entries with different sizes 0x40000/0x80000 [ 621.869220] Lustre: 7357:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff880076d60500 x1788538592316288/t4295044622(0) o101->45aa66e1-e2ed-4253-b00e-100865308d55@192.168.202.31@tcp:68/0 lens 376/46816 e 0 to 0 dl 1705683988 ref 1 fl Interpret:H/202/0 rc 0/0 job:'dd.0' uid:0 gid:0 [ 621.877712] Lustre: 7357:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 23 previous similar messages [ 631.824953] Lustre: 17933:0:(lod_lov.c:1429:lod_parse_striping()) lustre-MDT0000-mdtlov: EXTENSION flags=40 set on component[2]=1 of non-SEL file [0x200000405:0xdaf:0x0] with magic=0xbd60bd0 [ 631.833113] Lustre: 17933:0:(lod_lov.c:1429:lod_parse_striping()) Skipped 57 previous similar messages [ 647.667817] Lustre: lustre-OST0000-osc-MDT0001: update sequence from 0x280000400 to 0x280000402 [ 672.090780] Lustre: lustre-OST0001-osc-MDT0001: update sequence from 0x2c0000400 to 0x2c0000402 [ 718.825262] Lustre: lustre-OST0000-osc-MDT0000: update sequence from 0x280000401 to 0x280000403 [ 727.579917] LustreError: 7345:0:(ldlm_lockd.c:261:expired_lock_main()) ### lock callback timer expired after 100s: evicting client at 192.168.202.31@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff880124955400/0x5506c851a9a213cd lrc: 3/0,0 mode: PR/PR res: [0x200000405:0xe96:0x0].0x0 bits 0x53/0x0 rrc: 5 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.202.31@tcp remote: 0xce820b82ab7eabca expref: 904 pid: 17823 timeout: 726 lvb_type: 0 [ 727.609285] LustreError: 32052:0:(ldlm_lockd.c:2590:ldlm_cancel_handler()) ldlm_cancel from 192.168.202.31@tcp arrived at 1705683950 with bad export cookie 6126804596143210052 [ 758.425205] Lustre: 17933:0:(mdt_recovery.c:149:mdt_req_from_lrd()) @@@ restoring transno req@ffff880120485f00 x1788538625377472/t4295055283(0) o101->48b532ba-9d1b-4b82-90a8-fe6a5385e4bc@192.168.202.31@tcp:204/0 lens 384/48232 e 0 to 0 dl 1705684124 ref 1 fl Interpret:H/202/0 rc 0/0 job:'lfs.0' uid:0 gid:0 [ 758.434295] Lustre: 17933:0:(mdt_recovery.c:149:mdt_req_from_lrd()) Skipped 34 previous similar messages [ 758.529259] Lustre: lustre-OST0001-osc-MDT0000: update sequence from 0x2c0000401 to 0x2c0000403 ****************************************************************************** ************************ A Summary Of Problems Found ************************* ****************************************************************************** -------------------- A list of all +++WARNING+++ messages -------------------- PARTIAL DUMP with size(vmcore) < 25% size(RAM) There are 3 threads running in their own namespaces Use 'taskinfo --ns' to get more details ------------------------------------------------------------------------------ ** Execution took 13.57s (real) 7.93s (CPU), Child processes: 5.61s