== conf-sanity test 35a: Reconnect to the last active server first ========================================================== 05:12:02 (1743498722) start mds service on oleg617-server /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1001: echo: write error: Device or resource busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1002: echo: write error: Device or resource busy Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs MODOPTS_LIBCFS= Force libcfs to create 2 CPU partitions ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' quota/lquota options: 'hash_lqs_cur_bits=3' ln: failed to create symbolic link '/sbin/.libs': Read-only file system loading modules on: 'oleg617-server' oleg617-server: oleg617-server.virtnet: executing load_modules_local oleg617-server: Loading modules from /home/green/git/lustre-release/lustre oleg617-server: /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1001: echo: write error: Device or resource busy oleg617-server: /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1002: echo: write error: Device or resource busy oleg617-server: detected 4 online CPUs by sysfs oleg617-server: MODOPTS_LIBCFS= oleg617-server: Force libcfs to create 2 CPU partitions oleg617-server: ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' oleg617-server: quota/lquota options: 'hash_lqs_cur_bits=3' Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg617-server: oleg617-server.virtnet: executing set_default_debug -1 all pdsh@oleg617-client: oleg617-server: ssh exited with exit code 1 Started lustre-MDT0000 start mds service on oleg617-server Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg617-server: oleg617-server.virtnet: executing set_default_debug -1 all pdsh@oleg617-client: oleg617-server: ssh exited with exit code 1 Started lustre-MDT0001 oleg617-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid oleg617-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-*.mds_server_uuid start ost1 service on oleg617-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg617-server: oleg617-server.virtnet: executing set_default_debug -1 all pdsh@oleg617-client: oleg617-server: ssh exited with exit code 1 Started lustre-OST0000 oleg617-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid mount lustre on /mnt/lustre..... Starting client: oleg617-client.virtnet: -o user_xattr,flock 192.168.206.117@tcp:/lustre /mnt/lustre debug=ha Set up a fake failnode for the MDS Wait for RECONNECT_INTERVAL seconds (10s) conf-sanity.sh test_35a 2025-04-01 5h13m56s Stopping the MDT: lustre-MDT0000 stop mds service on oleg617-server Stopping /mnt/lustre-mds1 (opts:-f) on oleg617-server Restarting the MDT: lustre-MDT0000 start mds service on oleg617-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg617-server: oleg617-server.virtnet: executing set_default_debug -1 all pdsh@oleg617-client: oleg617-server: ssh exited with exit code 1 Started lustre-MDT0000 Wait for df (34741) ... done debug=trace inode super iotrace malloc cache info ioctl neterror net warning buffs other dentry nettrace page dlmtrace error emerg ha rpctrace vfstrace reada mmap config console quota sec lfsck hsm snapshot layout Debug log: 146 lines, 146 kept, 0 dropped, 0 bad. umount lustre on /mnt/lustre..... Stopping client oleg617-client.virtnet /mnt/lustre (opts:) stop ost1 service on oleg617-server Stopping /mnt/lustre-ost1 (opts:-f) on oleg617-server stop mds service on oleg617-server Stopping /mnt/lustre-mds1 (opts:-f) on oleg617-server stop mds service on oleg617-server Stopping /mnt/lustre-mds2 (opts:-f) on oleg617-server /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 114: echo: write error: Device or resource busy /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 156: echo: write error: Device or resource busy unloading modules via unload_modules_local on: 'oleg617-server' oleg617-server: oleg617-server.virtnet: executing unload_modules_local oleg617-server: /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 114: echo: write error: Device or resource busy oleg617-server: /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 156: echo: write error: Device or resource busy modules unloaded. oleg617-server: tunefs.lustre: Unable to mount /dev/mapper/mds1_flakey: No such device oleg617-server: Is the ldiskfs module available? oleg617-server: oleg617-server: tunefs.lustre FATAL: failed to write local files oleg617-server: tunefs.lustre: exiting with 19 (No such device) pdsh@oleg617-client: oleg617-server: ssh exited with exit code 19 checking for existing Lustre data: found Read previous values: Target: lustre-MDT0000 Index: 0 Lustre FS: lustre Mount type: ldiskfs Flags: 0x5 (MDT MGS ) Persistent mount opts: user_xattr,errors=remount-ro Parameters: sys.timeout=20 mdt.identity_upcall=/home/green/git/lustre-release/lustre/utils/l_getidentity Permanent disk data: Target: lustre=MDT0000 Index: 0 Lustre FS: lustre Mount type: ldiskfs Flags: 0x105 (MDT MGS writeconf ) Persistent mount opts: user_xattr,errors=remount-ro Parameters: sys.timeout=20 mdt.identity_upcall=/home/green/git/lustre-release/lustre/utils/l_getidentity oleg617-server: tunefs.lustre: Unable to mount /dev/mapper/mds2_flakey: No such device oleg617-server: Is the ldiskfs module available? oleg617-server: oleg617-server: tunefs.lustre FATAL: failed to write local files oleg617-server: tunefs.lustre: exiting with 19 (No such device) pdsh@oleg617-client: oleg617-server: ssh exited with exit code 19 checking for existing Lustre data: found Read previous values: Target: lustre-MDT0001 Index: 1 Lustre FS: lustre Mount type: ldiskfs Flags: 0x1 (MDT ) Persistent mount opts: user_xattr,errors=remount-ro Parameters: mgsnode=192.168.206.117@tcp sys.timeout=20 mdt.identity_upcall=/home/green/git/lustre-release/lustre/utils/l_getidentity Permanent disk data: Target: lustre=MDT0001 Index: 1 Lustre FS: lustre Mount type: ldiskfs Flags: 0x101 (MDT writeconf ) Persistent mount opts: user_xattr,errors=remount-ro Parameters: mgsnode=192.168.206.117@tcp sys.timeout=20 mdt.identity_upcall=/home/green/git/lustre-release/lustre/utils/l_getidentity oleg617-server: tunefs.lustre: Unable to mount /dev/mapper/ost1_flakey: No such device oleg617-server: Is the ldiskfs module available? oleg617-server: oleg617-server: tunefs.lustre FATAL: failed to write local files oleg617-server: tunefs.lustre: exiting with 19 (No such device) pdsh@oleg617-client: oleg617-server: ssh exited with exit code 19 checking for existing Lustre data: found Read previous values: Target: lustre-OST0000 Index: 0 Lustre FS: lustre Mount type: ldiskfs Flags: 0x2 (OST ) Persistent mount opts: ,errors=remount-ro Parameters: mgsnode=192.168.206.117@tcp sys.timeout=20 Permanent disk data: Target: lustre=OST0000 Index: 0 Lustre FS: lustre Mount type: ldiskfs Flags: 0x102 (OST writeconf ) Persistent mount opts: ,errors=remount-ro Parameters: mgsnode=192.168.206.117@tcp sys.timeout=20 oleg617-server: tunefs.lustre: Unable to mount /dev/mapper/ost2_flakey: No such device oleg617-server: Is the ldiskfs module available? oleg617-server: oleg617-server: tunefs.lustre FATAL: failed to write local files oleg617-server: tunefs.lustre: exiting with 19 (No such device) pdsh@oleg617-client: oleg617-server: ssh exited with exit code 19 checking for existing Lustre data: found Read previous values: Target: lustre-OST0001 Index: 1 Lustre FS: lustre Mount type: ldiskfs Flags: 0x62 (OST first_time update ) Persistent mount opts: ,errors=remount-ro Parameters: mgsnode=192.168.206.117@tcp sys.timeout=20 Permanent disk data: Target: lustre=OST0001 Index: 1 Lustre FS: lustre Mount type: ldiskfs Flags: 0x162 (OST first_time update writeconf ) Persistent mount opts: ,errors=remount-ro Parameters: mgsnode=192.168.206.117@tcp sys.timeout=20 tunefs failed, reformatting instead Stopping clients: oleg617-client.virtnet /mnt/lustre (opts:-f) Stopping clients: oleg617-client.virtnet /mnt/lustre2 (opts:-f) pdsh@oleg617-client: oleg617-server: ssh exited with exit code 2 oleg617-server: oleg617-server.virtnet: executing set_hostid /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1001: echo: write error: Device or resource busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1002: echo: write error: Device or resource busy Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs MODOPTS_LIBCFS= Force libcfs to create 2 CPU partitions ../libcfs/libcfs/libcfs options: 'cpu_npartitions=2' ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' quota/lquota options: 'hash_lqs_cur_bits=3' ln: failed to create symbolic link '/sbin/.libs': Read-only file system loading modules on: 'oleg617-server' oleg617-server: oleg617-server.virtnet: executing load_modules_local oleg617-server: Loading modules from /home/green/git/lustre-release/lustre oleg617-server: /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1001: echo: write error: Device or resource busy oleg617-server: /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1002: echo: write error: Device or resource busy oleg617-server: detected 4 online CPUs by sysfs oleg617-server: MODOPTS_LIBCFS= oleg617-server: Force libcfs to create 2 CPU partitions oleg617-server: ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' oleg617-server: quota/lquota options: 'hash_lqs_cur_bits=3' Formatting mgs, mds, osts Format mds1: /dev/mapper/mds1_flakey Format mds2: /dev/mapper/mds2_flakey Format ost1: /dev/mapper/ost1_flakey Format ost2: /dev/mapper/ost2_flakey start mds service on oleg617-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg617-server: oleg617-server.virtnet: executing set_default_debug -1 all pdsh@oleg617-client: oleg617-server: ssh exited with exit code 1 Commit the device label on /dev/mapper/mds1_flakey Started lustre-MDT0000 start mds service on oleg617-server Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg617-server: oleg617-server.virtnet: executing set_default_debug -1 all pdsh@oleg617-client: oleg617-server: ssh exited with exit code 1 Commit the device label on /dev/mapper/mds2_flakey Started lustre-MDT0001 oleg617-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid oleg617-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-*.mds_server_uuid start ost1 service on oleg617-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg617-server: oleg617-server.virtnet: executing set_default_debug -1 all pdsh@oleg617-client: oleg617-server: ssh exited with exit code 1 Commit the device label on /dev/mapper/ost1_flakey Started lustre-OST0000 oleg617-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid oleg617-server: oleg617-server.virtnet: executing wait_import_state FULL os[cp].lustre-OST0000-osc-MDT0000.ost_server_uuid 50 oleg617-server: os[cp].lustre-OST0000-osc-MDT0000.ost_server_uuid in FULL state after 0 sec oleg617-server: oleg617-server.virtnet: executing wait_import_state FULL os[cp].lustre-OST0000-osc-MDT0001.ost_server_uuid 50 oleg617-server: os[cp].lustre-OST0000-osc-MDT0001.ost_server_uuid in FULL state after 0 sec stop ost1 service on oleg617-server Stopping /mnt/lustre-ost1 (opts:-f) on oleg617-server stop mds service on oleg617-server Stopping /mnt/lustre-mds1 (opts:-f) on oleg617-server stop mds service on oleg617-server Stopping /mnt/lustre-mds2 (opts:-f) on oleg617-server