== conf-sanity test 35b: Continue reconnection retries, if the active server is busy ========================================================== 04:31:33 (1743496293) start mds service on oleg633-server Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg633-server: oleg633-server.virtnet: executing set_default_debug -1 all pdsh@oleg633-client: oleg633-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg633-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid start ost1 service on oleg633-server Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg633-server: oleg633-server.virtnet: executing set_default_debug -1 all pdsh@oleg633-client: oleg633-server: ssh exited with exit code 1 Started lustre-OST0000 oleg633-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid mount lustre on /mnt/lustre..... Starting client: oleg633-client.virtnet: -o user_xattr,flock 192.168.206.133@tcp:/lustre /mnt/lustre debug=ha conf-sanity.sh test_35b 2025-04-01 4h32m54s Set up a fake failnode for the MDS at_max=0 at_max=0 Injecting EBUSY on MDS fail_loc=0x80000136 mdc.lustre-MDT0000-mdc-ffff9f992b284000.stats=clear Creating a test file and stat it File: /mnt/lustre/d35b.conf-sanity/f35b.conf-sanity Size: 0 Blocks: 1 IO Block: 4194304 regular empty file Device: 2c54f966h/743766374d Inode: 144115238826934274 Links: 1 Access: (0644/-rw-r--r--) Uid: ( 0/ root) Gid: ( 0/ root) Access: 2025-04-01 04:34:04.000000000 -0400 Modify: 2025-04-01 04:34:04.000000000 -0400 Change: 2025-04-01 04:34:04.000000000 -0400 Birth: 2025-04-01 04:34:04.000000000 -0400 Stop injecting EBUSY on MDS fail_loc=0 done at_max=600 at_max=600 Debug log: 48 lines, 48 kept, 0 dropped, 0 bad. umount lustre on /mnt/lustre..... Stopping client oleg633-client.virtnet /mnt/lustre (opts:) stop ost1 service on oleg633-server Stopping /mnt/lustre-ost1 (opts:-f) on oleg633-server stop mds service on oleg633-server Stopping /mnt/lustre-mds1 (opts:-f) on oleg633-server /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 114: echo: write error: Device or resource busy /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 156: echo: write error: Device or resource busy unloading modules via unload_modules_local on: 'oleg633-server' oleg633-server: oleg633-server.virtnet: executing unload_modules_local oleg633-server: /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 114: echo: write error: Device or resource busy oleg633-server: /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 156: echo: write error: Device or resource busy modules unloaded. oleg633-server: oleg633-server: tunefs.lustre FATAL: Device lustre-mdt1/mdt1 has not been formatted with mkfs.lustre oleg633-server: tunefs.lustre: exiting with 19 (No such device) pdsh@oleg633-client: oleg633-server: ssh exited with exit code 19 checking for existing Lustre data: not found oleg633-server: oleg633-server: tunefs.lustre FATAL: Device lustre-ost1/ost1 has not been formatted with mkfs.lustre oleg633-server: tunefs.lustre: exiting with 19 (No such device) pdsh@oleg633-client: oleg633-server: ssh exited with exit code 19 checking for existing Lustre data: not found oleg633-server: oleg633-server: tunefs.lustre FATAL: Device lustre-ost2/ost2 has not been formatted with mkfs.lustre oleg633-server: tunefs.lustre: exiting with 19 (No such device) pdsh@oleg633-client: oleg633-server: ssh exited with exit code 19 checking for existing Lustre data: not found tunefs failed, reformatting instead Stopping clients: oleg633-client.virtnet /mnt/lustre (opts:-f) Stopping clients: oleg633-client.virtnet /mnt/lustre2 (opts:-f) pdsh@oleg633-client: oleg633-server: ssh exited with exit code 2 oleg633-server: oleg633-server.virtnet: executing set_hostid /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1001: echo: write error: Device or resource busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1002: echo: write error: Device or resource busy Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs MODOPTS_LIBCFS= Force libcfs to create 2 CPU partitions ../libcfs/libcfs/libcfs options: 'cpu_npartitions=2' ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' quota/lquota options: 'hash_lqs_cur_bits=3' ln: failed to create symbolic link '/sbin/.libs': Read-only file system loading modules on: 'oleg633-server' oleg633-server: oleg633-server.virtnet: executing load_modules_local oleg633-server: Loading modules from /home/green/git/lustre-release/lustre oleg633-server: /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1001: echo: write error: Device or resource busy oleg633-server: /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1002: echo: write error: Device or resource busy oleg633-server: detected 4 online CPUs by sysfs oleg633-server: MODOPTS_LIBCFS= oleg633-server: Force libcfs to create 2 CPU partitions oleg633-server: ptlrpc/ptlrpc options: 'lbug_on_grant_miscount=1' oleg633-server: quota/lquota options: 'hash_lqs_cur_bits=3' Formatting mgs, mds, osts Format mds1: lustre-mdt1/mdt1 Format ost1: lustre-ost1/ost1 Format ost2: lustre-ost2/ost2 start mds service on oleg633-server Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg633-server: oleg633-server.virtnet: executing set_default_debug -1 all pdsh@oleg633-client: oleg633-server: ssh exited with exit code 1 Commit the device label on lustre-mdt1/mdt1 Started lustre-MDT0000 oleg633-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid start ost1 service on oleg633-server Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg633-server: oleg633-server.virtnet: executing set_default_debug -1 all pdsh@oleg633-client: oleg633-server: ssh exited with exit code 1 Commit the device label on lustre-ost1/ost1 Started lustre-OST0000 oleg633-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid oleg633-server: oleg633-server.virtnet: executing wait_import_state FULL os[cp].lustre-OST0000-osc-MDT0000.ost_server_uuid 50 oleg633-server: os[cp].lustre-OST0000-osc-MDT0000.ost_server_uuid in FULL state after 0 sec stop ost1 service on oleg633-server Stopping /mnt/lustre-ost1 (opts:-f) on oleg633-server stop mds service on oleg633-server Stopping /mnt/lustre-mds1 (opts:-f) on oleg633-server