-----============= acceptance-small: insanity ============----- Mon Mar 16 09:38:50 EDT 2026 mgs: Rocky Linux release 8.10 (Green Obsidian) MGS_OS_ID_LIKE=rhel centos fedora rocky MGS_OS_VERSION_ID=8.10 MGS_OS_ID=rocky MGS_OS_VERSION_CODE=134873088 mds1: Rocky Linux release 8.10 (Green Obsidian) MDS1_OS_VERSION_ID=8.10 MDS1_OS_VERSION_CODE=134873088 MDS1_OS_ID_LIKE=rhel centos fedora rocky MDS1_OS_ID=rocky ost1: Rocky Linux release 8.10 (Green Obsidian) OST1_OS_VERSION_CODE=134873088 OST1_OS_ID_LIKE=rhel centos fedora rocky OST1_OS_VERSION_ID=8.10 OST1_OS_ID=rocky client: Rocky Linux release 8.10 (Green Obsidian) CLIENT_OS_ID=rocky CLIENT_OS_VERSION_CODE=134873088 CLIENT_OS_VERSION_ID=8.10 CLIENT_OS_ID_LIKE=rhel centos fedora rocky oleg123-server: ls: cannot access '/home/green/git/lustre-release/lustre/tests/except/insanity.*ex': No such file or directory excepting tests: === insanity: start setup 09:39:01 (1773668341) === oleg123-client.virtnet: executing check_config_client /mnt/lustre oleg123-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg123-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff96ec85474800.idle_timeout=debug osc.lustre-OST0001-osc-ffff96ec85474800.idle_timeout=debug disable quota as required oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all osd-ldiskfs.track_declares_assert=1 === insanity: finish setup 09:39:18 (1773668358) === == insanity test 0: Fail all nodes, independently ======== 09:39:19 (1773668359) Failing mds1 on oleg123-server Stopping /mnt/lustre-mds1 (opts:) on oleg123-server 09:39:22 (1773668362) shut down facet: mds1 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds1 to oleg123-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0000 09:39:38 (1773668378) targets are mounted 09:39:38 (1773668378) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg123-server Stopping /mnt/lustre-mds2 (opts:) on oleg123-server 09:39:46 (1773668386) shut down facet: mds2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds2 to oleg123-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 09:40:02 (1773668402) targets are mounted 09:40:02 (1773668402) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Failing ost1 on oleg123-server Stopping /mnt/lustre-ost1 (opts:) on oleg123-server 09:40:10 (1773668410) shut down facet: ost1 facet_host: oleg123-server facet_failover_host: oleg123-server Failover ost1 to oleg123-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0000 09:40:27 (1773668427) targets are mounted 09:40:27 (1773668427) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Failing ost2 on oleg123-server Stopping /mnt/lustre-ost2 (opts:) on oleg123-server 09:40:34 (1773668434) shut down facet: ost2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover ost2 to oleg123-server mount facets: ost2 Start ost2: mount -t lustre -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0001 09:40:50 (1773668450) targets are mounted 09:40:51 (1773668451) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 0 (99s) == insanity test 1: MDS/MDS failure ====================== 09:40:58 (1773668458) Stopping /mnt/lustre-mds1 (opts:) on oleg123-server Failover mds1 to oleg123-server Stopping /mnt/lustre-mds2 (opts:) on oleg123-server Reintegrating MDS2 oleg123-server.virtnet Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 oleg123-server.virtnet Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0000 Verify reintegration PASS 1 (178s) == insanity test 2: Second Failure Mode: MDS/OST Mon Mar 16 09:43:56 EDT 2026 ========================================================== 09:43:56 (1773668636) Verify Lustre filesystem is up and running Stopping /mnt/lustre-mds1 (opts:) on oleg123-server Failover mds1 to oleg123-server Stopping /mnt/lustre-mds2 (opts:) on oleg123-server Failover mds2 to oleg123-server Stopping /mnt/lustre-ost1 (opts:) on oleg123-server Reintegrating OST oleg123-server.virtnet Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0000 oleg123-server.virtnet Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg123-server.virtnet Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 Verify reintegration PASS 2 (197s) == insanity test 3: Third Failure Mode: MDS/CLIENT Mon Mar 16 09:47:13 EDT 2026 ========================================================== 09:47:13 (1773668833) Verify Lustre filesystem is up and running Failing mds1 on oleg123-server Stopping /mnt/lustre-mds1 (opts:) on oleg123-server 09:47:18 (1773668838) shut down facet: mds1 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds1 to oleg123-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0000 09:47:37 (1773668857) targets are mounted 09:47:37 (1773668857) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg123-server Stopping /mnt/lustre-mds2 (opts:) on oleg123-server 09:47:48 (1773668868) shut down facet: mds2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds2 to oleg123-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 09:48:09 (1773668889) targets are mounted 09:48:09 (1773668889) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Test Lustre stability after MDS failover Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS PASS 3 (79s) == insanity test 4: Fourth Failure Mode: OST/MDS Mon Mar 16 09:48:32 EDT 2026 ========================================================== 09:48:32 (1773668912) Fourth Failure Mode: OST/MDS Mon Mar 16 09:48:33 EDT 2026 Stopping /mnt/lustre-ost1 (opts:) on oleg123-server Test Lustre stability after OST failure Stopping /mnt/lustre-mds1 (opts:) on oleg123-server Failover mds1 to oleg123-server Stopping /mnt/lustre-mds2 (opts:) on oleg123-server Failover mds2 to oleg123-server Reintegrating OST oleg123-server.virtnet Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0000 oleg123-server.virtnet Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg123-server.virtnet Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 Test Lustre stability after MDS failover PASS 4 (195s) == insanity test 5: Fifth Failure Mode: OST/OST Mon Mar 16 09:51:47 EDT 2026 ========================================================== 09:51:48 (1773669108) Fifth Failure Mode: OST/OST Mon Mar 16 09:51:48 EDT 2026 Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg123-server Test Lustre stability after OST failure Stopping /mnt/lustre-ost2 (opts:) on oleg123-server Test Lustre stability after OST failure Reintegrating OSTs oleg123-server.virtnet Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0000 oleg123-server.virtnet Start ost2: mount -t lustre -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0001 PASS 5 (90s) == insanity test 6: Sixth Failure Mode: OST/CLIENT Mon Mar 16 09:53:17 EDT 2026 ========================================================== 09:53:17 (1773669197) Sixth Failure Mode: OST/CLIENT Mon Mar 16 09:53:20 EDT 2026 Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg123-server Test Lustre stability after OST failure DFPIDA=23099 Failing CLIENTs Request fail clients: , to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure DFPIDB=23410 Reintegrating OST/CLIENTs oleg123-server.virtnet Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0000 Verifying mount PASS 6 (81s) == insanity test 7: Seventh Failure Mode: CLIENT/MDS Mon Mar 16 09:54:38 EDT 2026 ========================================================== 09:54:38 (1773669278) Seventh Failure Mode: CLIENT/MDS Mon Mar 16 09:54:40 EDT 2026 Verify Lustre filesystem is up and running Part 1: Failing CLIENT Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg123-client: total 0 oleg123-client: -rw-r--r-- 1 root root 0 Mar 16 09:54 oleg123-client.virtnet_testfile Wait 1 minutes Verify Lustre filesystem is up and running oleg123-client: rm: cannot remove '/mnt/lustre/d0.insanity/oleg123-client.virtnet_testfile': No such file or directory pdsh@oleg123-client: oleg123-client: ssh exited with exit code 1 Failing mds1 on oleg123-server Stopping /mnt/lustre-mds1 (opts:) on oleg123-server 09:55:58 (1773669358) shut down facet: mds1 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds1 to oleg123-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0000 09:56:33 (1773669393) targets are mounted 09:56:33 (1773669393) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg123-server Stopping /mnt/lustre-mds2 (opts:) on oleg123-server 09:56:49 (1773669409) shut down facet: mds2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds2 to oleg123-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 09:57:15 (1773669435) targets are mounted 09:57:15 (1773669435) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec oleg123-client: total 0 Reintegrating CLIENTs wait 1 minutes PASS 7 (237s) == insanity test 8: Eighth Failure Mode: CLIENT/OST Mon Mar 16 09:58:35 EDT 2026 ========================================================== 09:58:35 (1773669515) Eighth Failure Mode: CLIENT/OST Mon Mar 16 09:58:37 EDT 2026 Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg123-client: total 0 oleg123-client: -rw-r--r-- 1 root root 0 Mar 16 09:58 oleg123-client.virtnet_testfile Wait 1 minutes Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg123-server Test Lustre stability after OST failure Reintegrating CLIENTs/OST oleg123-server.virtnet Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0000 Wait 1 minutes PASS 8 (183s) == insanity test 9: Ninth Failure Mode: CLIENT/CLIENT Mon Mar 16 10:01:38 EDT 2026 ========================================================== 10:01:38 (1773669698) Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg123-client: total 0 oleg123-client: -rw-r--r-- 1 root root 0 Mar 16 10:01 oleg123-client.virtnet_testfile oleg123-client: -rw-r--r-- 1 root root 0 Mar 16 10:00 oleg123-client.virtnet_testfile2 Wait 1 minutes Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg123-client: total 0 oleg123-client: -rw-r--r-- 1 root root 0 Mar 16 10:02 oleg123-client.virtnet_testfile oleg123-client: -rw-r--r-- 1 root root 0 Mar 16 10:00 oleg123-client.virtnet_testfile2 Reintegrating CLIENTs/CLIENTs Wait 1 minutes PASS 9 (146s) == insanity test 10: Tenth Failure Mode: MDT0/OST/MDT1 Mon Mar 16 10:04:04 EDT 2026 ========================================================== 10:04:04 (1773669844) Stopping /mnt/lustre-mds1 (opts:) on oleg123-server Failover mds1 to oleg123-server Stopping /mnt/lustre-ost1 (opts:) on oleg123-server Reintegrating OST oleg123-server.virtnet Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0000 Stopping /mnt/lustre-mds2 (opts:) on oleg123-server Failover mds2 to oleg123-server oleg123-server.virtnet Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg123-server.virtnet Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 Verify reintegration PASS 10 (159s) == insanity test 11: Eleventh Failure Mode: MDS0/CLIENT/MDS1 Mon Mar 16 10:06:43 EDT 2026 ========================================================== 10:06:43 (1773670003) Verify Lustre filesystem is up and running Failing mds1 on oleg123-server Stopping /mnt/lustre-mds1 (opts:) on oleg123-server 10:06:46 (1773670006) shut down facet: mds1 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds1 to oleg123-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0000 10:07:02 (1773670022) targets are mounted 10:07:02 (1773670022) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Test Lustre stability after MDS failover Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS Failing mds2 on oleg123-server Stopping /mnt/lustre-mds2 (opts:) on oleg123-server 10:07:15 (1773670035) shut down facet: mds2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds2 to oleg123-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 10:07:31 (1773670051) targets are mounted 10:07:31 (1773670051) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 11 (60s) == insanity test 12: Twelve Failure Mode: MDS0,MDS1/OST0, OST1/CLIENTS Mon Mar 16 10:07:43 EDT 2026 ========================================================== 10:07:43 (1773670063) Verify Lustre filesystem is up and running Failing mds1 on oleg123-server Stopping /mnt/lustre-mds1 (opts:) on oleg123-server Failing mds2 on oleg123-server Stopping /mnt/lustre-mds2 (opts:) on oleg123-server 10:07:47 (1773670067) shut down facet: mds1 facet_host: oleg123-server facet_failover_host: oleg123-server facet: mds2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds1 to oleg123-server mount facets: mds1 Failover mds2 to oleg123-server mount facets: mds2 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 10:08:14 (1773670094) targets are mounted 10:08:14 (1773670094) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Failing ost1 on oleg123-server Stopping /mnt/lustre-ost1 (opts:) on oleg123-server Failing ost2 on oleg123-server Stopping /mnt/lustre-ost2 (opts:) on oleg123-server 10:08:26 (1773670106) shut down facet: ost1 facet_host: oleg123-server facet_failover_host: oleg123-server facet: ost2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover ost1 to oleg123-server mount facets: ost1 Failover ost2 to oleg123-server mount facets: ost2 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 Start ost2: mount -t lustre -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 seq.cli-lustre-OST0000-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0001 Started lustre-OST0000 10:08:44 (1773670124) targets are mounted 10:08:44 (1773670124) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid,osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS PASS 12 (77s) == insanity test 13: Thirteen Failure Mode: MDS0,MDS1/CLIENTS/OST0,OST1 Mon Mar 16 10:09:00 EDT 2026 ========================================================== 10:09:00 (1773670140) Verify Lustre filesystem is up and running Failing mds1 on oleg123-server Stopping /mnt/lustre-mds1 (opts:) on oleg123-server Failing mds2 on oleg123-server Stopping /mnt/lustre-mds2 (opts:) on oleg123-server 10:09:05 (1773670145) shut down facet: mds1 facet_host: oleg123-server facet_failover_host: oleg123-server facet: mds2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds1 to oleg123-server mount facets: mds1 Failover mds2 to oleg123-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 10:09:32 (1773670172) targets are mounted 10:09:32 (1773670172) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS Failing ost1 on oleg123-server Stopping /mnt/lustre-ost1 (opts:) on oleg123-server Failing ost2 on oleg123-server Stopping /mnt/lustre-ost2 (opts:) on oleg123-server 10:09:51 (1773670191) shut down facet: ost1 facet_host: oleg123-server facet_failover_host: oleg123-server facet: ost2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover ost1 to oleg123-server mount facets: ost1 Failover ost2 to oleg123-server mount facets: ost2 Start ost2: mount -t lustre -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0001-super.width=65536 seq.cli-lustre-OST0000-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0001 Started lustre-OST0000 10:10:09 (1773670209) targets are mounted 10:10:09 (1773670209) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid,osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 13 (78s) == insanity test 14: Fourteen Failure Mode: OST0,OST1/CLIENTS/MDS0,MDS1 Mon Mar 16 10:10:18 EDT 2026 ========================================================== 10:10:19 (1773670219) Verify Lustre filesystem is up and running Failing ost1 on oleg123-server Stopping /mnt/lustre-ost1 (opts:) on oleg123-server Failing ost2 on oleg123-server Stopping /mnt/lustre-ost2 (opts:) on oleg123-server 10:10:23 (1773670223) shut down facet: ost1 facet_host: oleg123-server facet_failover_host: oleg123-server facet: ost2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover ost1 to oleg123-server mount facets: ost1 Failover ost2 to oleg123-server mount facets: ost2 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 Start ost2: mount -t lustre -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 seq.cli-lustre-OST0000-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0001 Started lustre-OST0000 10:10:40 (1773670240) targets are mounted 10:10:40 (1773670240) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid,osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS Failing mds1 on oleg123-server Stopping /mnt/lustre-mds1 (opts:) on oleg123-server Failing mds2 on oleg123-server Stopping /mnt/lustre-mds2 (opts:) on oleg123-server 10:10:59 (1773670259) shut down facet: mds1 facet_host: oleg123-server facet_failover_host: oleg123-server facet: mds2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds1 to oleg123-server mount facets: mds1 Failover mds2 to oleg123-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 10:11:25 (1773670285) targets are mounted 10:11:25 (1773670285) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 14 (78s) == insanity test complete, duration 1965 sec ============= 10:11:36 (1773670296) === insanity: start cleanup 10:11:37 (1773670297) === === insanity: finish cleanup 10:11:38 (1773670298) ===