== insanity test 13: Thirteen Failure Mode: MDS0,MDS1/CLIENTS/OST0,OST1 Mon Mar 16 10:09:00 EDT 2026 ========================================================== 10:09:00 (1773670140) Verify Lustre filesystem is up and running Failing mds1 on oleg123-server Stopping /mnt/lustre-mds1 (opts:) on oleg123-server Failing mds2 on oleg123-server Stopping /mnt/lustre-mds2 (opts:) on oleg123-server 10:09:05 (1773670145) shut down facet: mds1 facet_host: oleg123-server facet_failover_host: oleg123-server facet: mds2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover mds1 to oleg123-server mount facets: mds1 Failover mds2 to oleg123-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 10:09:32 (1773670172) targets are mounted 10:09:32 (1773670172) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS Failing ost1 on oleg123-server Stopping /mnt/lustre-ost1 (opts:) on oleg123-server Failing ost2 on oleg123-server Stopping /mnt/lustre-ost2 (opts:) on oleg123-server 10:09:51 (1773670191) shut down facet: ost1 facet_host: oleg123-server facet_failover_host: oleg123-server facet: ost2 facet_host: oleg123-server facet_failover_host: oleg123-server Failover ost1 to oleg123-server mount facets: ost1 Failover ost2 to oleg123-server mount facets: ost2 Start ost2: mount -t lustre -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0001-super.width=65536 seq.cli-lustre-OST0000-super.width=65536 oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg123-server: oleg123-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 pdsh@oleg123-client: oleg123-server: ssh exited with exit code 1 Started lustre-OST0001 Started lustre-OST0000 10:10:09 (1773670209) targets are mounted 10:10:09 (1773670209) facet_failover done oleg123-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid,osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec