-----============= acceptance-small: insanity ============----- Mon Mar 16 09:36:53 EDT 2026 mgs: Rocky Linux release 8.10 (Green Obsidian) MGS_OS_ID_LIKE=rhel centos fedora rocky MGS_OS_VERSION_ID=8.10 MGS_OS_ID=rocky MGS_OS_VERSION_CODE=134873088 mds1: Rocky Linux release 8.10 (Green Obsidian) MDS1_OS_VERSION_ID=8.10 MDS1_OS_VERSION_CODE=134873088 MDS1_OS_ID_LIKE=rhel centos fedora rocky MDS1_OS_ID=rocky ost1: Rocky Linux release 8.10 (Green Obsidian) OST1_OS_VERSION_CODE=134873088 OST1_OS_ID_LIKE=rhel centos fedora rocky OST1_OS_VERSION_ID=8.10 OST1_OS_ID=rocky client: Rocky Linux release 8.10 (Green Obsidian) CLIENT_OS_ID=rocky CLIENT_OS_VERSION_CODE=134873088 CLIENT_OS_VERSION_ID=8.10 CLIENT_OS_ID_LIKE=rhel centos fedora rocky oleg333-server: ls: cannot access '/home/green/git/lustre-release/lustre/tests/except/insanity.*ex': No such file or directory excepting tests: === insanity: start setup 09:36:58 (1773668218) === oleg333-client.virtnet: executing check_config_client /mnt/lustre oleg333-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg333-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8b74513a8800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8b74513a8800.idle_timeout=debug disable quota as required oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all === insanity: finish setup 09:37:08 (1773668228) === == insanity test 0: Fail all nodes, independently ======== 09:37:09 (1773668229) Failing mds1 on oleg333-server Stopping /mnt/lustre-mds1 (opts:) on oleg333-server 09:37:11 (1773668231) shut down facet: mds1 facet_host: oleg333-server facet_failover_host: oleg333-server Failover mds1 to oleg333-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-MDT0000 09:37:24 (1773668244) targets are mounted 09:37:24 (1773668244) facet_failover done oleg333-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing ost1 on oleg333-server Stopping /mnt/lustre-ost1 (opts:) on oleg333-server 09:37:30 (1773668250) shut down facet: ost1 facet_host: oleg333-server facet_failover_host: oleg333-server Failover ost1 to oleg333-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-OST0000 09:37:44 (1773668264) targets are mounted 09:37:44 (1773668264) facet_failover done oleg333-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Failing ost2 on oleg333-server Stopping /mnt/lustre-ost2 (opts:) on oleg333-server 09:37:50 (1773668270) shut down facet: ost2 facet_host: oleg333-server facet_failover_host: oleg333-server Failover ost2 to oleg333-server mount facets: ost2 Start ost2: mount -t lustre -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-OST0001 09:38:05 (1773668285) targets are mounted 09:38:05 (1773668285) facet_failover done oleg333-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0001-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 0 (61s) == insanity test 1: MDS/MDS failure ====================== 09:38:11 (1773668291) SKIP: insanity test_1 needs >= 2 MDTs SKIP 1 (2s) == insanity test 2: Second Failure Mode: MDS/OST Mon Mar 16 09:38:12 EDT 2026 ========================================================== 09:38:12 (1773668292) Verify Lustre filesystem is up and running Stopping /mnt/lustre-mds1 (opts:) on oleg333-server Failover mds1 to oleg333-server Stopping /mnt/lustre-ost1 (opts:) on oleg333-server Reintegrating OST oleg333-server.virtnet Start ost1: mount -t lustre -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-OST0000 oleg333-server.virtnet Start mds1: mount -t lustre -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-MDT0000 Verify reintegration PASS 2 (145s) == insanity test 3: Third Failure Mode: MDS/CLIENT Mon Mar 16 09:40:37 EDT 2026 ========================================================== 09:40:38 (1773668438) Verify Lustre filesystem is up and running Failing mds1 on oleg333-server Stopping /mnt/lustre-mds1 (opts:) on oleg333-server 09:40:40 (1773668440) shut down facet: mds1 facet_host: oleg333-server facet_failover_host: oleg333-server Failover mds1 to oleg333-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-MDT0000 09:40:54 (1773668454) targets are mounted 09:40:54 (1773668454) facet_failover done oleg333-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Test Lustre stability after MDS failover Failing 2 CLIENTS Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENT failure Reintegrating CLIENTS PASS 3 (28s) == insanity test 4: Fourth Failure Mode: OST/MDS Mon Mar 16 09:41:06 EDT 2026 ========================================================== 09:41:07 (1773668467) Fourth Failure Mode: OST/MDS Mon Mar 16 09:41:07 EDT 2026 Stopping /mnt/lustre-ost1 (opts:) on oleg333-server Test Lustre stability after OST failure Stopping /mnt/lustre-mds1 (opts:) on oleg333-server Failover mds1 to oleg333-server Reintegrating OST oleg333-server.virtnet Start ost1: mount -t lustre -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-OST0000 oleg333-server.virtnet Start mds1: mount -t lustre -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-MDT0000 Test Lustre stability after MDS failover PASS 4 (155s) == insanity test 5: Fifth Failure Mode: OST/OST Mon Mar 16 09:43:41 EDT 2026 ========================================================== 09:43:41 (1773668621) Fifth Failure Mode: OST/OST Mon Mar 16 09:43:43 EDT 2026 Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg333-server Test Lustre stability after OST failure Stopping /mnt/lustre-ost2 (opts:) on oleg333-server Test Lustre stability after OST failure Reintegrating OSTs oleg333-server.virtnet Start ost1: mount -t lustre -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-OST0000 oleg333-server.virtnet Start ost2: mount -t lustre -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-OST0001 PASS 5 (88s) == insanity test 6: Sixth Failure Mode: OST/CLIENT Mon Mar 16 09:45:09 EDT 2026 ========================================================== 09:45:09 (1773668709) Sixth Failure Mode: OST/CLIENT Mon Mar 16 09:45:10 EDT 2026 Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg333-server Test Lustre stability after OST failure DFPIDA=19042 Failing CLIENTs Request fail clients: , to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure DFPIDB=19349 Reintegrating OST/CLIENTs oleg333-server.virtnet Start ost1: mount -t lustre -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-OST0000 Verifying mount PASS 6 (50s) == insanity test 7: Seventh Failure Mode: CLIENT/MDS Mon Mar 16 09:45:59 EDT 2026 ========================================================== 09:45:59 (1773668759) Seventh Failure Mode: CLIENT/MDS Mon Mar 16 09:46:00 EDT 2026 Verify Lustre filesystem is up and running Part 1: Failing CLIENT Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg333-client: total 1 oleg333-client: -rw-r--r-- 1 root root 0 Mar 16 09:46 oleg333-client.virtnet_testfile Wait 1 minutes Verify Lustre filesystem is up and running oleg333-client: rm: cannot remove '/mnt/lustre/d0.insanity/oleg333-client.virtnet_testfile': No such file or directory pdsh@oleg333-client: oleg333-client: ssh exited with exit code 1 Failing mds1 on oleg333-server Stopping /mnt/lustre-mds1 (opts:) on oleg333-server 09:47:10 (1773668830) shut down facet: mds1 facet_host: oleg333-server facet_failover_host: oleg333-server Failover mds1 to oleg333-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-MDT0000 09:47:29 (1773668849) targets are mounted 09:47:29 (1773668849) facet_failover done oleg333-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec oleg333-client: total 0 Reintegrating CLIENTs wait 1 minutes PASS 7 (163s) == insanity test 8: Eighth Failure Mode: CLIENT/OST Mon Mar 16 09:48:42 EDT 2026 ========================================================== 09:48:42 (1773668922) Eighth Failure Mode: CLIENT/OST Mon Mar 16 09:48:43 EDT 2026 Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg333-client: total 1 oleg333-client: -rw-r--r-- 1 root root 0 Mar 16 09:48 oleg333-client.virtnet_testfile Wait 1 minutes Verify Lustre filesystem is up and running Stopping /mnt/lustre-ost1 (opts:) on oleg333-server Test Lustre stability after OST failure Reintegrating CLIENTs/OST oleg333-server.virtnet Start ost1: mount -t lustre -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg333-server: oleg333-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg333-client: oleg333-server: ssh exited with exit code 1 Started lustre-OST0000 Wait 1 minutes PASS 8 (159s) == insanity test 9: Ninth Failure Mode: CLIENT/CLIENT Mon Mar 16 09:51:21 EDT 2026 ========================================================== 09:51:21 (1773669081) Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg333-client: total 1 oleg333-client: -rw-r--r-- 1 root root 0 Mar 16 09:51 oleg333-client.virtnet_testfile oleg333-client: -rw-r--r-- 1 root root 0 Mar 16 09:50 oleg333-client.virtnet_testfile2 Wait 1 minutes Verify Lustre filesystem is up and running Failing CLIENTs Request fail clients: 2, to fail: 0, failed: 0 No clients failed! Test Lustre stability after CLIENTs failure oleg333-client: total 1 oleg333-client: -rw-r--r-- 1 root root 0 Mar 16 09:52 oleg333-client.virtnet_testfile oleg333-client: -rw-r--r-- 1 root root 0 Mar 16 09:50 oleg333-client.virtnet_testfile2 Reintegrating CLIENTs/CLIENTs Wait 1 minutes PASS 9 (142s) == insanity test 10: Tenth Failure Mode: MDT0/OST/MDT1 Mon Mar 16 09:53:43 EDT 2026 ========================================================== 09:53:44 (1773669224) SKIP: insanity test_10 needs >= 2 MDTs SKIP 10 (4s) == insanity test 11: Eleventh Failure Mode: MDS0/CLIENT/MDS1 Mon Mar 16 09:53:47 EDT 2026 ========================================================== 09:53:47 (1773669227) SKIP: insanity test_11 needs >= 2 MDTs SKIP 11 (4s) == insanity test 12: Twelve Failure Mode: MDS0,MDS1/OST0, OST1/CLIENTS Mon Mar 16 09:53:51 EDT 2026 ========================================================== 09:53:51 (1773669231) SKIP: insanity test_12 needs >= 2 MDTs SKIP 12 (3s) == insanity test 13: Thirteen Failure Mode: MDS0,MDS1/CLIENTS/OST0,OST1 Mon Mar 16 09:53:54 EDT 2026 ========================================================== 09:53:54 (1773669234) SKIP: insanity test_13 needs >= 2 MDTs SKIP 13 (3s) == insanity test 14: Fourteen Failure Mode: OST0,OST1/CLIENTS/MDS0,MDS1 Mon Mar 16 09:53:57 EDT 2026 ========================================================== 09:53:57 (1773669237) SKIP: insanity test_14 needs >= 2 MDTs SKIP 14 (3s) == insanity test complete, duration 1027 sec ============= 09:54:00 (1773669240) === insanity: start cleanup 09:54:02 (1773669242) === === insanity: finish cleanup 09:54:05 (1773669245) ===