-----============= acceptance-small: replay-single ============----- Tue Apr 1 03:44:15 EDT 2025 mgs: Rocky Linux release 8.10 (Green Obsidian) MGS_OS_ID_LIKE=rhel centos fedora rocky MGS_OS_VERSION_ID=8.10 MGS_OS_ID=rocky MGS_OS_VERSION_CODE=134873088 mds1: Rocky Linux release 8.10 (Green Obsidian) MDS1_OS_VERSION_ID=8.10 MDS1_OS_VERSION_CODE=134873088 MDS1_OS_ID_LIKE=rhel centos fedora rocky MDS1_OS_ID=rocky ost1: Rocky Linux release 8.10 (Green Obsidian) OST1_OS_VERSION_CODE=134873088 OST1_OS_ID_LIKE=rhel centos fedora rocky OST1_OS_VERSION_ID=8.10 OST1_OS_ID=rocky client: Rocky Linux release 8.10 (Green Obsidian) CLIENT_OS_ID=rocky CLIENT_OS_VERSION_CODE=134873088 CLIENT_OS_VERSION_ID=8.10 CLIENT_OS_ID_LIKE=rhel centos fedora rocky oleg631-server: ls: cannot access '/home/green/git/lustre-release/lustre/tests/except/replay-single.*ex': No such file or directory excepting tests: 110f 131b 59 36 === replay-single: start setup 03:45:03 (1743493503) === oleg631-client.virtnet: executing check_config_client /mnt/lustre oleg631-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg631-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff899606cd1000.idle_timeout=debug osc.lustre-OST0001-osc-ffff899606cd1000.idle_timeout=debug disable quota as required oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all === replay-single: finish setup 03:46:12 (1743493572) === == replay-single test 0a: empty replay =================== 03:46:17 (1743493577) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 03:46:37 (1743493597) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 03:47:34 (1743493654) targets are mounted 03:47:34 (1743493654) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 0a (113s) == replay-single test 0b: ensure object created after recover exists. (3284) ========================================================== 03:48:10 (1743493690) Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 03:48:24 (1743493704) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 03:49:06 (1743493746) targets are mounted 03:49:06 (1743493746) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec total: 20 open/close in 0.91 seconds: 21.88 ops/second - unlinked 0 (time 1743493769 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 0b (93s) == replay-single test 0c: check replay-barrier =========== 03:49:43 (1743493783) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 03:50:07 (1743493807) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 03:51:02 (1743493862) targets are mounted 03:51:02 (1743493862) facet_failover done Starting client: oleg631-client.virtnet: -o user_xattr,flock 192.168.206.131@tcp:/lustre /mnt/lustre rm: cannot remove '/mnt/lustre/f0c.replay-single': No such file or directory PASS 0c (160s) == replay-single test 0d: expired recovery with no clients ========================================================== 03:52:23 (1743493943) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 03:52:46 (1743493966) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 03:53:40 (1743494020) targets are mounted 03:53:40 (1743494020) facet_failover done Starting client: oleg631-client.virtnet: -o user_xattr,flock 192.168.206.131@tcp:/lustre /mnt/lustre PASS 0d (158s) == replay-single test 1: simple create =================== 03:55:02 (1743494102) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 03:55:24 (1743494124) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 03:56:20 (1743494180) targets are mounted 03:56:20 (1743494180) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f1.replay-single has type file OK PASS 1 (112s) == replay-single test 2a: touch ========================== 03:56:53 (1743494213) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 03:57:14 (1743494234) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 03:58:08 (1743494288) targets are mounted 03:58:08 (1743494288) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2a.replay-single has type file OK PASS 2a (107s) == replay-single test 2b: touch ========================== 03:58:41 (1743494321) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 03:59:02 (1743494342) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 03:59:51 (1743494391) targets are mounted 03:59:51 (1743494391) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2b.replay-single has type file OK PASS 2b (101s) == replay-single test 2c: setstripe replay =============== 04:00:22 (1743494422) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:00:44 (1743494444) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:01:35 (1743494495) targets are mounted 04:01:35 (1743494495) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2c.replay-single has type file OK PASS 2c (103s) == replay-single test 2d: setdirstripe replay ============ 04:02:04 (1743494524) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:02:25 (1743494545) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:03:15 (1743494595) targets are mounted 04:03:15 (1743494595) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d2d.replay-single has type dir OK PASS 2d (111s) == replay-single test 2e: O_CREAT|O_EXCL create replay === 04:03:56 (1743494636) fail_loc=0x8000013b UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:04:20 (1743494660) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 Succeed in opening file "/mnt/lustre/f2e.replay-single"(flags=O_CREAT) oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:05:00 (1743494700) targets are mounted 04:05:00 (1743494700) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2e.replay-single has type file OK PASS 2e (100s) == replay-single test 3a: replay failed open(O_DIRECTORY) ========================================================== 04:05:35 (1743494735) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Error in opening file "/mnt/lustre/f3a.replay-single"(flags=O_DIRECTORY) 20: Not a directory Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:05:58 (1743494758) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:06:37 (1743494797) targets are mounted 04:06:37 (1743494797) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f3a.replay-single has type file OK PASS 3a (91s) == replay-single test 3b: replay failed open -ENOMEM ===== 04:07:06 (1743494826) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000114 touch: cannot touch '/mnt/lustre/f3b.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:07:31 (1743494851) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:08:25 (1743494905) targets are mounted 04:08:25 (1743494905) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3b.replay-single: No such file or directory PASS 3b (113s) == replay-single test 3c: replay failed open -ENOMEM ===== 04:08:59 (1743494939) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000128 touch: cannot touch '/mnt/lustre/f3c.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:09:25 (1743494965) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:10:18 (1743495018) targets are mounted 04:10:18 (1743495018) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3c.replay-single: No such file or directory PASS 3c (108s) == replay-single test 4a: |x| 10 open(O_CREAT)s ========== 04:10:48 (1743495048) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:11:08 (1743495068) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:12:00 (1743495120) targets are mounted 04:12:00 (1743495120) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 4a (103s) == replay-single test 4b: |x| rm 10 files ================ 04:12:31 (1743495151) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:12:51 (1743495171) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:13:41 (1743495221) targets are mounted 04:13:41 (1743495221) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f4b.replay-single-*: No such file or directory PASS 4b (101s) == replay-single test 5: |x| 220 open(O_CREAT) =========== 04:14:11 (1743495251) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:14:46 (1743495286) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:15:39 (1743495339) targets are mounted 04:15:39 (1743495339) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 5 (154s) == replay-single test 6a: mkdir + contained create ======= 04:16:45 (1743495405) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:17:07 (1743495427) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:17:57 (1743495477) targets are mounted 04:17:57 (1743495477) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d6a.replay-single has type dir OK /mnt/lustre/d6a.replay-single/f6a.replay-single has type file OK PASS 6a (102s) == replay-single test 6b: |X| rmdir ====================== 04:18:27 (1743495507) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:18:49 (1743495529) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:19:27 (1743495567) targets are mounted 04:19:27 (1743495567) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d6b.replay-single: No such file or directory PASS 6b (88s) == replay-single test 7: mkdir |X| contained create ====== 04:19:56 (1743495596) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:20:16 (1743495616) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:20:56 (1743495656) targets are mounted 04:20:56 (1743495656) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d7.replay-single has type dir OK /mnt/lustre/d7.replay-single/f7.replay-single has type file OK PASS 7 (87s) == replay-single test 8: creat open |X| close ============ 04:21:24 (1743495684) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f8.replay-single vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.7367 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:21:46 (1743495706) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:22:25 (1743495745) targets are mounted 04:22:25 (1743495745) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f8.replay-single /mnt/lustre/f8.replay-single has type file OK PASS 8 (89s) == replay-single test 9: |X| create (same inum/gen) ====== 04:22:53 (1743495773) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:23:14 (1743495794) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:23:52 (1743495832) targets are mounted 04:23:52 (1743495832) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old_inum == 144115305935798546, new_inum == 144115305935798546 old_inum and new_inum match PASS 9 (89s) == replay-single test 10: create |X| rename unlink ======= 04:24:23 (1743495863) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:24:45 (1743495885) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:25:24 (1743495924) targets are mounted 04:25:24 (1743495924) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f10.replay-single: No such file or directory PASS 10 (91s) == replay-single test 11: create open write rename |X| create-old-name read ========================================================== 04:25:54 (1743495954) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre new old Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:26:16 (1743495976) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:27:05 (1743496025) targets are mounted 04:27:05 (1743496025) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec new old PASS 11 (100s) == replay-single test 12: open, unlink |X| close ========= 04:27:34 (1743496054) multiop /mnt/lustre/f12.replay-single vo_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:27:54 (1743496074) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:28:32 (1743496112) targets are mounted 04:28:32 (1743496112) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 12 (84s) == replay-single test 13: open chmod 0 |x| write close === 04:28:59 (1743496139) multiop /mnt/lustre/f13.replay-single vO_wc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 /mnt/lustre/f13.replay-single has perms 00 OK UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:29:18 (1743496158) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:29:57 (1743496197) targets are mounted 04:29:57 (1743496197) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f13.replay-single has perms 00 OK /mnt/lustre/f13.replay-single has size 1 OK PASS 13 (93s) == replay-single test 14: open(O_CREAT), unlink |X| close ========================================================== 04:30:32 (1743496232) multiop /mnt/lustre/f14.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:30:54 (1743496254) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:31:45 (1743496305) targets are mounted 04:31:45 (1743496305) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 14 (102s) == replay-single test 15: open(O_CREAT), unlink |X| touch new, close ========================================================== 04:32:13 (1743496333) multiop /mnt/lustre/f15.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210176 3200 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:32:33 (1743496353) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:33:13 (1743496393) targets are mounted 04:33:13 (1743496393) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 15 (90s) == replay-single test 16: |X| open(O_CREAT), unlink, touch new, unlink new ========================================================== 04:33:43 (1743496423) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:34:04 (1743496444) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:34:42 (1743496482) targets are mounted 04:34:42 (1743496482) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 16 (87s) == replay-single test 17: |X| open(O_CREAT), |replay| close ========================================================== 04:35:11 (1743496511) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f17.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7367 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:35:32 (1743496532) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:36:25 (1743496585) targets are mounted 04:36:25 (1743496585) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f17.replay-single has type file OK PASS 17 (106s) == replay-single test 18: open(O_CREAT), unlink, touch new, close, touch, unlink ========================================================== 04:36:56 (1743496616) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f18.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 pid: 49225 will close Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:37:15 (1743496635) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:37:53 (1743496673) targets are mounted 04:37:53 (1743496673) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 18 (86s) == replay-single test 19: mcreate, open, write, rename === 04:38:23 (1743496703) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre old Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:38:44 (1743496724) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:39:38 (1743496778) targets are mounted 04:39:38 (1743496778) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old PASS 19 (109s) == replay-single test 20a: |X| open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 04:40:12 (1743496812) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f20a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:40:33 (1743496833) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:41:24 (1743496884) targets are mounted 04:41:24 (1743496884) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 20a (101s) == replay-single test 20b: write, unlink, eviction, replay (test mds_cleanup_orphans) ========================================================== 04:41:53 (1743496913) /mnt/lustre/f20b.replay-single lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 1090 0x442 0x240000400 dd: error writing '/mnt/lustre/f20b.replay-single': Cannot send after transport endpoint shutdown pdsh@oleg631-client: oleg631-client: ssh exited with exit code 5 3926+0 records in 3925+0 records out 16076800 bytes (16 MB, 15 MiB) copied, 4.96698 s, 3.2 MB/s Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:42:22 (1743496942) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:43:11 (1743496991) targets are mounted 04:43:11 (1743496991) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg631-server: oleg631-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg631-server: *.lustre-MDT0000.recovery_status status: COMPLETE sleep 5 for ZFS MDS Waiting for MDT destroys to complete before 6144, after 6144 PASS 20b (133s) == replay-single test 20c: check that client eviction does not affect file content ========================================================== 04:44:06 (1743497046) multiop /mnt/lustre/f20c.replay-single vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.7367 -rw-r--r-- 1 root root 1 Apr 1 04:44 /mnt/lustre/f20c.replay-single PASS 20c (18s) == replay-single test 21: |X| open(O_CREAT), unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 04:44:25 (1743497065) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f21.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:44:45 (1743497085) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:45:23 (1743497123) targets are mounted 04:45:23 (1743497123) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 21 (84s) == replay-single test 22: open(O_CREAT), |X| unlink, replay, close (test mds_cleanup_orphans) ========================================================== 04:45:50 (1743497150) multiop /mnt/lustre/f22.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:46:10 (1743497170) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:47:00 (1743497220) targets are mounted 04:47:00 (1743497220) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 22 (99s) == replay-single test 23: open(O_CREAT), |X| unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 04:47:29 (1743497249) multiop /mnt/lustre/f23.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:47:48 (1743497268) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:48:37 (1743497317) targets are mounted 04:48:37 (1743497317) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 23 (94s) == replay-single test 24: open(O_CREAT), replay, unlink, close (test mds_cleanup_orphans) ========================================================== 04:49:03 (1743497343) multiop /mnt/lustre/f24.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3328 2205184 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:49:22 (1743497362) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:49:59 (1743497399) targets are mounted 04:49:59 (1743497399) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 24 (82s) == replay-single test 25: open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 04:50:25 (1743497425) multiop /mnt/lustre/f25.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3328 2205184 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:50:44 (1743497444) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:51:23 (1743497483) targets are mounted 04:51:23 (1743497483) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 25 (84s) == replay-single test 26: |X| open(O_CREAT), unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 04:51:49 (1743497509) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3328 2205184 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f26.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 multiop /mnt/lustre/f26.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:52:06 (1743497526) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:52:44 (1743497564) targets are mounted 04:52:44 (1743497564) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 26 (80s) == replay-single test 27: |X| open(O_CREAT), unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 04:53:10 (1743497590) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3328 2205184 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f27.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 multiop /mnt/lustre/f27.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:53:27 (1743497607) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:54:02 (1743497642) targets are mounted 04:54:02 (1743497642) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 27 (79s) == replay-single test 28: open(O_CREAT), |X| unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 04:54:28 (1743497668) multiop /mnt/lustre/f28.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 multiop /mnt/lustre/f28.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3328 2205184 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:54:47 (1743497687) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:55:24 (1743497724) targets are mounted 04:55:24 (1743497724) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 28 (80s) == replay-single test 29: open(O_CREAT), |X| unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 04:55:48 (1743497748) multiop /mnt/lustre/f29.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 multiop /mnt/lustre/f29.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3328 2205184 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:56:03 (1743497763) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:56:41 (1743497801) targets are mounted 04:56:41 (1743497801) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 29 (78s) == replay-single test 30: open(O_CREAT) two, unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 04:57:07 (1743497827) multiop /mnt/lustre/f30.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 multiop /mnt/lustre/f30.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3328 2205184 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:57:23 (1743497843) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:57:58 (1743497878) targets are mounted 04:57:58 (1743497878) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 30 (76s) == replay-single test 31: open(O_CREAT) two, unlink one, |X| unlink one, close two (test mds_cleanup_orphans) ========================================================== 04:58:22 (1743497902) multiop /mnt/lustre/f31.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 multiop /mnt/lustre/f31.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3328 2205184 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 04:58:41 (1743497921) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 04:59:15 (1743497955) targets are mounted 04:59:15 (1743497955) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 31 (78s) == replay-single test 32: close() notices client eviction; close() after client eviction ========================================================== 04:59:40 (1743497980) multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7367 multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7367 PASS 32 (15s) == replay-single test 33a: fid seq shouldn't be reused after abort recovery ========================================================== 04:59:56 (1743497996) total: 10 open/close in 0.48 seconds: 20.97 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg631-server Failover mds1 to oleg631-server oleg631-server.virtnet Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.40 seconds: 25.22 ops/second PASS 33a (49s) == replay-single test 33b: test fid seq allocation ======= 05:00:45 (1743498045) fail_loc=0x1311 total: 10 open/close in 0.38 seconds: 26.51 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg631-server Failover mds1 to oleg631-server oleg631-server.virtnet Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.33 seconds: 29.99 ops/second PASS 33b (51s) == replay-single test 34: abort recovery before client does replay (test mds_cleanup_orphans) ========================================================== 05:01:36 (1743498096) multiop /mnt/lustre/f34.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg631-server Failover mds1 to oleg631-server oleg631-server.virtnet Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 34 (48s) == replay-single test 35: test recovery from llog for unlink op ========================================================== 05:02:24 (1743498144) fail_loc=0x80000119 Stopping /mnt/lustre-mds1 (opts:) on oleg631-server Failover mds1 to oleg631-server oleg631-server.virtnet Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 rm: cannot remove '/mnt/lustre/f35.replay-single': Input/output error oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 Can't lstat /mnt/lustre/f35.replay-single: No such file or directory PASS 35 (48s) SKIP: replay-single test_36 skipping ALWAYS excluded test 36 == replay-single test 37: abort recovery before client does replay (test mds_cleanup_orphans for directories) ========================================================== 05:03:15 (1743498195) multiop /mnt/lustre/d37.replay-single/f37.replay-single vdD_c TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3456 2205056 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg631-server Failover mds1 to oleg631-server oleg631-server.virtnet Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 37 (50s) == replay-single test 38: test recovery from unlink llog (test llog_gen_rec) ========================================================== 05:04:05 (1743498245) - open/close 310 (time 1743498262.27 total 10.01 last 30.97) - open/close 659 (time 1743498272.27 total 20.01 last 34.90) total: 800 open/close in 24.27 seconds: 32.97 ops/second - unlinked 0 (time 1743498283 ; total 0 ; last 0) total: 400 unlinks in 9 seconds: 44.444443 unlinks/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3712 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:05:07 (1743498307) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:05:45 (1743498345) targets are mounted 05:05:45 (1743498345) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1743498364 ; total 0 ; last 0) total: 400 unlinks in 12 seconds: 33.333332 unlinks/second Can't lstat /mnt/lustre/f38.replay-single-*: No such file or directory PASS 38 (145s) == replay-single test 39: test recovery from unlink llog (test llog_gen_rec) ========================================================== 05:06:31 (1743498391) - open/close 347 (time 1743498407.21 total 10.03 last 34.58) - open/close 689 (time 1743498417.21 total 20.03 last 34.20) total: 800 open/close in 23.07 seconds: 34.68 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3712 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre - unlinked 0 (time 1743498432 ; total 0 ; last 0) total: 400 unlinks in 8 seconds: 50.000000 unlinks/second Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:07:30 (1743498450) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:08:08 (1743498488) targets are mounted 05:08:08 (1743498488) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1743498507 ; total 0 ; last 0) total: 400 unlinks in 12 seconds: 33.333332 unlinks/second Can't lstat /mnt/lustre/f39.replay-single-*: No such file or directory PASS 39 (142s) == replay-single test 41: read from a valid osc while other oscs are invalid ========================================================== 05:08:54 (1743498534) 1+0 records in 1+0 records out 4096 bytes (4.1 kB, 4.0 KiB) copied, 0.010677 s, 384 kB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB, 4.0 KiB) copied, 0.0226778 s, 181 kB/s PASS 41 (16s) == replay-single test 42: recovery after ost failure ===== 05:09:10 (1743498550) - open/close 335 (time 1743498566.87 total 10.03 last 33.41) - open/close 673 (time 1743498576.88 total 20.04 last 33.75) total: 800 open/close in 23.93 seconds: 33.43 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3712 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre - unlinked 0 (time 1743498594 ; total 0 ; last 0) total: 400 unlinks in 9 seconds: 44.444443 unlinks/second debug=-1 Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 05:10:15 (1743498615) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 05:10:46 (1743498646) targets are mounted 05:10:46 (1743498646) facet_failover done wait for MDS to timeout and recover - unlinked 0 (time 1743498693 ; total 0 ; last 0) total: 400 unlinks in 9 seconds: 44.444443 unlinks/second Can't lstat /mnt/lustre/f42.replay-single-*: No such file or directory PASS 42 (163s) == replay-single test 43: mds osc import failure during recovery; don't LBUG ========================================================== 05:11:54 (1743498714) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3584 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000204 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:12:09 (1743498729) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:12:41 (1743498761) targets are mounted 05:12:41 (1743498761) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 43 (79s) == replay-single test 44a: race in target handle connect ========================================================== 05:13:13 (1743498793) at_max=40 1 of 10 (1743498800) service : cur 5 worst 5 (at 1743498753, 48s ago) 4 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3456 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 2 of 10 (1743498808) service : cur 5 worst 5 (at 1743498753, 56s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3456 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 3 of 10 (1743498815) service : cur 5 worst 5 (at 1743498753, 63s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3456 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 4 of 10 (1743498823) service : cur 5 worst 5 (at 1743498753, 71s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3456 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 5 of 10 (1743498831) service : cur 5 worst 5 (at 1743498753, 79s ago) 5 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3456 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 6 of 10 (1743498839) service : cur 6 worst 6 (at 1743498839, 1s ago) 6 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3456 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 7 of 10 (1743498846) service : cur 6 worst 6 (at 1743498839, 8s ago) 6 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3456 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 8 of 10 (1743498854) service : cur 6 worst 6 (at 1743498839, 16s ago) 6 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3456 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 9 of 10 (1743498861) service : cur 6 worst 6 (at 1743498839, 23s ago) 6 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3456 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre 10 of 10 (1743498869) service : cur 6 worst 6 (at 1743498839, 31s ago) 6 0 0 0 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3456 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0 at_max=600 PASS 44a (94s) == replay-single test 44b: race in target handle connect ========================================================== 05:14:48 (1743498888) 1 of 10 (1743498890) service : cur 6 worst 6 (at 1743498839, 52s ago) 6 0 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: /mnt/lustre: Operation already in progress 2 of 10 (1743498913) service : cur 6 worst 6 (at 1743498839, 76s ago) 1 6 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: /mnt/lustre: Operation already in progress 3 of 10 (1743498937) service : cur 40 worst 40 (at 1743498933, 5s ago) 40 6 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: /mnt/lustre: Operation already in progress 4 of 10 (1743498960) service : cur 40 worst 40 (at 1743498933, 29s ago) 40 6 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: /mnt/lustre: Operation already in progress 5 of 10 (1743498984) service : cur 40 worst 40 (at 1743498933, 53s ago) 40 6 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: /mnt/lustre: Operation already in progress 6 of 10 (1743499008) service : cur 40 worst 40 (at 1743498933, 77s ago) 40 6 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: /mnt/lustre: Operation already in progress 7 of 10 (1743499031) service : cur 40 worst 40 (at 1743498933, 100s ago) 40 6 0 0 fail_loc=0x80000704 error: recover: Connection timed out df: /mnt/lustre: Operation already in progress 8 of 10 (1743499055) service : cur 40 worst 40 (at 1743498933, 124s ago) 1 40 6 0 fail_loc=0x80000704 error: recover: Connection timed out df: /mnt/lustre: Operation already in progress 9 of 10 (1743499079) service : cur 40 worst 40 (at 1743498933, 148s ago) 40 40 6 0 fail_loc=0x80000704 error: recover: Connection timed out df: /mnt/lustre: Operation already in progress 10 of 10 (1743499103) service : cur 40 worst 40 (at 1743498933, 172s ago) 40 40 6 0 fail_loc=0x80000704 error: recover: Connection timed out df: /mnt/lustre: Operation already in progress PASS 44b (248s) == replay-single test 44c: race in target handle connect ========================================================== 05:18:56 (1743499136) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3456 2204928 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 100 create in 2.00 seconds: 49.91 ops/second fail_loc=0x80000712 Stopping /mnt/lustre-mds1 (opts:) on oleg631-server Failover mds1 to oleg631-server oleg631-server.virtnet Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:19:41 (1743499181) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:20:08 (1743499208) targets are mounted 05:20:08 (1743499208) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second PASS 44c (90s) == replay-single test 45: Handle failed close ============ 05:20:26 (1743499226) multiop /mnt/lustre/f45.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7367 /mnt/lustre/f45.replay-single has type file OK PASS 45 (11s) == replay-single test 46: Don't leak file handle after open resend (3325) ========================================================== 05:20:37 (1743499237) fail_loc=0x122 fail_loc=0 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:21:04 (1743499264) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:21:39 (1743499299) targets are mounted 05:21:39 (1743499299) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lfs path2fid: cannot get fid for 'f46.replay-single': No such file or directory PASS 46 (84s) == replay-single test 47: MDS->OSC failure during precreate cleanup (2824) ========================================================== 05:22:01 (1743499321) total: 20 open/close in 0.49 seconds: 40.72 ops/second Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 05:22:10 (1743499330) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 05:22:37 (1743499357) targets are mounted 05:22:37 (1743499357) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_loc=0x80000204 total: 20 open/close in 0.39 seconds: 50.97 ops/second - unlinked 0 (time 1743499432 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 47 (119s) == replay-single test 48: MDS->OSC failure during precreate cleanup (2824) ========================================================== 05:24:00 (1743499440) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3584 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 20 open/close in 0.52 seconds: 38.58 ops/second Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:24:13 (1743499453) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:24:38 (1743499478) targets are mounted 05:24:38 (1743499478) facet_failover done fail_loc=0x80000216 total: 20 open/close in 0.49 seconds: 40.78 ops/second - unlinked 0 (time 1743499542 ; total 0 ; last 0) total: 40 unlinks in 1 seconds: 40.000000 unlinks/second PASS 48 (110s) == replay-single test 50: Double OSC recovery, don't LASSERT (3812) ========================================================== 05:25:51 (1743499551) PASS 50 (17s) == replay-single test 52: time out lock replay (3764) ==== 05:26:07 (1743499567) multiop /mnt/lustre/f52.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7367 fail_loc=0x80000157 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:26:16 (1743499576) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:26:47 (1743499607) targets are mounted 05:26:47 (1743499607) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 fail_loc=0x0 PASS 52 (69s) == replay-single test 53a: |X| close request while two MDC requests in flight ========================================================== 05:27:17 (1743499637) fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:27:32 (1743499652) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:28:04 (1743499684) targets are mounted 05:28:04 (1743499684) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53a.replay-single-1/f has type file OK /mnt/lustre/d53a.replay-single-2/f has type file OK PASS 53a (65s) == replay-single test 53b: |X| open request while two MDC requests in flight ========================================================== 05:28:21 (1743499701) multiop /mnt/lustre/d53b.replay-single-1/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7367 fail_loc=0x80000107 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:28:35 (1743499715) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:29:01 (1743499741) targets are mounted 05:29:01 (1743499741) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53b.replay-single-1/f has type file OK /mnt/lustre/d53b.replay-single-2/f has type file OK PASS 53b (61s) == replay-single test 53c: |X| open request and close request while two MDC requests in flight ========================================================== 05:29:22 (1743499762) fail_loc=0x80000107 fail_loc=0x80000115 Replay barrier on lustre-MDT0000 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:29:37 (1743499777) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:30:12 (1743499812) targets are mounted 05:30:12 (1743499812) facet_failover done fail_loc=0 /mnt/lustre/d53c.replay-single-1/f has type file OK /mnt/lustre/d53c.replay-single-2/f has type file OK PASS 53c (60s) == replay-single test 53d: close reply while two MDC requests in flight ========================================================== 05:30:22 (1743499822) fail_loc=0x8000013b fail_loc=0 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:30:33 (1743499833) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:31:09 (1743499869) targets are mounted 05:31:09 (1743499869) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53d.replay-single-1/f has type file OK /mnt/lustre/d53d.replay-single-2/f has type file OK PASS 53d (65s) == replay-single test 53e: |X| open reply while two MDC requests in flight ========================================================== 05:31:28 (1743499888) fail_loc=0x119 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:31:43 (1743499903) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:32:15 (1743499935) targets are mounted 05:32:15 (1743499935) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53e.replay-single-1/f has type file OK /mnt/lustre/d53e.replay-single-2/f has type file OK PASS 53e (67s) == replay-single test 53f: |X| open reply and close reply while two MDC requests in flight ========================================================== 05:32:35 (1743499955) fail_loc=0x119 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:32:50 (1743499970) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:33:21 (1743500001) targets are mounted 05:33:21 (1743500001) facet_failover done fail_loc=0 /mnt/lustre/d53f.replay-single-1/f has type file OK /mnt/lustre/d53f.replay-single-2/f has type file OK PASS 53f (56s) == replay-single test 53g: |X| drop open reply and close request while close and open are both in flight ========================================================== 05:33:31 (1743500011) fail_loc=0x119 fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:33:48 (1743500028) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:34:22 (1743500062) targets are mounted 05:34:22 (1743500062) facet_failover done /mnt/lustre/d53g.replay-single-1/f has type file OK /mnt/lustre/d53g.replay-single-2/f has type file OK PASS 53g (60s) == replay-single test 53h: open request and close reply while two MDC requests in flight ========================================================== 05:34:31 (1743500071) fail_loc=0x80000107 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:34:47 (1743500087) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:35:23 (1743500123) targets are mounted 05:35:23 (1743500123) facet_failover done fail_loc=0 /mnt/lustre/d53h.replay-single-1/f has type file OK /mnt/lustre/d53h.replay-single-2/f has type file OK PASS 53h (62s) == replay-single test 55: let MDS_CHECK_RESENT return the original return code instead of 0 ========================================================== 05:35:33 (1743500133) fail_loc=0x8000012b fail_loc=0x0 touch: cannot touch '/mnt/lustre/f55.replay-single'rm: cannot remove '/mnt/lustre/f55.replay-single': No such file or directory : No such file or directory PASS 55 (27s) == replay-single test 56: don't replay a symlink open request (3440) ========================================================== 05:36:00 (1743500160) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3584 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:36:12 (1743500172) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:36:47 (1743500207) targets are mounted 05:36:47 (1743500207) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 56 (81s) == replay-single test 57: test recovery from llog for setattr op ========================================================== 05:37:21 (1743500241) fail_loc=0x8000012c UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3584 2204800 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:37:34 (1743500254) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:38:08 (1743500288) targets are mounted 05:38:08 (1743500288) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg631-server: oleg631-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg631-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed wait 40 secs maximumly for oleg631-server mds-ost sync done. /mnt/lustre/f57.replay-single has type file OK fail_loc=0x0 PASS 57 (73s) == replay-single test 58a: test recovery from llog for setattr op (test llog_gen_rec) ========================================================== 05:38:34 (1743500314) fail_loc=0x8000012c - open/close 578 (time 1743500331.10 total 10.01 last 57.73) - open/close 1179 (time 1743500341.10 total 20.02 last 60.06) - open/close 1758 (time 1743500351.11 total 30.03 last 57.86) - open/close 2393 (time 1743500361.12 total 40.04 last 63.42) total: 2500 open/close in 41.88 seconds: 59.70 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 4608 2203776 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:39:36 (1743500376) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:40:11 (1743500411) targets are mounted 05:40:11 (1743500411) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 - unlinked 0 (time 1743500444 ; total 0 ; last 0) total: 2500 unlinks in 28 seconds: 89.285713 unlinks/second PASS 58a (168s) == replay-single test 58b: test replay of setxattr op ==== 05:41:22 (1743500482) Starting client: oleg631-client.virtnet: -o user_xattr,flock 192.168.206.131@tcp:/lustre /mnt/lustre2 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3840 2204544 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:41:35 (1743500495) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:42:08 (1743500528) targets are mounted 05:42:09 (1743500529) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Stopping client oleg631-client.virtnet /mnt/lustre2 (opts:) oleg631-client.virtnet: executing wait_import_state_mount FULL mgc.*.mgs_server_uuid mgc.*.mgs_server_uuid in FULL state after 0 sec PASS 58b (77s) == replay-single test 58c: resend/reconstruct setxattr op ========================================================== 05:42:39 (1743500559) Starting client: oleg631-client.virtnet: -o user_xattr,flock 192.168.206.131@tcp:/lustre /mnt/lustre2 fail_val=0 fail_loc=0x123 fail_loc=0 fail_loc=0x119 fail_loc=0 Stopping client oleg631-client.virtnet /mnt/lustre2 (opts:) PASS 58c (54s) SKIP: replay-single test_59 skipping ALWAYS excluded test 59 == replay-single test 60: test llog post recovery init vs llog unlink ========================================================== 05:43:36 (1743500616) total: 200 open/close in 3.38 seconds: 59.24 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3712 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre - unlinked 0 (time 1743500630 ; total 0 ; last 0) total: 100 unlinks in 2 seconds: 50.000000 unlinks/second Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:43:57 (1743500637) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:44:22 (1743500662) targets are mounted 05:44:22 (1743500662) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1743500672 ; total 0 ; last 0) total: 100 unlinks in 2 seconds: 50.000000 unlinks/second PASS 60 (64s) == replay-single test 61a: test race llog recovery vs llog cleanup ========================================================== 05:44:40 (1743500680) - open/close 718 (time 1743500696.14 total 10.01 last 71.73) total: 800 open/close in 11.31 seconds: 70.76 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210432 3968 2204416 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre - unlinked 0 (time 1743500706 ; total 0 ; last 0) total: 800 unlinks in 7 seconds: 114.285713 unlinks/second fail_val=0 fail_loc=0x80000221 Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 05:45:22 (1743500722) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 05:45:47 (1743500747) targets are mounted 05:45:47 (1743500747) facet_failover done Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 05:46:03 (1743500763) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 05:46:28 (1743500788) targets are mounted 05:46:28 (1743500788) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 Can't lstat /mnt/lustre/d61a.replay-single/f61a.replay-single-*: No such file or directory PASS 61a (159s) == replay-single test 61b: test race mds llog sync vs llog cleanup ========================================================== 05:47:19 (1743500839) fail_loc=0x8000013a Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:47:26 (1743500846) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:47:51 (1743500871) targets are mounted 05:47:51 (1743500871) facet_failover done Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:48:07 (1743500887) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:48:39 (1743500919) targets are mounted 05:48:39 (1743500919) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1+0 records in 1+0 records out 4096 bytes (4.1 kB, 4.0 KiB) copied, 0.00877497 s, 467 kB/s PASS 61b (97s) == replay-single test 61c: test race mds llog sync vs llog cleanup ========================================================== 05:48:56 (1743500936) fail_val=0 fail_loc=0x80000222 Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 05:49:16 (1743500956) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 05:49:43 (1743500983) targets are mounted 05:49:43 (1743500983) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 PASS 61c (66s) == replay-single test 61d: error in llog_setup should cleanup the llog context correctly ========================================================== 05:50:02 (1743501002) Stopping /mnt/lustre-mds1 (opts:) on oleg631-server fail_loc=0x80000605 Starting mgs: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: mount.lustre: mount lustre-mdt1/mdt1 at /mnt/lustre-mds1 failed: Operation not supported pdsh@oleg631-client: oleg631-server: ssh exited with exit code 95 Start of lustre-mdt1/mdt1 on mgs failed 95 fail_loc=0 Starting mgs: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 61d (47s) == replay-single test 62: don't mis-drop resent replay === 05:50:50 (1743501050) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210304 3584 2204672 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 25 open/close in 0.46 seconds: 54.25 ops/second fail_loc=0x80000707 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 05:51:03 (1743501063) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 05:51:28 (1743501088) targets are mounted 05:51:28 (1743501088) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 - unlinked 0 (time 1743501115 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second PASS 62 (73s) == replay-single test 65a: AT: verify early replies ====== 05:52:02 (1743501122) at_history=8 at_history=8 debug=other fail_val=11000 fail_loc=0x8000050a 00000100:00001000:3.0:1743501163.276608:0:120152:0:(client.c:536:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff899632ae0040 x1828185020334464/t0(0) o101->lustre-MDT0000-mdc-ffff899607b39000@192.168.206.131@tcp:12/10 lens 664/66320 e 1 to 0 dl 1743501198 ref 2 fl Rpc:PQr/200/ffffffff rc 0/-1 job:'createmany.0' uid:0 gid:0 portal 12 : cur 11 worst 40 (at 1743498960, 2210s ago) 36 5 5 5 portal 23 : cur 5 worst 10 (at 1743494274, 6896s ago) 5 10 5 5 portal 30 : cur 5 worst 5 (at 1743494109, 7061s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1743494434, 6736s ago) 5 5 5 0 portal 24 : cur 5 worst 5 (at 1743495139, 6031s ago) 5 0 0 0 portal 13 : cur 10 worst 10 (at 1743495922, 5248s ago) 10 5 0 0 portal 12 : cur 11 worst 40 (at 1743498960, 2220s ago) 36 5 5 5 portal 23 : cur 5 worst 10 (at 1743494274, 6906s ago) 5 10 5 5 portal 30 : cur 5 worst 5 (at 1743494109, 7071s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1743494434, 6746s ago) 5 5 5 0 portal 24 : cur 5 worst 5 (at 1743495139, 6041s ago) 5 0 0 0 portal 13 : cur 10 worst 10 (at 1743495922, 5258s ago) 10 5 0 0 PASS 65a (64s) == replay-single test 65b: AT: verify early replies on packed reply / bulk ========================================================== 05:53:06 (1743501186) at_history=8 at_history=8 debug=other trace fail_val=11 fail_loc=0x224 fail_loc=0 00000100:00001000:3.0:1743501226.760582:0:2400:0:(client.c:536:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff89963900f340 x1828185020353920/t0(0) o4->lustre-OST0000-osc-ffff899607b39000@192.168.206.131@tcp:6/4 lens 4584/448 e 1 to 0 dl 1743501261 ref 2 fl Rpc:Qr/200/ffffffff rc 0/-1 job:'multiop.0' uid:0 gid:0 portal 28 : cur 5 worst 5 (at 1743494083, 7152s ago) 5 5 5 5 portal 7 : cur 5 worst 5 (at 1743494086, 7149s ago) 5 5 5 5 portal 6 : cur 36 worst 36 (at 1743501231, 4s ago) 36 0 5 0 portal 17 : cur 5 worst 5 (at 1743495122, 6113s ago) 5 0 0 0 PASS 65b (55s) == replay-single test 66a: AT: verify MDT service time adjusts with no early replies ========================================================== 05:54:01 (1743501241) at_history=8 at_history=8 portal 12 : cur 5 worst 40 (at 1743498960, 2309s ago) 36 5 5 5 fail_val=5000 fail_loc=0x8000050a portal 12 : cur 5 worst 40 (at 1743498960, 2318s ago) 36 5 5 5 fail_val=10000 fail_loc=0x8000050a portal 12 : cur 36 worst 40 (at 1743498960, 2331s ago) 36 36 5 5 fail_loc=0 portal 12 : cur 5 worst 40 (at 1743498960, 2342s ago) 36 36 5 5 Current MDT timeout 5, worst 40 PASS 66a (69s) == replay-single test 66b: AT: verify net latency adjusts ========================================================== 05:55:10 (1743501310) at_history=8 at_history=8 fail_val=10 fail_loc=0x50c fail_loc=0 network timeout orig 5, cur 11, worst 11 PASS 66b (97s) == replay-single test 67a: AT: verify slow request processing doesn't induce reconnects ========================================================== 05:56:48 (1743501408) at_history=8 at_history=8 fail_val=400 fail_loc=0x50a fail_loc=0 0 osc reconnect attempts on gradual slow PASS 67a (84s) == replay-single test 67b: AT: verify instant slowdown doesn't induce reconnects ========================================================== 05:58:12 (1743501492) at_history=8 at_history=8 Creating to objid 5441 on ost lustre-OST0000... fail_val=20000 fail_loc=0x80000223 total: 17 open/close in 0.28 seconds: 61.40 ops/second Connected clients: oleg631-client.virtnet oleg631-client.virtnet service : cur 5 worst 5 (at 1743493333, 8193s ago) 1 1 0 0 phase 2 1 osc reconnect attempts on instant slow fail_loc=0x80000223 fail_loc=0 Connected clients: oleg631-client.virtnet oleg631-client.virtnet service : cur 5 worst 5 (at 1743493333, 8197s ago) 1 1 1 1 0 osc reconnect attempts on 2nd slow PASS 67b (44s) == replay-single test 68: AT: verify slowing locks ======= 05:58:56 (1743501536) at_history=8 at_history=8 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1975: $ldlm_enqueue_min: ambiguous redirect fail_val=19 fail_loc=0x80000312 fail_val=25 fail_loc=0x80000312 fail_loc=0 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1990: $ldlm_enqueue_min: ambiguous redirect PASS 68 (86s) Cleaning up AT ... == replay-single test 70a: check multi client t-f ======== 06:00:22 (1743501622) SKIP: replay-single test_70a Need two or more clients, have 1 SKIP 70a (3s) == replay-single test 70b: dbench 1mdts recovery; 1 clients ========================================================== 06:00:26 (1743501626) Starting client oleg631-client.virtnet: -o user_xattr,flock 192.168.206.131@tcp:/lustre /mnt/lustre Started clients oleg631-client.virtnet: 192.168.206.131@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,encrypt,statfs_project) + MISSING_DBENCH_OK= + PATH=/opt/iozone/bin:/opt/iozone/bin:/home/green/git/lustre-release/lustre/tests/mpi:/home/green/git/lustre-release/lustre/tests/racer:/home/green/git/lustre-release/lustre/../lustre-iokit/sgpdd-survey:/home/green/git/lustre-release/lustre/tests:/home/green/git/lustre-release/lustre/utils/gss:/home/green/git/lustre-release/lustre/utils:/opt/iozone/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin::/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests:/sbin:/usr/sbin:/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests/: + DBENCH_LIB= + TESTSUITE=replay-single + TESTNAME=test_70b + MOUNT=/mnt/lustre ++ hostname + DIR=/mnt/lustre/d70b.replay-single/oleg631-client.virtnet + LCTL=/home/green/git/lustre-release/lustre/utils/lctl + rundbench 1 -t 300 dbench: no process found dbench: no process found Started rundbench load pid=125726 ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210304 3840 2204416 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 32768 3689472 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3729408 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 35840 7418880 1% /mnt/lustre test_70b fail mds1 1 times Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:00:46 (1743501646) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 oleg631-client.virtnet: looking for dbench program oleg631-client.virtnet: /usr/bin/dbench oleg631-client.virtnet: creating output directory /mnt/lustre/d70b.replay-single/oleg631-client.virtnet oleg631-client.virtnet: mkdir: created directory '/mnt/lustre/d70b.replay-single' oleg631-client.virtnet: mkdir: created directory '/mnt/lustre/d70b.replay-single/oleg631-client.virtnet' oleg631-client.virtnet: found dbench client file /usr/share/dbench/client.txt oleg631-client.virtnet: '/usr/share/dbench/client.txt' -> 'client.txt' oleg631-client.virtnet: running 'dbench 1 -t 300' on /mnt/lustre/d70b.replay-single/oleg631-client.virtnet at Tue Apr 1 06:00:29 EDT 2025 oleg631-client.virtnet: waiting for dbench pid 125756 oleg631-client.virtnet: dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 oleg631-client.virtnet: oleg631-client.virtnet: Running for 300 seconds with load 'client.txt' and minimum warmup 60 secs oleg631-client.virtnet: failed to create barrier semaphore oleg631-client.virtnet: 0 of 1 processes prepared for launch 0 sec oleg631-client.virtnet: 1 of 1 processes prepared for launch 0 sec oleg631-client.virtnet: releasing clients oleg631-client.virtnet: 1 176 7.06 MB/sec warmup 1 sec latency 91.357 ms oleg631-client.virtnet: 1 306 5.61 MB/sec warmup 2 sec latency 48.090 ms oleg631-client.virtnet: 1 437 5.13 MB/sec warmup 3 sec latency 92.607 ms oleg631-client.virtnet: 1 639 5.26 MB/sec warmup 4 sec latency 70.307 ms oleg631-client.virtnet: 1 721 4.26 MB/sec warmup 5 sec latency 183.179 ms oleg631-client.virtnet: 1 786 3.57 MB/sec warmup 6 sec latency 159.982 ms oleg631-client.virtnet: 1 864 3.08 MB/sec warmup 7 sec latency 236.849 ms oleg631-client.virtnet: 1 957 2.73 MB/sec warmup 8 sec latency 152.762 ms oleg631-client.virtnet: 1 1128 2.55 MB/sec warmup 9 sec latency 163.178 ms oleg631-client.virtnet: 1 1410 2.60 MB/sec warmup 10 sec latency 24.335 ms oleg631-client.virtnet: 1 1560 2.37 MB/sec warmup 11 sec latency 233.276 ms oleg631-client.virtnet: 1 1689 2.18 MB/sec warmup 12 sec latency 127.724 ms oleg631-client.virtnet: 1 1868 2.03 MB/sec warmup 13 sec latency 36.845 ms oleg631-client.virtnet: 1 1984 1.89 MB/sec warmup 14 sec latency 172.754 ms oleg631-client.virtnet: 1 2145 1.78 MB/sec warmup 15 sec latency 336.995 ms oleg631-client.virtnet: 1 2145 1.67 MB/sec warmup 16 sec latency 1337.264 ms oleg631-client.virtnet: 1 2145 1.57 MB/sec warmup 17 sec latency 2337.601 ms oleg631-client.virtnet: 1 2145 1.48 MB/sec warmup 18 sec latency 3337.971 ms oleg631-client.virtnet: 1 2145 1.40 MB/sec warmup 19 sec latency 4338.156 ms oleg631-client.virtnet: 1 2145 1.33 MB/sec warmup 20 sec latency 5338.340 ms oleg631-client.virtnet: 1 2145 1.27 MB/sec warmup 21 sec latency 6338.681 ms oleg631-client.virtnet: 1 2145 1.21 MB/sec warmup 22 sec latency 7338.865 ms oleg631-client.virtnet: 1 2145 1.16 MB/sec warmup 23 sec latency 8339.126 ms oleg631-client.virtnet: 1 2145 1.11 MB/sec warmup 24 sec latency 9339.326 ms oleg631-client.virtnet: 1 2145 1.07 MB/sec warmup 25 sec latency 10339.641 ms oleg631-client.virtnet: 1 2145 1.03 MB/sec warmup 26 sec latency 11339.827 ms oleg631-client.virtnet: 1 2145 0.99 MB/sec warmup 27 sec latency 12340.115 ms oleg631-client.virtnet: 1 2145 0.95 MB/sec warmup 28 sec latency 13340.357 ms oleg631-client.virtnet: 1 2145 0.92 MB/sec warmup 29 sec latency 14340.673 ms oleg631-client.virtnet: 1 2145 0.89 MB/sec warmup 30 sec latency 15341.014 ms oleg631-client.virtnet: 1 2145 0.86 MB/sec warmup 31 sec latency 16341.212 ms oleg631-client.virtnet: 1 2145 0.83 MB/sec warmup 32 sec latency 17341.532Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:01:09 (1743501669) targets are mounted 06:01:09 (1743501669) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210304 3840 2204416 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3714048 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 13312 3748864 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 54272 7462912 1% /mnt/lustre test_70b fail mds1 2 times Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:01:45 (1743501705) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server ms oleg631-client.virtnet: 1 2145 0.81 MB/sec warmup 33 sec latency 18341.875 ms oleg631-client.virtnet: 1 2145 0.78 MB/sec warmup 34 sec latency 19342.181 ms oleg631-client.virtnet: 1 2145 0.76 MB/sec warmup 35 sec latency 20342.455 ms oleg631-client.virtnet: 1 2145 0.74 MB/sec warmup 36 sec latency 21342.717 ms oleg631-client.virtnet: 1 2145 0.72 MB/sec warmup 37 sec latency 22343.014 ms oleg631-client.virtnet: 1 2145 0.70 MB/sec warmup 38 sec latency 23343.261 ms oleg631-client.virtnet: 1 2145 0.68 MB/sec warmup 39 sec latency 24343.523 ms oleg631-client.virtnet: 1 2145 0.67 MB/sec warmup 40 sec latency 25343.748 ms oleg631-client.virtnet: 1 2145 0.65 MB/sec warmup 41 sec latency 26344.095 ms oleg631-client.virtnet: 1 2145 0.64 MB/sec warmup 42 sec latency 27344.348 ms oleg631-client.virtnet: 1 2145 0.62 MB/sec warmup 43 sec latency 28344.536 ms oleg631-client.virtnet: 1 2145 0.61 MB/sec warmup 44 sec latency 29344.794 ms oleg631-client.virtnet: 1 2145 0.59 MB/sec warmup 45 sec latency 30345.090 ms oleg631-client.virtnet: 1 2145 0.58 MB/sec warmup 46 sec latency 31345.321 ms oleg631-client.virtnet: 1 2145 0.57 MB/sec warmup 47 sec latency 32345.555 ms oleg631-client.virtnet: 1 2145 0.56 MB/sec warmup 48 sec latency 33345.801 ms oleg631-client.virtnet: 1 2145 0.54 MB/sec warmup 49 sec latency 34346.069 ms oleg631-client.virtnet: 1 2145 0.53 MB/sec warmup 50 sec latency 35346.340 ms oleg631-client.virtnet: 1 2145 0.52 MB/sec warmup 51 sec latency 36347.851 ms oleg631-client.virtnet: 1 2191 0.51 MB/sec warmup 52 sec latency 36781.364 ms oleg631-client.virtnet: 1 2376 0.53 MB/sec warmup 53 sec latency 212.820 ms oleg631-client.virtnet: 1 2477 0.53 MB/sec warmup 54 sec latency 271.767 ms oleg631-client.virtnet: 1 2759 0.58 MB/sec warmup 55 sec latency 204.813 ms oleg631-client.virtnet: 1 2987 0.60 MB/sec warmup 56 sec latency 128.902 ms oleg631-client.virtnet: 1 3212 0.60 MB/sec warmup 57 sec latency 246.556 ms oleg631-client.virtnet: 1 3491 0.66 MB/sec warmup 58 sec latency 92.799 ms oleg631-client.virtnet: 1 3696 0.66 MB/sec warmup 59 sec latency 24.449 ms oleg631-client.virtnet: 1 4010 0.30 MB/sec execute 1 sec latency 44.897 ms oleg631-client.virtnet: 1 4147 0.16 MB/sec execute 2 sec latency 48.819 ms oleg631-client.virtnet: 1 4250 0.18 MB/sec execute 3 sec latency 215.650 ms oleg631-client.virtnet: 1 4333 0.18 MB/sec execute 4 sec latency 168.112 ms oleg631-client.virtnet: 1 4379 0.17 MB/sec execute 5 sec latency 219.541 ms oleg631-client.virtnet: 1 4483 0.19 MB/sec execute 6 sec latency 193.986 ms oleg631-client.virtnet: 1 4552 0.18 MB/sec execute 7 sec latency 256.393 ms oleg631-client.virtnet: 1 4739 0.30 MB/sec execute 8 sec latency 69.753 ms oleg631-client.virtnet: 1 4986 0.60 MB/sec execute 9 sec latency 42.376 ms oleg631-client.virtnet: 1 5123 0.55 MB/sec execute 10 sec latency 114.225 ms oleg631-client.virtnet: 1 5208 0.50 MB/sec execute 11 sec latency 200.477 ms oleg631-client.virtnet: 1 5344 0.47 MB/sec execute 12 sec latency 55.335 ms oleg631-client.virtnet: 1 5453 0.44 MB/sec execute 13 sec latency 165.351 ms oleg631-client.virtnet: 1 5539 0.41 MB/sec execute 14 sec latency 731.088 ms oleg631-client.virtnet: 1 5539 0.39 MB/sec execute 15 sec latency 1731.454 ms oleg631-client.virtnet: 1 5539 0.36 MB/sec execute 16 sec latency 2731.740 ms oleg631-client.virtnet: 1 5539 0.34 MB/sec execute 17 sec latency 3732.016 ms oleg631-client.virtnet: 1 5Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:02:08 (1743501728) targets are mounted 06:02:08 (1743501728) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210304 3840 2204416 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3707904 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 14336 3741696 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 55296 7449600 1% /mnt/lustre test_70b fail mds1 3 times Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 539 0.32 MB/sec execute 18 sec latency 4732.558 ms oleg631-client.virtnet: 1 5539 0.31 MB/sec execute 19 sec latency 5732.875 ms oleg631-client.virtnet: 1 5539 0.29 MB/sec execute 20 sec latency 6733.152 ms oleg631-client.virtnet: 1 5539 0.28 MB/sec execute 21 sec latency 7733.436 ms oleg631-client.virtnet: 1 5539 0.26 MB/sec execute 22 sec latency 8733.685 ms oleg631-client.virtnet: 1 5539 0.25 MB/sec execute 23 sec latency 9734.064 ms oleg631-client.virtnet: 1 5539 0.24 MB/sec execute 24 sec latency 10734.606 ms oleg631-client.virtnet: 1 5539 0.23 MB/sec execute 25 sec latency 11734.845 ms oleg631-client.virtnet: 1 5539 0.22 MB/sec execute 26 sec latency 12735.064 ms oleg631-client.virtnet: 1 5539 0.21 MB/sec execute 27 sec latency 13735.333 ms oleg631-client.virtnet: 1 5539 0.21 MB/sec execute 28 sec latency 14735.954 ms oleg631-client.virtnet: 1 5539 0.20 MB/sec execute 29 sec latency 15736.970 ms oleg631-client.virtnet: 1 5539 0.19 MB/sec execute 30 sec latency 16737.231 ms oleg631-client.virtnet: 1 5539 0.19 MB/sec execute 31 sec latency 17737.576 ms oleg631-client.virtnet: 1 5539 0.18 MB/sec execute 32 sec latency 18738.126 ms oleg631-client.virtnet: 1 5539 0.18 MB/sec execute 33 sec latency 19738.474 ms oleg631-client.virtnet: 1 5547 0.17 MB/sec execute 34 sec latency 20576.399 ms oleg631-client.virtnet: 1 5718 0.17 MB/sec execute 35 sec latency 75.943 ms oleg631-client.virtnet: 1 5907 0.20 MB/sec execute 36 sec latency 207.585 ms oleg631-client.virtnet: 1 5989 0.20 MB/sec execute 37 sec latency 178.967 ms oleg631-client.virtnet: 1 6160 0.22 MB/sec execute 38 sec latency 198.581 ms oleg631-client.virtnet: 1 6490 0.33 MB/sec execute 39 sec latency 192.480 ms oleg631-client.virtnet: 1 6779 0.34 MB/sec execute 40 sec latency 222.114 ms oleg631-client.virtnet: 1 7066 0.43 MB/sec execute 41 sec latency 118.597 ms oleg631-client.virtnet: 1 7260 0.43 MB/sec execute 42 sec latency 25.808 ms oleg631-client.virtnet: 1 7422 0.44 MB/sec execute 43 sec latency 34.326 ms oleg631-client.virtnet: 1 7554 0.44 MB/sec execute 44 sec latency 37.183 ms oleg631-client.virtnet: 1 7669 0.43 MB/sec execute 45 sec latency 56.316 ms oleg631-client.virtnet: 1 7785 0.43 MB/sec execute 46 sec latency 241.776 ms oleg631-client.virtnet: 1 7846 0.42 MB/sec execute 47 sec latency 163.371 ms oleg631-client.virtnet: 1 7960 0.42 MB/sec execute 48 sec latency 149.895 ms oleg631-client.virtnet: 1 8047 0.41 MB/sec execute 49 sec latency 177.725 ms oleg631-client.virtnet: 1 8244 0.43 MB/sec execute 50 sec latency 114.005 ms oleg631-client.virtnet: 1 8514 0.48 MB/sec execute 51 sec latency 61.396 ms oleg631-client.virtnet: 1 8674 0.47 MB/sec execute 52 sec latency 161.350 ms oleg631-client.virtnet: 1 8803 0.47 MB/sec execute 53 sec latency 206.396 ms oleg631-client.virtnet: 1 8936 0.46 MB/sec execute 54 sec latency 36.546 ms oleg631-client.virtnet: 1 9111 0.45 MB/sec execute 55 sec latency 133.685 ms oleg631-client.virtnet: 1 9364 0.45 MB/sec execute 56 sec latency 33.836 ms oleg631-client.virtnet: 1 9505 0.46 MB/sec execute 57 sec latency 117.560 ms oleg631-client.virtnet: 1 9593 0.47 MB/sec execute 58 sec latency 187.960 ms oleg631-client.virtnet: 1 9979 0.52 MB/sec execute 59 sec latency 154.301 ms oleg631-client.virtnet: 1 10106 0.53 MB/sec execute 60 sec latency 131.691 ms oleg631-client.virtnet: 1 10106 0.52 MB/sec execute 61 sec latency 1075.060 ms oleg631-client.virtnet: 1 10106 0.52 MB/sec execut06:02:31 (1743501751) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:02:55 (1743501775) targets are mounted 06:02:55 (1743501775) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210304 3840 2204416 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3710976 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 13312 3745792 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 54272 7456768 1% /mnt/lustre test_70b fail mds1 4 times Failing mds1 on oleg631-server e 62 sec latency 2075.298 ms oleg631-client.virtnet: 1 10106 0.51 MB/sec execute 63 sec latency 3075.541 ms oleg631-client.virtnet: 1 10106 0.50 MB/sec execute 64 sec latency 4075.765 ms oleg631-client.virtnet: 1 10106 0.49 MB/sec execute 65 sec latency 5075.975 ms oleg631-client.virtnet: 1 10106 0.48 MB/sec execute 66 sec latency 6076.248 ms oleg631-client.virtnet: 1 10106 0.48 MB/sec execute 67 sec latency 7076.454 ms oleg631-client.virtnet: 1 10106 0.47 MB/sec execute 68 sec latency 8076.753 ms oleg631-client.virtnet: 1 10106 0.46 MB/sec execute 69 sec latency 9077.019 ms oleg631-client.virtnet: 1 10106 0.46 MB/sec execute 70 sec latency 10077.331 ms oleg631-client.virtnet: 1 10106 0.45 MB/sec execute 71 sec latency 11077.575 ms oleg631-client.virtnet: 1 10106 0.44 MB/sec execute 72 sec latency 12077.850 ms oleg631-client.virtnet: 1 10106 0.44 MB/sec execute 73 sec latency 13078.067 ms oleg631-client.virtnet: 1 10106 0.43 MB/sec execute 74 sec latency 14078.336 ms oleg631-client.virtnet: 1 10106 0.43 MB/sec execute 75 sec latency 15078.606 ms oleg631-client.virtnet: 1 10106 0.42 MB/sec execute 76 sec latency 16078.904 ms oleg631-client.virtnet: 1 10106 0.42 MB/sec execute 77 sec latency 17079.152 ms oleg631-client.virtnet: 1 10106 0.41 MB/sec execute 78 sec latency 18079.422 ms oleg631-client.virtnet: 1 10106 0.41 MB/sec execute 79 sec latency 19079.710 ms oleg631-client.virtnet: 1 10106 0.40 MB/sec execute 80 sec latency 20079.984 ms oleg631-client.virtnet: 1 10296 0.40 MB/sec execute 81 sec latency 20474.311 ms oleg631-client.virtnet: 1 10471 0.41 MB/sec execute 82 sec latency 255.004 ms oleg631-client.virtnet: 1 10732 0.45 MB/sec execute 83 sec latency 28.996 ms oleg631-client.virtnet: 1 10882 0.46 MB/sec execute 84 sec latency 56.007 ms oleg631-client.virtnet: 1 11038 0.45 MB/sec execute 85 sec latency 28.507 ms oleg631-client.virtnet: 1 11217 0.45 MB/sec execute 86 sec latency 62.504 ms oleg631-client.virtnet: 1 11332 0.45 MB/sec execute 87 sec latency 184.342 ms oleg631-client.virtnet: 1 11393 0.44 MB/sec execute 88 sec latency 158.852 ms oleg631-client.virtnet: 1 11463 0.44 MB/sec execute 89 sec latency 202.814 ms oleg631-client.virtnet: 1 11575 0.44 MB/sec execute 90 sec latency 104.200 ms oleg631-client.virtnet: 1 11719 0.44 MB/sec execute 91 sec latency 137.393 ms oleg631-client.virtnet: 1 11880 0.44 MB/sec execute 92 sec latency 33.005 ms oleg631-client.virtnet: 1 12084 0.47 MB/sec execute 93 sec latency 42.933 ms oleg631-client.virtnet: 1 12213 0.46 MB/sec execute 94 sec latency 240.652 ms oleg631-client.virtnet: 1 12361 0.46 MB/sec execute 95 sec latency 154.434 ms oleg631-client.virtnet: 1 12496 0.46 MB/sec execute 96 sec latency 65.148 ms oleg631-client.virtnet: 1 12682 0.45 MB/sec execute 97 sec latency 146.181 ms oleg631-client.virtnet: 1 12887 0.45 MB/sec execute 98 sec latency 30.824 ms oleg631-client.virtnet: 1 13030 0.46 MB/sec execute 99 sec latency 296.298 ms oleg631-client.virtnet: 1 13117 0.46 MB/sec execute 100 sec latency 230.361 ms oleg631-client.virtnet: 1 13260 0.46 MB/sec execute 101 sec latency 181.186 ms oleg631-client.virtnet: 1 13581 0.50 MB/sec execute 102 sec latency 183.619 ms oleg631-client.virtnet: 1 13926 0.51 MB/sec execute 103 sec latency 105.429 ms oleg631-client.virtnet: 1 14179 0.54 MB/sec execute 104 sec latency 72.282 ms oleg631-client.virtnet: 1 14319 0.54 MB/sec execute 105 sec latency 34.931 ms oleg631-client.virtnet: 1 14448 0.54 MB/sec execute 106 sec latency 57.373 Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:03:19 (1743501799) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:03:51 (1743501831) targets are mounted 06:03:51 (1743501831) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec ms oleg631-client.virtnet: 1 14586 0.54 MB/sec execute 107 sec latency 39.494 ms oleg631-client.virtnet: 1 14622 0.53 MB/sec execute 108 sec latency 702.280 ms oleg631-client.virtnet: 1 14622 0.53 MB/sec execute 109 sec latency 1702.514 ms oleg631-client.virtnet: 1 14622 0.52 MB/sec execute 110 sec latency 2702.824 ms oleg631-client.virtnet: 1 14622 0.52 MB/sec execute 111 sec latency 3703.052 ms oleg631-client.virtnet: 1 14622 0.51 MB/sec execute 112 sec latency 4703.343 ms oleg631-client.virtnet: 1 14622 0.51 MB/sec execute 113 sec latency 5703.579 ms oleg631-client.virtnet: 1 14622 0.51 MB/sec execute 114 sec latency 6703.893 ms oleg631-client.virtnet: 1 14622 0.50 MB/sec execute 115 sec latency 7704.186 ms oleg631-client.virtnet: 1 14622 0.50 MB/sec execute 116 sec latency 8704.515 ms oleg631-client.virtnet: 1 14622 0.49 MB/sec execute 117 sec latency 9704.851 ms oleg631-client.virtnet: 1 14622 0.49 MB/sec execute 118 sec latency 10705.262 ms oleg631-client.virtnet: 1 14622 0.48 MB/sec execute 119 sec latency 11705.552 ms oleg631-client.virtnet: 1 14622 0.48 MB/sec execute 120 sec latency 12705.754 ms oleg631-client.virtnet: 1 14622 0.48 MB/sec execute 121 sec latency 13706.032 ms oleg631-client.virtnet: 1 14622 0.47 MB/sec execute 122 sec latency 14706.309 ms oleg631-client.virtnet: 1 14622 0.47 MB/sec execute 123 sec latency 15706.536 ms oleg631-client.virtnet: 1 14622 0.46 MB/sec execute 124 sec latency 16706.805 ms oleg631-client.virtnet: 1 14622 0.46 MB/sec execute 125 sec latency 17707.065 ms oleg631-client.virtnet: 1 14622 0.46 MB/sec execute 126 sec latency 18707.324 ms oleg631-client.virtnet: 1 14622 0.45 MB/sec execute 127 sec latency 19707.615 ms oleg631-client.virtnet: 1 14622 0.45 MB/sec execute 128 sec latency 20707.870 ms oleg631-client.virtnet: 1 14622 0.45 MB/sec execute 129 sec latency 21708.126 ms oleg631-client.virtnet: 1 14622 0.44 MB/sec execute 130 sec latency 22708.403 ms oleg631-client.virtnet: 1 14622 0.44 MB/sec execute 131 sec latency 23708.650 ms oleg631-client.virtnet: 1 14622 0.44 MB/sec execute 132 sec latency 24708.874 ms oleg631-client.virtnet: 1 14622 0.43 MB/sec execute 133 sec latency 25709.136 ms oleg631-client.virtnet: 1 14622 0.43 MB/sec execute 134 sec latency 26709.552 ms oleg631-client.virtnet: 1 14622 0.43 MB/sec execute 135 sec latency 27709.828 ms oleg631-client.virtnet: 1 14622 0.42 MB/sec execute 136 sec latency 28710.117 ms oleg631-client.virtnet: 1 14622 0.42 MB/sec execute 137 sec latency 29710.313 ms oleg631-client.virtnet: 1 14622 0.42 MB/sec execute 138 sec latency 30711.324 ms oleg631-client.virtnet: 1 14622 0.41 MB/sec execute 139 sec latency 31711.630 ms oleg631-client.virtnet: 1 14622 0.41 MB/sec execute 140 sec latency 32711.898 ms oleg631-client.virtnet: 1 14622 0.41 MB/sec execute 141 sec latency 33712.169 ms oleg631-client.virtnet: 1 14622 0.41 MB/sec execute 142 sec latency 34712.528 ms oleg631-client.virtnet: 1 14622 0.40 MB/sec execute 143 sec latency 35712.808 ms oleg631-client.virtnet: 1 14622 0.40 MB/sec execute 144 sec latency 36712.993 ms oleg631-client.virtnet: 1 14691 0.40 MB/sec execute 145 sec latency 37029.750 ms oleg631-client.virtnet: 1 14802 0.39 MB/sec execute 146 sec latency 55.663 ms oleg631-client.virtnet: 1 14879 0.39 MB/sec execute 147 sec latency 365.585 ms oleg631-client.virtnet: 1 14954 0.39 MB/sec execute 148 sec latency 122.380 ms oleg631-client.virtnet: 1 15005 0.39 MB/sec execute 149 sec latency 244.395 ms oleg631-client.virtnet: 1 15098 0.39 MB/sec execute 150 sec lUUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210304 3840 2204416 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3712000 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 13312 3745792 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 54272 7457792 1% /mnt/lustre test_70b fail mds1 5 times Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:04:16 (1743501856) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:04:39 (1743501879) targets are mounted 06:04:39 (1743501879) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid atency 134.907 ms oleg631-client.virtnet: 1 15185 0.39 MB/sec execute 151 sec latency 136.613 ms oleg631-client.virtnet: 1 15400 0.39 MB/sec execute 152 sec latency 180.877 ms oleg631-client.virtnet: 1 15624 0.41 MB/sec execute 153 sec latency 47.463 ms oleg631-client.virtnet: 1 15781 0.41 MB/sec execute 154 sec latency 185.281 ms oleg631-client.virtnet: 1 15892 0.41 MB/sec execute 155 sec latency 219.306 ms oleg631-client.virtnet: 1 16026 0.40 MB/sec execute 156 sec latency 42.815 ms oleg631-client.virtnet: 1 16175 0.40 MB/sec execute 157 sec latency 187.658 ms oleg631-client.virtnet: 1 16348 0.40 MB/sec execute 158 sec latency 28.155 ms oleg631-client.virtnet: 1 16541 0.41 MB/sec execute 159 sec latency 153.702 ms oleg631-client.virtnet: 1 16683 0.41 MB/sec execute 160 sec latency 185.374 ms oleg631-client.virtnet: 1 16963 0.43 MB/sec execute 161 sec latency 120.789 ms oleg631-client.virtnet: 1 17174 0.43 MB/sec execute 162 sec latency 154.673 ms oleg631-client.virtnet: 1 17467 0.44 MB/sec execute 163 sec latency 148.156 ms oleg631-client.virtnet: 1 17727 0.46 MB/sec execute 164 sec latency 85.102 ms oleg631-client.virtnet: 1 17727 0.45 MB/sec execute 165 sec latency 1052.309 ms oleg631-client.virtnet: 1 17727 0.45 MB/sec execute 166 sec latency 2052.996 ms oleg631-client.virtnet: 1 17727 0.45 MB/sec execute 167 sec latency 3053.251 ms oleg631-client.virtnet: 1 17727 0.45 MB/sec execute 168 sec latency 4053.940 ms oleg631-client.virtnet: 1 17727 0.44 MB/sec execute 169 sec latency 5054.382 ms oleg631-client.virtnet: 1 17727 0.44 MB/sec execute 170 sec latency 6054.715 ms oleg631-client.virtnet: 1 17727 0.44 MB/sec execute 171 sec latency 7054.989 ms oleg631-client.virtnet: 1 17727 0.44 MB/sec execute 172 sec latency 8055.269 ms oleg631-client.virtnet: 1 17727 0.43 MB/sec execute 173 sec latency 9055.517 ms oleg631-client.virtnet: 1 17727 0.43 MB/sec execute 174 sec latency 10055.743 ms oleg631-client.virtnet: 1 17727 0.43 MB/sec execute 175 sec latency 11056.001 ms oleg631-client.virtnet: 1 17727 0.43 MB/sec execute 176 sec latency 12056.311 ms oleg631-client.virtnet: 1 17727 0.42 MB/sec execute 177 sec latency 13056.519 ms oleg631-client.virtnet: 1 17727 0.42 MB/sec execute 178 sec latency 14056.983 ms oleg631-client.virtnet: 1 17727 0.42 MB/sec execute 179 sec latency 15057.329 ms oleg631-client.virtnet: 1 17727 0.42 MB/sec execute 180 sec latency 16057.713 ms oleg631-client.virtnet: 1 17727 0.41 MB/sec execute 181 sec latency 17057.951 ms oleg631-client.virtnet: 1 17727 0.41 MB/sec execute 182 sec latency 18058.314 ms oleg631-client.virtnet: 1 17727 0.41 MB/sec execute 183 sec latency 19058.837 ms oleg631-client.virtnet: 1 17727 0.41 MB/sec execute 184 sec latency 20059.723 ms oleg631-client.virtnet: 1 17732 0.41 MB/sec execute 185 sec latency 20888.377 ms oleg631-client.virtnet: 1 17898 0.41 MB/sec execute 186 sec latency 28.119 ms oleg631-client.virtnet: 1 18043 0.41 MB/sec execute 187 sec latency 31.774 ms oleg631-client.virtnet: 1 18170 0.41 MB/sec execute 188 sec latency 42.329 ms oleg631-client.virtnet: 1 18296 0.41 MB/sec execute 189 sec latency 63.188 ms oleg631-client.virtnet: 1 18420 0.41 MB/sec execute 190 sec latency 70.924 ms oleg631-client.virtnet: 1 18476 0.40 MB/sec execute 191 sec latency 298.201 ms oleg631-client.virtnet: 1 18546 0.40 MB/sec execute 192 sec latency 160.468 ms oleg631-client.virtnet: 1 18630 0.40 MB/sec execute 193 sec latency 199.764 ms oleg631-client.virtnet: 1 18733 0.40 MB/sec execute 194 sec latency 127.408 ms oleg63mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210304 3968 2204288 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 40960 3706880 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 13312 3745792 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 54272 7452672 1% /mnt/lustre test_70b fail mds1 6 times Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:05:03 (1743501903) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1-client.virtnet: 1 18883 0.40 MB/sec execute 195 sec latency 138.597 ms oleg631-client.virtnet: 1 19011 0.40 MB/sec execute 196 sec latency 63.553 ms oleg631-client.virtnet: 1 19232 0.42 MB/sec execute 197 sec latency 40.856 ms oleg631-client.virtnet: 1 19338 0.41 MB/sec execute 198 sec latency 145.933 ms oleg631-client.virtnet: 1 19486 0.41 MB/sec execute 199 sec latency 135.525 ms oleg631-client.virtnet: 1 19637 0.41 MB/sec execute 200 sec latency 124.729 ms oleg631-client.virtnet: 1 19896 0.41 MB/sec execute 201 sec latency 72.569 ms oleg631-client.virtnet: 1 20130 0.41 MB/sec execute 202 sec latency 26.100 ms oleg631-client.virtnet: 1 20201 0.41 MB/sec execute 203 sec latency 196.153 ms oleg631-client.virtnet: 1 20385 0.42 MB/sec execute 204 sec latency 201.809 ms oleg631-client.virtnet: 1 20702 0.44 MB/sec execute 205 sec latency 263.402 ms oleg631-client.virtnet: 1 20989 0.44 MB/sec execute 206 sec latency 202.352 ms oleg631-client.virtnet: 1 21316 0.46 MB/sec execute 207 sec latency 65.902 ms oleg631-client.virtnet: 1 21485 0.45 MB/sec execute 208 sec latency 31.444 ms oleg631-client.virtnet: 1 21633 0.46 MB/sec execute 209 sec latency 43.945 ms oleg631-client.virtnet: 1 21797 0.46 MB/sec execute 210 sec latency 26.056 ms oleg631-client.virtnet: 1 21920 0.45 MB/sec execute 211 sec latency 242.276 ms oleg631-client.virtnet: 1 21920 0.45 MB/sec execute 212 sec latency 1242.540 ms oleg631-client.virtnet: 1 21920 0.45 MB/sec execute 213 sec latency 2242.794 ms oleg631-client.virtnet: 1 21920 0.45 MB/sec execute 214 sec latency 3243.073 ms oleg631-client.virtnet: 1 21920 0.45 MB/sec execute 215 sec latency 4243.313 ms oleg631-client.virtnet: 1 21920 0.44 MB/sec execute 216 sec latency 5243.594 ms oleg631-client.virtnet: 1 21920 0.44 MB/sec execute 217 sec latency 6243.837 ms oleg631-client.virtnet: 1 21920 0.44 MB/sec execute 218 sec latency 7244.108 ms oleg631-client.virtnet: 1 21920 0.44 MB/sec execute 219 sec latency 8244.464 ms oleg631-client.virtnet: 1 21920 0.44 MB/sec execute 220 sec latency 9244.714 ms oleg631-client.virtnet: 1 21920 0.43 MB/sec execute 221 sec latency 10245.085 ms oleg631-client.virtnet: 1 21920 0.43 MB/sec execute 222 sec latency 11245.508 ms oleg631-client.virtnet: 1 21920 0.43 MB/sec execute 223 sec latency 12245.777 ms oleg631-client.virtnet: 1 21920 0.43 MB/sec execute 224 sec latency 13246.112 ms oleg631-client.virtnet: 1 21920 0.43 MB/sec execute 225 sec latency 14246.476 ms oleg631-client.virtnet: 1 21920 0.42 MB/sec execute 226 sec latency 15246.729 ms oleg631-client.virtnet: 1 21920 0.42 MB/sec execute 227 sec latency 16246.931 ms oleg631-client.virtnet: 1 21920 0.42 MB/sec execute 228 sec latency 17247.253 ms oleg631-client.virtnet: 1 21920 0.42 MB/sec execute 229 sec latency 18247.524 ms oleg631-client.virtnet: 1 21920 0.42 MB/sec execute 230 sec latency 19247.786 ms oleg631-client.virtnet: 1 21920 0.42 MB/sec execute 231 sec latency 20248.000 ms oleg631-client.virtnet: 1 21920 0.41 MB/sec execute 232 sec latency 21248.278 ms oleg631-client.virtnet: 1 21920 0.41 MB/sec execute 233 sec latency 22248.737 ms oleg631-client.virtnet: 1 21920 0.41 MB/sec execute 234 sec latency 23248.999 ms oleg631-client.virtnet: 1 21920 0.41 MB/sec execute 235 sec latency 24249.308 ms oleg631-client.virtnet: 1 21920 0.41 MB/sec execute 236 sec latency 25249.646 ms oleg631-client.virtnet: 1 21920 0.40 MB/sec execute 237 sec latency 26249.913 ms oleg631-client.virtnet: 1 21920 0.40 MB/sec execute 238 sec latency 27250.215 ms oleg631-client.violeg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:05:34 (1743501934) targets are mounted 06:05:34 (1743501934) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec rtnet: 1 21920 0.40 MB/sec execute 239 sec latency 28250.469 ms oleg631-client.virtnet: 1 21920 0.40 MB/sec execute 240 sec latency 29250.728 ms oleg631-client.virtnet: 1 21920 0.40 MB/sec execute 241 sec latency 30251.071 ms oleg631-client.virtnet: 1 21920 0.40 MB/sec execute 242 sec latency 31251.460 ms oleg631-client.virtnet: 1 21920 0.39 MB/sec execute 243 sec latency 32253.412 ms oleg631-client.virtnet: 1 21920 0.39 MB/sec execute 244 sec latency 33253.685 ms oleg631-client.virtnet: 1 21920 0.39 MB/sec execute 245 sec latency 34253.910 ms oleg631-client.virtnet: 1 21920 0.39 MB/sec execute 246 sec latency 35254.176 ms oleg631-client.virtnet: 1 21920 0.39 MB/sec execute 247 sec latency 36254.471 ms oleg631-client.virtnet: 1 21920 0.39 MB/sec execute 248 sec latency 37254.716 ms oleg631-client.virtnet: 1 21920 0.39 MB/sec execute 249 sec latency 38255.676 ms oleg631-client.virtnet: 1 21920 0.38 MB/sec execute 250 sec latency 39256.141 ms oleg631-client.virtnet: 1 21920 0.38 MB/sec execute 251 sec latency 40256.345 ms oleg631-client.virtnet: 1 22001 0.38 MB/sec execute 252 sec latency 40369.871 ms oleg631-client.virtnet: 1 22057 0.38 MB/sec execute 253 sec latency 310.164 ms oleg631-client.virtnet: 1 22125 0.38 MB/sec execute 254 sec latency 190.409 ms oleg631-client.virtnet: 1 22213 0.38 MB/sec execute 255 sec latency 132.941 ms oleg631-client.virtnet: 1 22304 0.38 MB/sec execute 256 sec latency 140.190 ms oleg631-client.virtnet: 1 22479 0.38 MB/sec execute 257 sec latency 108.627 ms oleg631-client.virtnet: 1 22627 0.38 MB/sec execute 258 sec latency 43.759 ms oleg631-client.virtnet: 1 22902 0.39 MB/sec execute 259 sec latency 125.925 ms oleg631-client.virtnet: 1 23043 0.39 MB/sec execute 260 sec latency 122.397 ms oleg631-client.virtnet: 1 23216 0.39 MB/sec execute 261 sec latency 171.885 ms oleg631-client.virtnet: 1 23489 0.39 MB/sec execute 262 sec latency 95.364 ms oleg631-client.virtnet: 1 23726 0.39 MB/sec execute 263 sec latency 128.729 ms oleg631-client.virtnet: 1 23921 0.39 MB/sec execute 264 sec latency 123.323 ms oleg631-client.virtnet: 1 24305 0.41 MB/sec execute 265 sec latency 145.602 ms oleg631-client.virtnet: 1 24662 0.41 MB/sec execute 266 sec latency 112.274 ms oleg631-client.virtnet: 1 25004 0.43 MB/sec execute 267 sec latency 42.269 ms oleg631-client.virtnet: 1 25247 0.43 MB/sec execute 268 sec latency 17.651 ms oleg631-client.virtnet: 1 25432 0.43 MB/sec execute 269 sec latency 36.374 ms oleg631-client.virtnet: 1 25588 0.43 MB/sec execute 270 sec latency 315.451 ms oleg631-client.virtnet: 1 25684 0.43 MB/sec execute 271 sec latency 140.430 ms oleg631-client.virtnet: 1 25838 0.43 MB/sec execute 272 sec latency 103.664 ms oleg631-client.virtnet: 1 26062 0.43 MB/sec execute 273 sec latency 93.167 ms oleg631-client.virtnet: 1 26363 0.44 MB/sec execute 274 sec latency 91.460 ms oleg631-client.virtnet: 1 26541 0.44 MB/sec execute 275 sec latency 158.758 ms oleg631-client.virtnet: 1 26765 0.44 MB/sec execute 276 sec latency 19.056 ms oleg631-client.virtnet: 1 27028 0.44 MB/sec execute 277 sec latency 119.911 ms oleg631-client.virtnet: 1 27252 0.44 MB/sec execute 278 sec latency 196.668 ms oleg631-client.virtnet: 1 27456 0.44 MB/sec execute 279 sec latency 164.839 ms oleg631-client.virtnet: 1 27850 0.46 MB/sec execute 280 sec latency 129.517 ms oleg631-client.virtnet: 1 28187 0.46 MB/sec execute 281 sec latency 113.505 ms oleg631-client.virtnet: 1 28479 0.47 MB/sec execute 282 sec latency 60.997 ms oleg631-client.virtnet: 1 28688 0.47 MB/sec execute 283 sec latency 24.601 ms oleg631-client.virtnet: 1 28890 0.47 MB/sec execute 284 sec latency 25.194 ms oleg631-client.virtnet: 1 29093 0.47 MB/sec execute 285 sec latency 256.765 ms oleg631-client.virtnet: 1 29169 0.47 MB/sec execute 286 sec latency 299.349 ms oleg631-client.virtnet: 1 29300 0.47 MB/sec execute 287 sec latency 174.101 ms oleg631-client.virtnet: 1 29407 0.47 MB/sec execute 288 sec latency 135.860 ms oleg631-client.virtnet: 1 29652 0.47 MB/sec execute 289 sec latency 59.048 ms oleg631-client.virtnet: 1 29910 0.48 MB/sec execute 290 sec latency 159.816 ms oleg631-client.virtnet: 1 30093 0.48 MB/sec execute 291 sec latency 107.937 ms oleg631-client.virtnet: 1 30278 0.48 MB/sec execute 292 sec latency 23.813 ms oleg631-client.virtnet: 1 30463 0.48 MB/sec execute 293 sec latency 134.212 ms oleg631-client.virtnet: 1 30653 0.48 MB/sec execute 294 sec latency 32.369 ms oleg631-client.virtnet: 1 30835 0.48 MB/sec execute 295 sec latency 187.242 ms oleg631-client.virtnet: 1 31029 0.48 MB/sec execute 296 sec latency 175.464 ms oleg631-client.virtnet: 1 31408 0.49 MB/sec execute 297 sec latency 147.968 ms oleg631-client.virtnet: 1 31787 0.50 MB/sec execute 298 sec latency 106.775 ms oleg631-client.virtnet: 1 32113 0.51 MB/sec execute 299 sec latency 18.902 ms oleg631-client.virtnet: 1 cleanup 300 sec oleg631-client.virtnet: 0 cleanup 301 sec oleg631-client.virtnet: oleg631-client.virtnet: Operation Count AvgLat MaxLat oleg631-client.virtnet: ---------------------------------------- oleg631-client.virtnet: NTCreateX 4953 25.717 37029.724 oleg631-client.virtnet: Close 3629 8.691 20888.360 oleg631-client.virtnet: Rename 208 21.696 63.501 oleg631-client.virtnet: Unlink 1002 68.386 40369.853 oleg631-client.virtnet: Qpathinfo 4468 3.901 63.163 oleg631-client.virtnet: Qfileinfo 777 0.428 11.592 oleg631-client.virtnet: Qfsinfo 821 0.296 34.349 oleg631-client.virtnet: Sfileinfo 400 13.999 63.686 oleg631-client.virtnet: Find 1725 1.791 73.972 oleg631-client.virtnet: WriteX 2431 2.041 19.727 oleg631-client.virtnet: ReadX 7713 0.046 5.354 oleg631-client.virtnet: LockX 16 2.852 6.530 oleg631-client.virtnet: UnlockX 16 2.555 4.576 oleg631-client.virtnet: Flush 344 103.608 365.565 oleg631-client.virtnet: oleg631-client.virtnet: Throughput 0.50831 MB/sec 1 clients 1 procs max_latency=40369.871 ms oleg631-client.virtnet: stopping dbench on /mnt/lustre/d70b.replay-single/oleg631-client.virtnet at Tue Apr 1 06:06:30 EDT 2025 with return code 0 oleg631-client.virtnet: clean dbench files on /mnt/lustre/d70b.replay-single/oleg631-client.virtnet oleg631-client.virtnet: /mnt/lustre/d70b.replay-single/oleg631-client.virtnet /mnt/lustre/d70b.replay-single/oleg631-client.virtnet oleg631-client.virtnet: removed directory 'clients/client0/~dmtmp/WORD' oleg631-client.virtnet: removed directory 'clients/client0/~dmtmp/PARADOX' oleg631-client.virtnet: removed directory 'clients/client0/~dmtmp/EXCEL' oleg631-client.virtnet: removed directory 'clients/client0/~dmtmp/ACCESS' oleg631-client.virtnet: removed directory 'clients/client0/~dmtmp/SEED' oleg631-client.virtnet: removed directory 'clients/client0/~dmtmp/PM' oleg631-client.virtnet: removed directory 'clients/client0/~dmtmp/WORDPRO' oleg631-client.virtnet: removed directory 'clients/client0/~dmtmp/PWRPNT' oleg631-client.virtnet: removed directory 'clients/client0/~dmtmp/COREL' oleg631-client.virtnet: removed directory 'clients/client0/~dmtmp' oleg631-client.virtnet: removed directory 'clients/client0' oleg631-client.virtnet: removed directory 'clients' oleg631-client.virtnet: removed 'client.txt' oleg631-client.virtnet: /mnt/lustre/d70b.replay-single/oleg631-client.virtnet oleg631-client.virtnet: dbench successfully finished PASS 70b (372s) == replay-single test 70c: tar 1mdts recovery ============ 06:06:37 (1743501997) Starting client oleg631-client.virtnet: -o user_xattr,flock 192.168.206.131@tcp:/lustre /mnt/lustre Started clients oleg631-client.virtnet: 192.168.206.131@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,encrypt,statfs_project) Started tar 132229 tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210176 7168 2200960 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 19456 3393536 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 13312 3426304 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 32768 6819840 1% /mnt/lustre test_70c fail mds1 1 times Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:09:04 (1743502144) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:09:35 (1743502175) targets are mounted 06:09:35 (1743502175) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210176 7168 2200960 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 13312 3427328 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 13312 3424256 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 26624 6851584 1% /mnt/lustre tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets test_70c fail mds1 2 times Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:12:23 (1743502343) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:12:55 (1743502375) targets are mounted 06:12:55 (1743502375) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70c (445s) == replay-single test 70d: mkdir/rmdir striped dir 1mdts recovery ========================================================== 06:14:02 (1743502442) SKIP: replay-single test_70d needs >= 2 MDTs SKIP 70d (3s) == replay-single test 70e: rename cross-MDT with random fails ========================================================== 06:14:06 (1743502446) SKIP: replay-single test_70e needs >= 2 MDTs SKIP 70e (3s) == replay-single test 70f: OSS O_DIRECT recovery with 1 clients ========================================================== 06:14:09 (1743502449) mount clients oleg631-client.virtnet ... Starting client oleg631-client.virtnet: -o user_xattr,flock 192.168.206.131@tcp:/lustre /mnt/lustre Started clients oleg631-client.virtnet: 192.168.206.131@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,encrypt,statfs_project) Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210048 3840 2204160 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 9216 3760128 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3762176 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 16384 7522304 1% /mnt/lustre ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... test_70f failing OST 1 times Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 06:14:26 (1743502466) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... Started lustre-OST0000 06:14:51 (1743502491) targets are mounted 06:14:51 (1743502491) facet_failover done ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210048 3840 2204160 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 9216 3746816 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3753984 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 16384 7500800 1% /mnt/lustre ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... test_70f failing OST 2 times Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 06:15:22 (1743502522) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Started lustre-OST0000 Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... 06:15:47 (1743502547) targets are mounted 06:15:47 (1743502547) facet_failover done ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210048 3840 2204160 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 7168 3752960 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3762176 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 14336 7515136 1% /mnt/lustre ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... test_70f failing OST 3 times Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 06:16:15 (1743502575) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... Started lustre-OST0000 06:16:40 (1743502600) targets are mounted 06:16:40 (1743502600) facet_failover done ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... ldlm.namespaces.MGC192.168.206.131@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff899607b39000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff899607b39000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg631-client.virtnet' ... PASS 70f (171s) == replay-single test 71a: mkdir/rmdir striped dir with 2 mdts recovery ========================================================== 06:17:00 (1743502620) SKIP: replay-single test_71a needs >= 2 MDTs SKIP 71a (4s) == replay-single test 73a: open(O_CREAT), unlink, replay, reconnect before open replay, close ========================================================== 06:17:04 (1743502624) multiop /mnt/lustre/f73a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210048 3840 2204160 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000302 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:17:13 (1743502633) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:17:34 (1743502654) targets are mounted 06:17:34 (1743502654) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73a (56s) == replay-single test 73b: open(O_CREAT), unlink, replay, reconnect at open_replay reply, close ========================================================== 06:18:00 (1743502680) multiop /mnt/lustre/f73b.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7367 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210048 3712 2204288 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 7168 7531520 1% /mnt/lustre fail_loc=0x80000157 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:18:09 (1743502689) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:18:40 (1743502720) targets are mounted 06:18:40 (1743502720) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73b (66s) == replay-single test 74: Ensure applications don't fail waiting for OST recovery ========================================================== 06:19:06 (1743502746) Stopping clients: oleg631-client.virtnet /mnt/lustre (opts:) Stopping client oleg631-client.virtnet /mnt/lustre opts: Stopping /mnt/lustre-ost1 (opts:) on oleg631-server Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:19:15 (1743502755) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:19:36 (1743502776) targets are mounted 06:19:36 (1743502776) facet_failover done Starting client oleg631-client.virtnet: -o user_xattr,flock 192.168.206.131@tcp:/lustre /mnt/lustre Started clients oleg631-client.virtnet: 192.168.206.131@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,encrypt,statfs_project) Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 74 (50s) == replay-single test 80a: DNE: create remote dir, drop update rep from MDT0, fail MDT0 ========================================================== 06:19:56 (1743502796) SKIP: replay-single test_80a needs >= 2 MDTs SKIP 80a (3s) == replay-single test 80b: DNE: create remote dir, drop update rep from MDT0, fail MDT1 ========================================================== 06:19:59 (1743502799) SKIP: replay-single test_80b needs >= 2 MDTs SKIP 80b (3s) == replay-single test 80c: DNE: create remote dir, drop update rep from MDT1, fail MDT[0,1] ========================================================== 06:20:02 (1743502802) SKIP: replay-single test_80c needs >= 2 MDTs SKIP 80c (4s) == replay-single test 80d: DNE: create remote dir, drop update rep from MDT1, fail 2 MDTs ========================================================== 06:20:06 (1743502806) SKIP: replay-single test_80d needs >= 2 MDTs SKIP 80d (4s) == replay-single test 80e: DNE: create remote dir, drop MDT1 rep, fail MDT0 ========================================================== 06:20:10 (1743502810) SKIP: replay-single test_80e needs >= 2 MDTs SKIP 80e (3s) == replay-single test 80f: DNE: create remote dir, drop MDT1 rep, fail MDT1 ========================================================== 06:20:13 (1743502813) SKIP: replay-single test_80f needs >= 2 MDTs SKIP 80f (4s) == replay-single test 80g: DNE: create remote dir, drop MDT1 rep, fail MDT0, then MDT1 ========================================================== 06:20:17 (1743502817) SKIP: replay-single test_80g needs >= 2 MDTs SKIP 80g (3s) == replay-single test 80h: DNE: create remote dir, drop MDT1 rep, fail 2 MDTs ========================================================== 06:20:21 (1743502821) SKIP: replay-single test_80h needs >= 2 MDTs SKIP 80h (4s) == replay-single test 81a: DNE: unlink remote dir, drop MDT0 update rep, fail MDT1 ========================================================== 06:20:24 (1743502824) SKIP: replay-single test_81a needs >= 2 MDTs SKIP 81a (3s) == replay-single test 81b: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0 ========================================================== 06:20:27 (1743502827) SKIP: replay-single test_81b needs >= 2 MDTs SKIP 81b (4s) == replay-single test 81c: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0,MDT1 ========================================================== 06:20:31 (1743502831) SKIP: replay-single test_81c needs >= 2 MDTs SKIP 81c (3s) == replay-single test 81d: DNE: unlink remote dir, drop MDT0 update reply, fail 2 MDTs ========================================================== 06:20:35 (1743502835) SKIP: replay-single test_81d needs >= 2 MDTs SKIP 81d (3s) == replay-single test 81e: DNE: unlink remote dir, drop MDT1 req reply, fail MDT0 ========================================================== 06:20:38 (1743502838) SKIP: replay-single test_81e needs >= 2 MDTs SKIP 81e (3s) == replay-single test 81f: DNE: unlink remote dir, drop MDT1 req reply, fail MDT1 ========================================================== 06:20:42 (1743502842) SKIP: replay-single test_81f needs >= 2 MDTs SKIP 81f (3s) == replay-single test 81g: DNE: unlink remote dir, drop req reply, fail M0, then M1 ========================================================== 06:20:45 (1743502845) SKIP: replay-single test_81g needs >= 2 MDTs SKIP 81g (3s) == replay-single test 81h: DNE: unlink remote dir, drop request reply, fail 2 MDTs ========================================================== 06:20:49 (1743502849) SKIP: replay-single test_81h needs >= 2 MDTs SKIP 81h (3s) == replay-single test 84a: stale open during export disconnect ========================================================== 06:20:52 (1743502852) fail_loc=0x80000144 total: 1 open/close in 0.02 seconds: 51.26 ops/second pdsh@oleg631-client: oleg631-client: ssh exited with exit code 5 PASS 84a (11s) == replay-single test 85a: check the cancellation of unused locks during recovery(IBITS) ========================================================== 06:21:03 (1743502863) before recovery: unused locks count = 201 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:21:12 (1743502872) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:21:34 (1743502894) targets are mounted 06:21:34 (1743502894) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec after recovery: unused locks count = 101 PASS 85a (45s) == replay-single test 85b: check the cancellation of unused locks during recovery(EXTENT) ========================================================== 06:21:48 (1743502908) before recovery: unused locks count = 100 Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 06:22:02 (1743502922) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 06:22:25 (1743502945) targets are mounted 06:22:25 (1743502945) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec after recovery: unused locks count = 0 PASS 85b (51s) == replay-single test 86: umount server after clear nid_stats should not hit LBUG ========================================================== 06:22:39 (1743502959) Stopping clients: oleg631-client.virtnet /mnt/lustre (opts:) Stopping client oleg631-client.virtnet /mnt/lustre opts: mdt.lustre-MDT0000.exports.clear=0 Stopping /mnt/lustre-mds1 (opts:) on oleg631-server Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting client oleg631-client.virtnet: -o user_xattr,flock 192.168.206.131@tcp:/lustre /mnt/lustre Started clients oleg631-client.virtnet: 192.168.206.131@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,encrypt,statfs_project) PASS 86 (22s) == replay-single test 87a: write replay ================== 06:23:01 (1743502981) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210048 3712 2204288 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 8192 7530496 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB, 8.0 MiB) copied, 0.337681 s, 24.8 MB/s Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 06:23:10 (1743502990) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 06:23:33 (1743503013) targets are mounted 06:23:33 (1743503013) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 8+0 records in 8+0 records out 8388608 bytes (8.4 MB, 8.0 MiB) copied, 0.313252 s, 26.8 MB/s PASS 87a (47s) == replay-single test 87b: write replay with changed data (checksum resend) ========================================================== 06:23:48 (1743503028) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210048 3712 2204288 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 12288 3757056 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 16384 7522304 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB, 8.0 MiB) copied, 0.328691 s, 25.5 MB/s 8+0 records in 8+0 records out 8 bytes copied, 0.00381081 s, 2.1 kB/s Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 06:23:59 (1743503039) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 06:24:21 (1743503061) targets are mounted 06:24:21 (1743503061) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 0+1 records in 0+1 records out 72 bytes copied, 0.00997206 s, 7.2 kB/s PASS 87b (47s) == replay-single test 88: MDS should not assign same objid to different files ========================================================== 06:24:35 (1743503075) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210048 3712 2204288 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 13312 3756032 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 17408 7521280 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210048 3712 2204288 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 13312 3756032 1% /mnt/lustre[OST:0] R lustre-OST0001_UUID 3771392 4096 3765248 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 17408 7521280 1% /mnt/lustre before test: last_id = 9089, next_id = 9060 Creating to objid 9089 on ost lustre-OST0000... total: 31 open/close in 0.33 seconds: 94.13 ops/second total: 8 open/close in 0.06 seconds: 126.06 ops/second before recovery: last_id = 9089, next_id = 9060 Stopping /mnt/lustre-mds1 (opts:) on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server Failover mds1 to oleg631-server oleg631-server.virtnet Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 Failover ost1 to oleg631-server oleg631-server.virtnet Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 after recovery: last_id = 9121, next_id = 9090 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0713802 s, 7.3 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0845284 s, 6.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.057358 s, 9.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0613644 s, 8.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0856873 s, 6.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0663679 s, 7.9 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0647721 s, 8.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0580326 s, 9.0 MB/s -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9060 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9061 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9062 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9063 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9064 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9065 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9066 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9067 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9068 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9069 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9070 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9071 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9072 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9073 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9074 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9075 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9076 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9077 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9078 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9079 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9080 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9081 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9082 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9083 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9084 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9085 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9086 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9087 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9088 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9089 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9090 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9091 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9092 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9093 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9094 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9095 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9096 -rw-r--r-- 1 root root 0 Apr 1 06:24 /mnt/lustre/d88.replay-single/f-9097 -rw-r--r-- 1 root root 524288 Apr 1 06:26 /mnt/lustre/d88.replay-single/f-9101 -rw-r--r-- 1 root root 524288 Apr 1 06:26 /mnt/lustre/d88.replay-single/f-9102 -rw-r--r-- 1 root root 524288 Apr 1 06:26 /mnt/lustre/d88.replay-single/f-9103 -rw-r--r-- 1 root root 524288 Apr 1 06:26 /mnt/lustre/d88.replay-single/f-9104 -rw-r--r-- 1 root root 524288 Apr 1 06:26 /mnt/lustre/d88.replay-single/f-9105 -rw-r--r-- 1 root root 524288 Apr 1 06:26 /mnt/lustre/d88.replay-single/f-9106 -rw-r--r-- 1 root root 524288 Apr 1 06:26 /mnt/lustre/d88.replay-single/f-9107 -rw-r--r-- 1 root root 524288 Apr 1 06:26 /mnt/lustre/d88.replay-single/f-9108 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0519461 s, 10.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0481666 s, 10.9 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0533237 s, 9.8 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0554525 s, 9.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0573375 s, 9.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0552767 s, 9.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0596245 s, 8.8 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0582803 s, 9.0 MB/s PASS 88 (103s) == replay-single test 89: no disk space leak on late ost connection ========================================================== 06:26:19 (1743503179) Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed wait 40 secs maximumly for oleg631-server mds-ost sync done. sleep 5 for ZFS MDS Waiting for MDT destroys to complete 10+0 records in 10+0 records out 41943040 bytes (42 MB, 40 MiB) copied, 0.678977 s, 61.8 MB/s Stopping /mnt/lustre-ost1 (opts:) on oleg631-server Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:26:40 (1743503200) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:27:01 (1743503221) targets are mounted 06:27:01 (1743503221) facet_failover done Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 Starting client: oleg631-client.virtnet: -o user_xattr,flock 192.168.206.131@tcp:/lustre /mnt/lustre osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 59 sec Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed wait 40 secs maximumly for oleg631-server mds-ost sync done. sleep 5 for ZFS MDS Waiting for MDT destroys to complete free_before: 7518208 free_after: 7518208 PASS 89 (147s) == replay-single test 90: lfs find identifies the missing striped file segments ========================================================== 06:28:45 (1743503325) Create the files Fail ost1 lustre-OST0000_UUID, display the list of affected files Stopping /mnt/lustre-ost1 (opts:) on oleg631-server General Query: lfs find /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f1 /mnt/lustre/d90.replay-single/f0 Querying files on shutdown ost1: lfs find --obd lustre-OST0000_UUID /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f0 Check getstripe: /home/green/git/lustre-release/lustre/utils/lfs getstripe -r --obd lustre-OST0000_UUID /mnt/lustre/d90.replay-single/all lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 obdidx objid objid group 0 9122 0x23a2 0x240000400 * /mnt/lustre/d90.replay-single/f0 lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 9123 0x23a3 0x240000400 * /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f0 Failover ost1 to oleg631-server oleg631-server.virtnet Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 90 (36s) == replay-single test 93a: replay + reconnect ============ 06:29:21 (1743503361) 1+0 records in 1+0 records out 1024 bytes (1.0 kB, 1.0 KiB) copied, 0.00544026 s, 188 kB/s fail_val=40 fail_loc=0x715 Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 06:29:30 (1743503370) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 06:29:52 (1743503392) targets are mounted 06:29:52 (1743503392) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 93a (80s) == replay-single test 93b: replay + reconnect on mds ===== 06:30:41 (1743503441) total: 20 open/close in 0.20 seconds: 100.77 ops/second fail_val=80 fail_loc=0x715 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:30:48 (1743503448) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:31:09 (1743503469) targets are mounted 06:31:09 (1743503469) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 93b (119s) == replay-single test 100a: DNE: create striped dir, drop update rep from MDT1, fail MDT1 ========================================================== 06:32:40 (1743503560) SKIP: replay-single test_100a needs >= 2 MDTs SKIP 100a (3s) == replay-single test 100b: DNE: create striped dir, fail MDT0 ========================================================== 06:32:44 (1743503564) SKIP: replay-single test_100b needs >= 2 MDTs SKIP 100b (4s) == replay-single test 100c: DNE: create striped dir, abort_recov_mdt mds2 ========================================================== 06:32:47 (1743503567) SKIP: replay-single test_100c needs >= 2 MDTs SKIP 100c (4s) == replay-single test 100d: DNE: cancel update logs upon recovery abort ========================================================== 06:32:51 (1743503571) SKIP: replay-single test_100d needs > 1 MDTs SKIP 100d (4s) == replay-single test 100e: DNE: create striped dir on MDT0 and MDT1, fail MDT0, MDT1 ========================================================== 06:32:55 (1743503575) SKIP: replay-single test_100e needs >= 2 MDTs SKIP 100e (4s) == replay-single test 101: Shouldn't reassign precreated objs to other files after recovery ========================================================== 06:32:59 (1743503579) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210048 3840 2204160 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 17408 3751936 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3762176 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 24576 7514112 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg631-server Failover mds1 to oleg631-server oleg631-server.virtnet Starting mds1: -o localrecov -o abort_recovery lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 101 (83s) == replay-single test 102a: check resend (request lost) with multiple modify RPCs in flight ========================================================== 06:34:22 (1743503662) creating 7 files ... fail_loc=0x159 launch 7 chmod in parallel (06:34:25) ... fail_loc=0 done (06:34:41) /mnt/lustre/d102a.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-7 has perms 0600 OK PASS 102a (25s) == replay-single test 102b: check resend (reply lost) with multiple modify RPCs in flight ========================================================== 06:34:47 (1743503687) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (06:34:50) ... fail_loc=0 done (06:35:06) /mnt/lustre/d102b.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-7 has perms 0600 OK PASS 102b (25s) == replay-single test 102c: check replay w/o reconstruction with multiple mod RPCs in flight ========================================================== 06:35:13 (1743503713) creating 7 files ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210048 3968 2204032 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 18432 3336192 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3347456 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 25600 6683648 1% /mnt/lustre fail_loc=0x15a launch 7 chmod in parallel (06:35:18) ... fail_loc=0 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:35:23 (1743503723) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:35:44 (1743503744) targets are mounted 06:35:44 (1743503744) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec done (06:35:52) /mnt/lustre/d102c.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-7 has perms 0600 OK PASS 102c (46s) == replay-single test 102d: check replay & reconstruction with multiple mod RPCs in flight ========================================================== 06:35:58 (1743503758) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (06:36:01) ... fail_loc=0 Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:36:07 (1743503767) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:36:28 (1743503788) targets are mounted 06:36:28 (1743503788) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec done (06:36:37) /mnt/lustre/d102d.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-7 has perms 0600 OK PASS 102d (44s) == replay-single test 103: Check otr_next_id overflow ==== 06:36:43 (1743503803) fail_loc=0x80000162 total: 30 open/close in 0.32 seconds: 94.78 ops/second Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:36:49 (1743503809) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:37:11 (1743503831) targets are mounted 06:37:11 (1743503831) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 103 (42s) == replay-single test 110a: DNE: create striped dir, fail MDT1 ========================================================== 06:37:24 (1743503844) SKIP: replay-single test_110a needs >= 2 MDTs SKIP 110a (4s) == replay-single test 110b: DNE: create striped dir, fail MDT1 and client ========================================================== 06:37:28 (1743503848) SKIP: replay-single test_110b needs >= 2 MDTs SKIP 110b (3s) == replay-single test 110c: DNE: create striped dir, fail MDT2 ========================================================== 06:37:31 (1743503851) SKIP: replay-single test_110c needs >= 2 MDTs SKIP 110c (3s) == replay-single test 110d: DNE: create striped dir, fail MDT2 and client ========================================================== 06:37:35 (1743503855) SKIP: replay-single test_110d needs >= 2 MDTs SKIP 110d (4s) == replay-single test 110e: DNE: create striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 06:37:38 (1743503858) SKIP: replay-single test_110e needs >= 2 MDTs SKIP 110e (3s) SKIP: replay-single test_110f skipping excluded test 110f == replay-single test 110g: DNE: create striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 06:37:43 (1743503863) SKIP: replay-single test_110g needs >= 2 MDTs SKIP 110g (3s) == replay-single test 111a: DNE: unlink striped dir, fail MDT1 ========================================================== 06:37:46 (1743503866) SKIP: replay-single test_111a needs >= 2 MDTs SKIP 111a (4s) == replay-single test 111b: DNE: unlink striped dir, fail MDT2 ========================================================== 06:37:50 (1743503870) SKIP: replay-single test_111b needs >= 2 MDTs SKIP 111b (3s) == replay-single test 111c: DNE: unlink striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 06:37:53 (1743503873) SKIP: replay-single test_111c needs >= 2 MDTs SKIP 111c (4s) == replay-single test 111d: DNE: unlink striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 06:37:57 (1743503877) SKIP: replay-single test_111d needs >= 2 MDTs SKIP 111d (3s) == replay-single test 111e: DNE: unlink striped dir, uncommit on MDT2, fail MDT1/MDT2 ========================================================== 06:38:01 (1743503881) SKIP: replay-single test_111e needs >= 2 MDTs SKIP 111e (3s) == replay-single test 111f: DNE: unlink striped dir, uncommit on MDT1, fail MDT1/MDT2 ========================================================== 06:38:04 (1743503884) SKIP: replay-single test_111f needs >= 2 MDTs SKIP 111f (3s) == replay-single test 111g: DNE: unlink striped dir, fail MDT1/MDT2 ========================================================== 06:38:08 (1743503888) SKIP: replay-single test_111g needs >= 2 MDTs SKIP 111g (4s) == replay-single test 112a: DNE: cross MDT rename, fail MDT1 ========================================================== 06:38:11 (1743503891) SKIP: replay-single test_112a needs >= 4 MDTs SKIP 112a (3s) == replay-single test 112b: DNE: cross MDT rename, fail MDT2 ========================================================== 06:38:15 (1743503895) SKIP: replay-single test_112b needs >= 4 MDTs SKIP 112b (3s) == replay-single test 112c: DNE: cross MDT rename, fail MDT3 ========================================================== 06:38:18 (1743503898) SKIP: replay-single test_112c needs >= 4 MDTs SKIP 112c (3s) == replay-single test 112d: DNE: cross MDT rename, fail MDT4 ========================================================== 06:38:22 (1743503902) SKIP: replay-single test_112d needs >= 4 MDTs SKIP 112d (3s) == replay-single test 112e: DNE: cross MDT rename, fail MDT1 and MDT2 ========================================================== 06:38:25 (1743503905) SKIP: replay-single test_112e needs >= 4 MDTs SKIP 112e (3s) == replay-single test 112f: DNE: cross MDT rename, fail MDT1 and MDT3 ========================================================== 06:38:28 (1743503908) SKIP: replay-single test_112f needs >= 4 MDTs SKIP 112f (4s) == replay-single test 112g: DNE: cross MDT rename, fail MDT1 and MDT4 ========================================================== 06:38:32 (1743503912) SKIP: replay-single test_112g needs >= 4 MDTs SKIP 112g (4s) == replay-single test 112h: DNE: cross MDT rename, fail MDT2 and MDT3 ========================================================== 06:38:36 (1743503916) SKIP: replay-single test_112h needs >= 4 MDTs SKIP 112h (3s) == replay-single test 112i: DNE: cross MDT rename, fail MDT2 and MDT4 ========================================================== 06:38:39 (1743503919) SKIP: replay-single test_112i needs >= 4 MDTs SKIP 112i (3s) == replay-single test 112j: DNE: cross MDT rename, fail MDT3 and MDT4 ========================================================== 06:38:43 (1743503923) SKIP: replay-single test_112j needs >= 4 MDTs SKIP 112j (4s) == replay-single test 112k: DNE: cross MDT rename, fail MDT1,MDT2,MDT3 ========================================================== 06:38:46 (1743503926) SKIP: replay-single test_112k needs >= 4 MDTs SKIP 112k (3s) == replay-single test 112l: DNE: cross MDT rename, fail MDT1,MDT2,MDT4 ========================================================== 06:38:49 (1743503929) SKIP: replay-single test_112l needs >= 4 MDTs SKIP 112l (4s) == replay-single test 112m: DNE: cross MDT rename, fail MDT1,MDT3,MDT4 ========================================================== 06:38:53 (1743503933) SKIP: replay-single test_112m needs >= 4 MDTs SKIP 112m (3s) == replay-single test 112n: DNE: cross MDT rename, fail MDT2,MDT3,MDT4 ========================================================== 06:38:56 (1743503936) SKIP: replay-single test_112n needs >= 4 MDTs SKIP 112n (4s) == replay-single test 115: failover for create/unlink striped directory ========================================================== 06:39:00 (1743503940) SKIP: replay-single test_115 needs >= 2 MDTs SKIP 115 (3s) == replay-single test 116a: large update log master MDT recovery ========================================================== 06:39:03 (1743503943) SKIP: replay-single test_116a needs >= 2 MDTs SKIP 116a (4s) == replay-single test 116b: large update log slave MDT recovery ========================================================== 06:39:07 (1743503947) SKIP: replay-single test_116b needs >= 2 MDTs SKIP 116b (3s) == replay-single test 117: DNE: cross MDT unlink, fail MDT1 and MDT2 ========================================================== 06:39:11 (1743503951) SKIP: replay-single test_117 needs >= 4 MDTs SKIP 117 (3s) == replay-single test 118: invalidate osp update will not cause update log corruption ========================================================== 06:39:14 (1743503954) SKIP: replay-single test_118 needs >= 2 MDTs SKIP 118 (3s) == replay-single test 119: timeout of normal replay does not cause DNE replay fails ========================================================== 06:39:17 (1743503957) SKIP: replay-single test_119 needs >= 2 MDTs SKIP 119 (3s) == replay-single test 120: DNE fail abort should stop both normal and DNE replay ========================================================== 06:39:21 (1743503961) SKIP: replay-single test_120 needs >= 2 MDTs SKIP 120 (3s) == replay-single test 121: lock replay timed out and race ========================================================== 06:39:24 (1743503964) multiop /mnt/lustre/f121.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7367 Stopping /mnt/lustre-mds1 (opts:) on oleg631-server Failover mds1 to oleg631-server oleg631-server.virtnet fail_loc=0x721 fail_val=0 at_max=0 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 fail_loc=0x0 at_max=600 PASS 121 (42s) == replay-single test 130a: DoM file create (setstripe) replay ========================================================== 06:40:06 (1743504006) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2209920 3840 2204032 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 18432 3336192 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3762176 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 25600 7098368 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:40:14 (1743504014) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:40:46 (1743504046) targets are mounted 06:40:46 (1743504046) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130a (53s) == replay-single test 130b: DoM file create (inherited) replay ========================================================== 06:40:59 (1743504059) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2209920 3840 2204032 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 18432 3750912 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3762176 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 25600 7513088 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:41:07 (1743504067) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:41:29 (1743504089) targets are mounted 06:41:29 (1743504089) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130b (45s) == replay-single test 131a: DoM file write lock replay === 06:41:45 (1743504105) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2209920 3840 2204032 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 18432 3750912 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3762176 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 25600 7513088 1% /mnt/lustre 1+0 records in 1+0 records out 8 bytes copied, 0.00415555 s, 1.9 kB/s Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:41:54 (1743504114) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:42:26 (1743504146) targets are mounted 06:42:26 (1743504146) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 131a (66s) SKIP: replay-single test_131b skipping excluded test 131b == replay-single test 132a: PFL new component instantiate replay ========================================================== 06:42:53 (1743504173) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2209920 3840 2204032 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 18432 3750912 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 7168 3762176 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 25600 7513088 1% /mnt/lustre 1+0 records in 1+0 records out 1048576 bytes (1.0 MB, 1.0 MiB) copied, 0.0423309 s, 24.8 MB/s /mnt/lustre/f132a.replay-single lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x240000400:0x28c2:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x280000400:0x2842:0x0] } - 1: { l_ost_idx: 0, l_fid: [0x240000400:0x28c3:0x0] } Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:43:02 (1743504182) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:43:33 (1743504213) targets are mounted 06:43:33 (1743504213) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f132a.replay-single lcm_layout_gen: 4 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x240000400:0x28c2:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x280000400:0x2842:0x0] } - 1: { l_ost_idx: 0, l_fid: [0x240000400:0x28c3:0x0] } PASS 132a (57s) == replay-single test 133: check resend of ongoing requests for lwp during failover ========================================================== 06:43:50 (1743504230) SKIP: replay-single test_133 needs >= 2 MDTs SKIP 133 (5s) == replay-single test 134: replay creation of a file created in a pool ========================================================== 06:43:54 (1743504234) Creating new pool pool_134 oleg631-server: Pool lustre.pool_134 created Adding targets to pool oleg631-server: OST lustre-OST0001_UUID added to pool lustre.pool_134 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2209920 3968 2203904 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 18432 3750912 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 8192 3761152 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 26624 7512064 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:44:08 (1743504248) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:44:40 (1743504280) targets are mounted 06:44:40 (1743504280) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Destroy the created pools: pool_134 lustre.pool_134 oleg631-server: OST lustre-OST0001_UUID removed from pool lustre.pool_134 oleg631-server: Pool lustre.pool_134 destroyed PASS 134 (71s) == replay-single test 135: Server failure in lock replay phase ========================================================== 06:45:05 (1743504305) Failing ost1 on oleg631-server Stopping /mnt/lustre-ost1 (opts:) on oleg631-server 06:45:11 (1743504311) shut down facet: ost1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover ost1 to oleg631-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 06:45:34 (1743504334) targets are mounted 06:45:34 (1743504334) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2209920 3968 2203904 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 18432 3750912 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 8192 3761152 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 26624 7512064 1% /mnt/lustre ldlm.cancel_unused_locks_before_replay=0 debug_mb=100 debug=+info +ha +dlmtrace Stopping /mnt/lustre-ost1 (opts:) on oleg631-server Failover ost1 to oleg631-server oleg631-server.virtnet oleg631-server: oleg631-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0x32d fail_val=20 debug_mb=100 debug=+info +ha +dlmtrace Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 oleg631-client.virtnet: executing wait_import_state_mount REPLAY_LOCKS osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in REPLAY_LOCKS state after 0 sec Stopping /mnt/lustre-ost1 (opts:) on oleg631-server Failover ost1 to oleg631-server oleg631-server.virtnet oleg631-server: oleg631-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 End of sync oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 Stopping /mnt/lustre-ost1 (opts:-f) on oleg631-server Stopping /mnt/lustre-ost2 (opts:-f) on oleg631-server Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0000 Starting ost2: -o localrecov lustre-ost2/ost2 /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-OST0001 pdsh@oleg631-client: oleg631-client: ssh exited with exit code 5 debug_mb=21 debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck ldlm.cancel_unused_locks_before_replay=1 PASS 135 (148s) == replay-single test 136: MDS to disconnect all OSPs first, then cleanup ldlm ========================================================== 06:47:33 (1743504453) SKIP: replay-single test_136 needs > 2 MDTs SKIP 136 (4s) == replay-single test 137a: DNE: create under striped dir, fail MDT1 ========================================================== 06:47:37 (1743504457) SKIP: replay-single test_137a needs >= 2 MDTs SKIP 137a (4s) == replay-single test 137b: DNE: create under striped dir, fail MDT2 ========================================================== 06:47:41 (1743504461) SKIP: replay-single test_137b needs >= 2 MDTs SKIP 137b (3s) == replay-single test 137c: DNE: create under striped dir, fail MDT1/MDT2 ========================================================== 06:47:45 (1743504465) SKIP: replay-single test_137c needs >= 2 MDTs SKIP 137c (3s) == replay-single test 200: Dropping one OBD_PING should not cause disconnect ========================================================== 06:47:49 (1743504469) SKIP: replay-single test_200 Need remote client SKIP 200 (3s) == replay-single test 201: MDT umount cascading disconnects timeouts ========================================================== 06:47:52 (1743504472) SKIP: replay-single test_201 needs >= 2 MDTs SKIP 201 (4s) == replay-single test 202: pfl replay should recovery layout generation ========================================================== 06:47:56 (1743504476) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2209920 3968 2203904 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 18432 3750912 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 8192 3761152 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 26624 7512064 1% /mnt/lustre Failing mds1 on oleg631-server Stopping /mnt/lustre-mds1 (opts:) on oleg631-server 06:48:06 (1743504486) shut down facet: mds1 facet_host: oleg631-server facet_failover_host: oleg631-server Failover mds1 to oleg631-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg631-server: oleg631-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg631-client: oleg631-server: ssh exited with exit code 1 Started lustre-MDT0000 06:48:26 (1743504506) targets are mounted 06:48:26 (1743504506) facet_failover done oleg631-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 202 (46s) == replay-single test complete, duration 11063 sec ======= 06:48:43 (1743504523) === replay-single: start cleanup 06:48:45 (1743504525) === === replay-single: finish cleanup 06:48:52 (1743504532) ===