-----============= acceptance-small: replay-single ============----- Mon May 20 21:06:28 EDT 2024 mgs: CentOS Linux release 7.9.2009 (Core) MGS_OS_VERSION_ID=7 MGS_OS_ID=centos MGS_OS_VERSION_CODE=117440512 MGS_OS_ID_LIKE=rhel fedora centos mds1: CentOS Linux release 7.9.2009 (Core) MDS1_OS_ID_LIKE=rhel fedora centos MDS1_OS_ID=centos MDS1_OS_VERSION_ID=7 MDS1_OS_VERSION_CODE=117440512 ost1: CentOS Linux release 7.9.2009 (Core) OST1_OS_VERSION_CODE=117440512 OST1_OS_VERSION_ID=7 OST1_OS_ID_LIKE=rhel fedora centos OST1_OS_ID=centos client: CentOS Linux release 7.9.2009 (Core) CLIENT_OS_ID=centos CLIENT_OS_ID_LIKE=rhel fedora centos CLIENT_OS_VERSION_ID=7 CLIENT_OS_VERSION_CODE=117440512 excepting tests: 110f 131b 59 36 === replay-single: start setup 21:06:37 (1716253597) === oleg237-client.virtnet: executing check_config_client /mnt/lustre oleg237-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg237-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff88012ab46000.idle_timeout=debug osc.lustre-OST0001-osc-ffff88012ab46000.idle_timeout=debug disable quota as required oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all osd-ldiskfs.track_declares_assert=1 === replay-single: finish setup 21:06:49 (1716253609) === debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0a: empty replay =================== 21:06:51 (1716253611) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1776 1285912 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1616 1286072 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3048 7210992 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:06:58 (1716253618) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:07:16 (1716253636) targets are mounted 21:07:16 (1716253636) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 0a (34s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0b: ensure object created after recover exists. (3284) ========================================================== 21:07:28 (1716253648) Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 21:07:31 (1716253651) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 21:07:49 (1716253669) targets are mounted 21:07:49 (1716253669) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec total: 20 open/close in 0.26 seconds: 77.99 ops/second - unlinked 0 (time 1716253675 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 0b (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0c: check replay-barrier =========== 21:08:01 (1716253681) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1772 1285916 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1616 1286072 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:08:06 (1716253686) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:08:23 (1716253703) targets are mounted 21:08:23 (1716253703) facet_failover done Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre rm: cannot remove '/mnt/lustre/f0c.replay-single': No such file or directory PASS 0c (97s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 0d: expired recovery with no clients ========================================================== 21:09:40 (1716253780) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1772 1285916 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1616 1286072 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:09:49 (1716253789) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:10:08 (1716253808) targets are mounted 21:10:08 (1716253808) facet_failover done Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre PASS 0d (101s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 1: simple create =================== 21:11:23 (1716253883) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1772 1285916 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1616 1286072 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:11:27 (1716253887) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:11:40 (1716253900) targets are mounted 21:11:40 (1716253900) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f1.replay-single has type file OK PASS 1 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2a: touch ========================== 21:11:49 (1716253909) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1772 1285916 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1616 1286072 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:11:52 (1716253912) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:12:06 (1716253926) targets are mounted 21:12:06 (1716253926) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2a.replay-single has type file OK PASS 2a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2b: touch ========================== 21:12:15 (1716253935) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1772 1285916 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1616 1286072 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:12:18 (1716253938) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:12:32 (1716253952) targets are mounted 21:12:32 (1716253952) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2b.replay-single has type file OK PASS 2b (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2c: setstripe replay =============== 21:12:41 (1716253961) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1772 1285916 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1616 1286072 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:12:45 (1716253965) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:13:00 (1716253980) targets are mounted 21:13:00 (1716253980) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2c.replay-single has type file OK PASS 2c (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2d: setdirstripe replay ============ 21:13:10 (1716253990) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1772 1285916 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1616 1286072 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:13:14 (1716253994) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:13:29 (1716254009) targets are mounted 21:13:29 (1716254009) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d2d.replay-single has type dir OK PASS 2d (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 2e: O_CREAT|O_EXCL create replay === 21:13:39 (1716254019) fail_loc=0x8000013b UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1856 1285832 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:13:45 (1716254025) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:14:00 (1716254040) targets are mounted 21:14:00 (1716254040) facet_failover done Succeed in opening file "/mnt/lustre/f2e.replay-single"(flags=O_CREAT) oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2e.replay-single has type file OK PASS 2e (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3a: replay failed open(O_DIRECTORY) ========================================================== 21:14:10 (1716254050) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1856 1285832 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Error in opening file "/mnt/lustre/f3a.replay-single"(flags=O_DIRECTORY) 20: Not a directory Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:14:14 (1716254054) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:14:30 (1716254070) targets are mounted 21:14:30 (1716254070) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f3a.replay-single has type file OK PASS 3a (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3b: replay failed open -ENOMEM ===== 21:14:39 (1716254079) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1856 1285832 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre fail_loc=0x80000114 touch: cannot touch '/mnt/lustre/f3b.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:14:44 (1716254084) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:15:00 (1716254100) targets are mounted 21:15:00 (1716254100) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3b.replay-single: No such file or directory PASS 3b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 3c: replay failed open -ENOMEM ===== 21:15:09 (1716254109) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1856 1285832 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre fail_loc=0x80000128 touch: cannot touch '/mnt/lustre/f3c.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:15:14 (1716254114) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:15:30 (1716254130) targets are mounted 21:15:30 (1716254130) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3c.replay-single: No such file or directory PASS 3c (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4a: |x| 10 open(O_CREAT)s ========== 21:15:39 (1716254139) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1856 1285832 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1540 3605480 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1524 3605496 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3064 7210976 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:15:44 (1716254144) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:16:00 (1716254160) targets are mounted 21:16:00 (1716254160) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 4a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 4b: |x| rm 10 files ================ 21:16:10 (1716254170) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1856 1285832 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:16:14 (1716254174) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:16:30 (1716254190) targets are mounted 21:16:30 (1716254190) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f4b.replay-single-*: No such file or directory PASS 4b (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 5: |x| 220 open(O_CREAT) =========== 21:16:39 (1716254199) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1856 1285832 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:16:46 (1716254206) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:17:01 (1716254221) targets are mounted 21:17:01 (1716254221) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 5 (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6a: mkdir + contained create ======= 21:17:20 (1716254240) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1888 1285800 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:17:24 (1716254244) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:17:39 (1716254259) targets are mounted 21:17:39 (1716254259) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d6a.replay-single has type dir OK /mnt/lustre/d6a.replay-single/f6a.replay-single has type file OK PASS 6a (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 6b: |X| rmdir ====================== 21:17:51 (1716254271) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1884 1285804 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:17:55 (1716254275) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:18:11 (1716254291) targets are mounted 21:18:11 (1716254291) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d6b.replay-single: No such file or directory PASS 6b (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 7: mkdir |X| contained create ====== 21:18:20 (1716254300) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1884 1285804 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:18:25 (1716254305) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:18:40 (1716254320) targets are mounted 21:18:40 (1716254320) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d7.replay-single has type dir OK /mnt/lustre/d7.replay-single/f7.replay-single has type file OK PASS 7 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 8: creat open |X| close ============ 21:18:50 (1716254330) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre multiop /mnt/lustre/f8.replay-single vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:18:54 (1716254334) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:19:10 (1716254350) targets are mounted 21:19:10 (1716254350) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f8.replay-single /mnt/lustre/f8.replay-single has type file OK PASS 8 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 9: |X| create (same inum/gen) ====== 21:19:19 (1716254359) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:19:24 (1716254364) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:19:39 (1716254379) targets are mounted 21:19:39 (1716254379) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old_inum == 144115305935798546, new_inum == 144115305935798546 old_inum and new_inum match PASS 9 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 10: create |X| rename unlink ======= 21:19:49 (1716254389) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:19:53 (1716254393) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:20:09 (1716254409) targets are mounted 21:20:09 (1716254409) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f10.replay-single: No such file or directory PASS 10 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 11: create open write rename |X| create-old-name read ========================================================== 21:20:18 (1716254418) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre new old Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:20:23 (1716254423) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:20:38 (1716254438) targets are mounted 21:20:38 (1716254438) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec new old PASS 11 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 12: open, unlink |X| close ========= 21:20:48 (1716254448) multiop /mnt/lustre/f12.replay-single vo_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:20:53 (1716254453) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:21:08 (1716254468) targets are mounted 21:21:08 (1716254468) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 12 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 13: open chmod 0 |x| write close === 21:21:18 (1716254478) multiop /mnt/lustre/f13.replay-single vO_wc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 /mnt/lustre/f13.replay-single has perms 00 OK UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:21:22 (1716254482) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:21:38 (1716254498) targets are mounted 21:21:38 (1716254498) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f13.replay-single has perms 00 OK /mnt/lustre/f13.replay-single has size 1 OK PASS 13 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 14: open(O_CREAT), unlink |X| close ========================================================== 21:21:47 (1716254507) multiop /mnt/lustre/f14.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:21:52 (1716254512) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:22:07 (1716254527) targets are mounted 21:22:07 (1716254527) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 14 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 15: open(O_CREAT), unlink |X| touch new, close ========================================================== 21:22:17 (1716254537) multiop /mnt/lustre/f15.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:22:21 (1716254541) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:22:37 (1716254557) targets are mounted 21:22:37 (1716254557) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 15 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 16: |X| open(O_CREAT), unlink, touch new, unlink new ========================================================== 21:22:46 (1716254566) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:22:51 (1716254571) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:23:06 (1716254586) targets are mounted 21:23:06 (1716254586) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 16 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 17: |X| open(O_CREAT), |replay| close ========================================================== 21:23:16 (1716254596) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre multiop /mnt/lustre/f17.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:23:20 (1716254600) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:23:36 (1716254616) targets are mounted 21:23:36 (1716254616) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f17.replay-single has type file OK PASS 17 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 18: open(O_CREAT), unlink, touch new, close, touch, unlink ========================================================== 21:23:45 (1716254625) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre multiop /mnt/lustre/f18.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 pid: 25074 will close Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:23:50 (1716254630) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:24:05 (1716254645) targets are mounted 21:24:05 (1716254645) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 18 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 19: mcreate, open, write, rename === 21:24:15 (1716254655) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3112 7210928 1% /mnt/lustre old Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:24:20 (1716254660) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:24:35 (1716254675) targets are mounted 21:24:35 (1716254675) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old PASS 19 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20a: |X| open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 21:24:45 (1716254685) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3116 7210924 1% /mnt/lustre multiop /mnt/lustre/f20a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:24:50 (1716254690) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:25:05 (1716254705) targets are mounted 21:25:05 (1716254705) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 20a (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20b: write, unlink, eviction, replay (test mds_cleanup_orphans) ========================================================== 21:25:14 (1716254714) /mnt/lustre/f20b.replay-single lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 1090 0x442 0x280000401 10000+0 records in 10000+0 records out 40960000 bytes (41 MB) copied, 1.33543 s, 30.7 MB/s Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:25:19 (1716254719) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:25:33 (1716254733) targets are mounted 21:25:33 (1716254733) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg237-server: oleg237-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg237-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for MDT destroys to complete before 3116, after 3116 PASS 20b (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 20c: check that client eviction does not affect file content ========================================================== 21:25:46 (1716254746) multiop /mnt/lustre/f20c.replay-single vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 -rw-r--r-- 1 root root 1 May 20 21:25 /mnt/lustre/f20c.replay-single pdsh@oleg237-client: oleg237-client: ssh exited with exit code 5 PASS 20c (4s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 21: |X| open(O_CREAT), unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 21:25:53 (1716254753) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1548 3605472 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3116 7210880 1% /mnt/lustre multiop /mnt/lustre/f21.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:25:57 (1716254757) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:26:12 (1716254772) targets are mounted 21:26:12 (1716254772) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 21 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 22: open(O_CREAT), |X| unlink, replay, close (test mds_cleanup_orphans) ========================================================== 21:26:22 (1716254782) multiop /mnt/lustre/f22.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:26:27 (1716254787) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:26:42 (1716254802) targets are mounted 21:26:42 (1716254802) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 22 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 23: open(O_CREAT), |X| unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 21:26:52 (1716254812) multiop /mnt/lustre/f23.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:26:56 (1716254816) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:27:11 (1716254831) targets are mounted 21:27:11 (1716254831) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 23 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 24: open(O_CREAT), replay, unlink, close (test mds_cleanup_orphans) ========================================================== 21:27:21 (1716254841) multiop /mnt/lustre/f24.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:27:26 (1716254846) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:27:41 (1716254861) targets are mounted 21:27:41 (1716254861) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 24 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 25: open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 21:27:51 (1716254871) multiop /mnt/lustre/f25.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:27:55 (1716254875) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:28:11 (1716254891) targets are mounted 21:28:11 (1716254891) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 25 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 26: |X| open(O_CREAT), unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 21:28:20 (1716254900) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre multiop /mnt/lustre/f26.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f26.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:28:25 (1716254905) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:28:40 (1716254920) targets are mounted 21:28:40 (1716254920) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 26 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 27: |X| open(O_CREAT), unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 21:28:50 (1716254930) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre multiop /mnt/lustre/f27.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f27.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:28:55 (1716254935) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:29:10 (1716254950) targets are mounted 21:29:10 (1716254950) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 27 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 28: open(O_CREAT), |X| unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 21:29:20 (1716254960) multiop /mnt/lustre/f28.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f28.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:29:24 (1716254964) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:29:40 (1716254980) targets are mounted 21:29:40 (1716254980) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 28 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 29: open(O_CREAT), |X| unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 21:29:49 (1716254989) multiop /mnt/lustre/f29.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f29.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:29:54 (1716254994) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:30:09 (1716255009) targets are mounted 21:30:09 (1716255009) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 29 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 30: open(O_CREAT) two, unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 21:30:19 (1716255019) multiop /mnt/lustre/f30.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f30.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:30:23 (1716255023) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:30:39 (1716255039) targets are mounted 21:30:39 (1716255039) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 30 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 31: open(O_CREAT) two, unlink one, |X| unlink one, close two (test mds_cleanup_orphans) ========================================================== 21:30:48 (1716255048) multiop /mnt/lustre/f31.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f31.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1880 1285808 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1692 1285996 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:30:53 (1716255053) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:31:08 (1716255068) targets are mounted 21:31:08 (1716255068) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 31 (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 32: close() notices client eviction; close() after client eviction ========================================================== 21:31:18 (1716255078) multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 pdsh@oleg237-client: oleg237-client: ssh exited with exit code 5 PASS 32 (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33a: fid seq shouldn't be reused after abort recovery ========================================================== 21:31:25 (1716255085) total: 10 open/close in 0.07 seconds: 142.22 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failover mds1 to oleg237-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.07 seconds: 142.13 ops/second PASS 33a (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 33b: test fid seq allocation ======= 21:31:48 (1716255108) fail_loc=0x1311 total: 10 open/close in 0.07 seconds: 140.91 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failover mds1 to oleg237-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.08 seconds: 133.14 ops/second PASS 33b (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 34: abort recovery before client does replay (test mds_cleanup_orphans) ========================================================== 21:32:10 (1716255130) multiop /mnt/lustre/f34.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1912 1285776 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1724 1285964 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failover mds1 to oleg237-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 34 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 35: test recovery from llog for unlink op ========================================================== 21:32:33 (1716255153) fail_loc=0x80000119 Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failover mds1 to oleg237-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 rm: cannot remove '/mnt/lustre/f35.replay-single': Input/output error oleg237-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg237-client: oleg237-client: ssh exited with exit code 5 first stat failed: 5 Can't lstat /mnt/lustre/f35.replay-single: No such file or directory PASS 35 (20s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_36 skipping ALWAYS excluded test 36 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 37: abort recovery before client does replay (test mds_cleanup_orphans for directories) ========================================================== 21:32:56 (1716255176) multiop /mnt/lustre/d37.replay-single/f37.replay-single vdD_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1984 1285704 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1788 1285900 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failover mds1 to oleg237-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg237-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg237-client: oleg237-client: ssh exited with exit code 5 first stat failed: 5 PASS 37 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 38: test recovery from unlink llog (test llog_gen_rec) ========================================================== 21:33:20 (1716255200) total: 800 open/close in 4.89 seconds: 163.64 ops/second - unlinked 0 (time 1716255208 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2116 1285572 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:33:34 (1716255214) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:33:49 (1716255229) targets are mounted 21:33:49 (1716255229) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1716255236 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Can't lstat /mnt/lustre/f38.replay-single-*: No such file or directory PASS 38 (42s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 39: test recovery from unlink llog (test llog_gen_rec) ========================================================== 21:34:05 (1716255245) total: 800 open/close in 5.01 seconds: 159.69 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2120 1285568 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1568 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3120 7210876 1% /mnt/lustre - unlinked 0 (time 1716255255 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:34:18 (1716255258) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:34:33 (1716255273) targets are mounted 21:34:33 (1716255273) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1716255281 ; total 0 ; last 0) total: 400 unlinks in 3 seconds: 133.333328 unlinks/second Can't lstat /mnt/lustre/f39.replay-single-*: No such file or directory PASS 39 (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 41: read from a valid osc while other oscs are invalid ========================================================== 21:34:50 (1716255290) 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00279469 s, 1.5 MB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00722035 s, 567 kB/s PASS 41 (3s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 42: recovery after ost failure ===== 21:34:55 (1716255295) total: 800 open/close in 4.15 seconds: 192.75 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2148 1285540 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1716255305 ; total 0 ; last 0) total: 400 unlinks in 1 seconds: 400.000000 unlinks/second debug=-1 Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 21:35:08 (1716255308) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 21:35:24 (1716255324) targets are mounted 21:35:24 (1716255324) facet_failover done wait for MDS to timeout and recover debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck - unlinked 0 (time 1716255365 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Can't lstat /mnt/lustre/f42.replay-single-*: No such file or directory PASS 42 (74s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 43: mds osc import failure during recovery; don't LBUG ========================================================== 21:36:11 (1716255371) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2204 1285484 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre fail_loc=0x80000204 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:36:16 (1716255376) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:36:31 (1716255391) targets are mounted 21:36:31 (1716255391) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 43 (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44a: race in target handle connect ========================================================== 21:36:51 (1716255411) at_max=40 1 of 10 (1716255413) service : cur 5 worst 5 (at 1716253538, 1875s ago) 4 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 2 of 10 (1716255419) service : cur 6 worst 6 (at 1716255419, 0s ago) 6 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 3 of 10 (1716255424) service : cur 6 worst 6 (at 1716255419, 6s ago) 6 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 4 of 10 (1716255430) service : cur 6 worst 6 (at 1716255419, 11s ago) 6 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 5 of 10 (1716255436) service : cur 6 worst 6 (at 1716255419, 17s ago) 6 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 6 of 10 (1716255441) service : cur 6 worst 6 (at 1716255419, 23s ago) 6 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 7 of 10 (1716255447) service : cur 6 worst 6 (at 1716255419, 28s ago) 6 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 8 of 10 (1716255453) service : cur 6 worst 6 (at 1716255419, 34s ago) 6 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 9 of 10 (1716255458) service : cur 6 worst 6 (at 1716255419, 39s ago) 6 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre 10 of 10 (1716255464) service : cur 6 worst 6 (at 1716255419, 45s ago) 6 4 4 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre fail_loc=0 at_max=600 PASS 44a (61s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44b: race in target handle connect ========================================================== 21:37:54 (1716255474) 1 of 10 (1716255475) service : cur 6 worst 6 (at 1716255419, 56s ago) 6 4 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.137@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 2 of 10 (1716255515) service : cur 40 worst 40 (at 1716255515, 1s ago) 40 6 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.137@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 3 of 10 (1716255536) service : cur 40 worst 40 (at 1716255515, 21s ago) 40 6 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.137@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 4 of 10 (1716255557) service : cur 40 worst 40 (at 1716255515, 42s ago) 40 6 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.137@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 5 of 10 (1716255577) service : cur 40 worst 40 (at 1716255515, 63s ago) 40 6 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.137@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 6 of 10 (1716255598) service : cur 40 worst 40 (at 1716255515, 83s ago) 40 6 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.137@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 7 of 10 (1716255618) service : cur 40 worst 40 (at 1716255515, 104s ago) 40 6 4 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.137@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 8 of 10 (1716255639) service : cur 40 worst 40 (at 1716255515, 124s ago) 40 40 6 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.137@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 9 of 10 (1716255660) service : cur 40 worst 40 (at 1716255515, 145s ago) 40 40 6 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.137@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre 10 of 10 (1716255680) service : cur 40 worst 40 (at 1716255515, 166s ago) 40 40 6 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.202.137@tcp:/lustre 7666232 3124 7210916 1% /mnt/lustre PASS 44b (229s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 44c: race in target handle connect ========================================================== 21:41:45 (1716255705) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2100 1285588 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1820 1285868 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre total: 100 create in 0.47 seconds: 214.05 ops/second fail_loc=0x80000712 Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failover mds1 to oleg237-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:42:06 (1716255726) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:42:20 (1716255740) targets are mounted 21:42:20 (1716255740) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second PASS 44c (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 45: Handle failed close ============ 21:42:30 (1716255750) multiop /mnt/lustre/f45.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 /mnt/lustre/f45.replay-single has type file OK PASS 45 (3s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 46: Don't leak file handle after open resend (3325) ========================================================== 21:42:35 (1716255755) fail_loc=0x122 fail_loc=0 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:42:54 (1716255774) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:43:08 (1716255788) targets are mounted 21:43:08 (1716255788) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lfs path2fid: cannot get fid for 'f46.replay-single': No such file or directory PASS 46 (41s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 47: MDS->OSC failure during precreate cleanup (2824) ========================================================== 21:43:18 (1716255798) total: 20 open/close in 0.15 seconds: 130.90 ops/second Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 21:43:20 (1716255800) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 21:43:35 (1716255815) targets are mounted 21:43:35 (1716255815) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_loc=0x80000204 total: 20 open/close in 0.13 seconds: 157.29 ops/second - unlinked 0 (time 1716255880 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 47 (84s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 48: MDS->OSC failure during precreate cleanup (2824) ========================================================== 21:44:44 (1716255884) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2132 1285556 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre total: 20 open/close in 0.15 seconds: 137.65 ops/second Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:44:49 (1716255889) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:45:04 (1716255904) targets are mounted 21:45:04 (1716255904) facet_failover done fail_loc=0x80000216 total: 20 open/close in 0.14 seconds: 138.83 ops/second - unlinked 0 (time 1716255968 ; total 0 ; last 0) total: 40 unlinks in 0 seconds: inf unlinks/second PASS 48 (86s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 50: Double OSC recovery, don't LASSERT (3812) ========================================================== 21:46:12 (1716255972) PASS 50 (8s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 52: time out lock replay (3764) ==== 21:46:22 (1716255982) multiop /mnt/lustre/f52.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7517 fail_loc=0x80000157 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:46:24 (1716255984) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:46:38 (1716255998) targets are mounted 21:46:38 (1716255998) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 fail_loc=0x0 PASS 52 (81s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53a: |X| close request while two MDC requests in flight ========================================================== 21:47:45 (1716256065) fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:47:52 (1716256072) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:48:07 (1716256087) targets are mounted 21:48:07 (1716256087) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53a.replay-single-1/f has type file OK /mnt/lustre/d53a.replay-single-2/f has type file OK PASS 53a (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53b: |X| open request while two MDC requests in flight ========================================================== 21:48:17 (1716256097) multiop /mnt/lustre/d53b.replay-single-1/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.7517 fail_loc=0x80000107 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:48:23 (1716256103) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:48:38 (1716256118) targets are mounted 21:48:38 (1716256118) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53b.replay-single-1/f has type file OK /mnt/lustre/d53b.replay-single-2/f has type file OK PASS 53b (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53c: |X| open request and close request while two MDC requests in flight ========================================================== 21:48:48 (1716256128) fail_loc=0x80000107 fail_loc=0x80000115 Replay barrier on lustre-MDT0000 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:48:54 (1716256134) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:49:09 (1716256149) targets are mounted 21:49:09 (1716256149) facet_failover done fail_loc=0 /mnt/lustre/d53c.replay-single-1/f has type file OK /mnt/lustre/d53c.replay-single-2/f has type file OK PASS 53c (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53d: close reply while two MDC requests in flight ========================================================== 21:49:19 (1716256159) fail_loc=0x8000013b fail_loc=0 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:49:22 (1716256162) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:49:36 (1716256176) targets are mounted 21:49:36 (1716256176) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53d.replay-single-1/f has type file OK /mnt/lustre/d53d.replay-single-2/f has type file OK PASS 53d (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53e: |X| open reply while two MDC requests in flight ========================================================== 21:49:46 (1716256186) fail_loc=0x119 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:49:52 (1716256192) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:50:08 (1716256208) targets are mounted 21:50:08 (1716256208) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53e.replay-single-1/f has type file OK /mnt/lustre/d53e.replay-single-2/f has type file OK PASS 53e (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53f: |X| open reply and close reply while two MDC requests in flight ========================================================== 21:50:17 (1716256217) fail_loc=0x119 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:50:24 (1716256224) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:50:39 (1716256239) targets are mounted 21:50:39 (1716256239) facet_failover done fail_loc=0 /mnt/lustre/d53f.replay-single-1/f has type file OK /mnt/lustre/d53f.replay-single-2/f has type file OK PASS 53f (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53g: |X| drop open reply and close request while close and open are both in flight ========================================================== 21:50:48 (1716256248) fail_loc=0x119 fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:50:55 (1716256255) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:51:10 (1716256270) targets are mounted 21:51:10 (1716256270) facet_failover done /mnt/lustre/d53g.replay-single-1/f has type file OK /mnt/lustre/d53g.replay-single-2/f has type file OK PASS 53g (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 53h: open request and close reply while two MDC requests in flight ========================================================== 21:51:19 (1716256279) fail_loc=0x80000107 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:51:27 (1716256287) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:51:42 (1716256302) targets are mounted 21:51:42 (1716256302) facet_failover done fail_loc=0 /mnt/lustre/d53h.replay-single-1/f has type file OK /mnt/lustre/d53h.replay-single-2/f has type file OK PASS 53h (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 55: let MDS_CHECK_RESENT return the original return code instead of 0 ========================================================== 21:51:51 (1716256311) fail_loc=0x8000012b fail_loc=0x0 touch: rm: cannot touch '/mnt/lustre/f55.replay-single'cannot remove '/mnt/lustre/f55.replay-single': No such file or directory: No such file or directory PASS 55 (63s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 56: don't replay a symlink open request (3440) ========================================================== 21:52:56 (1716256376) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2140 1285548 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:53:00 (1716256380) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:53:16 (1716256396) targets are mounted 21:53:16 (1716256396) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 56 (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 57: test recovery from llog for setattr op ========================================================== 21:53:35 (1716256415) fail_loc=0x8000012c UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2140 1285548 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:53:40 (1716256420) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:53:56 (1716256436) targets are mounted 21:53:56 (1716256436) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg237-server: oleg237-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg237-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg237-server mds-ost sync done. /mnt/lustre/f57.replay-single has type file OK fail_loc=0x0 PASS 57 (32s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58a: test recovery from llog for setattr op (test llog_gen_rec) ========================================================== 21:54:09 (1716256449) fail_loc=0x8000012c - open/close 2288 (time 1716256461.45 total 10.00 last 228.76) total: 2500 open/close in 10.92 seconds: 228.91 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2448 1285240 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:54:26 (1716256466) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:54:42 (1716256482) targets are mounted 21:54:42 (1716256482) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 - unlinked 0 (time 1716256503 ; total 0 ; last 0) total: 2500 unlinks in 6 seconds: 416.666656 unlinks/second PASS 58a (62s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58b: test replay of setxattr op ==== 21:55:13 (1716256513) Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre2 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2416 1285272 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:55:19 (1716256519) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:55:34 (1716256534) targets are mounted 21:55:34 (1716256534) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Stopping client oleg237-client.virtnet /mnt/lustre2 (opts:) oleg237-client.virtnet: executing wait_import_state_mount FULL mgc.*.mgs_server_uuid mgc.*.mgs_server_uuid in FULL state after 0 sec PASS 58b (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 58c: resend/reconstruct setxattr op ========================================================== 21:55:46 (1716256546) Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre2 fail_val=0 fail_loc=0x123 fail_loc=0 fail_loc=0x119 fail_loc=0 Stopping client oleg237-client.virtnet /mnt/lustre2 (opts:) PASS 58c (129s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_59 skipping ALWAYS excluded test 59 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 60: test llog post recovery init vs llog unlink ========================================================== 21:57:58 (1716256678) total: 200 open/close in 0.88 seconds: 227.85 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2276 1285412 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1852 1285836 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1716256683 ; total 0 ; last 0) total: 100 unlinks in 0 seconds: inf unlinks/second Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 21:58:05 (1716256685) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 21:58:20 (1716256700) targets are mounted 21:58:20 (1716256700) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1716256706 ; total 0 ; last 0) total: 100 unlinks in 1 seconds: 100.000000 unlinks/second PASS 60 (31s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61a: test race llog recovery vs llog cleanup ========================================================== 21:58:31 (1716256711) total: 800 open/close in 3.51 seconds: 227.60 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2360 1285328 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2024 1285664 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3124 7210916 1% /mnt/lustre - unlinked 0 (time 1716256720 ; total 0 ; last 0) total: 800 unlinks in 2 seconds: 400.000000 unlinks/second fail_val=0 fail_loc=0x80000221 Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 21:58:44 (1716256724) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 21:59:00 (1716256740) targets are mounted 21:59:00 (1716256740) facet_failover done Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 21:59:11 (1716256751) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 21:59:26 (1716256766) targets are mounted 21:59:26 (1716256766) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 Can't lstat /mnt/lustre/d61a.replay-single/f61a.replay-single-*: No such file or directory PASS 61a (91s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61b: test race mds llog sync vs llog cleanup ========================================================== 22:00:04 (1716256804) fail_loc=0x8000013a Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:00:06 (1716256806) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:00:20 (1716256820) targets are mounted 22:00:20 (1716256820) facet_failover done Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:00:32 (1716256832) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:00:46 (1716256846) targets are mounted 22:00:46 (1716256846) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00517322 s, 792 kB/s PASS 61b (49s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61c: test race mds llog sync vs llog cleanup ========================================================== 22:00:55 (1716256855) fail_val=0 fail_loc=0x80000222 Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 22:01:08 (1716256868) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 22:01:22 (1716256882) targets are mounted 22:01:22 (1716256882) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 PASS 61c (33s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 61d: error in llog_setup should cleanup the llog context correctly ========================================================== 22:01:31 (1716256891) Stopping /mnt/lustre-mds1 (opts:) on oleg237-server fail_loc=0x80000605 Starting mgs: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: mount.lustre: mount /dev/mapper/mds1_flakey at /mnt/lustre-mds1 failed: Operation not supported pdsh@oleg237-client: oleg237-server: ssh exited with exit code 95 Start of /dev/mapper/mds1_flakey on mgs failed 95 fail_loc=0 Starting mgs: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 61d (11s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 62: don't mis-drop resent replay === 22:01:44 (1716256904) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2352 1285336 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2008 1285680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1552 3605468 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3128 7210912 1% /mnt/lustre total: 25 open/close in 0.13 seconds: 189.59 ops/second fail_loc=0x80000707 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:01:49 (1716256909) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:02:04 (1716256924) targets are mounted 22:02:04 (1716256924) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 - unlinked 0 (time 1716256986 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second PASS 62 (84s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65a: AT: verify early replies ====== 22:03:10 (1716256990) at_history=8 at_history=8 debug=other fail_val=11000 fail_loc=0x8000050a 00000100:00001000:0.0:1716257020.906512:0:14881:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff8800a8dcc700 x1799622196190272/t0(0) o101->lustre-MDT0000-mdc-ffff8800aa9e2800@192.168.202.137@tcp:12/10 lens 664/66320 e 1 to 0 dl 1716257055 ref 2 fl Rpc:PQr/200/ffffffff rc 0/-1 job:'createmany.0' uid:0 gid:0 portal 12 : cur 36 worst 40 (at 1716255515, 1511s ago) 40 40 40 40 portal 29 : cur 5 worst 5 (at 1716253879, 3147s ago) 5 0 0 0 portal 23 : cur 5 worst 5 (at 1716253879, 3147s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1716253884, 3142s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1716253964, 3062s ago) 5 5 0 5 portal 13 : cur 5 worst 5 (at 1716254392, 2634s ago) 5 5 0 0 portal 12 : cur 5 worst 40 (at 1716255515, 1520s ago) 5 40 40 40 portal 29 : cur 5 worst 5 (at 1716253879, 3156s ago) 5 0 0 0 portal 23 : cur 5 worst 5 (at 1716253879, 3156s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1716253884, 3151s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1716253964, 3071s ago) 5 5 0 5 portal 13 : cur 5 worst 5 (at 1716254392, 2643s ago) 5 5 0 0 PASS 65a (47s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 65b: AT: verify early replies on packed reply / bulk ========================================================== 22:03:59 (1716257039) at_history=8 at_history=8 debug=other trace fail_val=11 fail_loc=0x224 fail_loc=0 00000100:00001000:1.0:1716257069.322619:0:2009:0:(client.c:537:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (25s) req@ffff8801316e0000 x1799622196200128/t0(0) o4->lustre-OST0000-osc-ffff8800aa9e2800@192.168.202.137@tcp:6/4 lens 4584/448 e 1 to 0 dl 1716257104 ref 2 fl Rpc:Qr/200/ffffffff rc 0/-1 job:'multiop.0' uid:0 gid:0 debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck debug=super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck portal 28 : cur 5 worst 5 (at 1716253879, 3196s ago) 5 5 5 5 portal 7 : cur 5 worst 5 (at 1716253880, 3195s ago) 5 5 5 5 portal 17 : cur 5 worst 5 (at 1716253935, 3140s ago) 5 0 5 5 portal 6 : cur 36 worst 36 (at 1716257074, 1s ago) 36 5 0 0 PASS 65b (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66a: AT: verify MDT service time adjusts with no early replies ========================================================== 22:04:39 (1716257079) at_history=8 at_history=8 portal 12 : cur 5 worst 40 (at 1716255515, 1586s ago) 5 40 40 40 fail_val=5000 fail_loc=0x8000050a portal 12 : cur 5 worst 40 (at 1716255515, 1592s ago) 5 40 40 40 fail_val=10000 fail_loc=0x8000050a portal 12 : cur 36 worst 40 (at 1716255515, 1603s ago) 36 40 40 40 fail_loc=0 portal 12 : cur 5 worst 40 (at 1716255515, 1613s ago) 36 40 40 40 Current MDT timeout 5, worst 40 PASS 66a (51s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 66b: AT: verify net latency adjusts ========================================================== 22:05:32 (1716257132) at_history=8 at_history=8 fail_val=10 fail_loc=0x50c fail_loc=0 network timeout orig 5, cur 10, worst 10 PASS 66b (95s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67a: AT: verify slow request processing doesn't induce reconnects ========================================================== 22:07:09 (1716257229) at_history=8 at_history=8 fail_val=400 fail_loc=0x50a fail_loc=0 0 osc reconnect attempts on gradual slow PASS 67a (74s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 67b: AT: verify instant slowdown doesn't induce reconnects ========================================================== 22:08:25 (1716257305) at_history=8 at_history=8 Creating to objid 5057 on ost lustre-OST0000... fail_val=20000 fail_loc=0x80000223 total: 18 open/close in 0.09 seconds: 205.07 ops/second Connected clients: oleg237-client.virtnet oleg237-client.virtnet service : cur 5 worst 5 (at 1716253551, 3779s ago) 1 1 1 1 phase 2 0 osc reconnect attempts on instant slow fail_loc=0x80000223 fail_loc=0 Connected clients: oleg237-client.virtnet oleg237-client.virtnet service : cur 5 worst 5 (at 1716253551, 3780s ago) 1 1 1 1 0 osc reconnect attempts on 2nd slow PASS 67b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 68: AT: verify slowing locks ======= 22:08:55 (1716257335) at_history=8 at_history=8 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1973: $ldlm_enqueue_min: ambiguous redirect fail_val=19 fail_loc=0x80000312 fail_val=25 fail_loc=0x80000312 fail_loc=0 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1988: $ldlm_enqueue_min: ambiguous redirect PASS 68 (71s) debug_raw_pointers=0 debug_raw_pointers=0 Cleaning up AT ... debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70a: check multi client t-f ======== 22:10:08 (1716257408) SKIP: replay-single test_70a Need two or more clients, have 1 SKIP 70a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70b: dbench 2mdts recovery; 1 clients ========================================================== 22:10:11 (1716257411) Starting client oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre Started clients oleg237-client.virtnet: 192.168.202.137@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d70b.replay-single + MISSING_DBENCH_OK= + PATH=/opt/iozone/bin:/opt/iozone/bin:/home/green/git/lustre-release/lustre/tests/mpi:/home/green/git/lustre-release/lustre/tests/racer:/home/green/git/lustre-release/lustre/../lustre-iokit/sgpdd-survey:/home/green/git/lustre-release/lustre/tests:/home/green/git/lustre-release/lustre/utils/gss:/home/green/git/lustre-release/lustre/utils:/opt/iozone/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin::/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests:/sbin:/usr/sbin:/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests/: + DBENCH_LIB= + TESTSUITE=replay-single + TESTNAME=test_70b + MOUNT=/mnt/lustre ++ hostname + DIR=/mnt/lustre/d70b.replay-single/oleg237-client.virtnet + LCTL=/home/green/git/lustre-release/lustre/utils/lctl + rundbench 1 -t 300 dbench: no process found Started rundbench load pid=22014 ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2524 1285164 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2116 1285572 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3593908 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 26132 3580888 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 27712 7174796 1% /mnt/lustre test_70b fail mds1 1 times Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:10:20 (1716257420) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:10:35 (1716257435) targets are mounted 22:10:35 (1716257435) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2076 1285612 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1776 3593500 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 26488 3566976 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 28264 7160476 1% /mnt/lustre oleg237-client.virtnet: looking for dbench program oleg237-client.virtnet: /usr/bin/dbench oleg237-client.virtnet: creating output directory /mnt/lustre/d70b.replay-single/oleg237-client.virtnet oleg237-client.virtnet: mkdir: created directory '/mnt/lustre/d70b.replay-single/oleg237-client.virtnet' oleg237-client.virtnet: found dbench client file /usr/share/dbench/client.txt oleg237-client.virtnet: '/usr/share/dbench/client.txt' -> 'client.txt' oleg237-client.virtnet: running 'dbench 1 -t 300' on /mnt/lustre/d70b.replay-single/oleg237-client.virtnet at Mon May 20 22:10:12 EDT 2024 oleg237-client.virtnet: waiting for dbench pid 22038 oleg237-client.virtnet: dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 oleg237-client.virtnet: oleg237-client.virtnet: Running for 300 seconds with load 'client.txt' and minimum warmup 60 secs oleg237-client.virtnet: failed to create barrier semaphore oleg237-client.virtnet: 0 of 1 processes prepared for launch 0 sec oleg237-client.virtnet: 1 of 1 processes prepared for launch 0 sec oleg237-client.virtnet: releasing clients oleg237-client.virtnet: 1 298 10.96 MB/sec warmup 1 sec latency 24.035 ms oleg237-client.virtnet: 1 656 10.53 MB/sec warmup 2 sec latency 18.328 ms oleg237-client.virtnet: 1 959 7.27 MB/sec warmup 3 sec latency 15.755 ms oleg237-client.virtnet: 1 1393 6.50 MB/sec warmup 4 sec latency 15.389 ms oleg237-client.virtnet: 1 1729 5.25 MB/sec warmup 5 sec latency 28.284 ms oleg237-client.virtnet: 1 2134 4.44 MB/sec warmup 6 sec latency 10.657 ms oleg237-client.virtnet: 1 2473 4.09 MB/sec warmup 7 sec latency 54.359 ms oleg237-client.virtnet: 1 2473 3.58 MB/sec warmup 8 sec latency 1054.552 ms oleg237-client.virtnet: 1 2473 3.18 MB/sec warmup 9 sec latency 2054.717 ms oleg237-client.virtnet: 1 2473 2.86 MB/sec warmup 10 sec latency 3054.878 ms oleg237-client.virtnet: 1 2473 2.60 MB/sec warmup 11 sec latency 4055.070 ms oleg237-client.virtnet: 1 2473 2.38 MB/sec warmup 12 sec latency 5055.212 ms oleg237-client.virtnet: 1 2473 2.20 MB/sec warmup 13 sec latency 6055.363 ms oleg237-client.virtnet: 1 2473 2.04 MB/sec warmup 14 sec latency 7055.543 ms oleg237-client.virtnet: 1 2473 1.91 MB/sec warmup 15 sec latency 8055.710 ms oleg237-client.virtnet: 1 2473 1.79 MB/sec warmup 16 sec latency 9055.867 ms oleg237-client.virtnet: 1 2473 1.68 MB/sec warmup 17 sec latency 10056.050 ms oleg237-client.virtnet: 1 2473 1.59 MB/sec warmup 18 sec latency 11056.246 ms oleg237-client.virtnet: 1 2473 1.51 MB/sec warmup 19 sec latency 12056.444 ms oleg237-client.virtnet: 1 2473 1.43 MB/sec warmup 20 sec latency 13056.680 ms oleg237-client.virtnet: 1 2473 1.36 MB/sec warmup 21 sec latency 14056.874 ms oleg237-client.virtnet: 1 2473 1.30 MB/sec warmup 22 sec latency 15057.081 ms oleg237-client.virtnet: 1 2473 1.24 MB/sec warmup 23 sec latency 16057.285 ms oleg237-client.virtnet: 1 2473 1.19 MB/sec warmup 24 sec latency 17057.473 ms oleg237-client.virtnet: 1 2473 1.14 MB/sec warmup 25 sec latency 18057.679 ms oleg237-client.virtnet: 1 2473 1.10 MB/sec warmup 26 sec latency 19057.853 ms oleg237-client.virtnet: 1 2761 1.18 MB/sec warmup 27 sec latency 19611.409 ms oleg237-client.virtnet: 1 3378 1.27 MB/sec warmup 28 sec latency 16.491 ms oleg237-client.virtnet: 1 3813 1.37 MB/sec warmup 29 sec latency 15.485 ms oleg237-client.virtnet: 1 4051 1.34 MB/sec warmup 30 sec latency 16.040 ms oleg237-client.virtnet: 1 4369 1.31 MB/sec warmup 31 sec latency 19.257 ms oleg237-client.virtnet: 1 4739 1.32 MB/sec warmup 32 sec latency 15.590 ms oleg237-client.virtnet: 1 5137 1.37 MB/sec warmup 33 sec latencytest_70b fail mds2 2 times Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:10:49 (1716257449) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:11:04 (1716257464) targets are mounted 22:11:04 (1716257464) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2008 1285680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 11916 3593952 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37616 3568644 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 49532 7162596 1% /mnt/lustre test_70b fail mds1 3 times Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:11:23 (1716257483) shut down 16.512 ms oleg237-client.virtnet: 1 5426 1.34 MB/sec warmup 34 sec latency 17.390 ms oleg237-client.virtnet: 1 5834 1.32 MB/sec warmup 35 sec latency 11.103 ms oleg237-client.virtnet: 1 6305 1.42 MB/sec warmup 36 sec latency 15.937 ms oleg237-client.virtnet: 1 6920 1.47 MB/sec warmup 37 sec latency 16.367 ms oleg237-client.virtnet: 1 7345 1.55 MB/sec warmup 38 sec latency 16.199 ms oleg237-client.virtnet: 1 7567 1.52 MB/sec warmup 39 sec latency 16.127 ms oleg237-client.virtnet: 1 7893 1.49 MB/sec warmup 40 sec latency 19.881 ms oleg237-client.virtnet: 1 8221 1.49 MB/sec warmup 41 sec latency 15.892 ms oleg237-client.virtnet: 1 8606 1.53 MB/sec warmup 42 sec latency 15.901 ms oleg237-client.virtnet: 1 8903 1.50 MB/sec warmup 43 sec latency 18.396 ms oleg237-client.virtnet: 1 9281 1.48 MB/sec warmup 44 sec latency 10.741 ms oleg237-client.virtnet: 1 9669 1.49 MB/sec warmup 45 sec latency 15.526 ms oleg237-client.virtnet: 1 10311 1.58 MB/sec warmup 46 sec latency 15.846 ms oleg237-client.virtnet: 1 10810 1.64 MB/sec warmup 47 sec latency 12.219 ms oleg237-client.virtnet: 1 11074 1.63 MB/sec warmup 48 sec latency 16.024 ms oleg237-client.virtnet: 1 11393 1.61 MB/sec warmup 49 sec latency 18.743 ms oleg237-client.virtnet: 1 11711 1.60 MB/sec warmup 50 sec latency 15.960 ms oleg237-client.virtnet: 1 12113 1.63 MB/sec warmup 51 sec latency 16.699 ms oleg237-client.virtnet: 1 12233 1.61 MB/sec warmup 52 sec latency 607.714 ms oleg237-client.virtnet: 1 12233 1.58 MB/sec warmup 53 sec latency 1607.891 ms oleg237-client.virtnet: 1 12233 1.55 MB/sec warmup 54 sec latency 2608.065 ms oleg237-client.virtnet: 1 12335 1.52 MB/sec warmup 55 sec latency 3311.726 ms oleg237-client.virtnet: 1 12674 1.50 MB/sec warmup 56 sec latency 17.045 ms oleg237-client.virtnet: 1 13060 1.50 MB/sec warmup 57 sec latency 15.441 ms oleg237-client.virtnet: 1 13581 1.57 MB/sec warmup 58 sec latency 16.297 ms oleg237-client.virtnet: 1 14205 1.62 MB/sec warmup 59 sec latency 16.231 ms oleg237-client.virtnet: 1 14760 0.33 MB/sec execute 1 sec latency 15.637 ms oleg237-client.virtnet: 1 15091 0.51 MB/sec execute 2 sec latency 17.193 ms oleg237-client.virtnet: 1 15416 0.80 MB/sec execute 3 sec latency 16.839 ms oleg237-client.virtnet: 1 15653 1.34 MB/sec execute 4 sec latency 490.255 ms oleg237-client.virtnet: 1 15653 1.07 MB/sec execute 5 sec latency 1490.445 ms oleg237-client.virtnet: 1 15653 0.90 MB/sec execute 6 sec latency 2490.607 ms oleg237-client.virtnet: 1 15653 0.77 MB/sec execute 7 sec latency 3490.769 ms oleg237-client.virtnet: 1 15653 0.67 MB/sec execute 8 sec latency 4490.980 ms oleg237-client.virtnet: 1 15653 0.60 MB/sec execute 9 sec latency 5491.146 ms oleg237-client.virtnet: 1 15653 0.54 MB/sec execute 10 sec latency 6491.296 ms oleg237-client.virtnet: 1 15653 0.49 MB/sec execute 11 sec latency 7491.481 ms oleg237-client.virtnet: 1 15653 0.45 MB/sec execute 12 sec latency 8491.652 ms oleg237-client.virtnet: 1 15653 0.41 MB/sec execute 13 sec latency 9491.820 ms oleg237-client.virtnet: 1 15653 0.38 MB/sec execute 14 sec latency 10491.983 ms oleg237-client.virtnet: 1 15653 0.36 MB/sec execute 15 sec latency 11492.151 ms oleg237-client.virtnet: 1 15653 0.34 MB/sec execute 16 sec latency 12492.318 ms oleg237-client.virtnet: 1 15653 0.32 MB/sec execute 17 sec latency 13492.489 ms oleg237-client.virtnet: 1 15653 0.30 MB/sec execute 18 sec latency 14492.645 ms oleg237-client.virtnet: 1 15653 0.28 MB/sec execute 1Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:11:38 (1716257498) targets are mounted 22:11:38 (1716257498) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2444 1285244 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2008 1285680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 11920 3594056 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37572 3568804 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 49492 7162860 1% /mnt/lustre test_70b fail mds2 4 times Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:11:51 (1716257511) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:12:07 (1716257527) targets are mounted 22:12:07 (1716257527) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2008 1285680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12076 3594056 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37400 3569108 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 49476 7163164 1% /mnt/lustre 9 sec latency 15492.839 ms oleg237-client.virtnet: 1 15653 0.27 MB/sec execute 20 sec latency 16493.037 ms oleg237-client.virtnet: 1 15653 0.26 MB/sec execute 21 sec latency 17493.224 ms oleg237-client.virtnet: 1 15653 0.24 MB/sec execute 22 sec latency 18493.419 ms oleg237-client.virtnet: 1 15653 0.23 MB/sec execute 23 sec latency 19493.607 ms oleg237-client.virtnet: 1 15653 0.22 MB/sec execute 24 sec latency 20493.800 ms oleg237-client.virtnet: 1 15653 0.21 MB/sec execute 25 sec latency 21493.934 ms oleg237-client.virtnet: 1 15653 0.21 MB/sec execute 26 sec latency 22494.110 ms oleg237-client.virtnet: 1 15653 0.20 MB/sec execute 27 sec latency 23494.297 ms oleg237-client.virtnet: 1 15653 0.19 MB/sec execute 28 sec latency 24494.472 ms oleg237-client.virtnet: 1 15653 0.19 MB/sec execute 29 sec latency 25494.603 ms oleg237-client.virtnet: 1 15787 0.18 MB/sec execute 30 sec latency 26044.332 ms oleg237-client.virtnet: 1 16125 0.19 MB/sec execute 31 sec latency 17.027 ms oleg237-client.virtnet: 1 16538 0.22 MB/sec execute 32 sec latency 14.202 ms oleg237-client.virtnet: 1 17095 0.37 MB/sec execute 33 sec latency 16.358 ms oleg237-client.virtnet: 1 17722 0.52 MB/sec execute 34 sec latency 21.986 ms oleg237-client.virtnet: 1 18022 0.55 MB/sec execute 35 sec latency 15.701 ms oleg237-client.virtnet: 1 18258 0.54 MB/sec execute 36 sec latency 15.555 ms oleg237-client.virtnet: 1 18602 0.54 MB/sec execute 37 sec latency 21.324 ms oleg237-client.virtnet: 1 18914 0.57 MB/sec execute 38 sec latency 15.649 ms oleg237-client.virtnet: 1 19332 0.63 MB/sec execute 39 sec latency 15.686 ms oleg237-client.virtnet: 1 19660 0.62 MB/sec execute 40 sec latency 17.754 ms oleg237-client.virtnet: 1 20077 0.64 MB/sec execute 41 sec latency 10.607 ms oleg237-client.virtnet: 1 20501 0.73 MB/sec execute 42 sec latency 15.480 ms oleg237-client.virtnet: 1 21140 0.79 MB/sec execute 43 sec latency 15.783 ms oleg237-client.virtnet: 1 21561 0.87 MB/sec execute 44 sec latency 15.497 ms oleg237-client.virtnet: 1 21791 0.86 MB/sec execute 45 sec latency 15.646 ms oleg237-client.virtnet: 1 22122 0.85 MB/sec execute 46 sec latency 18.461 ms oleg237-client.virtnet: 1 22450 0.86 MB/sec execute 47 sec latency 15.568 ms oleg237-client.virtnet: 1 22836 0.91 MB/sec execute 48 sec latency 15.992 ms oleg237-client.virtnet: 1 23144 0.90 MB/sec execute 49 sec latency 17.527 ms oleg237-client.virtnet: 1 23556 0.89 MB/sec execute 50 sec latency 10.886 ms oleg237-client.virtnet: 1 23929 0.91 MB/sec execute 51 sec latency 15.471 ms oleg237-client.virtnet: 1 24672 1.02 MB/sec execute 52 sec latency 15.755 ms oleg237-client.virtnet: 1 25100 1.08 MB/sec execute 53 sec latency 15.279 ms oleg237-client.virtnet: 1 25341 1.07 MB/sec execute 54 sec latency 15.797 ms oleg237-client.virtnet: 1 25467 1.05 MB/sec execute 55 sec latency 647.093 ms oleg237-client.virtnet: 1 25467 1.03 MB/sec execute 56 sec latency 1647.269 ms oleg237-client.virtnet: 1 25467 1.01 MB/sec execute 57 sec latency 2647.477 ms oleg237-client.virtnet: 1 25651 1.00 MB/sec execute 58 sec latency 3150.925 ms oleg237-client.virtnet: 1 25969 1.01 MB/sec execute 59 sec latency 15.652 ms oleg237-client.virtnet: 1 26379 1.05 MB/sec execute 60 sec latency 15.743 ms oleg237-client.virtnet: 1 26685 1.03 MB/sec execute 61 sec latency 18.432 ms oleg237-client.virtnet: 1 27076 1.02 MB/sec execute 62 sec latency 11.282 ms oleg237-client.virtnet: 1 27460 1.04 MB/sec execute 63 sec latency 16.389 ms oleg237-client.virtnet: test_70b fail mds1 5 times Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:12:19 (1716257539) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:12:35 (1716257555) targets are mounted 22:12:35 (1716257555) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2444 1285244 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2008 1285680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 14424 3591568 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37400 3569192 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 51824 7160760 1% /mnt/lustre test_70b fail mds2 6 times Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:12:48 (1716257568) shut down Failover mds2 to oleg237-server mount facets: mds2 1 27864 1.10 MB/sec execute 64 sec latency 373.893 ms oleg237-client.virtnet: 1 28488 1.16 MB/sec execute 65 sec latency 16.155 ms oleg237-client.virtnet: 1 28761 1.16 MB/sec execute 66 sec latency 16.547 ms oleg237-client.virtnet: 1 28811 1.15 MB/sec execute 67 sec latency 796.319 ms oleg237-client.virtnet: 1 28811 1.13 MB/sec execute 68 sec latency 1796.499 ms oleg237-client.virtnet: 1 28811 1.11 MB/sec execute 69 sec latency 2796.702 ms oleg237-client.virtnet: 1 28811 1.10 MB/sec execute 70 sec latency 3796.870 ms oleg237-client.virtnet: 1 28811 1.08 MB/sec execute 71 sec latency 4797.029 ms oleg237-client.virtnet: 1 28811 1.07 MB/sec execute 72 sec latency 5797.198 ms oleg237-client.virtnet: 1 28811 1.05 MB/sec execute 73 sec latency 6797.353 ms oleg237-client.virtnet: 1 28811 1.04 MB/sec execute 74 sec latency 7797.545 ms oleg237-client.virtnet: 1 28811 1.02 MB/sec execute 75 sec latency 8797.693 ms oleg237-client.virtnet: 1 28811 1.01 MB/sec execute 76 sec latency 9797.859 ms oleg237-client.virtnet: 1 28811 1.00 MB/sec execute 77 sec latency 10798.011 ms oleg237-client.virtnet: 1 28811 0.98 MB/sec execute 78 sec latency 11798.191 ms oleg237-client.virtnet: 1 28811 0.97 MB/sec execute 79 sec latency 12798.390 ms oleg237-client.virtnet: 1 28811 0.96 MB/sec execute 80 sec latency 13798.598 ms oleg237-client.virtnet: 1 28811 0.95 MB/sec execute 81 sec latency 14798.779 ms oleg237-client.virtnet: 1 28811 0.94 MB/sec execute 82 sec latency 15798.954 ms oleg237-client.virtnet: 1 28811 0.93 MB/sec execute 83 sec latency 16799.142 ms oleg237-client.virtnet: 1 28811 0.91 MB/sec execute 84 sec latency 17799.332 ms oleg237-client.virtnet: 1 28811 0.90 MB/sec execute 85 sec latency 18799.554 ms oleg237-client.virtnet: 1 28845 0.89 MB/sec execute 86 sec latency 19703.552 ms oleg237-client.virtnet: 1 29122 0.89 MB/sec execute 87 sec latency 16.051 ms oleg237-client.virtnet: 1 29398 0.88 MB/sec execute 88 sec latency 16.115 ms oleg237-client.virtnet: 1 29815 0.92 MB/sec execute 89 sec latency 16.043 ms oleg237-client.virtnet: 1 30112 0.91 MB/sec execute 90 sec latency 16.808 ms oleg237-client.virtnet: 1 30481 0.91 MB/sec execute 91 sec latency 15.070 ms oleg237-client.virtnet: 1 30854 0.91 MB/sec execute 92 sec latency 16.367 ms oleg237-client.virtnet: 1 31420 0.96 MB/sec execute 93 sec latency 15.633 ms oleg237-client.virtnet: 1 32036 1.01 MB/sec execute 94 sec latency 15.928 ms oleg237-client.virtnet: 1 32322 1.01 MB/sec execute 95 sec latency 15.688 ms oleg237-client.virtnet: 1 32489 1.00 MB/sec execute 96 sec latency 272.885 ms oleg237-client.virtnet: 1 32489 0.99 MB/sec execute 97 sec latency 1273.179 ms oleg237-client.virtnet: 1 32489 0.98 MB/sec execute 98 sec latency 2273.428 ms oleg237-client.virtnet: 1 32489 0.97 MB/sec execute 99 sec latency 3273.690 ms oleg237-client.virtnet: 1 32489 0.96 MB/sec execute 100 sec latency 4273.896 ms oleg237-client.virtnet: 1 32489 0.95 MB/sec execute 101 sec latency 5274.083 ms oleg237-client.virtnet: 1 32489 0.94 MB/sec execute 102 sec latency 6274.251 ms oleg237-client.virtnet: 1 32489 0.93 MB/sec execute 103 sec latency 7274.427 ms oleg237-client.virtnet: 1 32489 0.92 MB/sec execute 104 sec latency 8274.632 ms oleg237-client.virtnet: 1 32489 0.91 MB/sec execute 105 sec latency 9274.841 ms oleg237-client.virtnet: 1 32489 0.91 MB/sec execute 106 sec latency 10275.015 ms oleg237-client.virtnet: 1 32489 0.90 MB/sec execute 107 sec latency 11275.188 ms oleg237-client.virtnet: 1 32489 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:13:03 (1716257583) targets are mounted 22:13:03 (1716257583) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2452 1285236 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2008 1285680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12076 3593916 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37408 3569072 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 49484 7162988 1% /mnt/lustre test_70b fail mds1 7 times Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:13:16 (1716257596) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:13:31 (1716257611) targets are mounted 22:13:31 (1716257611) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2444 1285244 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2008 1285680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 14180 3591188 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37480 3569112 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 51660 7160300 1% /mnt/lustre test_70b fail mds2 8 times Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 0.89 MB/sec execute 108 sec latency 12275.369 ms oleg237-client.virtnet: 1 32489 0.88 MB/sec execute 109 sec latency 13275.570 ms oleg237-client.virtnet: 1 32489 0.87 MB/sec execute 110 sec latency 14275.776 ms oleg237-client.virtnet: 1 32489 0.86 MB/sec execute 111 sec latency 15275.976 ms oleg237-client.virtnet: 1 32489 0.86 MB/sec execute 112 sec latency 16276.162 ms oleg237-client.virtnet: 1 32489 0.85 MB/sec execute 113 sec latency 17276.340 ms oleg237-client.virtnet: 1 32502 0.84 MB/sec execute 114 sec latency 18236.178 ms oleg237-client.virtnet: 1 32863 0.84 MB/sec execute 115 sec latency 15.728 ms oleg237-client.virtnet: 1 33186 0.85 MB/sec execute 116 sec latency 16.069 ms oleg237-client.virtnet: 1 33577 0.86 MB/sec execute 117 sec latency 16.202 ms oleg237-client.virtnet: 1 33909 0.86 MB/sec execute 118 sec latency 17.700 ms oleg237-client.virtnet: 1 34325 0.86 MB/sec execute 119 sec latency 13.577 ms oleg237-client.virtnet: 1 34731 0.89 MB/sec execute 120 sec latency 103.074 ms oleg237-client.virtnet: 1 35289 0.91 MB/sec execute 121 sec latency 297.272 ms oleg237-client.virtnet: 1 35711 0.94 MB/sec execute 122 sec latency 13.219 ms oleg237-client.virtnet: 1 35910 0.94 MB/sec execute 123 sec latency 153.342 ms oleg237-client.virtnet: 1 35910 0.93 MB/sec execute 124 sec latency 1153.532 ms oleg237-client.virtnet: 1 35910 0.92 MB/sec execute 125 sec latency 2153.780 ms oleg237-client.virtnet: 1 35910 0.91 MB/sec execute 126 sec latency 3153.952 ms oleg237-client.virtnet: 1 35910 0.91 MB/sec execute 127 sec latency 4154.118 ms oleg237-client.virtnet: 1 35910 0.90 MB/sec execute 128 sec latency 5154.261 ms oleg237-client.virtnet: 1 35910 0.89 MB/sec execute 129 sec latency 6154.411 ms oleg237-client.virtnet: 1 35910 0.89 MB/sec execute 130 sec latency 7154.579 ms oleg237-client.virtnet: 1 35910 0.88 MB/sec execute 131 sec latency 8154.733 ms oleg237-client.virtnet: 1 35910 0.87 MB/sec execute 132 sec latency 9155.146 ms oleg237-client.virtnet: 1 35910 0.87 MB/sec execute 133 sec latency 10155.355 ms oleg237-client.virtnet: 1 35910 0.86 MB/sec execute 134 sec latency 11155.564 ms oleg237-client.virtnet: 1 35910 0.85 MB/sec execute 135 sec latency 12155.745 ms oleg237-client.virtnet: 1 35910 0.85 MB/sec execute 136 sec latency 13155.950 ms oleg237-client.virtnet: 1 35910 0.84 MB/sec execute 137 sec latency 14156.163 ms oleg237-client.virtnet: 1 35910 0.83 MB/sec execute 138 sec latency 15156.358 ms oleg237-client.virtnet: 1 35910 0.83 MB/sec execute 139 sec latency 16156.584 ms oleg237-client.virtnet: 1 35910 0.82 MB/sec execute 140 sec latency 17156.797 ms oleg237-client.virtnet: 1 35910 0.82 MB/sec execute 141 sec latency 18156.974 ms oleg237-client.virtnet: 1 35910 0.81 MB/sec execute 142 sec latency 19157.153 ms oleg237-client.virtnet: 1 35997 0.80 MB/sec execute 143 sec latency 19757.152 ms oleg237-client.virtnet: 1 36289 0.80 MB/sec execute 144 sec latency 21.043 ms oleg237-client.virtnet: 1 36588 0.81 MB/sec execute 145 sec latency 18.940 ms oleg237-client.virtnet: 1 36985 0.82 MB/sec execute 146 sec latency 16.048 ms oleg237-client.virtnet: 1 37288 0.82 MB/sec execute 147 sec latency 17.615 ms oleg237-client.virtnet: 1 37665 0.82 MB/sec execute 148 sec latency 13.210 ms oleg237-client.virtnet: 1 38061 0.83 MB/sec execute 149 sec latency 15.493 ms oleg237-client.virtnet: 1 38693 0.86 MB/sec execute 150 sec latency 17.301 ms oleg237-client.virtnet: 1 39178 0.88 MB/sec execute 151 sec latency 15.448 ms oleg237-client.virtnet: 1 39443 0.88 MB/sec ex22:13:45 (1716257625) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:14:00 (1716257640) targets are mounted 22:14:00 (1716257640) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2008 1285680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12100 3591868 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37524 3568908 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 49624 7160776 1% /mnt/lustre test_70b fail mds1 9 times Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:14:13 (1716257653) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:14:28 (1716257668) targets are mounted 22:14:28 (1716257668) facet_failover done ecute 152 sec latency 16.114 ms oleg237-client.virtnet: 1 39780 0.88 MB/sec execute 153 sec latency 15.907 ms oleg237-client.virtnet: 1 40060 0.88 MB/sec execute 154 sec latency 16.237 ms oleg237-client.virtnet: 1 40491 0.90 MB/sec execute 155 sec latency 16.465 ms oleg237-client.virtnet: 1 40789 0.89 MB/sec execute 156 sec latency 17.652 ms oleg237-client.virtnet: 1 41182 0.89 MB/sec execute 157 sec latency 11.049 ms oleg237-client.virtnet: 1 41526 0.90 MB/sec execute 158 sec latency 15.788 ms oleg237-client.virtnet: 1 42067 0.92 MB/sec execute 159 sec latency 16.059 ms oleg237-client.virtnet: 1 42683 0.95 MB/sec execute 160 sec latency 15.500 ms oleg237-client.virtnet: 1 42957 0.95 MB/sec execute 161 sec latency 15.993 ms oleg237-client.virtnet: 1 43249 0.95 MB/sec execute 162 sec latency 15.876 ms oleg237-client.virtnet: 1 43527 0.95 MB/sec execute 163 sec latency 17.756 ms oleg237-client.virtnet: 1 43872 0.95 MB/sec execute 164 sec latency 16.593 ms oleg237-client.virtnet: 1 44279 0.96 MB/sec execute 165 sec latency 16.022 ms oleg237-client.virtnet: 1 44642 0.96 MB/sec execute 166 sec latency 17.592 ms oleg237-client.virtnet: 1 45033 0.96 MB/sec execute 167 sec latency 15.509 ms oleg237-client.virtnet: 1 45034 0.96 MB/sec execute 168 sec latency 993.989 ms oleg237-client.virtnet: 1 45034 0.95 MB/sec execute 169 sec latency 1994.177 ms oleg237-client.virtnet: 1 45034 0.94 MB/sec execute 170 sec latency 2994.346 ms oleg237-client.virtnet: 1 45171 0.94 MB/sec execute 171 sec latency 3696.839 ms oleg237-client.virtnet: 1 45867 0.97 MB/sec execute 172 sec latency 15.688 ms oleg237-client.virtnet: 1 46303 0.99 MB/sec execute 173 sec latency 13.571 ms oleg237-client.virtnet: 1 46559 0.99 MB/sec execute 174 sec latency 16.503 ms oleg237-client.virtnet: 1 46870 0.99 MB/sec execute 175 sec latency 17.672 ms oleg237-client.virtnet: 1 47152 0.99 MB/sec execute 176 sec latency 16.423 ms oleg237-client.virtnet: 1 47589 1.00 MB/sec execute 177 sec latency 15.730 ms oleg237-client.virtnet: 1 47884 1.00 MB/sec execute 178 sec latency 17.658 ms oleg237-client.virtnet: 1 48288 1.00 MB/sec execute 179 sec latency 11.290 ms oleg237-client.virtnet: 1 48507 1.00 MB/sec execute 180 sec latency 460.952 ms oleg237-client.virtnet: 1 48507 0.99 MB/sec execute 181 sec latency 1461.139 ms oleg237-client.virtnet: 1 48507 0.99 MB/sec execute 182 sec latency 2461.316 ms oleg237-client.virtnet: 1 48507 0.98 MB/sec execute 183 sec latency 3461.523 ms oleg237-client.virtnet: 1 48507 0.97 MB/sec execute 184 sec latency 4461.679 ms oleg237-client.virtnet: 1 48507 0.97 MB/sec execute 185 sec latency 5461.839 ms oleg237-client.virtnet: 1 48507 0.96 MB/sec execute 186 sec latency 6462.059 ms oleg237-client.virtnet: 1 48507 0.96 MB/sec execute 187 sec latency 7462.221 ms oleg237-client.virtnet: 1 48507 0.95 MB/sec execute 188 sec latency 8462.381 ms oleg237-client.virtnet: 1 48507 0.95 MB/sec execute 189 sec latency 9462.603 ms oleg237-client.virtnet: 1 48507 0.94 MB/sec execute 190 sec latency 10462.745 ms oleg237-client.virtnet: 1 48507 0.94 MB/sec execute 191 sec latency 11462.926 ms oleg237-client.virtnet: 1 48507 0.93 MB/sec execute 192 sec latency 12463.109 ms oleg237-client.virtnet: 1 48507 0.93 MB/sec execute 193 sec latency 13463.286 ms oleg237-client.virtnet: 1 48507 0.92 MB/sec execute 194 sec latency 14463.464 ms oleg237-client.virtnet: 1 48507 0.92 MB/sec execute 195 sec latency 15463.633 ms oleg237-client.virtnet: 1 48507 0.92 MB/sec execute 196 sec latency 16463.825 ms oleg237-clieoleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2444 1285244 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2008 1285680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 12212 3593652 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37400 3567020 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 49612 7160672 1% /mnt/lustre test_70b fail mds2 10 times Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:14:41 (1716257681) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:14:57 (1716257697) targets are mounted 22:14:57 (1716257697) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2008 1285680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 11924 3593944 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 37568 3568612 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 49492 7162556 1% /mnt/lustre test_70b fail mds1 11 times Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:15:09 (1716257709) shut down nt.virtnet: 1 48507 0.91 MB/sec execute 197 sec latency 17464.006 ms oleg237-client.virtnet: 1 48507 0.91 MB/sec execute 198 sec latency 18464.191 ms oleg237-client.virtnet: 1 48507 0.90 MB/sec execute 199 sec latency 19464.365 ms oleg237-client.virtnet: 1 48923 0.92 MB/sec execute 200 sec latency 19572.380 ms oleg237-client.virtnet: 1 49633 0.94 MB/sec execute 201 sec latency 15.426 ms oleg237-client.virtnet: 1 49943 0.95 MB/sec execute 202 sec latency 16.618 ms oleg237-client.virtnet: 1 50173 0.94 MB/sec execute 203 sec latency 15.690 ms oleg237-client.virtnet: 1 50486 0.94 MB/sec execute 204 sec latency 16.236 ms oleg237-client.virtnet: 1 50823 0.94 MB/sec execute 205 sec latency 15.814 ms oleg237-client.virtnet: 1 51229 0.95 MB/sec execute 206 sec latency 16.015 ms oleg237-client.virtnet: 1 51528 0.95 MB/sec execute 207 sec latency 17.909 ms oleg237-client.virtnet: 1 51944 0.95 MB/sec execute 208 sec latency 12.004 ms oleg237-client.virtnet: 1 52406 0.97 MB/sec execute 209 sec latency 16.006 ms oleg237-client.virtnet: 1 53054 0.98 MB/sec execute 210 sec latency 15.715 ms oleg237-client.virtnet: 1 53479 1.00 MB/sec execute 211 sec latency 15.432 ms oleg237-client.virtnet: 1 53706 0.99 MB/sec execute 212 sec latency 16.149 ms oleg237-client.virtnet: 1 54063 0.99 MB/sec execute 213 sec latency 19.300 ms oleg237-client.virtnet: 1 54434 0.99 MB/sec execute 214 sec latency 15.571 ms oleg237-client.virtnet: 1 54835 1.00 MB/sec execute 215 sec latency 15.404 ms oleg237-client.virtnet: 1 55190 1.00 MB/sec execute 216 sec latency 16.766 ms oleg237-client.virtnet: 1 55611 1.00 MB/sec execute 217 sec latency 12.326 ms oleg237-client.virtnet: 1 56137 1.02 MB/sec execute 218 sec latency 15.490 ms oleg237-client.virtnet: 1 56771 1.04 MB/sec execute 219 sec latency 15.884 ms oleg237-client.virtnet: 1 57071 1.04 MB/sec execute 220 sec latency 15.425 ms oleg237-client.virtnet: 1 57322 1.04 MB/sec execute 221 sec latency 15.490 ms oleg237-client.virtnet: 1 57669 1.04 MB/sec execute 222 sec latency 18.345 ms oleg237-client.virtnet: 1 58017 1.04 MB/sec execute 223 sec latency 15.559 ms oleg237-client.virtnet: 1 58389 1.05 MB/sec execute 224 sec latency 136.427 ms oleg237-client.virtnet: 1 58389 1.04 MB/sec execute 225 sec latency 1136.609 ms oleg237-client.virtnet: 1 58389 1.04 MB/sec execute 226 sec latency 2136.781 ms oleg237-client.virtnet: 1 58389 1.04 MB/sec execute 227 sec latency 3136.943 ms oleg237-client.virtnet: 1 58583 1.03 MB/sec execute 228 sec latency 3553.481 ms oleg237-client.virtnet: 1 58984 1.03 MB/sec execute 229 sec latency 12.979 ms oleg237-client.virtnet: 1 59368 1.03 MB/sec execute 230 sec latency 16.007 ms oleg237-client.virtnet: 1 60055 1.06 MB/sec execute 231 sec latency 15.832 ms oleg237-client.virtnet: 1 60529 1.07 MB/sec execute 232 sec latency 13.671 ms oleg237-client.virtnet: 1 60767 1.07 MB/sec execute 233 sec latency 15.791 ms oleg237-client.virtnet: 1 61101 1.07 MB/sec execute 234 sec latency 15.856 ms oleg237-client.virtnet: 1 61411 1.07 MB/sec execute 235 sec latency 15.918 ms oleg237-client.virtnet: 1 61835 1.08 MB/sec execute 236 sec latency 16.558 ms oleg237-client.virtnet: 1 61848 1.07 MB/sec execute 237 sec latency 946.849 ms oleg237-client.virtnet: 1 61848 1.07 MB/sec execute 238 sec latency 1947.024 ms oleg237-client.virtnet: 1 61848 1.06 MB/sec execute 239 sec latency 2947.181 ms oleg237-client.virtnet: 1 61848 1.06 MB/sec execute 240 sec latency 3947.338 ms oleg237-client.virtnet: 1 61848 1.05 MB/sec execute 241 sec latFailover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:15:24 (1716257724) targets are mounted 22:15:24 (1716257724) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec ency 4947.494 ms oleg237-client.virtnet: 1 61848 1.05 MB/sec execute 242 sec latency 5947.658 ms oleg237-client.virtnet: 1 61848 1.05 MB/sec execute 243 sec latency 6947.826 ms oleg237-client.virtnet: 1 61848 1.04 MB/sec execute 244 sec latency 7948.005 ms oleg237-client.virtnet: 1 61848 1.04 MB/sec execute 245 sec latency 8948.216 ms oleg237-client.virtnet: 1 61848 1.03 MB/sec execute 246 sec latency 9948.391 ms oleg237-client.virtnet: 1 61848 1.03 MB/sec execute 247 sec latency 10948.548 ms oleg237-client.virtnet: 1 61848 1.02 MB/sec execute 248 sec latency 11948.747 ms oleg237-client.virtnet: 1 61848 1.02 MB/sec execute 249 sec latency 12948.920 ms oleg237-client.virtnet: 1 61848 1.02 MB/sec execute 250 sec latency 13949.120 ms oleg237-client.virtnet: 1 61848 1.01 MB/sec execute 251 sec latency 14949.297 ms oleg237-client.virtnet: 1 61848 1.01 MB/sec execute 252 sec latency 15949.477 ms oleg237-client.virtnet: 1 61848 1.00 MB/sec execute 253 sec latency 16949.640 ms oleg237-client.virtnet: 1 61848 1.00 MB/sec execute 254 sec latency 17949.815 ms oleg237-client.virtnet: 1 61848 1.00 MB/sec execute 255 sec latency 18949.995 ms oleg237-client.virtnet: 1 61874 0.99 MB/sec execute 256 sec latency 19857.321 ms oleg237-client.virtnet: 1 62259 0.99 MB/sec execute 257 sec latency 19.162 ms oleg237-client.virtnet: 1 62678 0.99 MB/sec execute 258 sec latency 14.403 ms oleg237-client.virtnet: 1 63103 1.00 MB/sec execute 259 sec latency 16.894 ms oleg237-client.virtnet: 1 63707 1.01 MB/sec execute 260 sec latency 15.992 ms oleg237-client.virtnet: 1 64111 1.03 MB/sec execute 261 sec latency 15.811 ms oleg237-client.virtnet: 1 64335 1.02 MB/sec execute 262 sec latency 16.523 ms oleg237-client.virtnet: 1 64657 1.02 MB/sec execute 263 sec latency 19.752 ms oleg237-client.virtnet: 1 64964 1.02 MB/sec execute 264 sec latency 18.049 ms oleg237-client.virtnet: 1 65375 1.03 MB/sec execute 265 sec latency 15.829 ms oleg237-client.virtnet: 1 65675 1.03 MB/sec execute 266 sec latency 17.842 ms oleg237-client.virtnet: 1 66053 1.03 MB/sec execute 267 sec latency 11.009 ms oleg237-client.virtnet: 1 66441 1.03 MB/sec execute 268 sec latency 15.713 ms oleg237-client.virtnet: 1 67105 1.05 MB/sec execute 269 sec latency 15.598 ms oleg237-client.virtnet: 1 67577 1.06 MB/sec execute 270 sec latency 12.925 ms oleg237-client.virtnet: 1 67844 1.06 MB/sec execute 271 sec latency 16.064 ms oleg237-client.virtnet: 1 68163 1.06 MB/sec execute 272 sec latency 20.054 ms oleg237-client.virtnet: 1 68474 1.06 MB/sec execute 273 sec latency 15.701 ms oleg237-client.virtnet: 1 68943 1.07 MB/sec execute 274 sec latency 15.549 ms oleg237-client.virtnet: 1 69258 1.06 MB/sec execute 275 sec latency 17.481 ms oleg237-client.virtnet: 1 69667 1.06 MB/sec execute 276 sec latency 10.524 ms oleg237-client.virtnet: 1 70028 1.07 MB/sec execute 277 sec latency 15.534 ms oleg237-client.virtnet: 1 70765 1.08 MB/sec execute 278 sec latency 15.463 ms oleg237-client.virtnet: 1 71178 1.10 MB/sec execute 279 sec latency 15.619 ms oleg237-client.virtnet: 1 71407 1.09 MB/sec execute 280 sec latency 15.997 ms oleg237-client.virtnet: 1 71736 1.09 MB/sec execute 281 sec latency 15.875 ms oleg237-client.virtnet: 1 72047 1.09 MB/sec execute 282 sec latency 15.582 ms oleg237-client.virtnet: 1 72459 1.10 MB/sec execute 283 sec latency 15.386 ms oleg237-client.virtnet: 1 72761 1.10 MB/sec execute 284 sec latency 17.599 ms oleg237-client.virtnet: 1 73152 1.09 MB/sec execute 285 sec latency 11.605 ms oleg237-client.virtnet: 1 73498 1.10 MB/sec execute 286 sec latency 15.299 ms oleg237-client.virtnet: 1 74170 1.11 MB/sec execute 287 sec latency 16.333 ms oleg237-client.virtnet: 1 74662 1.13 MB/sec execute 288 sec latency 15.997 ms oleg237-client.virtnet: 1 74923 1.13 MB/sec execute 289 sec latency 16.022 ms oleg237-client.virtnet: 1 75229 1.12 MB/sec execute 290 sec latency 19.822 ms oleg237-client.virtnet: 1 75512 1.12 MB/sec execute 291 sec latency 16.015 ms oleg237-client.virtnet: 1 75928 1.13 MB/sec execute 292 sec latency 15.817 ms oleg237-client.virtnet: 1 76238 1.13 MB/sec execute 293 sec latency 17.720 ms oleg237-client.virtnet: 1 76580 1.13 MB/sec execute 294 sec latency 15.725 ms oleg237-client.virtnet: 1 76971 1.13 MB/sec execute 295 sec latency 15.526 ms oleg237-client.virtnet: 1 77517 1.14 MB/sec execute 296 sec latency 15.526 ms oleg237-client.virtnet: 1 78116 1.15 MB/sec execute 297 sec latency 15.921 ms oleg237-client.virtnet: 1 78403 1.16 MB/sec execute 298 sec latency 15.828 ms oleg237-client.virtnet: 1 78665 1.15 MB/sec execute 299 sec latency 16.045 ms oleg237-client.virtnet: 1 cleanup 300 sec oleg237-client.virtnet: 0 cleanup 300 sec oleg237-client.virtnet: oleg237-client.virtnet: Operation Count AvgLat MaxLat oleg237-client.virtnet: ---------------------------------------- oleg237-client.virtnet: NTCreateX 11197 17.885 26044.319 oleg237-client.virtnet: Close 8200 1.852 7.386 oleg237-client.virtnet: Rename 470 11.772 21.956 oleg237-client.virtnet: Unlink 2287 2.966 7.339 oleg237-client.virtnet: Qpathinfo 10112 4.663 18236.147 oleg237-client.virtnet: Qfileinfo 1754 0.200 2.545 oleg237-client.virtnet: Qfsinfo 1859 0.333 6.947 oleg237-client.virtnet: Sfileinfo 912 6.297 13.346 oleg237-client.virtnet: Find 3910 0.635 16.692 oleg237-client.virtnet: WriteX 5498 1.354 10.089 oleg237-client.virtnet: ReadX 17418 0.024 4.077 oleg237-client.virtnet: LockX 36 1.956 2.331 oleg237-client.virtnet: UnlockX 36 2.030 2.229 oleg237-client.virtnet: Flush 794 9.237 373.870 oleg237-client.virtnet: oleg237-client.virtnet: Throughput 1.15236 MB/sec 1 clients 1 procs max_latency=26044.332 ms oleg237-client.virtnet: stopping dbench on /mnt/lustre/d70b.replay-single/oleg237-client.virtnet at Mon May 20 22:16:13 EDT 2024 with return code 0 oleg237-client.virtnet: clean dbench files on /mnt/lustre/d70b.replay-single/oleg237-client.virtnet oleg237-client.virtnet: /mnt/lustre/d70b.replay-single/oleg237-client.virtnet /mnt/lustre/d70b.replay-single/oleg237-client.virtnet oleg237-client.virtnet: removed directory: 'clients/client0/~dmtmp/ACCESS' oleg237-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORD' oleg237-client.virtnet: removed directory: 'clients/client0/~dmtmp/WORDPRO' oleg237-client.virtnet: removed directory: 'clients/client0/~dmtmp/COREL' oleg237-client.virtnet: removed directory: 'clients/client0/~dmtmp/SEED' oleg237-client.virtnet: removed directory: 'clients/client0/~dmtmp/PARADOX' oleg237-client.virtnet: removed directory: 'clients/client0/~dmtmp/EXCEL' oleg237-client.virtnet: removed directory: 'clients/client0/~dmtmp/PM' oleg237-client.virtnet: removed directory: 'clients/client0/~dmtmp/PWRPNT' oleg237-client.virtnet: removed directory: 'clients/client0/~dmtmp' oleg237-client.virtnet: removed directory: 'clients/client0' oleg237-client.virtnet: removed directory: 'clients' oleg237-client.virtnet: removed 'client.txt' oleg237-client.virtnet: /mnt/lustre/d70b.replay-single/oleg237-client.virtnet oleg237-client.virtnet: dbench successfully finished PASS 70b (364s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70c: tar 2mdts recovery ============ 22:16:17 (1716257777) Starting client oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre Started clients oleg237-client.virtnet: 192.168.202.137@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started tar 2634 striped dir -i0 -c2 -H crush2 /mnt/lustre/d70c.replay-single tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 6640 1281048 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4316 1283372 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 16032 3576776 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 7624 3587816 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 23656 7164592 1% /mnt/lustre tar: Removing leading `/' from hard link targets test_70c fail mds2 1 times Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:18:32 (1716257912) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:18:48 (1716257928) targets are mounted 22:18:48 (1716257928) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 6900 1280788 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4308 1283380 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 10060 3582184 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 15768 3575196 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 25828 7157380 1% /mnt/lustre test_70c fail mds2 2 times Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:21:09 (1716258069) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:21:25 (1716258085) targets are mounted 22:21:25 (1716258085) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70c (355s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70d: mkdir/rmdir striped dir 2mdts recovery ========================================================== 22:22:14 (1716258134) Starting client oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre Started clients oleg237-client.virtnet: 192.168.202.137@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started 5974 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 5300 1282388 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4172 1283516 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605356 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605436 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210792 1% /mnt/lustre test_70d fail mds1 1 times Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:24:35 (1716258275) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:24:50 (1716258290) targets are mounted 22:24:50 (1716258290) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 5904 1281784 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 5208 1282480 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605356 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605436 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210792 1% /mnt/lustre test_70d fail mds2 2 times Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:27:12 (1716258432) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:27:27 (1716258447) targets are mounted 22:27:27 (1716258447) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4716: 5974 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test || { echo "mkdir fails"; break; }; $LFS mkdir -i1 -c2 $DIR/$tdir/test1 || { echo "mkdir fails"; break; }; touch $DIR/$tdir/test/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test || { echo "rmdir fails"; ls -lR $DIR/$tdir; break; }; touch $DIR/$tdir/test1/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test1/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test1 || { echo "rmdir fails"; ls -lR $DIR/$tdir/test1; break; }; done ) (wd: ~) PASS 70d (322s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70e: rename cross-MDT with random fails ========================================================== 22:27:38 (1716258458) debug=+ha Starting client oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre Started clients oleg237-client.virtnet: 192.168.202.137@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started PID=23085 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4656 1283032 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4168 1283520 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605356 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605436 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210792 1% /mnt/lustre test_70e fail mds2 1 times Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:29:54 (1716258594) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:30:09 (1716258609) targets are mounted 22:30:09 (1716258609) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4796 1282892 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3940 1283748 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605356 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605436 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210792 1% /mnt/lustre test_70e fail mds1 2 times Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:32:30 (1716258750) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:32:45 (1716258765) targets are mounted 22:32:45 (1716258765) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70e (319s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 70f: OSS O_DIRECT recovery with 1 clients ========================================================== 22:32:59 (1716258779) mount clients oleg237-client.virtnet ... Starting client oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre Started clients oleg237-client.virtnet: 192.168.202.137@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2496 1285192 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2056 1285632 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 5676 3570192 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3584720 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 7232 7154912 1% /mnt/lustre ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... test_70f failing OST 1 times Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 22:33:08 (1716258788) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... Started lustre-OST0000 22:33:24 (1716258804) targets are mounted 22:33:24 (1716258804) facet_failover done ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2496 1285192 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2056 1285632 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3568000 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 5652 3593056 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 7232 7161056 1% /mnt/lustre Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... test_70f failing OST 2 times Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 22:33:38 (1716258818) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Started lustre-OST0000 22:33:54 (1716258834) targets are mounted 22:33:54 (1716258834) facet_failover done Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2496 1285192 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2056 1285632 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 5676 3599272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 5652 3588936 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11328 7188208 1% /mnt/lustre ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... test_70f failing OST 3 times Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 22:34:09 (1716258849) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... Started lustre-OST0000 22:34:24 (1716258864) targets are mounted 22:34:24 (1716258864) facet_failover done ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2496 1285192 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2056 1285632 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3592960 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3588840 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7181800 1% /mnt/lustre ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... test_70f failing OST 4 times Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 22:34:39 (1716258879) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... Started lustre-OST0000 22:34:55 (1716258895) targets are mounted 22:34:55 (1716258895) facet_failover done ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg237-client.virtnet' ... ldlm.namespaces.MGC192.168.202.137@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8800aa9e2800.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8800aa9e2800.lru_size=clear PASS 70f (123s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 71a: mkdir/rmdir striped dir with 2 mdts recovery ========================================================== 22:35:04 (1716258904) Starting client oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre Started clients oleg237-client.virtnet: 192.168.202.137@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started 769 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2916 1284772 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3316 1284372 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3044 1284644 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3484 1284204 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail mds2 mds1 1 times Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:37:24 (1716259044) shut down Failover mds1 to oleg237-server Failover mds2 to oleg237-server mount facets: mds2 mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 22:37:39 (1716259059) targets are mounted 22:37:39 (1716259059) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3976 1283712 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2540 1285148 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4128 1283560 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2704 1284984 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail mds2 mds1 2 times Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:40:10 (1716259210) shut down Failover mds1 to oleg237-server Failover mds2 to oleg237-server mount facets: mds2 mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 22:40:25 (1716259225) targets are mounted 22:40:25 (1716259225) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4716: 769 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test; rmdir $DIR/$tdir/test; done ) (wd: ~) PASS 71a (336s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73a: open(O_CREAT), unlink, replay, reconnect before open replay, close ========================================================== 22:40:42 (1716259242) multiop /mnt/lustre/f73a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3704 1283984 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3456 1284232 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail_loc=0x80000302 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:40:47 (1716259247) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:41:03 (1716259263) targets are mounted 22:41:03 (1716259263) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73a (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 73b: open(O_CREAT), unlink, replay, reconnect at open_replay reply, close ========================================================== 22:41:28 (1716259288) multiop /mnt/lustre/f73b.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.7517 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2440 1285248 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2020 1285668 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre fail_loc=0x80000157 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:41:33 (1716259293) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:41:48 (1716259308) targets are mounted 22:41:48 (1716259308) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73b (42s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 74: Ensure applications don't fail waiting for OST recovery ========================================================== 22:42:13 (1716259333) Stopping clients: oleg237-client.virtnet /mnt/lustre (opts:) Stopping client oleg237-client.virtnet /mnt/lustre opts: Stopping /mnt/lustre-ost1 (opts:) on oleg237-server Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:42:16 (1716259336) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:42:30 (1716259350) targets are mounted 22:42:30 (1716259350) facet_failover done Starting client oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre Started clients oleg237-client.virtnet: 192.168.202.137@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 74 (31s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80a: DNE: create remote dir, drop update rep from MDT0, fail MDT0 ========================================================== 22:42:46 (1716259366) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2444 1285244 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:42:51 (1716259371) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:43:06 (1716259386) targets are mounted 22:43:06 (1716259386) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 190.55 ops/second PASS 80a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80b: DNE: create remote dir, drop update rep from MDT0, fail MDT1 ========================================================== 22:43:16 (1716259396) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:43:30 (1716259410) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:43:45 (1716259425) targets are mounted 22:43:45 (1716259425) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 191.29 ops/second PASS 80b (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80c: DNE: create remote dir, drop update rep from MDT1, fail MDT[0,1] ========================================================== 22:43:55 (1716259435) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2520 1285168 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2096 1285592 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2520 1285168 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2096 1285592 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:44:03 (1716259443) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:44:18 (1716259458) targets are mounted 22:44:18 (1716259458) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:44:25 (1716259465) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:44:41 (1716259481) targets are mounted 22:44:41 (1716259481) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 190.74 ops/second PASS 80c (54s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80d: DNE: create remote dir, drop update rep from MDT1, fail 2 MDTs ========================================================== 22:44:51 (1716259491) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2556 1285132 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2556 1285132 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:45:09 (1716259509) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 22:45:29 (1716259529) targets are mounted 22:45:29 (1716259529) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 194.94 ops/second PASS 80d (59s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80e: DNE: create remote dir, drop MDT1 rep, fail MDT0 ========================================================== 22:45:52 (1716259552) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:46:00 (1716259560) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:46:15 (1716259575) targets are mounted 22:46:15 (1716259575) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 197.90 ops/second PASS 80e (58s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80f: DNE: create remote dir, drop MDT1 rep, fail MDT1 ========================================================== 22:46:52 (1716259612) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:46:57 (1716259617) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:47:13 (1716259633) targets are mounted 22:47:13 (1716259633) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 201.29 ops/second PASS 80f (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80g: DNE: create remote dir, drop MDT1 rep, fail MDT0, then MDT1 ========================================================== 22:47:23 (1716259643) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:47:33 (1716259653) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:47:49 (1716259669) targets are mounted 22:47:49 (1716259669) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:47:56 (1716259676) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:48:11 (1716259691) targets are mounted 22:48:11 (1716259691) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 193.20 ops/second PASS 80g (56s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 80h: DNE: create remote dir, drop MDT1 rep, fail 2 MDTs ========================================================== 22:48:21 (1716259701) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2484 1285204 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:48:33 (1716259713) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 22:48:58 (1716259738) targets are mounted 22:48:58 (1716259738) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.10 seconds: 196.08 ops/second PASS 80h (47s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81a: DNE: unlink remote dir, drop MDT0 update rep, fail MDT1 ========================================================== 22:49:10 (1716259750) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1285628 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:49:21 (1716259761) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:49:36 (1716259776) targets are mounted 22:49:36 (1716259776) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81a (34s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81b: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0 ========================================================== 22:49:46 (1716259786) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2516 1285172 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:49:51 (1716259791) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:50:06 (1716259806) targets are mounted 22:50:06 (1716259806) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81c: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0,MDT1 ========================================================== 22:50:16 (1716259816) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1285624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:50:23 (1716259823) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:50:39 (1716259839) targets are mounted 22:50:39 (1716259839) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:50:46 (1716259846) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:51:01 (1716259861) targets are mounted 22:51:01 (1716259861) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81c (53s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81d: DNE: unlink remote dir, drop MDT0 update reply, fail 2 MDTs ========================================================== 22:51:11 (1716259871) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2516 1285172 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2516 1285172 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:51:26 (1716259886) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 22:51:47 (1716259907) targets are mounted 22:51:47 (1716259907) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81d (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81e: DNE: unlink remote dir, drop MDT1 req reply, fail MDT0 ========================================================== 22:51:58 (1716259918) fail_loc=0x119 fail_loc=0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:52:03 (1716259923) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:52:19 (1716259939) targets are mounted 22:52:19 (1716259939) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81e (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81f: DNE: unlink remote dir, drop MDT1 req reply, fail MDT1 ========================================================== 22:52:28 (1716259948) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:52:34 (1716259954) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:52:49 (1716259969) targets are mounted 22:52:49 (1716259969) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81f (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81g: DNE: unlink remote dir, drop req reply, fail M0, then M1 ========================================================== 22:52:59 (1716259979) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:53:06 (1716259986) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:53:22 (1716260002) targets are mounted 22:53:22 (1716260002) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:53:29 (1716260009) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 22:53:44 (1716260024) targets are mounted 22:53:44 (1716260024) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81g (53s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 81h: DNE: unlink remote dir, drop request reply, fail 2 MDTs ========================================================== 22:53:54 (1716260034) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2068 1285620 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1556 3605464 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3136 7210904 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 22:54:03 (1716260043) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 22:54:29 (1716260069) targets are mounted 22:54:29 (1716260069) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81h (45s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 84a: stale open during export disconnect ========================================================== 22:54:41 (1716260081) fail_loc=0x80000144 total: 1 open/close in 0.01 seconds: 100.49 ops/second oleg237-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg237-client: oleg237-client: ssh exited with exit code 5 PASS 84a (5s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85a: check the cancellation of unused locks during recovery(IBITS) ========================================================== 22:54:49 (1716260089) before recovery: unused locks count = 201 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:54:52 (1716260092) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:55:06 (1716260106) targets are mounted 22:55:06 (1716260106) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec after recovery: unused locks count = 101 PASS 85a (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 85b: check the cancellation of unused locks during recovery(EXTENT) ========================================================== 22:55:16 (1716260116) before recovery: unused locks count = 100 Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 22:55:22 (1716260122) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 22:55:37 (1716260137) targets are mounted 22:55:37 (1716260137) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec after recovery: unused locks count = 0 PASS 85b (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 86: umount server after clear nid_stats should not hit LBUG ========================================================== 22:55:45 (1716260145) Stopping clients: oleg237-client.virtnet /mnt/lustre (opts:) Stopping client oleg237-client.virtnet /mnt/lustre opts: mdt.lustre-MDT0000.exports.clear=0 mdt.lustre-MDT0001.exports.clear=0 Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting client oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre Started clients oleg237-client.virtnet: 192.168.202.137@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) PASS 86 (14s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87a: write replay ================== 22:56:01 (1716260161) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1780 3605240 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3536 7210504 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.158749 s, 52.8 MB/s Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 22:56:06 (1716260166) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 22:56:22 (1716260182) targets are mounted 22:56:22 (1716260182) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.0612076 s, 137 MB/s PASS 87a (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 87b: write replay with changed data (checksum resend) ========================================================== 22:56:30 (1716260190) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9972 3597048 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11728 7202312 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB) copied, 0.133438 s, 62.9 MB/s 8+0 records in 8+0 records out 8 bytes (8 B) copied, 0.00261938 s, 3.1 kB/s Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 22:56:36 (1716260196) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 22:56:52 (1716260212) targets are mounted 22:56:52 (1716260212) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 0+1 records in 0+1 records out 72 bytes (72 B) copied, 0.00466485 s, 15.4 kB/s PASS 87b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 88: MDS should not assign same objid to different files ========================================================== 22:57:00 (1716260220) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9976 3597044 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11732 7202308 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2072 1285616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9976 3597044 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11732 7202308 1% /mnt/lustre before test: last_id = 14145, next_id = 14116 Creating to objid 14145 on ost lustre-OST0000... total: 31 open/close in 0.15 seconds: 200.46 ops/second total: 8 open/close in 0.04 seconds: 201.24 ops/second before recovery: last_id = 14177, next_id = 14154 Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server Failover mds1 to oleg237-server Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 Failover ost1 to oleg237-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 after recovery: last_id = 14185, next_id = 14154 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0482841 s, 10.9 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0438068 s, 12.0 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0399445 s, 13.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0369376 s, 14.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0324281 s, 16.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.028275 s, 18.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0260756 s, 20.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0253736 s, 20.7 MB/s -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14116 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14117 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14118 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14119 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14120 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14121 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14122 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14123 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14124 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14125 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14126 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14127 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14128 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14129 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14130 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14131 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14132 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14133 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14134 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14135 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14136 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14137 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14138 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14139 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14140 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14141 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14142 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14143 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14144 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14145 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14146 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14147 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14148 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14149 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14150 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14151 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14152 -rw-r--r-- 1 root root 0 May 20 22:57 /mnt/lustre/d88.replay-single/f-14153 -rw-r--r-- 1 root root 524288 May 20 22:57 /mnt/lustre/d88.replay-single/f-14157 -rw-r--r-- 1 root root 524288 May 20 22:57 /mnt/lustre/d88.replay-single/f-14158 -rw-r--r-- 1 root root 524288 May 20 22:57 /mnt/lustre/d88.replay-single/f-14159 -rw-r--r-- 1 root root 524288 May 20 22:57 /mnt/lustre/d88.replay-single/f-14160 -rw-r--r-- 1 root root 524288 May 20 22:57 /mnt/lustre/d88.replay-single/f-14161 -rw-r--r-- 1 root root 524288 May 20 22:57 /mnt/lustre/d88.replay-single/f-14162 -rw-r--r-- 1 root root 524288 May 20 22:57 /mnt/lustre/d88.replay-single/f-14163 -rw-r--r-- 1 root root 524288 May 20 22:57 /mnt/lustre/d88.replay-single/f-14164 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0414169 s, 12.7 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0395644 s, 13.3 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0394927 s, 13.3 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0387764 s, 13.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0386199 s, 13.6 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0385364 s, 13.6 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0385103 s, 13.6 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB) copied, 0.0432076 s, 12.1 MB/s PASS 88 (57s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 89: no disk space leak on late ost connection ========================================================== 22:57:59 (1716260279) Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg237-server mds-ost sync done. Waiting for MDT destroys to complete 10+0 records in 10+0 records out 10485760 bytes (10 MB) copied, 0.0889879 s, 118 MB/s Stopping /mnt/lustre-ost1 (opts:) on oleg237-server Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 22:58:06 (1716260286) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 22:58:20 (1716260300) targets are mounted 22:58:20 (1716260300) facet_failover done Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 68 sec Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg237-server mds-ost sync done. Waiting for MDT destroys to complete free_before: 7646308 free_after: 7646308 PASS 89 (111s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 90: lfs find identifies the missing striped file segments ========================================================== 22:59:53 (1716260393) Create the files Fail ost1 lustre-OST0000_UUID, display the list of affected files Stopping /mnt/lustre-ost1 (opts:) on oleg237-server General Query: lfs find /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f1 /mnt/lustre/d90.replay-single/f0 Querying files on shutdown ost1: lfs find --obd lustre-OST0000_UUID /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f0 Check getstripe: /home/green/git/lustre-release/lustre/utils/lfs getstripe -r --obd lustre-OST0000_UUID /mnt/lustre/d90.replay-single/all lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 14186 0x376a 0x280000401 * /mnt/lustre/d90.replay-single/f0 lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 14187 0x376b 0x280000401 * /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f0 Failover ost1 to oleg237-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 90 (20s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93a: replay + reconnect ============ 23:00:16 (1716260416) 1+0 records in 1+0 records out 1024 bytes (1.0 kB) copied, 0.00288002 s, 356 kB/s fail_val=40 fail_loc=0x715 Failing ost1 on oleg237-server Stopping /mnt/lustre-ost1 (opts:) on oleg237-server 23:00:18 (1716260418) shut down Failover ost1 to oleg237-server mount facets: ost1 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 23:00:33 (1716260433) targets are mounted 23:00:33 (1716260433) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 93a (60s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 93b: replay + reconnect on mds ===== 23:01:18 (1716260478) total: 20 open/close in 0.15 seconds: 132.95 ops/second fail_val=80 fail_loc=0x715 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:01:21 (1716260481) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:01:35 (1716260495) targets are mounted 23:01:35 (1716260495) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 93b (105s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100a: DNE: create striped dir, drop update rep from MDT1, fail MDT1 ========================================================== 23:03:05 (1716260585) fail_loc=0x1701 Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:03:07 (1716260587) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 23:03:21 (1716260601) targets are mounted 23:03:21 (1716260601) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200033080:0x1:0x0] 1 [0x24000bb9a:0x1:0x0] total: 20 open/close in 0.11 seconds: 187.02 ops/second PASS 100a (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100b: DNE: create striped dir, fail MDT0 ========================================================== 23:03:31 (1716260611) fail_loc=0x119 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:03:34 (1716260614) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:03:48 (1716260628) targets are mounted 23:03:48 (1716260628) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200033080:0x4:0x0] 1 [0x24000bb9a:0x4:0x0] total: 20 open/close in 0.10 seconds: 191.48 ops/second PASS 100b (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100c: DNE: create striped dir, abort_recov_mdt mds2 ========================================================== 23:03:58 (1716260638) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2572 1285116 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2148 1285540 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Stopping /mnt/lustre-mds2 (opts:) on oleg237-server Failover mds2 to oleg237-server Starting mds2: -o localrecov -o abort_recov_mdt /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 total: 20 open/close in 0.11 seconds: 177.83 ops/second Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:04:21 (1716260661) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 23:04:35 (1716260675) targets are mounted 23:04:35 (1716260675) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 1 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 1 [0x24000c368:0x3:0x0] 0 [0x200033081:0x3:0x0] total: 20 open/close in 0.10 seconds: 204.74 ops/second PASS 100c (44s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100d: DNE: cancel update logs upon recovery abort ========================================================== 23:04:45 (1716260685) striped dir -i0 -c2 -H crush2 /mnt/lustre/d100d.replay-single total: 100 mkdir in 0.48 seconds: 206.77 ops/second lustre-MDT0000-osd [catalog]: [0x20001a210:0x1:0x0] [index]: 00017 [logid]: [0x200033850:0x1:0x0] lustre-MDT0001-osp-MDT0000 [catalog]: [0x2400007ec:0x1:0x0] [index]: 00025 [logid]: [0x24000c369:0x1:0x0] [index]: 00026 [logid]: [0x24000c369:0x2:0x0] Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failover mds1 to oleg237-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg237-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg237-client: oleg237-client: ssh exited with exit code 5 first stat failed: 5 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 find: '/mnt/lustre/d100d.replay-single': No such file or directory find: '/mnt/lustre/d100d.replay-single': No such file or directory PASS 100d (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 100e: DNE: create striped dir on MDT0 and MDT1, fail MDT0, MDT1 ========================================================== 23:05:10 (1716260710) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2988 1284700 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2532 1285156 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2988 1284700 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2532 1285156 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200034020:0x2:0x0] 1 [0x24000d309:0x2:0x0] Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:05:19 (1716260719) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 23:05:46 (1716260746) targets are mounted 23:05:46 (1716260746) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec lmv_stripe_count: 2 lmv_stripe_offset: 0 lmv_hash_type: crush mdtidx FID[seq:oid:ver] 0 [0x200034020:0x2:0x0] 1 [0x24000d309:0x2:0x0] PASS 100e (42s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 101: Shouldn't reassign precreated objs to other files after recovery ========================================================== 23:05:54 (1716260754) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3104 1284584 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2636 1285052 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failover mds1 to oleg237-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 pdsh@oleg237-client: oleg237-client: ssh exited with exit code 5 first stat failed: 5 PASS 101 (53s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102a: check resend (request lost) with multiple modify RPCs in flight ========================================================== 23:06:49 (1716260809) creating 7 files ... fail_loc=0x159 launch 7 chmod in parallel (23:06:50) ... fail_loc=0 done (23:07:06) /mnt/lustre/d102a.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102a.replay-single/file-7 has perms 0600 OK PASS 102a (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102b: check resend (reply lost) with multiple modify RPCs in flight ========================================================== 23:07:10 (1716260830) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (23:07:11) ... fail_loc=0 done (23:07:27) /mnt/lustre/d102b.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102b.replay-single/file-7 has perms 0600 OK PASS 102b (19s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102c: check replay w/o reconstruction with multiple mod RPCs in flight ========================================================== 23:07:31 (1716260851) creating 7 files ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2992 1284696 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2456 1285232 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3580644 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3597060 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7177704 1% /mnt/lustre fail_loc=0x15a launch 7 chmod in parallel (23:07:35) ... fail_loc=0 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:07:38 (1716260858) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:07:53 (1716260873) targets are mounted 23:07:53 (1716260873) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec done (23:07:59) /mnt/lustre/d102c.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102c.replay-single/file-7 has perms 0600 OK PASS 102c (30s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 102d: check replay & reconstruction with multiple mod RPCs in flight ========================================================== 23:08:03 (1716260883) creating 7 files ... fail_loc=0x15a launch 7 chmod in parallel (23:08:04) ... fail_loc=0 Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:08:07 (1716260887) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 23:08:21 (1716260901) targets are mounted 23:08:21 (1716260901) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec done (23:08:27) /mnt/lustre/d102d.replay-single/file-1 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-2 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-3 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-4 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-5 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-6 has perms 0600 OK /mnt/lustre/d102d.replay-single/file-7 has perms 0600 OK PASS 102d (26s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 103: Check otr_next_id overflow ==== 23:08:31 (1716260911) fail_loc=0x80000162 total: 30 open/close in 0.16 seconds: 189.45 ops/second Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:08:34 (1716260914) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:08:48 (1716260928) targets are mounted 23:08:48 (1716260928) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 103 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110a: DNE: create striped dir, fail MDT1 ========================================================== 23:08:58 (1716260938) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2944 1284744 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2460 1285228 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3580644 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3597060 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7177704 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:09:03 (1716260943) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:09:18 (1716260958) targets are mounted 23:09:18 (1716260958) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d110a.replay-single/striped_dir has type dir OK PASS 110a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110b: DNE: create striped dir, fail MDT1 and client ========================================================== 23:09:28 (1716260968) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2948 1284740 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3580644 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3597060 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7177704 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:09:33 (1716260973) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:09:48 (1716260988) targets are mounted 23:09:48 (1716260988) facet_failover done pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110b.replay-single/striped_dir has type dir OK PASS 110b (97s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110c: DNE: create striped dir, fail MDT2 ========================================================== 23:11:08 (1716261068) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2952 1284736 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:11:13 (1716261073) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 23:11:28 (1716261088) targets are mounted 23:11:28 (1716261088) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d110c.replay-single/striped_dir has type dir OK PASS 110c (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110d: DNE: create striped dir, fail MDT2 and client ========================================================== 23:11:38 (1716261098) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2992 1284696 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2508 1285180 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:11:43 (1716261103) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 23:11:58 (1716261118) targets are mounted 23:11:58 (1716261118) facet_failover done pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110d.replay-single/striped_dir has type dir OK PASS 110d (96s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110e: DNE: create striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 23:13:16 (1716261196) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2940 1284748 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2456 1285232 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:13:25 (1716261205) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 23:13:51 (1716261231) targets are mounted 23:13:51 (1716261231) facet_failover done pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110e.replay-single/striped_dir has type dir OK PASS 110e (111s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_110f skipping excluded test 110f debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 110g: DNE: create striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 23:15:10 (1716261310) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2984 1284704 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2492 1285196 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:15:24 (1716261324) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 23:15:45 (1716261345) targets are mounted 23:15:45 (1716261345) facet_failover done pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre /mnt/lustre/d110g.replay-single/striped_dir has type dir OK PASS 110g (111s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111a: DNE: unlink striped dir, fail MDT1 ========================================================== 23:17:04 (1716261424) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2956 1284732 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:17:08 (1716261428) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:17:24 (1716261444) targets are mounted 23:17:24 (1716261444) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111a.replay-single/striped_dir: No such file or directory PASS 111a (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111b: DNE: unlink striped dir, fail MDT2 ========================================================== 23:17:34 (1716261454) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2960 1284728 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:17:38 (1716261458) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 23:17:54 (1716261474) targets are mounted 23:17:54 (1716261474) facet_failover done pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111b.replay-single/striped_dir: No such file or directory PASS 111b (95s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111c: DNE: unlink striped dir, uncommit on MDT1, fail client/MDT1/MDT2 ========================================================== 23:19:11 (1716261551) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3004 1284684 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:19:26 (1716261566) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 23:19:47 (1716261587) targets are mounted 23:19:47 (1716261587) facet_failover done pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111c.replay-single/striped_dir: No such file or directory PASS 111c (113s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111d: DNE: unlink striped dir, uncommit on MDT2, fail client/MDT1/MDT2 ========================================================== 23:21:06 (1716261666) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2964 1284724 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2472 1285216 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre df:/mnt/lustre Not a Lustre filesystem Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:21:15 (1716261675) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 23:21:41 (1716261701) targets are mounted 23:21:41 (1716261701) facet_failover done pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid pdsh@oleg237-client: oleg237-client: ssh exited with exit code 95 Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre Can't lstat /mnt/lustre/d111d.replay-single/striped_dir: No such file or directory PASS 111d (110s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111e: DNE: unlink striped dir, uncommit on MDT2, fail MDT1/MDT2 ========================================================== 23:22:59 (1716261779) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3008 1284680 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2508 1285180 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3000 1284688 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2500 1285188 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:23:08 (1716261788) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 23:23:33 (1716261813) targets are mounted 23:23:33 (1716261813) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111e.replay-single/striped_dir: No such file or directory PASS 111e (43s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111f: DNE: unlink striped dir, uncommit on MDT1, fail MDT1/MDT2 ========================================================== 23:23:44 (1716261824) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2968 1284720 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2480 1285208 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2968 1284720 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2468 1285220 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:23:53 (1716261833) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 23:24:19 (1716261859) targets are mounted 23:24:19 (1716261859) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111f.replay-single/striped_dir: No such file or directory PASS 111f (44s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 111g: DNE: unlink striped dir, fail MDT1/MDT2 ========================================================== 23:24:30 (1716261870) UUID Inodes IUsed IFree IUse% Mounted on lustre-MDT0000_UUID 1024000 551 1023449 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1024000 319 1023681 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 262144 554 261590 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 262144 491 261653 1% /mnt/lustre[OST:1] filesystem_summary: 524113 870 523243 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2976 1284712 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2976 1284712 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2476 1285212 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:24:39 (1716261879) shut down Failover mds1 to oleg237-server mount facets: mds1 Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 23:25:05 (1716261905) targets are mounted 23:25:05 (1716261905) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d111g.replay-single/striped_dir: No such file or directory PASS 111g (44s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112a: DNE: cross MDT rename, fail MDT1 ========================================================== 23:25:16 (1716261916) SKIP: replay-single test_112a needs >= 4 MDTs SKIP 112a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112b: DNE: cross MDT rename, fail MDT2 ========================================================== 23:25:20 (1716261920) SKIP: replay-single test_112b needs >= 4 MDTs SKIP 112b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112c: DNE: cross MDT rename, fail MDT3 ========================================================== 23:25:23 (1716261923) SKIP: replay-single test_112c needs >= 4 MDTs SKIP 112c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112d: DNE: cross MDT rename, fail MDT4 ========================================================== 23:25:26 (1716261926) SKIP: replay-single test_112d needs >= 4 MDTs SKIP 112d (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112e: DNE: cross MDT rename, fail MDT1 and MDT2 ========================================================== 23:25:30 (1716261930) SKIP: replay-single test_112e needs >= 4 MDTs SKIP 112e (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112f: DNE: cross MDT rename, fail MDT1 and MDT3 ========================================================== 23:25:33 (1716261933) SKIP: replay-single test_112f needs >= 4 MDTs SKIP 112f (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112g: DNE: cross MDT rename, fail MDT1 and MDT4 ========================================================== 23:25:36 (1716261936) SKIP: replay-single test_112g needs >= 4 MDTs SKIP 112g (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112h: DNE: cross MDT rename, fail MDT2 and MDT3 ========================================================== 23:25:40 (1716261940) SKIP: replay-single test_112h needs >= 4 MDTs SKIP 112h (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112i: DNE: cross MDT rename, fail MDT2 and MDT4 ========================================================== 23:25:43 (1716261943) SKIP: replay-single test_112i needs >= 4 MDTs SKIP 112i (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112j: DNE: cross MDT rename, fail MDT3 and MDT4 ========================================================== 23:25:46 (1716261946) SKIP: replay-single test_112j needs >= 4 MDTs SKIP 112j (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112k: DNE: cross MDT rename, fail MDT1,MDT2,MDT3 ========================================================== 23:25:50 (1716261950) SKIP: replay-single test_112k needs >= 4 MDTs SKIP 112k (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112l: DNE: cross MDT rename, fail MDT1,MDT2,MDT4 ========================================================== 23:25:53 (1716261953) SKIP: replay-single test_112l needs >= 4 MDTs SKIP 112l (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112m: DNE: cross MDT rename, fail MDT1,MDT3,MDT4 ========================================================== 23:25:56 (1716261956) SKIP: replay-single test_112m needs >= 4 MDTs SKIP 112m (2s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 112n: DNE: cross MDT rename, fail MDT2,MDT3,MDT4 ========================================================== 23:26:00 (1716261960) SKIP: replay-single test_112n needs >= 4 MDTs SKIP 112n (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 115: failover for create/unlink striped directory ========================================================== 23:26:03 (1716261963) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2972 1284716 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2464 1285224 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre striped dir -i1 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_0 striped dir -i1 -c2 -H all_char /mnt/lustre/d115.replay-single/test_1 striped dir -i1 -c2 -H fnv_1a_64 /mnt/lustre/d115.replay-single/test_2 striped dir -i1 -c2 -H all_char /mnt/lustre/d115.replay-single/test_3 striped dir -i1 -c2 -H all_char /mnt/lustre/d115.replay-single/test_4 Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:26:08 (1716261968) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 23:26:24 (1716261984) targets are mounted 23:26:24 (1716261984) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3024 1284664 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2532 1285156 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d115.replay-single/test_0 striped dir -i0 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_1 striped dir -i0 -c2 -H all_char /mnt/lustre/d115.replay-single/test_2 striped dir -i0 -c2 -H crush /mnt/lustre/d115.replay-single/test_3 striped dir -i0 -c2 -H crush2 /mnt/lustre/d115.replay-single/test_4 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:26:34 (1716261994) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:26:50 (1716262010) targets are mounted 23:26:50 (1716262010) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 115 (55s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116a: large update log master MDT recovery ========================================================== 23:27:00 (1716262020) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3064 1284624 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2596 1285092 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre fail_loc=0x80001702 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:27:05 (1716262025) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:27:20 (1716262040) targets are mounted 23:27:20 (1716262040) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d116a.replay-single/striped_dir has type dir OK PASS 116a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 116b: large update log slave MDT recovery ========================================================== 23:27:30 (1716262050) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3116 1284572 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2768 1284920 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre fail_loc=0x80001702 Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:27:35 (1716262055) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 23:27:51 (1716262071) targets are mounted 23:27:51 (1716262071) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d116b.replay-single/striped_dir has type dir OK PASS 116b (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 117: DNE: cross MDT unlink, fail MDT1 and MDT2 ========================================================== 23:28:01 (1716262081) SKIP: replay-single test_117 needs >= 4 MDTs SKIP 117 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 118: invalidate osp update will not cause update log corruption ========================================================== 23:28:04 (1716262084) fail_loc=0x1705 lfs setdirstripe: dirstripe error on '/mnt/lustre/d118.replay-single/striped_dir': Input/output error lfs setdirstripe: cannot create dir '/mnt/lustre/d118.replay-single/striped_dir': Input/output error lfs setdirstripe: dirstripe error on '/mnt/lustre/d118.replay-single/striped_dir1': Input/output error lfs setdirstripe: cannot create dir '/mnt/lustre/d118.replay-single/striped_dir1': Input/output error fail_loc=0x0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3212 1284476 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2648 1285040 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:28:10 (1716262090) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:28:25 (1716262105) targets are mounted 23:28:25 (1716262105) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 118 (29s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 119: timeout of normal replay does not cause DNE replay fails ========================================================== 23:28:35 (1716262115) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3280 1284408 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2708 1284980 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failover mds1 to oleg237-server fail_loc=0x80000714 fail_val=65 Starting mds1: -o localrecov -o recovery_time_hard=60 /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg237-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 72 sec mdt.lustre-MDT0000.recovery_time_hard=180 PASS 119 (90s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 120: DNE fail abort should stop both normal and DNE replay ========================================================== 23:30:08 (1716262208) Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failover mds1 to oleg237-server Starting mds1: -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 find: '/mnt/lustre/d120.replay-single': No such file or directory PASS 120 (27s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 121: lock replay timed out and race ========================================================== 23:30:37 (1716262237) multiop /mnt/lustre/f121.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.7517 Stopping /mnt/lustre-mds1 (opts:) on oleg237-server Failover mds1 to oleg237-server fail_loc=0x721 fail_val=0 at_max=0 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 pdsh@oleg237-client: oleg237-client: ssh exited with exit code 5 fail_loc=0x0 at_max=600 PASS 121 (200s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130a: DoM file create (setstripe) replay ========================================================== 23:33:59 (1716262439) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3672 1284016 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3008 1284680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:34:04 (1716262444) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:34:19 (1716262459) targets are mounted 23:34:19 (1716262459) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 130b: DoM file create (inherited) replay ========================================================== 23:34:29 (1716262469) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3676 1284012 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3008 1284680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:34:34 (1716262474) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:34:49 (1716262489) targets are mounted 23:34:49 (1716262489) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 130b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 131a: DoM file write lock replay === 23:34:59 (1716262499) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3676 1284012 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3008 1284680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre 1+0 records in 1+0 records out 8 bytes (8 B) copied, 0.00239081 s, 3.3 kB/s Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:35:04 (1716262504) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:35:19 (1716262519) targets are mounted 23:35:19 (1716262519) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 131a (28s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-single test_131b skipping excluded test 131b debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 132a: PFL new component instantiate replay ========================================================== 23:35:30 (1716262530) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3688 1284000 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3008 1284680 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 18172 3588848 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 19928 7194112 1% /mnt/lustre 1+0 records in 1+0 records out 1048576 bytes (1.0 MB) copied, 0.0283386 s, 37.0 MB/s /mnt/lustre/f132a.replay-single lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x2c0000401:0x3e62:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x280000401:0x3eaa:0x0] } - 1: { l_ost_idx: 1, l_fid: [0x2c0000401:0x3e63:0x0] } Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:35:34 (1716262534) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:35:50 (1716262550) targets are mounted 23:35:50 (1716262550) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f132a.replay-single lcm_layout_gen: 1 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 1 lmm_stripe_size: 1048576 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x2c0000401:0x3e62:0x0] } lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 65535 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x280000401:0x3eaa:0x0] } - 1: { l_ost_idx: 1, l_fid: [0x2c0000401:0x3e63:0x0] } PASS 132a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 133: check resend of ongoing requests for lwp during failover ========================================================== 23:36:00 (1716262560) seq.srv-lustre-MDT0001.space=clear Starting client: oleg237-client.virtnet: -o user_xattr,flock oleg237-server@tcp:/lustre /mnt/lustre fail_val=700 fail_loc=0x80000123 Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:36:05 (1716262565) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:36:19 (1716262579) targets are mounted 23:36:19 (1716262579) facet_failover done PASS 133 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 134: replay creation of a file created in a pool ========================================================== 23:36:26 (1716262586) Creating new pool oleg237-server: Pool lustre.pool_134 created Adding targets to pool oleg237-server: OST lustre-OST0001_UUID added to pool lustre.pool_134 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3768 1283920 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3052 1284636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 19196 3587824 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20952 7193088 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:36:36 (1716262596) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:36:52 (1716262612) targets are mounted 23:36:52 (1716262612) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Destroy the created pools: pool_134 lustre.pool_134 oleg237-server: OST lustre-OST0001_UUID removed from pool lustre.pool_134 oleg237-server: Pool lustre.pool_134 destroyed PASS 134 (39s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 135: Server failure in lock replay phase ========================================================== 23:37:07 (1716262627) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3748 1283940 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3052 1284636 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 19196 3587824 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1756 3605264 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20952 7193088 1% /mnt/lustre ldlm.cancel_unused_locks_before_replay=0 Stopping /mnt/lustre-ost1 (opts:) on oleg237-server Failover ost1 to oleg237-server oleg237-server: oleg237-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0x32d fail_val=20 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 oleg237-client.virtnet: executing wait_import_state_mount REPLAY_LOCKS osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in REPLAY_LOCKS state after 0 sec Stopping /mnt/lustre-ost1 (opts:) on oleg237-server Failover ost1 to oleg237-server oleg237-server: oleg237-server.virtnet: executing load_module ../libcfs/libcfs/libcfs fail_loc=0 Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all End of sync pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 Stopping /mnt/lustre-ost1 (opts:-f) on oleg237-server Stopping /mnt/lustre-ost2 (opts:-f) on oleg237-server Starting ost1: -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0000 Starting ost2: -o localrecov /dev/mapper/ost2_flakey /mnt/lustre-ost2 seq.cli-lustre-OST0001-super.width=65536 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-OST0001 pdsh@oleg237-client: oleg237-client: ssh exited with exit code 5 ldlm.cancel_unused_locks_before_replay=1 PASS 135 (71s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 136: MDS to disconnect all OSPs first, then cleanup ldlm ========================================================== 23:38:20 (1716262700) SKIP: replay-single test_136 needs > 2 MDTs SKIP 136 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 137a: DNE: create under striped dir, fail MDT1 ========================================================== 23:38:24 (1716262704) llite.lustre-ffff8800a9d77000.intent_mkdir=1 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3756 1283932 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3072 1284616 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 19196 3587824 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20968 7193072 1% /mnt/lustre Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:38:28 (1716262708) shut down Failover mds1 to oleg237-server mount facets: mds1 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0000 23:38:44 (1716262724) targets are mounted 23:38:44 (1716262724) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d137a.replay-single/striped_dir/dir0 has type dir OK lmv_stripe_count: 0 lmv_stripe_offset: 0 lmv_hash_type: none /mnt/lustre/d137a.replay-single/striped_dir/dir1 has type dir OK lmv_stripe_count: 0 lmv_stripe_offset: 1 lmv_hash_type: none PASS 137a (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 137b: DNE: create under striped dir, fail MDT2 ========================================================== 23:38:54 (1716262734) llite.lustre-ffff8800a9d77000.intent_mkdir=1 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3764 1283924 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3076 1284612 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 19196 3587824 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20968 7193072 1% /mnt/lustre Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server 23:38:59 (1716262739) shut down Failover mds2 to oleg237-server mount facets: mds2 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 23:39:14 (1716262754) targets are mounted 23:39:14 (1716262754) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d137b.replay-single/striped_dir/dir0 has type dir OK lmv_stripe_count: 0 lmv_stripe_offset: 0 lmv_hash_type: none /mnt/lustre/d137b.replay-single/striped_dir/dir1 has type dir OK lmv_stripe_count: 0 lmv_stripe_offset: 1 lmv_hash_type: none PASS 137b (28s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 137c: DNE: create under striped dir, fail MDT1/MDT2 ========================================================== 23:39:24 (1716262764) llite.lustre-ffff8800a9d77000.intent_mkdir=1 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3760 1283928 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3064 1284624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 19196 3587824 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20968 7193072 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3760 1283928 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3064 1284624 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 19196 3587824 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1772 3605248 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 20968 7193072 1% /mnt/lustre Failing mds2 on oleg237-server Stopping /mnt/lustre-mds2 (opts:) on oleg237-server Failing mds1 on oleg237-server Stopping /mnt/lustre-mds1 (opts:) on oleg237-server 23:39:33 (1716262773) shut down Failover mds1 to oleg237-server Failover mds2 to oleg237-server mount facets: mds2 mount facets: mds1 Starting mds2: -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Starting mds1: -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg237-server: oleg237-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 pdsh@oleg237-client: oleg237-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 23:39:48 (1716262788) targets are mounted 23:39:48 (1716262788) facet_failover done oleg237-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d137c.replay-single/striped_dir/dir0 has type dir OK lmv_stripe_count: 0 lmv_stripe_offset: 0 lmv_hash_type: none /mnt/lustre/d137c.replay-single/striped_dir/dir1 has type dir OK lmv_stripe_count: 0 lmv_stripe_offset: 1 lmv_hash_type: none PASS 137c (33s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-single test 200: Dropping one OBD_PING should not cause disconnect ========================================================== 23:39:59 (1716262799) SKIP: replay-single test_200 Need remote client SKIP 200 (1s) debug_raw_pointers=0 debug_raw_pointers=0 == replay-single test complete, duration 9212 sec ======== 23:40:01 (1716262801) === replay-single: start cleanup 23:40:02 (1716262802) === === replay-single: finish cleanup 23:40:06 (1716262806) ===