-----============= acceptance-small: replay-dual ============----- Mon May 20 21:11:12 EDT 2024 mgs: CentOS Linux release 7.9.2009 (Core) MGS_OS_VERSION_ID=7 MGS_OS_ID=centos MGS_OS_VERSION_CODE=117440512 MGS_OS_ID_LIKE=rhel fedora centos mds1: CentOS Linux release 7.9.2009 (Core) MDS1_OS_ID_LIKE=rhel fedora centos MDS1_OS_ID=centos MDS1_OS_VERSION_ID=7 MDS1_OS_VERSION_CODE=117440512 ost1: CentOS Linux release 7.9.2009 (Core) OST1_OS_VERSION_CODE=117440512 OST1_OS_VERSION_ID=7 OST1_OS_ID_LIKE=rhel fedora centos OST1_OS_ID=centos client: CentOS Linux release 7.9.2009 (Core) CLIENT_OS_ID=centos CLIENT_OS_ID_LIKE=rhel fedora centos CLIENT_OS_VERSION_ID=7 CLIENT_OS_VERSION_CODE=117440512 excepting tests: 14b 21b 21b skipping tests SLOW=no: 21b === replay-dual: start setup 21:11:16 (1716253876) === Starting client oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre2 Started clients oleg455-client.virtnet: 192.168.204.155@tcp:/lustre on /mnt/lustre2 type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) oleg455-client.virtnet: executing check_config_client /mnt/lustre oleg455-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg455-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8800aa66f000.idle_timeout=debug osc.lustre-OST0000-osc-ffff8800b6cf8800.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800aa66f000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8800b6cf8800.idle_timeout=debug disable quota as required oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all === replay-dual: finish setup 21:11:22 (1716253882) === debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 0a: expired recovery with lost client ========================================================== 21:11:23 (1716253883) Check file is LU482_FAILED=/tmp/replay-dual.lu482.Pbf44F UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 50 open/close in 0.46 seconds: 109.15 ops/second fail_loc=0x80000514 Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:11:26 (1716253886) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:11:39 (1716253899) targets are mounted 21:11:39 (1716253899) facet_failover done Starting client: oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre2 - unlinked 0 (time 1716253981 ; total 0 ; last 0) total: 50 unlinks in 1 seconds: 50.000000 unlinks/second PASS 0a (100s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 0b: lost client during waiting for next transno ========================================================== 21:13:05 (1716253985) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:13:08 (1716253988) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:13:21 (1716254001) targets are mounted 21:13:21 (1716254001) facet_failover done Starting client: oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre Starting client: oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre2 PASS 0b (88s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 1: |X| simple create ================= 21:14:35 (1716254075) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:14:38 (1716254078) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:14:52 (1716254092) targets are mounted 21:14:52 (1716254092) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 1 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 2: |X| mkdir adir ==================== 21:15:00 (1716254100) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:15:03 (1716254103) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:15:17 (1716254117) targets are mounted 21:15:17 (1716254117) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 2 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 3: |X| mkdir adir, mkdir adir/bdir === 21:15:25 (1716254125) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:15:28 (1716254128) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:15:42 (1716254142) targets are mounted 21:15:42 (1716254142) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 3 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 4: |X| mkdir adir (-EEXIST), mkdir adir/bdir ========================================================== 21:15:50 (1716254150) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre mkdir: cannot create directory '/mnt/lustre/adir': File exists Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:15:53 (1716254153) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:16:07 (1716254167) targets are mounted 21:16:07 (1716254167) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 4 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 5: open, unlink |X| close ============ 21:16:15 (1716254175) multiop /mnt/lustre2/a vo_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.6881 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:16:19 (1716254179) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:16:33 (1716254193) targets are mounted 21:16:33 (1716254193) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 5 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 6: open1, open2, unlink |X| close1 [fail mds1] close2 ========================================================== 21:16:40 (1716254200) multiop /mnt/lustre2/a vo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6881 multiop /mnt/lustre/a vo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6881 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:16:44 (1716254204) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:16:58 (1716254218) targets are mounted 21:16:58 (1716254218) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 6 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 8: replay of resent request ========== 21:17:06 (1716254226) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x119 fail_loc=0 Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:17:26 (1716254246) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:17:40 (1716254260) targets are mounted 21:17:40 (1716254260) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 8 (39s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 9: resending a replayed create ======= 21:17:48 (1716254268) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000119 Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:17:51 (1716254271) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:18:05 (1716254285) targets are mounted 21:18:05 (1716254285) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 PASS 9 (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 10: resending a replayed unlink ====== 21:18:27 (1716254307) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210560 3200 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000119 Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:18:31 (1716254311) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:18:45 (1716254325) targets are mounted 21:18:45 (1716254325) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 PASS 10 (38s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 11: both clients timeout during replay ========================================================== 21:19:07 (1716254347) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x0119 Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:19:11 (1716254351) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:19:25 (1716254365) targets are mounted 21:19:25 (1716254365) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 15 sec fail_loc=0 PASS 11 (37s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 12: open resend timeout ============== 21:19:46 (1716254386) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre multiop /mnt/lustre/f12.replay-dual vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6881 fail_loc=0x80000302 Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:19:50 (1716254390) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:20:03 (1716254403) targets are mounted 21:20:03 (1716254403) facet_failover done fail_loc=0 /mnt/lustre/f12.replay-dual /mnt/lustre/f12.replay-dual has type file OK PASS 12 (21s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 13: close resend timeout ============= 21:20:09 (1716254409) multiop /mnt/lustre/f13.replay-dual vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.6881 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x80000115 Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:20:13 (1716254413) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:20:26 (1716254426) targets are mounted 21:20:26 (1716254426) facet_failover done fail_loc=0 /mnt/lustre/f13.replay-dual /mnt/lustre/f13.replay-dual has type file OK PASS 13 (22s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-dual test_14b skipping ALWAYS excluded test 14b debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 15a: timeout waiting for lost client during replay, 1 client completes ========================================================== 21:20:34 (1716254434) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 25 open/close in 0.33 seconds: 74.84 ops/second total: 1 open/close in 0.01 seconds: 75.45 ops/second Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:20:38 (1716254438) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:20:52 (1716254452) targets are mounted 21:20:52 (1716254452) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1716254526 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second Starting client: oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre2 PASS 15a (94s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 15c: remove multiple OST orphans ===== 21:22:10 (1716254530) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:22:53 (1716254573) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:23:07 (1716254587) targets are mounted 21:23:07 (1716254587) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Starting client: oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre2 PASS 15c (130s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 16: fail MDS during recovery (3571) == 21:24:22 (1716254662) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3200 2205440 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre total: 25 open/close in 0.35 seconds: 72.22 ops/second total: 1 open/close in 0.01 seconds: 68.92 ops/second Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:24:26 (1716254666) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:24:40 (1716254680) targets are mounted 21:24:40 (1716254680) facet_failover done Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:25:02 (1716254702) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:25:16 (1716254716) targets are mounted 21:25:16 (1716254716) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1716254794 ; total 0 ; last 0) total: 25 unlinks in 0 seconds: inf unlinks/second Starting client: oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre2 PASS 16 (134s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 17: fail OST during recovery (3571) == 21:26:38 (1716254798) total: 25 open/close in 0.33 seconds: 75.97 ops/second total: 1 open/close in 0.02 seconds: 60.22 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing ost1 on oleg455-server Stopping /mnt/lustre-ost1 (opts:) on oleg455-server 21:26:42 (1716254802) shut down Failover ost1 to oleg455-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-OST0000 21:26:56 (1716254816) targets are mounted 21:26:56 (1716254816) facet_failover done Failing ost1 on oleg455-server Stopping /mnt/lustre-ost1 (opts:) on oleg455-server 21:27:17 (1716254837) shut down Failover ost1 to oleg455-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-OST0000 21:27:32 (1716254852) targets are mounted 21:27:32 (1716254852) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec - unlinked 0 (time 1716254922 ; total 0 ; last 0) total: 25 unlinks in 1 seconds: 25.000000 unlinks/second Starting client: oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre2 PASS 17 (126s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 18: ldlm_handle_enqueue succeeds on evicted export (3822) ========================================================== 21:28:47 (1716254927) debug=+dlmtrace using seed 4151856273 running for 500 iterations total: 500 stats in 0 seconds: inf stats/second fail_loc=0x8000030b ldlm.namespaces.MGC192.168.204.155@tcp.early_lock_cancel=0 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800a8ac6800.early_lock_cancel=0 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa631800.early_lock_cancel=0 ldlm.namespaces.lustre-OST0000-osc-ffff8800a8ac6800.early_lock_cancel=0 ldlm.namespaces.lustre-OST0000-osc-ffff8800aa631800.early_lock_cancel=0 ldlm.namespaces.lustre-OST0001-osc-ffff8800a8ac6800.early_lock_cancel=0 ldlm.namespaces.lustre-OST0001-osc-ffff8800aa631800.early_lock_cancel=0 fail_loc=0x80000305 Error in opening file "/mnt/lustre2/d18.replay-dual/f18.replay-dual"(flags=O_RDONLY) 2: No such file or directory ldlm.namespaces.MGC192.168.204.155@tcp.early_lock_cancel=1 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800a8ac6800.early_lock_cancel=1 ldlm.namespaces.lustre-MDT0000-mdc-ffff8800aa631800.early_lock_cancel=1 ldlm.namespaces.lustre-OST0000-osc-ffff8800a8ac6800.early_lock_cancel=1 ldlm.namespaces.lustre-OST0000-osc-ffff8800aa631800.early_lock_cancel=1 ldlm.namespaces.lustre-OST0001-osc-ffff8800a8ac6800.early_lock_cancel=1 ldlm.namespaces.lustre-OST0001-osc-ffff8800aa631800.early_lock_cancel=1 fail_loc=0 fail_loc=0 PASS 18 (46s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 19: resend of open request =========== 21:29:35 (1716254975) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre fail_loc=0x157 - open/close 0 (time 1716255063.92 total 86.02 last 0.00) total: 1 open/close in 86.02 seconds: 0.01 ops/second fail_loc=0 Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:31:05 (1716255065) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:31:19 (1716255079) targets are mounted 21:31:19 (1716255079) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 19 (110s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 20: recovery time is not increasing == 21:31:27 (1716255087) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:31:31 (1716255091) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:31:44 (1716255104) targets are mounted 21:31:44 (1716255104) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Starting client: oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre2 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3766272 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7532544 1% /mnt/lustre Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:34:09 (1716255249) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:34:23 (1716255263) targets are mounted 21:34:23 (1716255263) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Starting client: oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre2 PASS 20 (319s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 21a: commit on sharing =============== 21:36:48 (1716255408) mdt.lustre-MDT0000.commit_on_sharing=1 Replay barrier on lustre-MDT0000 Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:36:52 (1716255412) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:37:06 (1716255426) targets are mounted 21:37:06 (1716255426) facet_failover done Starting client: oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre2 mdt.lustre-MDT0000.commit_on_sharing=0 PASS 21a (157s) debug_raw_pointers=0 debug_raw_pointers=0 SKIP: replay-dual test_21b skipping SLOW test 21b debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22a: c1 lfs mkdir -i 1 dir1, M1 drop reply & fail, c2 mkdir dir1/dir ========================================================== 21:39:27 (1716255567) SKIP: replay-dual test_22a needs >= 2 MDTs SKIP 22a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22b: c1 lfs mkdir -i 1 d1, M1 drop reply & fail M0/M1, c2 mkdir d1/dir ========================================================== 21:39:30 (1716255570) SKIP: replay-dual test_22b needs >= 2 MDTs SKIP 22b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22c: c1 lfs mkdir -i 1 d1, M1 drop update & fail M1, c2 mkdir d1/dir ========================================================== 21:39:33 (1716255573) SKIP: replay-dual test_22c needs >= 2 MDTs SKIP 22c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 22d: c1 lfs mkdir -i 1 d1, M1 drop update & fail M0/M1,c2 mkdir d1/dir ========================================================== 21:39:37 (1716255577) SKIP: replay-dual test_22d needs >= 2 MDTs SKIP 22d (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23a: c1 rmdir d1, M1 drop reply and fail, client2 mkdir d1 ========================================================== 21:39:40 (1716255580) SKIP: replay-dual test_23a needs >= 2 MDTs SKIP 23a (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23b: c1 rmdir d1, M1 drop reply and fail M0/M1, c2 mkdir d1 ========================================================== 21:39:43 (1716255583) SKIP: replay-dual test_23b needs >= 2 MDTs SKIP 23b (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23c: c1 rmdir d1, M0 drop update reply and fail M0, c2 mkdir d1 ========================================================== 21:39:47 (1716255587) SKIP: replay-dual test_23c needs >= 2 MDTs SKIP 23c (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 23d: c1 rmdir d1, M0 drop update reply and fail M0/M1, c2 mkdir d1 ========================================================== 21:39:50 (1716255590) SKIP: replay-dual test_23d needs >= 2 MDTs SKIP 23d (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 24: reconstruct on non-existing object ========================================================== 21:39:53 (1716255593) fail_loc=0x119 fail_loc=0 truncate: cannot truncate '/mnt/lustre/f24.replay-dual' to length 100: No such file or directory PASS 24 (87s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 25: replay|resend ==================== 21:41:22 (1716255682) 1+0 records in 1+0 records out 512 bytes (512 B) copied, 0.00313927 s, 163 kB/s fail_loc=0x304 fail_loc=0x80000325 Failing ost1 on oleg455-server Stopping /mnt/lustre-ost1 (opts:) on oleg455-server 21:41:24 (1716255684) shut down Failover ost1 to oleg455-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-OST0000 21:41:39 (1716255699) targets are mounted 21:41:39 (1716255699) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 6534: 15873 Terminated LUSTRE="/home/green/git/lustre-release/lustre" bash -c "multiop /mnt/lustre2/f25.replay-dual Ow512" fail_loc=0 PASS 25 (23s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 26: dbench and tar with mds failover ========================================================== 21:41:47 (1716255707) Starting client oleg455-client.virtnet: -o user_xattr,flock oleg455-server@tcp:/lustre /mnt/lustre Started clients oleg455-client.virtnet: 192.168.204.155@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,noencrypt,statfs_project) Started tar loop with pid 17481 Started dbench loop with 17482 looking for dbench program /usr/bin/dbench found dbench client file /usr/share/dbench/client.txt '/usr/share/dbench/client.txt' -> 'client.txt' running 'dbench 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100' on /mnt/lustre at Mon May 20 21:41:49 EDT 2024 waiting for dbench pid 17513 dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 Running for 100 seconds with load 'client.txt' and minimum warmup 20 secs failed to create barrier semaphore 0 of 1 processes prepared for launch 0 sec 1 of 1 processes prepared for launch 0 sec releasing clients 1 155 6.54 MB/sec warmup 1 sec latency 69.644 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 3328 2205312 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 3072 3726336 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 3072 3717120 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 6144 7443456 1% /mnt/lustre 1 289 5.35 MB/sec warmup 2 sec latency 34.832 ms 1 459 5.37 MB/sec warmup 3 sec latency 32.773 ms 1 664 5.29 MB/sec warmup 4 sec latency 64.569 ms test_26 fail mds1 1 times Failing mds1 on oleg455-server 1 791 4.29 MB/sec warmup 5 sec latency 102.294 ms Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 1 900 3.61 MB/sec warmup 6 sec latency 474.001 ms 21:41:55 (1716255715) shut down 1 900 3.10 MB/sec warmup 7 sec latency 1474.219 ms 1 900 2.71 MB/sec warmup 8 sec latency 2474.503 ms 1 900 2.41 MB/sec warmup 9 sec latency 3474.731 ms 1 900 2.17 MB/sec warmup 10 sec latency 4474.977 ms 1 900 1.97 MB/sec warmup 11 sec latency 5475.253 ms 1 900 1.81 MB/sec warmup 12 sec latency 6475.523 ms 1 900 1.67 MB/sec warmup 13 sec latency 7475.812 ms 1 900 1.55 MB/sec warmup 14 sec latency 8476.020 ms 1 900 1.45 MB/sec warmup 15 sec latency 9476.286 ms 1 900 1.35 MB/sec warmup 16 sec latency 10476.603 ms Failover mds1 to oleg455-server mount facets: mds1 1 900 1.28 MB/sec warmup 17 sec latency 11476.885 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 900 1.20 MB/sec warmup 18 sec latency 12477.172 ms 1 900 1.14 MB/sec warmup 19 sec latency 13477.386 ms oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:42:09 (1716255729) targets are mounted 21:42:09 (1716255729) facet_failover done 1 900 0.00 MB/sec execute 1 sec latency 15477.705 ms 1 900 0.00 MB/sec execute 2 sec latency 16477.986 ms 1 900 0.00 MB/sec execute 3 sec latency 17478.248 ms 1 900 0.00 MB/sec execute 4 sec latency 18478.509 ms 1 900 0.00 MB/sec execute 5 sec latency 19478.725 ms 1 900 0.00 MB/sec execute 6 sec latency 20478.972 ms 1 900 0.00 MB/sec execute 7 sec latency 21479.140 ms oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid 1 1000 0.05 MB/sec execute 8 sec latency 21707.221 ms mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 1202 0.15 MB/sec execute 9 sec latency 43.465 ms 1 1447 0.43 MB/sec execute 10 sec latency 29.432 ms 1 1580 0.41 MB/sec execute 11 sec latency 75.028 ms 1 1747 0.38 MB/sec execute 12 sec latency 69.146 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 4480 2204160 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 10240 3645440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 41984 3570688 2% /mnt/lustre[OST:1] filesystem_summary: 7542784 52224 7216128 1% /mnt/lustre 1 1950 0.37 MB/sec execute 13 sec latency 56.444 ms 1 2150 0.36 MB/sec execute 14 sec latency 25.332 ms 1 2376 0.42 MB/sec execute 15 sec latency 78.252 ms test_26 fail mds1 2 times Failing mds1 on oleg455-server 1 2508 0.44 MB/sec execute 16 sec latency 76.243 ms Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 1 2768 0.60 MB/sec execute 17 sec latency 314.661 ms 21:42:26 (1716255746) shut down 1 2768 0.57 MB/sec execute 18 sec latency 1314.976 ms 1 2768 0.54 MB/sec execute 19 sec latency 2315.288 ms 1 2768 0.51 MB/sec execute 20 sec latency 3315.603 ms 1 2768 0.49 MB/sec execute 21 sec latency 4315.978 ms 1 2768 0.47 MB/sec execute 22 sec latency 5316.266 ms 1 2768 0.45 MB/sec execute 23 sec latency 6316.496 ms 1 2768 0.43 MB/sec execute 24 sec latency 7316.826 ms 1 2768 0.41 MB/sec execute 25 sec latency 8317.173 ms 1 2768 0.40 MB/sec execute 26 sec latency 9317.472 ms 1 2768 0.38 MB/sec execute 27 sec latency 10317.705 ms Failover mds1 to oleg455-server mount facets: mds1 1 2768 0.37 MB/sec execute 28 sec latency 11317.903 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 2768 0.35 MB/sec execute 29 sec latency 12318.111 ms 1 2768 0.34 MB/sec execute 30 sec latency 13318.308 ms oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all 1 2768 0.33 MB/sec execute 31 sec latency 14318.503 ms pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:42:40 (1716255760) targets are mounted 21:42:40 (1716255760) facet_failover done 1 2768 0.32 MB/sec execute 32 sec latency 15318.697 ms 1 2768 0.31 MB/sec execute 33 sec latency 16318.915 ms 1 2768 0.30 MB/sec execute 34 sec latency 17319.158 ms 1 2768 0.29 MB/sec execute 35 sec latency 18319.392 ms 1 2768 0.29 MB/sec execute 36 sec latency 19319.615 ms 1 2768 0.28 MB/sec execute 37 sec latency 20319.925 ms oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 3170 0.33 MB/sec execute 38 sec latency 20338.670 ms 1 3493 0.43 MB/sec execute 39 sec latency 48.729 ms 1 3693 0.43 MB/sec execute 40 sec latency 19.295 ms 1 3855 0.44 MB/sec execute 41 sec latency 23.385 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 4992 2203648 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 12288 3574784 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 48128 3512320 2% /mnt/lustre[OST:1] filesystem_summary: 7542784 60416 7087104 1% /mnt/lustre 1 3985 0.44 MB/sec execute 42 sec latency 25.163 ms 1 4112 0.43 MB/sec execute 43 sec latency 35.923 ms 1 4281 0.42 MB/sec execute 44 sec latency 60.462 ms 1 4468 0.42 MB/sec execute 45 sec latency 66.136 ms test_26 fail mds1 3 times Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 1 4706 0.44 MB/sec execute 46 sec latency 70.291 ms 21:42:56 (1716255776) shut down 1 4722 0.43 MB/sec execute 47 sec latency 935.250 ms 1 4722 0.43 MB/sec execute 48 sec latency 1935.532 ms 1 4722 0.42 MB/sec execute 49 sec latency 2935.826 ms 1 4722 0.41 MB/sec execute 50 sec latency 3936.070 ms 1 4722 0.40 MB/sec execute 51 sec latency 4936.266 ms 1 4722 0.39 MB/sec execute 52 sec latency 5936.447 ms 1 4722 0.39 MB/sec execute 53 sec latency 6936.606 ms 1 4722 0.38 MB/sec execute 54 sec latency 7936.782 ms 1 4722 0.37 MB/sec execute 55 sec latency 8936.989 ms 1 4722 0.37 MB/sec execute 56 sec latency 9937.173 ms 1 4722 0.36 MB/sec execute 57 sec latency 10937.277 ms Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 4722 0.35 MB/sec execute 58 sec latency 11937.425 ms 1 4722 0.35 MB/sec execute 59 sec latency 12937.519 ms oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all 1 4722 0.34 MB/sec execute 60 sec latency 13937.672 ms pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:43:09 (1716255789) targets are mounted 21:43:09 (1716255789) facet_failover done 1 4722 0.34 MB/sec execute 61 sec latency 14937.912 ms 1 4722 0.33 MB/sec execute 62 sec latency 15938.156 ms 1 4722 0.32 MB/sec execute 63 sec latency 16938.381 ms 1 4722 0.32 MB/sec execute 64 sec latency 17938.585 ms 1 4722 0.31 MB/sec execute 65 sec latency 18938.883 ms 1 4722 0.31 MB/sec execute 66 sec latency 19939.089 ms 1 4722 0.31 MB/sec execute 67 sec latency 20939.303 ms oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid 1 4818 0.30 MB/sec execute 68 sec latency 21412.477 ms mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 5058 0.34 MB/sec execute 69 sec latency 87.815 ms 1 5199 0.34 MB/sec execute 70 sec latency 78.431 ms 1 5360 0.33 MB/sec execute 71 sec latency 27.469 ms 1 5547 0.33 MB/sec execute 72 sec latency 72.726 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 6400 2202240 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 15360 3595264 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 50176 3559424 2% /mnt/lustre[OST:1] filesystem_summary: 7542784 65536 7154688 1% /mnt/lustre 1 5732 0.33 MB/sec execute 73 sec latency 19.113 ms 1 5949 0.34 MB/sec execute 74 sec latency 57.186 ms 1 6121 0.35 MB/sec execute 75 sec latency 70.557 ms test_26 fail mds1 4 times 1 6462 0.40 MB/sec execute 76 sec latency 70.472 ms Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 1 6825 0.43 MB/sec execute 77 sec latency 70.513 ms 21:43:27 (1716255807) shut down 1 6825 0.42 MB/sec execute 78 sec latency 1004.520 ms 1 6825 0.41 MB/sec execute 79 sec latency 2004.776 ms 1 6825 0.41 MB/sec execute 80 sec latency 3004.994 ms 1 6825 0.40 MB/sec execute 81 sec latency 4005.209 ms 1 6825 0.40 MB/sec execute 82 sec latency 5005.418 ms 1 6825 0.39 MB/sec execute 83 sec latency 6005.678 ms 1 6825 0.39 MB/sec execute 84 sec latency 7005.935 ms 1 6825 0.39 MB/sec execute 85 sec latency 8006.108 ms 1 6825 0.38 MB/sec execute 86 sec latency 9006.257 ms 1 6825 0.38 MB/sec execute 87 sec latency 10006.403 ms Failover mds1 to oleg455-server 1 6825 0.37 MB/sec execute 88 sec latency 11006.546 ms mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 6825 0.37 MB/sec execute 89 sec latency 12006.683 ms 1 6825 0.36 MB/sec execute 90 sec latency 13006.845 ms oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:43:40 (1716255820) targets are mounted 21:43:40 (1716255820) facet_failover done 1 6825 0.36 MB/sec execute 91 sec latency 14006.976 ms 1 6825 0.36 MB/sec execute 92 sec latency 15007.091 ms 1 6825 0.35 MB/sec execute 93 sec latency 16007.268 ms 1 6825 0.35 MB/sec execute 94 sec latency 17007.471 ms 1 6825 0.34 MB/sec execute 95 sec latency 18007.622 ms 1 6825 0.34 MB/sec execute 96 sec latency 19007.795 ms 1 6851 0.34 MB/sec execute 97 sec latency 19905.334 ms oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 7211 0.37 MB/sec execute 98 sec latency 37.550 ms 1 7408 0.38 MB/sec execute 99 sec latency 19.672 ms 1 cleanup 100 sec 0 cleanup 101 sec Operation Count AvgLat MaxLat ---------------------------------------- NTCreateX 1150 32.335 21707.207 Close 846 2.778 9.548 Rename 50 18.275 53.758 Unlink 211 6.989 23.435 Qpathinfo 1065 22.637 19905.321 Qfileinfo 187 0.650 4.376 Qfsinfo 186 0.743 3.820 Sfileinfo 87 255.940 21412.346 Find 395 1.750 15.892 WriteX 569 2.825 10.382 ReadX 1849 0.290 36.301 LockX 4 2.369 2.704 UnlockX 4 2.162 2.605 Flush 66 348.730 20338.658 Throughput 0.375689 MB/sec 1 clients 1 procs max_latency=21707.221 ms stopping dbench on /mnt/lustre at Mon May 20 21:43:50 EDT 2024 with return code 0 clean dbench files on /mnt/lustre /mnt/lustre /mnt/lustre removed 'client.txt' /mnt/lustre dbench successfully finished UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 7168 2201472 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 16384 3489792 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 51200 3465216 2% /mnt/lustre[OST:1] filesystem_summary: 7542784 67584 6955008 1% /mnt/lustre looking for dbench program /usr/bin/dbench found dbench client file /usr/share/dbench/client.txt '/usr/share/dbench/client.txt' -> 'client.txt' running 'dbench 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100' on /mnt/lustre at Mon May 20 21:43:51 EDT 2024 waiting for dbench pid 21703 dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 Running for 100 seconds with load 'client.txt' and minimum warmup 20 secs 0 of 1 processes prepared for launch 0 sec 1 of 1 processes prepared for launch 0 sec releasing clients 1 199 7.84 MB/sec warmup 1 sec latency 32.874 ms 1 358 6.39 MB/sec warmup 2 sec latency 29.226 ms test_26 fail mds1 5 times Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 1 523 6.02 MB/sec warmup 3 sec latency 30.158 ms 1 606 5.24 MB/sec warmup 4 sec latency 814.526 ms 21:43:56 (1716255836) shut down 1 606 4.19 MB/sec warmup 5 sec latency 1814.832 ms 1 606 3.49 MB/sec warmup 6 sec latency 2815.240 ms 1 606 3.00 MB/sec warmup 7 sec latency 3815.585 ms 1 606 2.62 MB/sec warmup 8 sec latency 4815.935 ms 1 606 2.33 MB/sec warmup 9 sec latency 5816.171 ms 1 606 2.10 MB/sec warmup 10 sec latency 6816.485 ms 1 606 1.91 MB/sec warmup 11 sec latency 7816.723 ms 1 606 1.75 MB/sec warmup 12 sec latency 8817.024 ms 1 606 1.61 MB/sec warmup 13 sec latency 9817.275 ms 1 606 1.50 MB/sec warmup 14 sec latency 10817.554 ms Failover mds1 to oleg455-server mount facets: mds1 1 606 1.40 MB/sec warmup 15 sec latency 11817.814 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 606 1.31 MB/sec warmup 16 sec latency 12818.113 ms 1 606 1.23 MB/sec warmup 17 sec latency 13818.359 ms oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:44:09 (1716255849) targets are mounted 21:44:09 (1716255849) facet_failover done 1 606 1.16 MB/sec warmup 18 sec latency 14818.594 ms 1 606 1.10 MB/sec warmup 19 sec latency 15818.824 ms 1 606 0.00 MB/sec execute 1 sec latency 17819.370 ms 1 606 0.00 MB/sec execute 2 sec latency 18819.766 ms 1 606 0.00 MB/sec execute 3 sec latency 19820.054 ms 1 606 0.00 MB/sec execute 4 sec latency 20820.314 ms 1 654 0.02 MB/sec execute 5 sec latency 21669.926 ms oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 827 0.09 MB/sec execute 6 sec latency 59.631 ms 1 980 0.13 MB/sec execute 7 sec latency 58.856 ms 1 1160 0.26 MB/sec execute 8 sec latency 56.765 ms 1 1421 0.56 MB/sec execute 9 sec latency 25.211 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 8960 2199680 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 37888 3433472 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 20480 3552256 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 58368 6985728 1% /mnt/lustre 1 1587 0.52 MB/sec execute 10 sec latency 77.140 ms 1 1825 0.49 MB/sec execute 11 sec latency 38.595 ms 1 2030 0.47 MB/sec execute 12 sec latency 62.092 ms test_26 fail mds1 6 times Failing mds1 on oleg455-server 1 2284 0.49 MB/sec execute 13 sec latency 21.311 ms Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 1 2377 0.50 MB/sec execute 14 sec latency 511.242 ms 21:44:26 (1716255866) shut down 1 2377 0.47 MB/sec execute 15 sec latency 1511.562 ms 1 2377 0.44 MB/sec execute 16 sec latency 2511.903 ms 1 2377 0.41 MB/sec execute 17 sec latency 3512.253 ms 1 2377 0.39 MB/sec execute 18 sec latency 4512.487 ms 1 2377 0.37 MB/sec execute 19 sec latency 5512.877 ms 1 2377 0.35 MB/sec execute 20 sec latency 6513.241 ms 1 2377 0.33 MB/sec execute 21 sec latency 7513.592 ms 1 2377 0.32 MB/sec execute 22 sec latency 8513.934 ms 1 2377 0.30 MB/sec execute 23 sec latency 9514.266 ms 1 2377 0.29 MB/sec execute 24 sec latency 10514.595 ms Failover mds1 to oleg455-server mount facets: mds1 1 2377 0.28 MB/sec execute 25 sec latency 11514.864 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 2377 0.27 MB/sec execute 26 sec latency 12515.076 ms oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 1 2377 0.26 MB/sec execute 27 sec latency 13515.255 ms Started lustre-MDT0000 21:44:39 (1716255879) targets are mounted 21:44:39 (1716255879) facet_failover done 1 2377 0.25 MB/sec execute 28 sec latency 14515.467 ms 1 2377 0.24 MB/sec execute 29 sec latency 15515.746 ms 1 2377 0.23 MB/sec execute 30 sec latency 16515.996 ms 1 2377 0.23 MB/sec execute 31 sec latency 17516.215 ms 1 2377 0.22 MB/sec execute 32 sec latency 18516.445 ms 1 2377 0.21 MB/sec execute 33 sec latency 19516.685 ms 1 2377 0.21 MB/sec execute 34 sec latency 20516.866 ms 1 2377 0.20 MB/sec execute 35 sec latency 21517.122 ms oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 2508 0.22 MB/sec execute 36 sec latency 21616.570 ms 1 2902 0.32 MB/sec execute 37 sec latency 73.018 ms 1 3185 0.35 MB/sec execute 38 sec latency 73.751 ms 1 3399 0.37 MB/sec execute 39 sec latency 55.720 ms 1 3674 0.44 MB/sec execute 40 sec latency 17.647 ms UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 2210688 9856 2198784 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 3771392 47104 3491840 2% /mnt/lustre[OST:0] lustre-OST0001_UUID 3771392 28672 3502080 1% /mnt/lustre[OST:1] filesystem_summary: 7542784 75776 6993920 2% /mnt/lustre 1 3852 0.46 MB/sec execute 41 sec latency 25.447 ms 1 3983 0.45 MB/sec execute 42 sec latency 25.479 ms 1 4120 0.44 MB/sec execute 43 sec latency 26.896 ms test_26 fail mds1 7 times Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 1 4292 0.44 MB/sec execute 44 sec latency 106.736 ms 1 4340 0.43 MB/sec execute 45 sec latency 772.895 ms 21:44:56 (1716255896) shut down 1 4340 0.42 MB/sec execute 46 sec latency 1773.173 ms 1 4340 0.42 MB/sec execute 47 sec latency 2773.367 ms 1 4340 0.41 MB/sec execute 48 sec latency 3773.556 ms 1 4340 0.40 MB/sec execute 49 sec latency 4773.722 ms 1 4340 0.39 MB/sec execute 50 sec latency 5773.896 ms 1 4340 0.38 MB/sec execute 51 sec latency 6774.074 ms 1 4340 0.38 MB/sec execute 52 sec latency 7774.248 ms 1 4340 0.37 MB/sec execute 53 sec latency 8774.452 ms 1 4340 0.36 MB/sec execute 54 sec latency 9774.615 ms 1 4340 0.35 MB/sec execute 55 sec latency 10774.801 ms Failover mds1 to oleg455-server mount facets: mds1 1 4340 0.35 MB/sec execute 56 sec latency 11774.968 ms Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 1 4340 0.34 MB/sec execute 57 sec latency 12775.105 ms oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all 1 4340 0.34 MB/sec execute 58 sec latency 13775.238 ms pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:45:10 (1716255910) targets are mounted 21:45:10 (1716255910) facet_failover done 1 4340 0.33 MB/sec execute 59 sec latency 14775.419 ms 1 4340 0.33 MB/sec execute 60 sec latency 15775.622 ms 1 4340 0.32 MB/sec execute 61 sec latency 16775.912 ms 1 4340 0.31 MB/sec execute 62 sec latency 17776.162 ms 1 4340 0.31 MB/sec execute 63 sec latency 18776.405 ms 1 4340 0.30 MB/sec execute 64 sec latency 19776.612 ms 1 4340 0.30 MB/sec execute 65 sec latency 20776.850 ms oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid 1 4458 0.30 MB/sec execute 66 sec latency 20968.995 ms mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1 4661 0.31 MB/sec execute 67 sec latency 65.021 ms tar: Unexpected EOF in archive tar: Unexpected EOF in archive 1 4862 0.32 MB/sec execute 68 sec latency 25.087 ms 1 5123 0.35 MB/sec execute 69 sec latency 72.784 ms 1 5294 0.35 MB/sec execute 70 sec latency 55.570 ms 1 5530 0.35 MB/sec execute 71 sec latency 78.799 ms tar: Error is not recoverable: exiting now dbench killed by signal 15 stopping dbench on /mnt/lustre at Mon May 20 21:45:23 EDT 2024 with return code 0 21703 pts/0 S+ 0:00 dbench -c client.txt 1 -D /mnt/lustre2/d26.replay-dual/run_dbench -t 100 killed dbench main pid 21703 clean dbench files on /mnt/lustre /mnt/lustre /mnt/lustre removed 'client.txt' /mnt/lustre dbench successfully finished PASS 26 (257s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 28: lock replay should be ordered: waiting after granted ========================================================== 21:46:07 (1716255967) 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00388709 s, 1.1 MB/s fail_loc=0x80000324 fail_loc=0x32a Failing ost1 on oleg455-server Stopping /mnt/lustre-ost1 (opts:) on oleg455-server 21:46:11 (1716255971) shut down Failover ost1 to oleg455-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 1+0 records in 1+0 records out 4096 bytes (4.1 kB) copied, 0.00569381 s, 719 kB/s Started lustre-OST0000 21:46:25 (1716255985) targets are mounted 21:46:25 (1716255985) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 28 (25s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 29: replay vs update with the same xid ========================================================== 21:46:34 (1716255994) SKIP: replay-dual test_29 needs >= 2 MDTs SKIP 29 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 30: layout lock replay is not blocked on IO ========================================================== 21:46:38 (1716255998) 10+0 records in 10+0 records out 40960 bytes (41 kB) copied, 0.013379 s, 3.1 MB/s 10+0 records in 10+0 records out 40960 bytes (41 kB) copied, 0.0132383 s, 3.1 MB/s fail_loc=0x32e fail_val=4 Failing mds1 on oleg455-server Stopping /mnt/lustre-mds1 (opts:) on oleg455-server 21:46:41 (1716256001) shut down Failover mds1 to oleg455-server mount facets: mds1 Starting mds1: -o localrecov lustre-mdt1/mdt1 /mnt/lustre-mds1 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-MDT0000 21:46:55 (1716256015) targets are mounted 21:46:55 (1716256015) facet_failover done 160+0 records in 160+0 records out 81920 bytes (82 kB) copied, 19.5409 s, 4.2 kB/s oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 30 (24s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 31: deadlock on file_remove_privs and occupied mod rpc slots ========================================================== 21:47:04 (1716256024) Failing ost1 on oleg455-server Stopping /mnt/lustre-ost1 (opts:) on oleg455-server 21:47:07 (1716256027) shut down Creating to objid 2945 on ost lustre-OST0000... total: 32 open/close in 0.30 seconds: 105.98 ops/second at_max=0 fail_loc=0x80001420 file /mnt/lustre2/d31.replay-dual/mdtdir/f31.replay-dual is not ready, wait 0.5 second... file /mnt/lustre2/d31.replay-dual/mdtdir/f31.replay-dual is ready Failover ost1 to oleg455-server mount facets: ost1 Starting ost1: -o localrecov lustre-ost1/ost1 /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg455-server: oleg455-server.virtnet: executing set_default_debug -1 all pdsh@oleg455-client: oleg455-server: ssh exited with exit code 1 Started lustre-OST0000 21:47:21 (1716256041) targets are mounted 21:47:21 (1716256041) facet_failover done oleg455-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL IDLE state after 0 sec pids: 29323 29326 29331 29332 29333 29334 29335 29336 29337 at_max=600 PASS 31 (22s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 32: gap in update llog shouldn't break recovery ========================================================== 21:47:28 (1716256048) SKIP: replay-dual test_32 needs >= 2 MDTs SKIP 32 (1s) debug_raw_pointers=0 debug_raw_pointers=0 debug_raw_pointers=Y debug_raw_pointers=Y == replay-dual test 33: Check for OBD_INCOMPAT_MULTI_RPCS in last_rcvd after abort_recovery ========================================================== 21:47:31 (1716256051) SKIP: replay-dual test_33 ldiskfs only test SKIP 33 (0s) debug_raw_pointers=0 debug_raw_pointers=0 == replay-dual test complete, duration 2180 sec ========== 21:47:32 (1716256052) === replay-dual: start cleanup 21:47:32 (1716256052) === Stopping clients: oleg455-client.virtnet /mnt/lustre2 (opts:) Stopping client oleg455-client.virtnet /mnt/lustre2 opts: === replay-dual: finish cleanup 21:47:33 (1716256053) ===