-----============= acceptance-small: replay-single ============----- Mon Mar 16 09:38:34 EDT 2026 mgs: Rocky Linux release 8.10 (Green Obsidian) MGS_OS_ID_LIKE=rhel centos fedora rocky MGS_OS_VERSION_ID=8.10 MGS_OS_ID=rocky MGS_OS_VERSION_CODE=134873088 mds1: Rocky Linux release 8.10 (Green Obsidian) MDS1_OS_VERSION_ID=8.10 MDS1_OS_VERSION_CODE=134873088 MDS1_OS_ID_LIKE=rhel centos fedora rocky MDS1_OS_ID=rocky ost1: Rocky Linux release 8.10 (Green Obsidian) OST1_OS_VERSION_CODE=134873088 OST1_OS_ID_LIKE=rhel centos fedora rocky OST1_OS_VERSION_ID=8.10 OST1_OS_ID=rocky client: Rocky Linux release 8.10 (Green Obsidian) CLIENT_OS_ID=rocky CLIENT_OS_VERSION_CODE=134873088 CLIENT_OS_VERSION_ID=8.10 CLIENT_OS_ID_LIKE=rhel centos fedora rocky oleg132-server: ls: cannot access '/home/green/git/lustre-release/lustre/tests/except/replay-single.*ex': No such file or directory excepting tests: 59 36 === replay-single: start setup 09:38:44 (1773668324) === oleg132-client.virtnet: executing check_config_client /mnt/lustre oleg132-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg132-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff8a4b9072d000.idle_timeout=debug osc.lustre-OST0001-osc-ffff8a4b9072d000.idle_timeout=debug disable quota as required oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all osd-ldiskfs.track_declares_assert=1 === replay-single: finish setup 09:39:04 (1773668344) === == replay-single test 0a: empty replay =================== 09:39:05 (1773668345) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1712 1269592 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1532 1269772 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3088 7210952 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:39:12 (1773668352) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:39:29 (1773668369) targets are mounted 09:39:29 (1773668369) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 0a (33s) == replay-single test 0b: ensure object created after recover exists. (3284) ========================================================== 09:39:38 (1773668378) Failing ost1 on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server 09:39:41 (1773668381) shut down facet: ost1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover ost1 to oleg132-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 09:39:58 (1773668398) targets are mounted 09:39:58 (1773668398) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec total: 20 open/close in 0.20 seconds: 98.43 ops/second - unlinked 0 (time 1773668403 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 0b (28s) == replay-single test 0c: check replay-barrier =========== 09:40:06 (1773668406) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1676 1269628 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1532 1269772 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:40:12 (1773668412) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:40:30 (1773668430) targets are mounted 09:40:30 (1773668430) facet_failover done Starting client: oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre rm: cannot remove '/mnt/lustre/f0c.replay-single': No such file or directory PASS 0c (100s) == replay-single test 0d: expired recovery with no clients ========================================================== 09:41:46 (1773668506) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1676 1269628 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1532 1269772 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:41:52 (1773668512) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:42:11 (1773668531) targets are mounted 09:42:11 (1773668531) facet_failover done Starting client: oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre PASS 0d (108s) == replay-single test 1: simple create =================== 09:43:35 (1773668615) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1676 1269628 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1532 1269772 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:43:48 (1773668628) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:44:23 (1773668663) targets are mounted 09:44:23 (1773668663) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f1.replay-single has type file OK PASS 1 (62s) == replay-single test 2a: touch ========================== 09:44:36 (1773668676) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1676 1269628 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1532 1269772 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:44:47 (1773668687) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:45:21 (1773668721) targets are mounted 09:45:21 (1773668721) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2a.replay-single has type file OK PASS 2a (57s) == replay-single test 2b: touch ========================== 09:45:33 (1773668733) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1676 1269628 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1532 1269772 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:45:43 (1773668743) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:46:07 (1773668767) targets are mounted 09:46:07 (1773668767) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2b.replay-single has type file OK PASS 2b (45s) == replay-single test 2c: setstripe replay =============== 09:46:18 (1773668778) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1676 1269628 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1532 1269772 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:46:29 (1773668789) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:46:54 (1773668814) targets are mounted 09:46:54 (1773668814) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2c.replay-single has type file OK PASS 2c (49s) == replay-single test 2d: setdirstripe replay ============ 09:47:07 (1773668827) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1676 1269628 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1532 1269772 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:47:19 (1773668839) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:47:40 (1773668860) targets are mounted 09:47:40 (1773668860) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d2d.replay-single has type dir OK PASS 2d (45s) == replay-single test 2e: O_CREAT|O_EXCL create replay === 09:47:53 (1773668873) fail_loc=0x8000013b UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:48:06 (1773668886) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Succeed in opening file "/mnt/lustre/f2e.replay-single"(flags=O_CREAT) Started lustre-MDT0000 09:48:40 (1773668920) targets are mounted 09:48:40 (1773668920) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f2e.replay-single has type file OK PASS 2e (60s) == replay-single test 3a: replay failed open(O_DIRECTORY) ========================================================== 09:48:53 (1773668933) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1760 1269544 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Error in opening file "/mnt/lustre/f3a.replay-single"(flags=O_DIRECTORY) 20: Not a directory Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:49:04 (1773668944) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:49:35 (1773668975) targets are mounted 09:49:35 (1773668975) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f3a.replay-single has type file OK PASS 3a (53s) == replay-single test 3b: replay failed open -ENOMEM ===== 09:49:46 (1773668986) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1760 1269544 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre fail_loc=0x80000114 touch: cannot touch '/mnt/lustre/f3b.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:49:58 (1773668998) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:50:20 (1773669020) targets are mounted 09:50:20 (1773669020) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3b.replay-single: No such file or directory PASS 3b (44s) == replay-single test 3c: replay failed open -ENOMEM ===== 09:50:31 (1773669031) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1760 1269544 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre fail_loc=0x80000128 touch: cannot touch '/mnt/lustre/f3c.replay-single': Cannot allocate memory fail_loc=0 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:50:39 (1773669039) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:51:01 (1773669061) targets are mounted 09:51:01 (1773669061) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f3c.replay-single: No such file or directory PASS 3c (39s) == replay-single test 4a: |x| 10 open(O_CREAT)s ========== 09:51:11 (1773669071) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1760 1269544 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1560 3605460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1544 3605476 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3104 7210936 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:51:20 (1773669080) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:51:42 (1773669102) targets are mounted 09:51:42 (1773669102) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 4a (43s) == replay-single test 4b: |x| rm 10 files ================ 09:51:53 (1773669113) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1760 1269544 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3144 7210896 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:52:02 (1773669122) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:52:21 (1773669141) targets are mounted 09:52:21 (1773669141) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f4b.replay-single-*: No such file or directory PASS 4b (38s) == replay-single test 5: |x| 220 open(O_CREAT) =========== 09:52:31 (1773669151) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1760 1269544 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1584 3605436 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3148 7210892 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:52:45 (1773669165) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:53:27 (1773669207) targets are mounted 09:53:27 (1773669207) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 5 (106s) == replay-single test 6a: mkdir + contained create ======= 09:54:17 (1773669257) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3144 7210896 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:54:37 (1773669277) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:55:12 (1773669312) targets are mounted 09:55:12 (1773669312) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d6a.replay-single has type dir OK /mnt/lustre/d6a.replay-single/f6a.replay-single has type file OK PASS 6a (75s) == replay-single test 6b: |X| rmdir ====================== 09:55:32 (1773669332) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1788 1269516 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3144 7210896 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:55:49 (1773669349) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:56:24 (1773669384) targets are mounted 09:56:24 (1773669384) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/d6b.replay-single: No such file or directory PASS 6b (71s) == replay-single test 7: mkdir |X| contained create ====== 09:56:44 (1773669404) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1788 1269516 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3144 7210896 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:57:00 (1773669420) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:57:36 (1773669456) targets are mounted 09:57:36 (1773669456) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d7.replay-single has type dir OK /mnt/lustre/d7.replay-single/f7.replay-single has type file OK PASS 7 (68s) == replay-single test 8: creat open |X| close ============ 09:57:51 (1773669471) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3144 7210896 1% /mnt/lustre multiop /mnt/lustre/f8.replay-single vmo_c TMPPIPE=/tmp/multiop_open_wait_pipe.8185 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:58:05 (1773669485) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 09:58:38 (1773669518) targets are mounted 09:58:38 (1773669518) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f8.replay-single /mnt/lustre/f8.replay-single has type file OK PASS 8 (64s) == replay-single test 9: |X| create (same inum/gen) ====== 09:58:56 (1773669536) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3144 7210896 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 09:59:14 (1773669554) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:00:01 (1773669601) targets are mounted 10:00:01 (1773669601) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old_inum == 144115305935798546, new_inum == 144115305935798546 old_inum and new_inum match PASS 9 (83s) == replay-single test 10: create |X| rename unlink ======= 10:00:18 (1773669618) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3144 7210896 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:00:35 (1773669635) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:01:07 (1773669667) targets are mounted 10:01:07 (1773669667) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Can't lstat /mnt/lustre/f10.replay-single: No such file or directory PASS 10 (65s) == replay-single test 11: create open write rename |X| create-old-name read ========================================================== 10:01:23 (1773669683) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1580 3605440 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1564 3605456 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3144 7210896 1% /mnt/lustre new old Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:01:37 (1773669697) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:02:09 (1773669729) targets are mounted 10:02:09 (1773669729) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec new old PASS 11 (59s) == replay-single test 12: open, unlink |X| close ========= 10:02:22 (1773669742) multiop /mnt/lustre/f12.replay-single vo_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1584 3605436 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3152 7210888 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:02:34 (1773669754) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:03:04 (1773669784) targets are mounted 10:03:05 (1773669785) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 12 (55s) == replay-single test 13: open chmod 0 |x| write close === 10:03:17 (1773669797) multiop /mnt/lustre/f13.replay-single vO_wc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 /mnt/lustre/f13.replay-single has perms 00 OK UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1584 3605436 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3152 7210888 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:03:29 (1773669809) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:04:03 (1773669843) targets are mounted 10:04:03 (1773669843) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f13.replay-single has perms 00 OK /mnt/lustre/f13.replay-single has size 1 OK PASS 13 (57s) == replay-single test 14: open(O_CREAT), unlink |X| close ========================================================== 10:04:15 (1773669855) multiop /mnt/lustre/f14.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1584 3605436 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3152 7210888 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:04:24 (1773669864) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:04:47 (1773669887) targets are mounted 10:04:47 (1773669887) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 14 (43s) == replay-single test 15: open(O_CREAT), unlink |X| touch new, close ========================================================== 10:04:58 (1773669898) multiop /mnt/lustre/f15.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1584 3605436 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3152 7210888 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:05:06 (1773669906) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:05:25 (1773669925) targets are mounted 10:05:25 (1773669925) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 15 (36s) == replay-single test 16: |X| open(O_CREAT), unlink, touch new, unlink new ========================================================== 10:05:34 (1773669934) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1584 3605436 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3152 7210888 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:05:41 (1773669941) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:06:00 (1773669960) targets are mounted 10:06:00 (1773669960) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 16 (35s) == replay-single test 17: |X| open(O_CREAT), |replay| close ========================================================== 10:06:09 (1773669969) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1584 3605436 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3152 7210888 1% /mnt/lustre multiop /mnt/lustre/f17.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.8185 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:06:16 (1773669976) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:06:35 (1773669995) targets are mounted 10:06:35 (1773669995) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/f17.replay-single has type file OK PASS 17 (34s) == replay-single test 18: open(O_CREAT), unlink, touch new, close, touch, unlink ========================================================== 10:06:43 (1773670003) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1584 3605436 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3152 7210888 1% /mnt/lustre multiop /mnt/lustre/f18.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 pid: 52844 will close Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:06:50 (1773670010) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:07:07 (1773670027) targets are mounted 10:07:07 (1773670027) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 18 (33s) == replay-single test 19: mcreate, open, write, rename === 10:07:16 (1773670036) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1584 3605436 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1568 3605452 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3152 7210888 1% /mnt/lustre old Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:07:22 (1773670042) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:07:40 (1773670060) targets are mounted 10:07:40 (1773670060) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec old PASS 19 (32s) == replay-single test 20a: |X| open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 10:07:48 (1773670068) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1584 3605436 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3156 7210884 1% /mnt/lustre multiop /mnt/lustre/f20a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:07:54 (1773670074) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:08:11 (1773670091) targets are mounted 10:08:11 (1773670091) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 20a (31s) == replay-single test 20b: write, unlink, eviction, replay (test mds_cleanup_orphans) ========================================================== 10:08:20 (1773670100) /mnt/lustre/f20b.replay-single lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 1090 0x442 0x280000401 pdsh@oleg132-client: oleg132-client: ssh exited with exit code 5 dd: error writing '/mnt/lustre/f20b.replay-single': Cannot send after transport endpoint shutdown 5472+0 records in 5471+0 records out 22409216 bytes (22 MB, 21 MiB) copied, 1.8973 s, 11.8 MB/s Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:08:27 (1773670107) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:08:44 (1773670124) targets are mounted 10:08:44 (1773670124) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg132-server: oleg132-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg132-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for MDT destroys to complete before 3156, after 3156 PASS 20b (38s) == replay-single test 20c: check that client eviction does not affect file content ========================================================== 10:08:58 (1773670138) multiop /mnt/lustre/f20c.replay-single vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.8185 -rw-r--r-- 1 root root 1 Mar 16 10:08 /mnt/lustre/f20c.replay-single PASS 20c (5s) == replay-single test 21: |X| open(O_CREAT), unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 10:09:03 (1773670143) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1584 3605408 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3156 7210856 1% /mnt/lustre multiop /mnt/lustre/f21.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:09:10 (1773670150) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:09:28 (1773670168) targets are mounted 10:09:28 (1773670168) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 21 (34s) == replay-single test 22: open(O_CREAT), |X| unlink, replay, close (test mds_cleanup_orphans) ========================================================== 10:09:37 (1773670177) multiop /mnt/lustre/f22.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:09:43 (1773670183) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:10:01 (1773670201) targets are mounted 10:10:01 (1773670201) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 22 (33s) == replay-single test 23: open(O_CREAT), |X| unlink touch new, replay, close (test mds_cleanup_orphans) ========================================================== 10:10:10 (1773670210) multiop /mnt/lustre/f23.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:10:16 (1773670216) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:10:34 (1773670234) targets are mounted 10:10:34 (1773670234) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 23 (33s) == replay-single test 24: open(O_CREAT), replay, unlink, close (test mds_cleanup_orphans) ========================================================== 10:10:43 (1773670243) multiop /mnt/lustre/f24.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:10:50 (1773670250) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:11:08 (1773670268) targets are mounted 10:11:08 (1773670268) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 24 (34s) == replay-single test 25: open(O_CREAT), unlink, replay, close (test mds_cleanup_orphans) ========================================================== 10:11:17 (1773670277) multiop /mnt/lustre/f25.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:11:23 (1773670283) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:11:40 (1773670300) targets are mounted 10:11:40 (1773670300) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 25 (32s) == replay-single test 26: |X| open(O_CREAT), unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 10:11:49 (1773670309) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre multiop /mnt/lustre/f26.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 multiop /mnt/lustre/f26.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:11:54 (1773670314) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:12:12 (1773670332) targets are mounted 10:12:12 (1773670332) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 26 (32s) == replay-single test 27: |X| open(O_CREAT), unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 10:12:21 (1773670341) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre multiop /mnt/lustre/f27.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 multiop /mnt/lustre/f27.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:12:27 (1773670347) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:12:45 (1773670365) targets are mounted 10:12:45 (1773670365) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 27 (32s) == replay-single test 28: open(O_CREAT), |X| unlink two, close one, replay, close one (test mds_cleanup_orphans) ========================================================== 10:12:53 (1773670373) multiop /mnt/lustre/f28.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 multiop /mnt/lustre/f28.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:12:59 (1773670379) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:13:16 (1773670396) targets are mounted 10:13:16 (1773670396) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 28 (31s) == replay-single test 29: open(O_CREAT), |X| unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 10:13:25 (1773670405) multiop /mnt/lustre/f29.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 multiop /mnt/lustre/f29.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:13:30 (1773670410) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:13:48 (1773670428) targets are mounted 10:13:48 (1773670428) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 29 (32s) == replay-single test 30: open(O_CREAT) two, unlink two, replay, close two (test mds_cleanup_orphans) ========================================================== 10:13:56 (1773670436) multiop /mnt/lustre/f30.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 multiop /mnt/lustre/f30.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:14:02 (1773670442) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:14:19 (1773670459) targets are mounted 10:14:19 (1773670459) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 30 (32s) == replay-single test 31: open(O_CREAT) two, unlink one, |X| unlink one, close two (test mds_cleanup_orphans) ========================================================== 10:14:28 (1773670468) multiop /mnt/lustre/f31.replay-single-1 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 multiop /mnt/lustre/f31.replay-single-2 vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1784 1269520 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1608 1269696 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:14:35 (1773670475) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:14:52 (1773670492) targets are mounted 10:14:52 (1773670492) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 31 (33s) == replay-single test 32: close() notices client eviction; close() after client eviction ========================================================== 10:15:01 (1773670501) multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.8185 multiop /mnt/lustre/f32.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.8185 pdsh@oleg132-client: oleg132-client: ssh exited with exit code 5 PASS 32 (6s) == replay-single test 33a: fid seq shouldn't be reused after abort recovery ========================================================== 10:15:07 (1773670507) total: 10 open/close in 0.08 seconds: 119.77 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Failover mds1 to oleg132-server oleg132-server.virtnet Start mds1: mount -t lustre -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.10 seconds: 104.35 ops/second PASS 33a (25s) == replay-single test 33b: test fid seq allocation ======= 10:15:32 (1773670532) fail_loc=0x1311 total: 10 open/close in 0.08 seconds: 119.50 ops/second Replay barrier on lustre-MDT0000 Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Failover mds1 to oleg132-server oleg132-server.virtnet Start mds1: mount -t lustre -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 total: 10 open/close in 0.09 seconds: 112.91 ops/second PASS 33b (25s) == replay-single test 34: abort recovery before client does replay (test mds_cleanup_orphans) ========================================================== 10:15:58 (1773670558) multiop /mnt/lustre/f34.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1816 1269488 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1640 1269664 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Failover mds1 to oleg132-server oleg132-server.virtnet Start mds1: mount -t lustre -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 34 (26s) == replay-single test 35: test recovery from llog for unlink op ========================================================== 10:16:24 (1773670584) fail_loc=0x80000119 Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Failover mds1 to oleg132-server oleg132-server.virtnet Start mds1: mount -t lustre -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 rm: cannot remove '/mnt/lustre/f35.replay-single': Input/output error oleg132-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg132-client: oleg132-client: ssh exited with exit code 5 first stat failed: 5 Can't lstat /mnt/lustre/f35.replay-single: No such file or directory PASS 35 (22s) SKIP: replay-single test_36 skipping ALWAYS excluded test 36 == replay-single test 37: abort recovery before client does replay (test mds_cleanup_orphans for directories) ========================================================== 10:16:47 (1773670607) multiop /mnt/lustre/d37.replay-single/f37.replay-single vdD_c TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 1888 1269416 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1704 1269600 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Failover mds1 to oleg132-server oleg132-server.virtnet Start mds1: mount -t lustre -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg132-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg132-client: oleg132-client: ssh exited with exit code 5 first stat failed: 5 PASS 37 (26s) == replay-single test 38: test recovery from unlink llog (test llog_gen_rec) ========================================================== 10:17:13 (1773670633) total: 800 open/close in 5.82 seconds: 137.57 ops/second - unlinked 0 (time 1773670642 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2028 1269276 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:17:35 (1773670655) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:17:53 (1773670673) targets are mounted 10:17:53 (1773670673) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1773670681 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Can't lstat /mnt/lustre/f38.replay-single-*: No such file or directory PASS 38 (55s) == replay-single test 39: test recovery from unlink llog (test llog_gen_rec) ========================================================== 10:18:08 (1773670688) total: 800 open/close in 6.05 seconds: 132.25 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2032 1269272 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1588 3605432 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3160 7210880 1% /mnt/lustre - unlinked 0 (time 1773670701 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:18:25 (1773670705) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:18:43 (1773670723) targets are mounted 10:18:43 (1773670723) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1773670732 ; total 0 ; last 0) total: 400 unlinks in 3 seconds: 133.333328 unlinks/second Can't lstat /mnt/lustre/f39.replay-single-*: No such file or directory PASS 39 (52s) == replay-single test 41: read from a valid osc while other oscs are invalid ========================================================== 10:19:00 (1773670740) 1+0 records in 1+0 records out 4096 bytes (4.1 kB, 4.0 KiB) copied, 0.00236286 s, 1.7 MB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB, 4.0 KiB) copied, 0.0128321 s, 319 kB/s PASS 41 (4s) == replay-single test 42: recovery after ost failure ===== 10:19:04 (1773670744) total: 800 open/close in 6.87 seconds: 116.41 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2064 1269240 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre - unlinked 0 (time 1773670758 ; total 0 ; last 0) total: 400 unlinks in 3 seconds: 133.333328 unlinks/second debug=-1 Failing ost1 on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server 10:19:24 (1773670764) shut down facet: ost1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover ost1 to oleg132-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 10:19:42 (1773670782) targets are mounted 10:19:42 (1773670782) facet_failover done wait for MDS to timeout and recover - unlinked 0 (time 1773670825 ; total 0 ; last 0) total: 400 unlinks in 2 seconds: 200.000000 unlinks/second Can't lstat /mnt/lustre/f42.replay-single-*: No such file or directory PASS 42 (87s) == replay-single test 43: mds osc import failure during recovery; don't LBUG ========================================================== 10:20:31 (1773670831) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2096 1269208 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre fail_loc=0x80000204 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:20:38 (1773670838) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:20:56 (1773670856) targets are mounted 10:20:56 (1773670856) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 43 (44s) == replay-single test 44a: race in target handle connect ========================================================== 10:21:15 (1773670875) at_max=40 1 of 10 (1773670878) service : cur 5 worst 5 (at 1773668263, 2615s ago) 4 4 1 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2000 1269304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre 2 of 10 (1773670884) service : cur 5 worst 5 (at 1773668263, 2621s ago) 5 4 1 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2000 1269304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre 3 of 10 (1773670890) service : cur 6 worst 6 (at 1773670890, 0s ago) 6 4 1 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2000 1269304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre 4 of 10 (1773670896) service : cur 6 worst 6 (at 1773670890, 6s ago) 6 4 1 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2000 1269304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre 5 of 10 (1773670902) service : cur 6 worst 6 (at 1773670890, 12s ago) 6 4 1 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2000 1269304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre 6 of 10 (1773670908) service : cur 6 worst 6 (at 1773670890, 18s ago) 6 4 1 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2000 1269304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre 7 of 10 (1773670914) service : cur 6 worst 6 (at 1773670890, 25s ago) 6 4 1 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2000 1269304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre 8 of 10 (1773670920) service : cur 6 worst 6 (at 1773670890, 31s ago) 6 4 1 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2000 1269304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre 9 of 10 (1773670927) service : cur 6 worst 6 (at 1773670890, 37s ago) 6 4 1 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2000 1269304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre 10 of 10 (1773670933) service : cur 6 worst 6 (at 1773670890, 43s ago) 6 4 1 4 fail_loc=0x80000701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2000 1269304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre fail_loc=0 at_max=600 PASS 44a (67s) == replay-single test 44b: race in target handle connect ========================================================== 10:22:22 (1773670942) 1 of 10 (1773670943) service : cur 6 worst 6 (at 1773670890, 53s ago) 6 4 1 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.132@tcp:/lustre 7666232 3164 7210876 1% /mnt/lustre 2 of 10 (1773670965) service : cur 6 worst 6 (at 1773670890, 75s ago) 1 6 4 1 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.132@tcp:/lustre 7666232 3164 7210876 1% /mnt/lustre 3 of 10 (1773670986) service : cur 40 worst 40 (at 1773670984, 3s ago) 40 6 4 1 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.132@tcp:/lustre 7666232 3164 7210876 1% /mnt/lustre 4 of 10 (1773671008) service : cur 40 worst 40 (at 1773670984, 24s ago) 40 6 4 1 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.132@tcp:/lustre 7666232 3164 7210876 1% /mnt/lustre 5 of 10 (1773671029) service : cur 40 worst 40 (at 1773670984, 45s ago) 40 6 4 1 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.132@tcp:/lustre 7666232 3164 7210876 1% /mnt/lustre 6 of 10 (1773671050) service : cur 40 worst 40 (at 1773670984, 67s ago) 40 6 4 1 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.132@tcp:/lustre 7666232 3164 7210876 1% /mnt/lustre 7 of 10 (1773671072) service : cur 40 worst 40 (at 1773670984, 88s ago) 40 6 4 1 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.132@tcp:/lustre 7666232 3164 7210876 1% /mnt/lustre 8 of 10 (1773671093) service : cur 40 worst 40 (at 1773670984, 110s ago) 40 6 4 1 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.132@tcp:/lustre 7666232 3164 7210876 1% /mnt/lustre 9 of 10 (1773671115) service : cur 40 worst 40 (at 1773670984, 131s ago) 40 40 6 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.132@tcp:/lustre 7666232 3164 7210876 1% /mnt/lustre 10 of 10 (1773671136) service : cur 40 worst 40 (at 1773670984, 153s ago) 40 40 6 4 fail_loc=0x80000704 error: recover: Connection timed out Filesystem 1K-blocks Used Available Use% Mounted on 192.168.201.132@tcp:/lustre 7666232 3164 7210876 1% /mnt/lustre PASS 44b (217s) == replay-single test 44c: race in target handle connect ========================================================== 10:25:59 (1773671159) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2000 1269304 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1736 1269568 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre total: 100 create in 0.43 seconds: 229.97 ops/second fail_loc=0x80000712 Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Failover mds1 to oleg132-server oleg132-server.virtnet Start mds1: mount -t lustre -o localrecov -o abort_recovery /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 oleg132-client: error: invalid path '/mnt/lustre': Input/output error pdsh@oleg132-client: oleg132-client: ssh exited with exit code 5 first stat failed: 5 unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:26:25 (1773671185) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:26:40 (1773671200) targets are mounted 10:26:40 (1773671200) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec unlink(/mnt/lustre/f44c.replay-single-0) error: No such file or directory total: 0 unlinks in 0 seconds: -nan unlinks/second PASS 44c (50s) == replay-single test 45: Handle failed close ============ 10:26:49 (1773671209) multiop /mnt/lustre/f45.replay-single vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.8185 /mnt/lustre/f45.replay-single has type file OK PASS 45 (4s) == replay-single test 46: Don't leak file handle after open resend (3325) ========================================================== 10:26:53 (1773671213) fail_loc=0x122 fail_loc=0 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:27:13 (1773671233) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:27:28 (1773671248) targets are mounted 10:27:28 (1773671248) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec lfs path2fid: cannot get fid for 'f46.replay-single': No such file or directory PASS 46 (44s) == replay-single test 47: MDS->OSC failure during precreate cleanup (2824) ========================================================== 10:27:37 (1773671257) total: 20 open/close in 0.20 seconds: 102.41 ops/second Failing ost1 on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server 10:27:40 (1773671260) shut down facet: ost1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover ost1 to oleg132-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 10:27:56 (1773671276) targets are mounted 10:27:56 (1773671276) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_loc=0x80000204 total: 20 open/close in 0.09 seconds: 213.98 ops/second - unlinked 0 (time 1773671343 ; total 0 ; last 0) total: 20 unlinks in 0 seconds: inf unlinks/second PASS 47 (88s) == replay-single test 48: MDS->OSC failure during precreate cleanup (2824) ========================================================== 10:29:05 (1773671345) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2032 1269272 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1768 1269536 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre total: 20 open/close in 0.16 seconds: 128.24 ops/second Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:29:12 (1773671352) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:29:29 (1773671369) targets are mounted 10:29:29 (1773671369) facet_failover done fail_loc=0x80000216 total: 20 open/close in 0.39 seconds: 50.84 ops/second - unlinked 0 (time 1773671432 ; total 0 ; last 0) total: 40 unlinks in 1 seconds: 40.000000 unlinks/second PASS 48 (93s) == replay-single test 50: Double OSC recovery, don't LASSERT (3812) ========================================================== 10:30:38 (1773671438) PASS 50 (13s) == replay-single test 52: time out lock replay (3764) ==== 10:30:51 (1773671451) multiop /mnt/lustre/f52.replay-single vs_s TMPPIPE=/tmp/multiop_open_wait_pipe.8185 fail_loc=0x80000157 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:30:58 (1773671458) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:31:16 (1773671476) targets are mounted 10:31:16 (1773671476) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 fail_loc=0x0 PASS 52 (91s) == replay-single test 53a: |X| close request while two MDC requests in flight ========================================================== 10:32:22 (1773671542) fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:32:32 (1773671552) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:32:50 (1773671570) targets are mounted 10:32:50 (1773671570) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53a.replay-single-1/f has type file OK /mnt/lustre/d53a.replay-single-2/f has type file OK PASS 53a (37s) == replay-single test 53b: |X| open request while two MDC requests in flight ========================================================== 10:32:59 (1773671579) multiop /mnt/lustre/d53b.replay-single-1/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.8185 fail_loc=0x80000107 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:33:08 (1773671588) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:33:26 (1773671606) targets are mounted 10:33:26 (1773671606) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53b.replay-single-1/f has type file OK /mnt/lustre/d53b.replay-single-2/f has type file OK PASS 53b (37s) == replay-single test 53c: |X| open request and close request while two MDC requests in flight ========================================================== 10:33:36 (1773671616) fail_loc=0x80000107 fail_loc=0x80000115 Replay barrier on lustre-MDT0000 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:33:45 (1773671625) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:34:13 (1773671653) targets are mounted 10:34:13 (1773671653) facet_failover done fail_loc=0 /mnt/lustre/d53c.replay-single-1/f has type file OK /mnt/lustre/d53c.replay-single-2/f has type file OK PASS 53c (44s) == replay-single test 53d: close reply while two MDC requests in flight ========================================================== 10:34:20 (1773671660) fail_loc=0x8000013b fail_loc=0 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:34:24 (1773671664) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:34:39 (1773671679) targets are mounted 10:34:39 (1773671679) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53d.replay-single-1/f has type file OK /mnt/lustre/d53d.replay-single-2/f has type file OK PASS 53d (28s) == replay-single test 53e: |X| open reply while two MDC requests in flight ========================================================== 10:34:48 (1773671688) fail_loc=0x119 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:34:54 (1773671694) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:35:11 (1773671711) targets are mounted 10:35:11 (1773671711) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec /mnt/lustre/d53e.replay-single-1/f has type file OK /mnt/lustre/d53e.replay-single-2/f has type file OK PASS 53e (32s) == replay-single test 53f: |X| open reply and close reply while two MDC requests in flight ========================================================== 10:35:20 (1773671720) fail_loc=0x119 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:35:26 (1773671726) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:35:43 (1773671743) targets are mounted 10:35:43 (1773671743) facet_failover done fail_loc=0 /mnt/lustre/d53f.replay-single-1/f has type file OK /mnt/lustre/d53f.replay-single-2/f has type file OK PASS 53f (30s) == replay-single test 53g: |X| drop open reply and close request while close and open are both in flight ========================================================== 10:35:50 (1773671750) fail_loc=0x119 fail_loc=0x80000115 fail_loc=0 Replay barrier on lustre-MDT0000 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:35:57 (1773671757) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:36:14 (1773671774) targets are mounted 10:36:14 (1773671774) facet_failover done /mnt/lustre/d53g.replay-single-1/f has type file OK /mnt/lustre/d53g.replay-single-2/f has type file OK PASS 53g (31s) == replay-single test 53h: open request and close reply while two MDC requests in flight ========================================================== 10:36:21 (1773671781) fail_loc=0x80000107 fail_loc=0x8000013b Replay barrier on lustre-MDT0000 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:36:28 (1773671788) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:36:45 (1773671805) targets are mounted 10:36:45 (1773671805) facet_failover done fail_loc=0 /mnt/lustre/d53h.replay-single-1/f has type file OK /mnt/lustre/d53h.replay-single-2/f has type file OK PASS 53h (31s) == replay-single test 55: let MDS_CHECK_RESENT return the original return code instead of 0 ========================================================== 10:36:52 (1773671812) fail_loc=0x8000012b fail_loc=0x0 rm: cannot remove '/mnt/lustre/f55.replay-single': No such file or directory touch: cannot touch '/mnt/lustre/f55.replay-single': No such file or directory PASS 55 (63s) == replay-single test 56: don't replay a symlink open request (3440) ========================================================== 10:37:55 (1773671875) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2064 1269240 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1768 1269536 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:38:00 (1773671880) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:38:16 (1773671896) targets are mounted 10:38:16 (1773671896) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 56 (40s) == replay-single test 57: test recovery from llog for setattr op ========================================================== 10:38:35 (1773671915) fail_loc=0x8000012c UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2040 1269264 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1768 1269536 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:38:41 (1773671921) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:38:57 (1773671937) targets are mounted 10:38:57 (1773671937) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec affected facets: mds1 oleg132-server: oleg132-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 oleg132-server: *.lustre-MDT0000.recovery_status status: COMPLETE Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg132-server mds-ost sync done. /mnt/lustre/f57.replay-single has type file OK fail_loc=0x0 PASS 57 (35s) == replay-single test 58a: test recovery from llog for setattr op (test llog_gen_rec) ========================================================== 10:39:10 (1773671950) fail_loc=0x8000012c total: 2500 open/close in 8.55 seconds: 292.34 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2336 1268968 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1768 1269536 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:39:26 (1773671966) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:39:58 (1773671998) targets are mounted 10:39:58 (1773671998) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0x0 - unlinked 0 (time 1773672032 ; total 0 ; last 0) total: 2500 unlinks in 78 seconds: 32.051281 unlinks/second PASS 58a (175s) == replay-single test 58b: test replay of setxattr op ==== 10:42:05 (1773672125) Starting client: oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre2 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2328 1268976 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1768 1269536 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:42:34 (1773672154) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:43:22 (1773672202) targets are mounted 10:43:22 (1773672202) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Stopping client oleg132-client.virtnet /mnt/lustre2 (opts:) oleg132-client.virtnet: executing wait_import_state_mount FULL mgc.*.mgs_server_uuid mgc.*.mgs_server_uuid in FULL state after 0 sec PASS 58b (111s) == replay-single test 58c: resend/reconstruct setxattr op ========================================================== 10:43:56 (1773672236) Starting client: oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre2 fail_val=0 fail_loc=0x123 fail_loc=0 fail_loc=0x119 fail_loc=0 Stopping client oleg132-client.virtnet /mnt/lustre2 (opts:) PASS 58c (149s) SKIP: replay-single test_59 skipping ALWAYS excluded test 59 == replay-single test 60: test llog post recovery init vs llog unlink ========================================================== 10:46:27 (1773672387) total: 200 open/close in 4.81 seconds: 41.59 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2176 1269128 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1768 1269536 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre - unlinked 0 (time 1773672413 ; total 0 ; last 0) total: 100 unlinks in 1 seconds: 100.000000 unlinks/second Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:47:02 (1773672422) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:47:51 (1773672471) targets are mounted 10:47:51 (1773672471) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec - unlinked 0 (time 1773672485 ; total 0 ; last 0) total: 100 unlinks in 2 seconds: 50.000000 unlinks/second PASS 60 (109s) == replay-single test 61a: test race llog recovery vs llog cleanup ========================================================== 10:48:17 (1773672497) - open/close 417 (time 1773672514.04 total 10.02 last 41.64) total: 800 open/close in 19.37 seconds: 41.31 ops/second UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2240 1269064 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1768 1269536 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1572 3605448 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3164 7210876 1% /mnt/lustre - unlinked 0 (time 1773672543 ; total 0 ; last 0) total: 800 unlinks in 10 seconds: 80.000000 unlinks/second fail_val=0 fail_loc=0x80000221 Failing ost1 on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server 10:49:25 (1773672565) shut down facet: ost1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover ost1 to oleg132-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 10:50:10 (1773672610) targets are mounted 10:50:10 (1773672610) facet_failover done Failing ost1 on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server 10:50:31 (1773672631) shut down facet: ost1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover ost1 to oleg132-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 10:51:10 (1773672670) targets are mounted 10:51:10 (1773672670) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 Can't lstat /mnt/lustre/d61a.replay-single/f61a.replay-single-*: No such file or directory PASS 61a (236s) == replay-single test 61b: test race mds llog sync vs llog cleanup ========================================================== 10:52:13 (1773672733) fail_loc=0x8000013a Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:52:25 (1773672745) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:53:02 (1773672782) targets are mounted 10:53:02 (1773672782) facet_failover done Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:53:21 (1773672801) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:54:00 (1773672840) targets are mounted 10:54:00 (1773672840) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec 1+0 records in 1+0 records out 4096 bytes (4.1 kB, 4.0 KiB) copied, 0.025926 s, 158 kB/s PASS 61b (135s) == replay-single test 61c: test race mds llog sync vs llog cleanup ========================================================== 10:54:28 (1773672868) fail_val=0 fail_loc=0x80000222 Failing ost1 on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server 10:54:53 (1773672893) shut down facet: ost1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover ost1 to oleg132-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 10:55:33 (1773672933) targets are mounted 10:55:33 (1773672933) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec fail_val=0 fail_loc=0x0 PASS 61c (93s) == replay-single test 61d: error in llog_setup should cleanup the llog context correctly ========================================================== 10:56:01 (1773672961) Stopping /mnt/lustre-mds1 (opts:) on oleg132-server fail_loc=0x80000605 Start mgs: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: mount.lustre: mount /dev/mapper/mds1_flakey at /mnt/lustre-mds1 failed: Operation not supported pdsh@oleg132-client: oleg132-server: ssh exited with exit code 95 Start of /dev/mapper/mds1_flakey on mgs failed 95 fail_loc=0 Start mgs: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 PASS 61d (75s) == replay-single test 62: don't mis-drop resent replay === 10:57:16 (1773673036) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2180 1269124 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1768 1269536 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1592 3605428 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3168 7210872 1% /mnt/lustre total: 25 open/close in 0.86 seconds: 29.08 ops/second fail_loc=0x80000707 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 10:57:44 (1773673064) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 10:58:40 (1773673120) targets are mounted 10:58:40 (1773673120) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec fail_loc=0 - unlinked 0 (time 1773673184 ; total 0 ; last 0) total: 25 unlinks in 1 seconds: 25.000000 unlinks/second PASS 62 (159s) == replay-single test 65a: AT: verify early replies ====== 10:59:55 (1773673195) at_history=8 at_history=8 debug=other fail_val=11000 fail_loc=0x8000050a 00000100:00001000:2.0:1773673245.436236:0:128729:0:(client.c:568:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 36s (26s) req@ffff8a4b9051b800 x1859825910103680/t0(0) o101->lustre-MDT0000-mdc-ffff8a4b91a9f000@192.168.201.132@tcp:12/10 lens 664/66320 e 1 to 0 dl 1773673281 ref 2 fl Rpc:PQr/600/ffffffff rc 0/-1 job:'createmany.0' uid:0 gid:0 projid:0 portal 12 : cur 12 worst 40 (at 1773671010, 2244s ago) 37 40 40 40 portal 29 : cur 5 worst 5 (at 1773668602, 4652s ago) 5 5 0 0 portal 23 : cur 5 worst 5 (at 1773668602, 4652s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1773668618, 4636s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1773668786, 4468s ago) 5 5 5 0 portal 24 : cur 5 worst 5 (at 1773669109, 4145s ago) 5 0 0 0 portal 13 : cur 5 worst 5 (at 1773669629, 3625s ago) 5 0 5 0 portal 12 : cur 5 worst 40 (at 1773671010, 2253s ago) 37 40 40 40 portal 29 : cur 5 worst 5 (at 1773668602, 4661s ago) 5 5 0 0 portal 23 : cur 5 worst 5 (at 1773668602, 4661s ago) 5 5 5 5 portal 30 : cur 5 worst 5 (at 1773668618, 4645s ago) 5 0 0 0 portal 17 : cur 5 worst 5 (at 1773668786, 4477s ago) 5 5 5 0 portal 24 : cur 5 worst 5 (at 1773669109, 4154s ago) 5 0 0 0 portal 13 : cur 5 worst 5 (at 1773669629, 3634s ago) 5 0 5 0 PASS 65a (79s) == replay-single test 65b: AT: verify early replies on packed reply / bulk ========================================================== 11:01:14 (1773673274) at_history=8 at_history=8 debug=other trace fail_val=11 fail_loc=0x224 fail_loc=0 00000100:00001000:0.0:1773673322.238565:0:2406:0:(client.c:568:ptlrpc_at_recv_early_reply()) @@@ Early reply #1, new deadline in 35s (26s) req@ffff8a4b85148e00 x1859825910126720/t0(0) o4->lustre-OST0000-osc-ffff8a4b91a9f000@192.168.201.132@tcp:6/4 lens 4584/448 e 1 to 0 dl 1773673357 ref 2 fl Rpc:Qr/600/ffffffff rc 0/-1 job:'multiop.0' uid:0 gid:0 projid:0 portal 28 : cur 5 worst 5 (at 1773668602, 4730s ago) 5 5 5 5 portal 7 : cur 5 worst 5 (at 1773668605, 4727s ago) 5 5 5 5 portal 17 : cur 5 worst 5 (at 1773668734, 4598s ago) 5 0 0 5 portal 6 : cur 37 worst 37 (at 1773673326, 6s ago) 37 0 0 0 PASS 65b (66s) == replay-single test 66a: AT: verify MDT service time adjusts with no early replies ========================================================== 11:02:20 (1773673340) at_history=8 at_history=8 portal 12 : cur 5 worst 40 (at 1773671010, 2365s ago) 37 40 40 40 fail_val=5000 fail_loc=0x8000050a portal 12 : cur 5 worst 40 (at 1773671010, 2375s ago) 37 40 40 40 fail_val=10000 fail_loc=0x8000050a portal 12 : cur 36 worst 40 (at 1773671010, 2390s ago) 37 40 40 40 fail_loc=0 portal 12 : cur 5 worst 40 (at 1773671010, 2403s ago) 5 37 40 40 Current MDT timeout 5, worst 40 PASS 66a (83s) == replay-single test 66b: AT: verify net latency adjusts ========================================================== 11:03:44 (1773673424) at_history=8 at_history=8 fail_val=10 fail_loc=0x50c fail_loc=0 network timeout orig 5, cur 10, worst 10 PASS 66b (105s) == replay-single test 67a: AT: verify slow request processing doesn't induce reconnects ========================================================== 11:05:30 (1773673530) at_history=8 at_history=8 fail_val=400 fail_loc=0x50a fail_loc=0 0 osc reconnect attempts on gradual slow PASS 67a (94s) == replay-single test 67b: AT: verify instant slowdown doesn't induce reconnects ========================================================== 11:07:04 (1773673624) at_history=8 at_history=8 Creating to objid 5441 on ost lustre-OST0000... fail_val=20000 fail_loc=0x80000223 total: 17 open/close in 0.58 seconds: 29.18 ops/second Connected clients: oleg132-client.virtnet oleg132-client.virtnet service : cur 5 worst 5 (at 1773668278, 5392s ago) 1 1 1 1 phase 2 1 osc reconnect attempts on instant slow fail_loc=0x80000223 fail_loc=0 Connected clients: oleg132-client.virtnet oleg132-client.virtnet service : cur 5 worst 5 (at 1773668278, 5400s ago) 1 1 1 1 0 osc reconnect attempts on 2nd slow PASS 67b (62s) == replay-single test 68: AT: verify slowing locks ======= 11:08:06 (1773673686) at_history=8 at_history=8 oleg132-server: bash: line 1: /sys/module/ptlrpc/parameters/ldlm_enqueue_min: Permission denied pdsh@oleg132-client: oleg132-server: ssh exited with exit code 126 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1974: $ldlm_enqueue_min: ambiguous redirect oleg132-server: bash: /sys/fs/lustre/ldlm/ldlm_enqueue_min: Permission denied oleg132-server: bash: line 1: /sys/module/ptlrpc/parameters/ldlm_enqueue_min: Permission denied pdsh@oleg132-client: oleg132-server: ssh exited with exit code 126 fail_val=19 fail_loc=0x80000312 fail_val=25 fail_loc=0x80000312 fail_loc=0 /home/green/git/lustre-release/lustre/tests/replay-single.sh: line 1989: $ldlm_enqueue_min: ambiguous redirect oleg132-server: bash: /sys/fs/lustre/ldlm/ldlm_enqueue_min: Permission denied oleg132-server: bash: line 1: /sys/module/ptlrpc/parameters/ldlm_enqueue_min: Permission denied pdsh@oleg132-client: oleg132-server: ssh exited with exit code 126 PASS 68 (100s) Cleaning up AT ... == replay-single test 70a: check multi client t-f ======== 11:09:46 (1773673786) SKIP: replay-single test_70a Need two or more clients, have 1 SKIP 70a (6s) == replay-single test 70b: dbench 2mdts recovery; 1 clients ========================================================== 11:09:52 (1773673792) Starting client oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre Started clients oleg132-client.virtnet: 192.168.201.132@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,encrypt,flock,lazystatfs,lruresize,nolock,statfs_project,nouser_fid2path,user_xattr,verbose) striped dir -i0 -c2 -H crush /mnt/lustre/d70b.replay-single + MISSING_DBENCH_OK= + PATH=/opt/iozone/bin:/opt/iozone/bin:/home/green/git/lustre-release/lustre/tests/mpi:/home/green/git/lustre-release/lustre/tests/racer:/home/green/git/lustre-release/lustre/../lustre-iokit/sgpdd-survey:/home/green/git/lustre-release/lustre/tests:/home/green/git/lustre-release/lustre/utils/gss:/home/green/git/lustre-release/lustre/utils:/opt/iozone/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin::/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests:/sbin:/usr/sbin:/home/green/git/lustre-release/lustre/utils:/home/green/git/lustre-release/lustre/tests/: + DBENCH_LIB= + TESTSUITE=replay-single + TESTNAME=test_70b + MOUNT=/mnt/lustre ++ hostname + DIR=/mnt/lustre/d70b.replay-single/oleg132-client.virtnet + LCTL=/home/green/git/lustre-release/lustre/utils/lctl + rundbench 1 -t 120 dbench: no process found dbench: no process found dbench: no process found Started rundbench load pid=134407 ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2396 1268908 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2004 1269300 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 4672 3600460 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 28204 3576648 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 32876 7177108 1% /mnt/lustre test_70b fail mds1 1 times Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:10:29 (1773673829) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server oleg132-client.virtnet: looking for dbench program oleg132-client.virtnet: /usr/bin/dbench oleg132-client.virtnet: creating output directory /mnt/lustre/d70b.replay-single/oleg132-client.virtnet oleg132-client.virtnet: mkdir: created directory '/mnt/lustre/d70b.replay-single/oleg132-client.virtnet' oleg132-client.virtnet: found dbench client file /usr/share/dbench/client.txt oleg132-client.virtnet: '/usr/share/dbench/client.txt' -> 'client.txt' oleg132-client.virtnet: running 'dbench 1 -t 120' on /mnt/lustre/d70b.replay-single/oleg132-client.virtnet at Mon Mar 16 11:09:59 EDT 2026 oleg132-client.virtnet: waiting for dbench pid 134445 oleg132-client.virtnet: dbench version 4.00 - Copyright Andrew Tridgell 1999-2004 oleg132-client.virtnet: oleg132-client.virtnet: Running for 120 seconds with load 'client.txt' and minimum warmup 24 secs oleg132-client.virtnet: failed to create barrier semaphore oleg132-client.virtnet: 0 of 1 processes prepared for launch 0 sec oleg132-client.virtnet: 1 of 1 processes prepared for launch 0 sec oleg132-client.virtnet: releasing clients oleg132-client.virtnet: 1 39 1.37 MB/sec warmup 1 sec latency 77.141 ms oleg132-client.virtnet: 1 111 2.61 MB/sec warmup 2 sec latency 175.639 ms oleg132-client.virtnet: 1 140 2.00 MB/sec warmup 3 sec latency 205.215 ms oleg132-client.virtnet: 1 165 1.69 MB/sec warmup 4 sec latency 221.381 ms oleg132-client.virtnet: 1 200 1.56 MB/sec warmup 5 sec latency 165.012 ms oleg132-client.virtnet: 1 247 1.56 MB/sec warmup 6 sec latency 173.929 ms oleg132-client.virtnet: 1 292 1.52 MB/sec warmup 7 sec latency 128.467 ms oleg132-client.virtnet: 1 331 1.50 MB/sec warmup 8 sec latency 129.885 ms oleg132-client.virtnet: 1 373 1.48 MB/sec warmup 9 sec latency 131.987 ms oleg132-client.virtnet: 1 421 1.47 MB/sec warmup 10 sec latency 142.806 ms oleg132-client.virtnet: 1 461 1.46 MB/sec warmup 11 sec latency 238.286 ms oleg132-client.virtnet: 1 511 1.47 MB/sec warmup 12 sec latency 141.650 ms oleg132-client.virtnet: 1 612 1.60 MB/sec warmup 13 sec latency 135.147 ms oleg132-client.virtnet: 1 673 1.50 MB/sec warmup 14 sec latency 87.113 ms oleg132-client.virtnet: 1 711 1.41 MB/sec warmup 15 sec latency 118.622 ms oleg132-client.virtnet: 1 740 1.33 MB/sec warmup 16 sec latency 127.429 ms oleg132-client.virtnet: 1 779 1.25 MB/sec warmup 17 sec latency 131.311 ms oleg132-client.virtnet: 1 815 1.19 MB/sec warmup 18 sec latency 104.083 ms oleg132-client.virtnet: 1 872 1.13 MB/sec warmup 19 sec latency 116.366 ms oleg132-client.virtnet: 1 885 1.08 MB/sec warmup 20 sec latency 311.647 ms oleg132-client.virtnet: 1 913 1.03 MB/sec warmup 21 sec latency 512.975 ms oleg132-client.virtnet: 1 955 0.99 MB/sec warmup 22 sec latency 99.446 ms oleg132-client.virtnet: 1 984 0.95 MB/sec warmup 23 sec latency 154.687 ms oleg132-client.virtnet: 1 1129 0.09 MB/sec execute 1 sec latency 90.259 ms oleg132-client.virtnet: 1 1187 0.07 MB/sec execute 2 sec latency 104.486 ms oleg132-client.virtnet: 1 1236 0.06 MB/sec execute 3 sec latency 115.870 ms oleg132-client.virtnet: 1 1275 0.06 MB/sec execute 4 sec latency 121.677 ms oleg132-client.virtnet: 1 1412 0.62 MB/sec execute 5 sec latency 101.904 ms oleg132-client.virtnet: 1 1462 0.52 MB/sec execute 6 sec latency 96.863 ms oleg132-client.virtnet: 1 1520 0.46 MB/sec execute 7 sec latency 92.198 ms oleg132-client.virtnet: 1 1612 0.41 MB/sec execute 8 sec latency 86.256 ms oleg132-client.virtnet: 1 1689 0.37 MB/sec execute 9 sec latency 98.179 ms oleg132-client.virtnet: 1 1761 0.34 MB/sec execute 10 sec latency 107.906 ms olegFailover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all 132-client.virtnet: 1 1808 0.31 MB/sec execute 11 sec latency 112.815 ms oleg132-client.virtnet: 1 1871 0.29 MB/sec execute 12 sec latency 76.147 ms oleg132-client.virtnet: 1 1978 0.27 MB/sec execute 13 sec latency 63.138 ms oleg132-client.virtnet: 1 2056 0.27 MB/sec execute 14 sec latency 70.886 ms oleg132-client.virtnet: 1 2117 0.25 MB/sec execute 15 sec latency 135.073 ms oleg132-client.virtnet: 1 2162 0.24 MB/sec execute 16 sec latency 89.371 ms oleg132-client.virtnet: 1 2198 0.23 MB/sec execute 17 sec latency 233.304 ms oleg132-client.virtnet: 1 2305 0.27 MB/sec execute 18 sec latency 72.060 ms oleg132-client.virtnet: 1 2373 0.26 MB/sec execute 19 sec latency 97.083 ms oleg132-client.virtnet: 1 2408 0.26 MB/sec execute 20 sec latency 84.269 ms oleg132-client.virtnet: 1 2454 0.25 MB/sec execute 21 sec latency 87.600 ms oleg132-client.virtnet: 1 2489 0.26 MB/sec execute 22 sec latency 159.762 ms oleg132-client.virtnet: 1 2583 0.26 MB/sec execute 23 sec latency 112.660 ms oleg132-client.virtnet: 1 2765 0.38 MB/sec execute 24 sec latency 78.899 ms oleg132-client.virtnet: 1 2898 0.40 MB/sec execute 25 sec latency 93.449 ms oleg132-client.virtnet: 1 2979 0.41 MB/sec execute 26 sec latency 97.229 ms oleg132-client.virtnet: 1 3026 0.39 MB/sec execute 27 sec latency 112.023 ms oleg132-client.virtnet: 1 3260 0.44 MB/sec execute 28 sec latency 87.610 ms oleg132-client.virtnet: 1 3361 0.43 MB/sec execute 29 sec latency 78.388 ms oleg132-client.virtnet: 1 3535 0.51 MB/sec execute 30 sec latency 104.404 ms oleg132-client.virtnet: 1 3624 0.51 MB/sec execute 31 sec latency 82.090 ms oleg132-client.virtnet: 1 3685 0.49 MB/sec execute 32 sec latency 71.224 ms oleg132-client.virtnet: 1 3738 0.49 MB/sec execute 33 sec latency 75.059 ms oleg132-client.virtnet: 1 3820 0.49 MB/sec execute 34 sec latency 89.887 ms oleg132-client.virtnet: 1 3906 0.49 MB/sec execute 35 sec latency 51.902 ms oleg132-client.virtnet: 1 3975 0.48 MB/sec execute 36 sec latency 69.398 ms oleg132-client.virtnet: 1 4052 0.46 MB/sec execute 37 sec latency 61.028 ms oleg132-client.virtnet: 1 4152 0.45 MB/sec execute 38 sec latency 123.155 ms oleg132-client.virtnet: 1 4266 0.45 MB/sec execute 39 sec latency 43.774 ms oleg132-client.virtnet: 1 4335 0.44 MB/sec execute 40 sec latency 81.613 ms oleg132-client.virtnet: 1 4395 0.43 MB/sec execute 41 sec latency 112.239 ms oleg132-client.virtnet: 1 4470 0.43 MB/sec execute 42 sec latency 125.161 ms oleg132-client.virtnet: 1 4529 0.42 MB/sec execute 43 sec latency 83.987 ms oleg132-client.virtnet: 1 4597 0.43 MB/sec execute 44 sec latency 101.361 ms oleg132-client.virtnet: 1 4705 0.43 MB/sec execute 45 sec latency 42.488 ms oleg132-client.virtnet: 1 4787 0.42 MB/sec execute 46 sec latency 57.637 ms oleg132-client.virtnet: 1 4830 0.41 MB/sec execute 47 sec latency 81.298 ms oleg132-client.virtnet: 1 4949 0.46 MB/sec execute 48 sec latency 187.743 ms oleg132-client.virtnet: 1 4957 0.45 MB/sec execute 49 sec latency 578.963 ms oleg132-client.virtnet: 1 4957 0.44 MB/sec execute 50 sec latency 1580.677 ms oleg132-client.virtnet: 1 4957 0.44 MB/sec execute 51 sec latency 2583.087 ms oleg132-client.virtnet: 1 4957 0.43 MB/sec execute 52 sec latency 3583.526 ms oleg132-client.virtnet: 1 4957 0.42 MB/sec execute 53 sec latency 4584.865 ms oleg132-client.virtnet: 1 4992 0.41 MB/sec execute 54 sec latency 4883.974 ms oleg132-client.virtnet: 1 5040 0.40 MB/sec execute 55 sec pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 11:11:21 (1773673881) targets are mounted 11:11:21 (1773673881) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2336 1268968 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1984 1269320 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 13188 3592876 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 36448 3570144 2% /mnt/lustre[OST:1] filesystem_summary: 7666232 49636 7163020 1% /mnt/lustre test_70b fail mds2 2 times Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:11:59 (1773673919) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server latency 104.319 ms oleg132-client.virtnet: 1 5078 0.40 MB/sec execute 56 sec latency 110.887 ms oleg132-client.virtnet: 1 5135 0.39 MB/sec execute 57 sec latency 105.197 ms oleg132-client.virtnet: 1 5181 0.39 MB/sec execute 58 sec latency 99.348 ms oleg132-client.virtnet: 1 5237 0.38 MB/sec execute 59 sec latency 81.317 ms oleg132-client.virtnet: 1 5276 0.37 MB/sec execute 60 sec latency 115.651 ms oleg132-client.virtnet: 1 5321 0.37 MB/sec execute 61 sec latency 109.887 ms oleg132-client.virtnet: 1 5360 0.36 MB/sec execute 62 sec latency 79.157 ms oleg132-client.virtnet: 1 5420 0.36 MB/sec execute 63 sec latency 85.981 ms oleg132-client.virtnet: 1 5475 0.35 MB/sec execute 64 sec latency 85.932 ms oleg132-client.virtnet: 1 5543 0.35 MB/sec execute 65 sec latency 88.041 ms oleg132-client.virtnet: 1 5573 0.34 MB/sec execute 66 sec latency 150.540 ms oleg132-client.virtnet: 1 5649 0.34 MB/sec execute 67 sec latency 90.693 ms oleg132-client.virtnet: 1 5710 0.34 MB/sec execute 68 sec latency 95.982 ms oleg132-client.virtnet: 1 5743 0.33 MB/sec execute 69 sec latency 140.602 ms oleg132-client.virtnet: 1 5819 0.33 MB/sec execute 70 sec latency 83.147 ms oleg132-client.virtnet: 1 5907 0.34 MB/sec execute 71 sec latency 102.413 ms oleg132-client.virtnet: 1 5954 0.34 MB/sec execute 72 sec latency 101.977 ms oleg132-client.virtnet: 1 6000 0.33 MB/sec execute 73 sec latency 87.606 ms oleg132-client.virtnet: 1 6037 0.34 MB/sec execute 74 sec latency 123.187 ms oleg132-client.virtnet: 1 6124 0.33 MB/sec execute 75 sec latency 130.800 ms oleg132-client.virtnet: 1 6154 0.33 MB/sec execute 76 sec latency 128.528 ms oleg132-client.virtnet: 1 6321 0.37 MB/sec execute 77 sec latency 100.437 ms oleg132-client.virtnet: 1 6451 0.37 MB/sec execute 78 sec latency 115.102 ms oleg132-client.virtnet: 1 6525 0.38 MB/sec execute 79 sec latency 176.623 ms oleg132-client.virtnet: 1 6575 0.37 MB/sec execute 80 sec latency 114.581 ms oleg132-client.virtnet: 1 6752 0.38 MB/sec execute 81 sec latency 387.358 ms oleg132-client.virtnet: 1 6752 0.37 MB/sec execute 82 sec latency 1387.563 ms oleg132-client.virtnet: 1 6752 0.37 MB/sec execute 83 sec latency 2387.778 ms oleg132-client.virtnet: 1 6801 0.37 MB/sec execute 84 sec latency 2694.029 ms oleg132-client.virtnet: 1 6907 0.37 MB/sec execute 85 sec latency 84.688 ms oleg132-client.virtnet: 1 6986 0.38 MB/sec execute 86 sec latency 101.785 ms oleg132-client.virtnet: 1 7102 0.40 MB/sec execute 87 sec latency 109.605 ms oleg132-client.virtnet: 1 7173 0.39 MB/sec execute 88 sec latency 75.370 ms oleg132-client.virtnet: 1 7219 0.39 MB/sec execute 89 sec latency 106.830 ms oleg132-client.virtnet: 1 7283 0.39 MB/sec execute 90 sec latency 100.289 ms oleg132-client.virtnet: 1 7345 0.39 MB/sec execute 91 sec latency 115.668 ms oleg132-client.virtnet: 1 7380 0.39 MB/sec execute 92 sec latency 198.614 ms oleg132-client.virtnet: 1 7380 0.39 MB/sec execute 93 sec latency 1159.139 ms oleg132-client.virtnet: 1 7380 0.38 MB/sec execute 94 sec latency 2160.556 ms oleg132-client.virtnet: 1 7380 0.38 MB/sec execute 95 sec latency 3160.989 ms oleg132-client.virtnet: 1 7380 0.37 MB/sec execute 96 sec latency 4164.459 ms oleg132-client.virtnet: 1 7380 0.37 MB/sec execute 97 sec latency 5165.582 ms oleg132-client.virtnet: 1 7380 0.37 MB/sec execute 98 sec latency 6167.349 ms oleg132-client.virtnet: 1 7380 0.36 MB/sec execute 99 sec latency 7167.749 ms oleg132-client.virtnet: 1 Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 11:12:38 (1773673958) targets are mounted 11:12:38 (1773673958) facet_failover done 7380 0.36 MB/sec execute 100 sec latency 8168.060 ms oleg132-client.virtnet: 1 7380 0.36 MB/sec execute 101 sec latency 9168.294 ms oleg132-client.virtnet: 1 7380 0.35 MB/sec execute 102 sec latency 10168.997 ms oleg132-client.virtnet: 1 7380 0.35 MB/sec execute 103 sec latency 11172.737 ms oleg132-client.virtnet: 1 7380 0.35 MB/sec execute 104 sec latency 12173.210 ms oleg132-client.virtnet: 1 7380 0.34 MB/sec execute 105 sec latency 13174.212 ms oleg132-client.virtnet: 1 7380 0.34 MB/sec execute 106 sec latency 14182.171 ms oleg132-client.virtnet: 1 7380 0.34 MB/sec execute 107 sec latency 15182.370 ms oleg132-client.virtnet: 1 7380 0.33 MB/sec execute 108 sec latency 16183.111 ms oleg132-client.virtnet: 1 7380 0.33 MB/sec execute 109 sec latency 17183.870 ms oleg132-client.virtnet: 1 7380 0.33 MB/sec execute 110 sec latency 18187.445 ms oleg132-client.virtnet: 1 7380 0.32 MB/sec execute 111 sec latency 19188.002 ms oleg132-client.virtnet: 1 7380 0.32 MB/sec execute 112 sec latency 20188.228 ms oleg132-client.virtnet: 1 7380 0.32 MB/sec execute 113 sec latency 21188.409 ms oleg132-client.virtnet: 1 7380 0.32 MB/sec execute 114 sec latency 22190.516 ms oleg132-client.virtnet: 1 7380 0.31 MB/sec execute 115 sec latency 23190.938 ms oleg132-client.virtnet: 1 7380 0.31 MB/sec execute 116 sec latency 24192.736 ms oleg132-client.virtnet: 1 7380 0.31 MB/sec execute 117 sec latency 25193.564 ms oleg132-client.virtnet: 1 7380 0.30 MB/sec execute 118 sec latency 26193.863 ms oleg132-client.virtnet: 1 7380 0.30 MB/sec execute 119 sec latency 27195.802 ms oleg132-client.virtnet: 1 cleanup 120 sec oleg132-client.virtnet: 1 cleanup 121 sec oleg132-client.virtnet: 1 cleanup 122 sec oleg132-client.virtnet: 1 cleanup 123 sec oleg132-client.virtnet: 1 cleanup 124 sec oleg132-client.virtnet: 1 cleanup 125 sec oleg132-client.virtnet: 1 cleanup 126 sec oleg132-client.virtnet: 1 cleanup 127 sec oleg132-client.virtnet: 1 cleanup 128 sec oleg132-client.virtnet: 1 cleanup 129 sec oleg132-client.virtnet: 1 cleanup 130 sec oleg132-client.virtnet: 1 cleanup 131 sec oleg132-client.virtnet: 1 cleanup 132 sec oleg132-client.virtnet: 0 cleanup 133 sec oleg132-client.virtnet: oleg132-client.virtnet: Operation Count AvgLat MaxLat oleg132-client.virtnet: ---------------------------------------- oleg132-client.virtnet: NTCreateX 1045 83.896 38803.415 oleg132-client.virtnet: Close 794 10.748 75.347 oleg132-client.virtnet: Rename 48 67.382 176.585 oleg132-client.virtnet: Unlink 185 14.743 48.652 oleg132-client.virtnet: Qpathinfo 1009 12.368 187.714 oleg132-client.virtnet: Qfileinfo 183 1.177 21.905 oleg132-client.virtnet: Qfsinfo 173 2.116 66.107 oleg132-client.virtnet: Sfileinfo 82 46.635 125.190 oleg132-client.virtnet: Find 379 2.431 119.661 oleg132-client.virtnet: WriteX 536 8.589 85.958 oleg132-client.virtnet: ReadX 1817 0.055 8.444 oleg132-client.virtnet: LockX 4 18.191 30.742 oleg132-client.virtnet: UnlockX 4 20.766 34.132 oleg132-client.virtnet: Flush 62 92.695 2694.010 oleg132-client.virtnet: oleg132-client.virtnet: Throughput 0.30186 MB/sec 1 clients 1 procs max_latency=27195.802 ms oleg132-client.virtnet: stopping dbench on /mnt/lustre/d70b.replay-single/oleg132-client.virtnet at Mon Mar 16 11:12:37 EDT 2026 with return code 0 oleg132-client.virtnet: clean dbench files on /mnt/lustre/d70b.replay-single/oleg132-client.virtnet oleg132-client.virtnet: /mnt/lustre/d70b.replay-single/oleg132-client.virtnet /mnt/lustre/d70b.replay-single/oleg132-client.virtnet oleg132-client.virtnet: removed directory 'clients/client0/~dmtmp/WORD' oleg132-client.virtnet: removed directory 'clients/client0/~dmtmp/COREL' oleg132-client.virtnet: removed directory 'clients/client0/~dmtmp/SEED' oleg132-client.virtnet: removed directory 'clients/client0/~dmtmp/PARADOX' oleg132-client.virtnet: removed directory 'clients/client0/~dmtmp/PWRPNT' oleg132-client.virtnet: removed directory 'clients/client0/~dmtmp/ACCESS' oleg132-client.virtnet: removed directory 'clients/client0/~dmtmp/PM' oleg132-client.virtnet: removed directory 'clients/client0/~dmtmp/EXCEL' oleg132-client.virtnet: removed directory 'clients/client0/~dmtmp/WORDPRO' oleg132-client.virtnet: removed directory 'clients/client0/~dmtmp' oleg132-client.virtnet: removed directory 'clients/client0' oleg132-client.virtnet: removed directory 'clients' oleg132-client.virtnet: removed 'client.txt' oleg132-client.virtnet: /mnt/lustre/d70b.replay-single/oleg132-client.virtnet oleg132-client.virtnet: dbench successfully finished oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70b (184s) == replay-single test 70c: tar 2mdts recovery ============ 11:12:56 (1773673976) Starting client oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre Started clients oleg132-client.virtnet: 192.168.201.132@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,encrypt,flock,lazystatfs,lruresize,nolock,statfs_project,nouser_fid2path,user_xattr,verbose) Started tar 137312 striped dir -i0 -c2 -H fnv_1a_64 /mnt/lustre/d70c.replay-single tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 4748 1266556 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 4388 1266916 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 23964 3582508 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 3304 3603192 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 27268 7185700 1% /mnt/lustre tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets test_70c fail mds2 1 times Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:15:36 (1773674136) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 11:16:20 (1773674180) targets are mounted 11:16:20 (1773674180) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70c (367s) == replay-single test 70d: mkdir/rmdir striped dir 2mdts recovery ========================================================== 11:19:04 (1773674344) Starting client oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre Started clients oleg132-client.virtnet: 192.168.201.132@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,encrypt,flock,lazystatfs,lruresize,nolock,statfs_project,nouser_fid2path,user_xattr,verbose) Started 139139 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3944 1267360 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3612 1267692 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605392 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210836 1% /mnt/lustre test_70d fail mds2 1 times Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:21:47 (1773674507) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 11:22:26 (1773674546) targets are mounted 11:22:26 (1773674546) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 7707: 139139 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test || { echo "mkdir fails"; break; }; $LFS mkdir -i1 -c2 $DIR/$tdir/test1 || { echo "mkdir fails"; break; }; touch $DIR/$tdir/test/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test || { echo "rmdir fails"; ls -lR $DIR/$tdir; break; }; touch $DIR/$tdir/test1/a || { echo "touch fails"; break; }; mkdir $DIR/$tdir/test1/b || { echo "mkdir fails"; break; }; rm -rf $DIR/$tdir/test1 || { echo "rmdir fails"; ls -lR $DIR/$tdir/test1; break; }; done ) (wd: ~) PASS 70d (223s) == replay-single test 70e: rename cross-MDT with random fails ========================================================== 11:22:47 (1773674567) debug=+ha Starting client oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre Started clients oleg132-client.virtnet: 192.168.201.132@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,encrypt,flock,lazystatfs,lruresize,nolock,statfs_project,nouser_fid2path,user_xattr,verbose) Started PID=142597 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3492 1267812 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3200 1268104 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre test_70e fail mds2 1 times Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:25:28 (1773674728) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 11:26:08 (1773674768) targets are mounted 11:26:08 (1773674768) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 70e (225s) == replay-single test 70f: OSS O_DIRECT recovery with 1 clients ========================================================== 11:26:33 (1773674793) mount clients oleg132-client.virtnet ... Starting client oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre Started clients oleg132-client.virtnet: 192.168.201.132@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,encrypt,flock,lazystatfs,lruresize,nolock,statfs_project,nouser_fid2path,user_xattr,verbose) Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3420 1267884 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 3068 1268236 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3599228 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3601324 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7200552 1% /mnt/lustre ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... test_70f failing OST 1 times Failing ost1 on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server 11:27:02 (1773674822) shut down facet: ost1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover ost1 to oleg132-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear seq.cli-lustre-OST0000-super.width=65536 Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Started lustre-OST0000 11:27:43 (1773674863) targets are mounted 11:27:43 (1773674863) facet_failover done Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... ldlm.namespaces.MGC192.168.201.132@tcp.lru_size=clear ldlm.namespaces.lustre-MDT0000-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-MDT0001-mdc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0000-osc-ffff8a4b91a9f000.lru_size=clear ldlm.namespaces.lustre-OST0001-osc-ffff8a4b91a9f000.lru_size=clear Write/read files in: '/mnt/lustre/d70f.replay-single', clients: 'oleg132-client.virtnet' ... PASS 70f (100s) == replay-single test 71a: mkdir/rmdir striped dir with 2 mdts recovery ========================================================== 11:28:14 (1773674894) Starting client oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre Started clients oleg132-client.virtnet: 192.168.201.132@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,encrypt,flock,lazystatfs,lruresize,nolock,statfs_project,nouser_fid2path,user_xattr,verbose) Started 149983 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3028 1268276 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2820 1268484 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 3160 1268144 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2960 1268344 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre fail mds2 mds1 1 times Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:31:13 (1773675073) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server Failover mds2 to oleg132-server mount facets: mds2 mount facets: mds1 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 11:32:05 (1773675125) targets are mounted 11:32:05 (1773675125) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4991: 149983 Killed ( while true; do $LFS mkdir -i0 -c2 $DIR/$tdir/test; rmdir $DIR/$tdir/test; done ) (wd: ~) PASS 71a (272s) == replay-single test 73a: open(O_CREAT), unlink, replay, reconnect before open replay, close ========================================================== 11:32:46 (1773675166) multiop /mnt/lustre/f73a.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2860 1268444 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2640 1268664 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre fail_loc=0x80000302 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:33:12 (1773675192) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 11:34:04 (1773675244) targets are mounted 11:34:04 (1773675244) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73a (103s) == replay-single test 73b: open(O_CREAT), unlink, replay, reconnect at open_replay reply, close ========================================================== 11:34:30 (1773675270) multiop /mnt/lustre/f73b.replay-single vO_tSc TMPPIPE=/tmp/multiop_open_wait_pipe.8185 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2328 1268976 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 1960 1269344 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre fail_loc=0x80000157 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:34:55 (1773675295) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 11:35:48 (1773675348) targets are mounted 11:35:48 (1773675348) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 73b (104s) == replay-single test 74: Ensure applications don't fail waiting for OST recovery ========================================================== 11:36:13 (1773675373) Stopping clients: oleg132-client.virtnet /mnt/lustre (opts:) Stopping client oleg132-client.virtnet /mnt/lustre opts: Stopping /mnt/lustre-ost1 (opts:) on oleg132-server Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:36:32 (1773675392) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 11:37:12 (1773675432) targets are mounted 11:37:12 (1773675432) facet_failover done Starting client oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre Mount client oleg132-client.virtnet: mount -t lustre -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre Started clients oleg132-client.virtnet: 192.168.201.132@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,encrypt,flock,lazystatfs,lruresize,nolock,statfs_project,nouser_fid2path,user_xattr,verbose) Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 74 (93s) == replay-single test 80a: DNE: create remote dir, drop update rep from MDT0, fail MDT0 ========================================================== 11:37:46 (1773675466) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2332 1268972 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2012 1269292 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:38:09 (1773675489) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 11:38:59 (1773675539) targets are mounted 11:38:59 (1773675539) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.39 seconds: 51.95 ops/second PASS 80a (93s) == replay-single test 80b: DNE: create remote dir, drop update rep from MDT0, fail MDT1 ========================================================== 11:39:19 (1773675559) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2372 1268932 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2012 1269292 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2372 1268932 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2012 1269292 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:39:55 (1773675595) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 11:40:34 (1773675634) targets are mounted 11:40:34 (1773675634) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.58 seconds: 34.67 ops/second PASS 80b (97s) == replay-single test 80c: DNE: create remote dir, drop update rep from MDT1, fail MDT[0,1] ========================================================== 11:40:56 (1773675656) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2408 1268896 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2048 1269256 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2408 1268896 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2048 1269256 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:41:29 (1773675689) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 11:42:17 (1773675737) targets are mounted 11:42:17 (1773675737) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:42:36 (1773675756) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 11:43:11 (1773675791) targets are mounted 11:43:11 (1773675791) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.61 seconds: 32.57 ops/second PASS 80c (158s) == replay-single test 80d: DNE: create remote dir, drop update rep from MDT1, fail 2 MDTs ========================================================== 11:43:34 (1773675814) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2444 1268860 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2012 1269292 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2444 1268860 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2012 1269292 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:44:20 (1773675860) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Failover mds2 to oleg132-server mount facets: mds2 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 11:45:06 (1773675906) targets are mounted 11:45:06 (1773675906) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.39 seconds: 51.72 ops/second PASS 80d (126s) == replay-single test 80e: DNE: create remote dir, drop MDT1 rep, fail MDT0 ========================================================== 11:45:40 (1773675940) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2408 1268896 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2048 1269256 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:46:00 (1773675960) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 11:46:38 (1773675998) targets are mounted 11:46:38 (1773675998) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.46 seconds: 43.08 ops/second PASS 80e (77s) == replay-single test 80f: DNE: create remote dir, drop MDT1 rep, fail MDT1 ========================================================== 11:46:57 (1773676017) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2408 1268896 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2012 1269292 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:47:15 (1773676035) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 11:47:49 (1773676069) targets are mounted 11:47:49 (1773676069) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.55 seconds: 36.23 ops/second PASS 80f (72s) == replay-single test 80g: DNE: create remote dir, drop MDT1 rep, fail MDT0, then MDT1 ========================================================== 11:48:09 (1773676089) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2408 1268896 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2012 1269292 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2408 1268896 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2012 1269292 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:48:41 (1773676121) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 11:49:18 (1773676158) targets are mounted 11:49:18 (1773676158) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:49:33 (1773676173) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 11:50:05 (1773676205) targets are mounted 11:50:05 (1773676205) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.58 seconds: 34.47 ops/second PASS 80g (136s) == replay-single test 80h: DNE: create remote dir, drop MDT1 rep, fail 2 MDTs ========================================================== 11:50:26 (1773676226) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2444 1268860 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2048 1269256 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2408 1268896 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2012 1269292 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:51:01 (1773676261) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 11:51:53 (1773676313) targets are mounted 11:51:53 (1773676313) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec total: 20 open/close in 0.36 seconds: 55.77 ops/second PASS 80h (108s) == replay-single test 81a: DNE: unlink remote dir, drop MDT0 update rep, fail MDT1 ========================================================== 11:52:13 (1773676333) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2048 1269256 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:52:37 (1773676357) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 11:53:08 (1773676388) targets are mounted 11:53:08 (1773676388) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81a (73s) == replay-single test 81b: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0 ========================================================== 11:53:26 (1773676406) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1269252 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:53:43 (1773676423) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 11:54:18 (1773676458) targets are mounted 11:54:18 (1773676458) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81b (69s) == replay-single test 81c: DNE: unlink remote dir, drop MDT0 update reply, fail MDT0,MDT1 ========================================================== 11:54:35 (1773676475) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2440 1268864 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1269252 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2440 1268864 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2052 1269252 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:55:03 (1773676503) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 11:55:35 (1773676535) targets are mounted 11:55:35 (1773676535) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:55:49 (1773676549) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 11:56:17 (1773676577) targets are mounted 11:56:17 (1773676577) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81c (116s) == replay-single test 81d: DNE: unlink remote dir, drop MDT0 update reply, fail 2 MDTs ========================================================== 11:56:31 (1773676591) fail_loc=0x1701 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2056 1269248 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2056 1269248 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:56:59 (1773676619) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Failover mds2 to oleg132-server mount facets: mds2 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 Started lustre-MDT0001 11:57:50 (1773676670) targets are mounted 11:57:50 (1773676670) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81d (104s) == replay-single test 81e: DNE: unlink remote dir, drop MDT1 req reply, fail MDT0 ========================================================== 11:58:15 (1773676695) fail_loc=0x119 fail_loc=0 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2092 1269212 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 11:58:32 (1773676712) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 11:59:05 (1773676745) targets are mounted 11:59:05 (1773676745) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81e (67s) == replay-single test 81f: DNE: unlink remote dir, drop MDT1 req reply, fail MDT1 ========================================================== 11:59:23 (1773676763) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2056 1269248 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 11:59:38 (1773676778) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 12:00:06 (1773676806) targets are mounted 12:00:06 (1773676806) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81f (58s) == replay-single test 81g: DNE: unlink remote dir, drop req reply, fail M0, then M1 ========================================================== 12:00:20 (1773676820) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1269244 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1269244 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 12:00:43 (1773676843) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 12:01:19 (1773676879) targets are mounted 12:01:19 (1773676879) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 12:01:32 (1773676892) shut down facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds2 to oleg132-server mount facets: mds2 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 12:01:58 (1773676918) targets are mounted 12:01:58 (1773676918) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81g (111s) == replay-single test 81h: DNE: unlink remote dir, drop request reply, fail 2 MDTs ========================================================== 12:02:11 (1773676931) fail_loc=0x119 UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2064 1269240 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1269244 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1600 3605420 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1576 3605444 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3176 7210864 1% /mnt/lustre Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Failing mds2 on oleg132-server Stopping /mnt/lustre-mds2 (opts:) on oleg132-server 12:02:35 (1773676955) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server facet: mds2 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Failover mds2 to oleg132-server mount facets: mds2 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 Start mds2: mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0001 Started lustre-MDT0000 12:03:03 (1773676983) targets are mounted 12:03:03 (1773676983) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid,mdc.lustre-MDT0001-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec mdc.lustre-MDT0001-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 81h (83s) == replay-single test 84a: stale open during export disconnect ========================================================== 12:03:34 (1773677014) fail_loc=0x80000144 total: 1 open/close in 0.03 seconds: 34.18 ops/second pdsh@oleg132-client: oleg132-client: ssh exited with exit code 5 PASS 84a (11s) == replay-single test 85a: check the cancellation of unused locks during recovery(IBITS) ========================================================== 12:03:45 (1773677025) before recovery: unused locks count = 202 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 12:03:55 (1773677035) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 12:04:16 (1773677056) targets are mounted 12:04:16 (1773677056) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec after recovery: unused locks count = 101 PASS 85a (44s) == replay-single test 85b: check the cancellation of unused locks during recovery(EXTENT) ========================================================== 12:04:29 (1773677069) before recovery: unused locks count = 100 Failing ost1 on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server 12:04:48 (1773677088) shut down facet: ost1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover ost1 to oleg132-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 12:05:09 (1773677109) targets are mounted 12:05:09 (1773677109) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec after recovery: unused locks count = 0 PASS 85b (53s) == replay-single test 86: umount server after clear nid_stats should not hit LBUG ========================================================== 12:05:22 (1773677122) Stopping clients: oleg132-client.virtnet /mnt/lustre (opts:) Stopping client oleg132-client.virtnet /mnt/lustre opts: mdt.lustre-MDT0000.exports.clear=0 mdt.lustre-MDT0001.exports.clear=0 Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 Starting client oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre Mount client oleg132-client.virtnet: mount -t lustre -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre Started clients oleg132-client.virtnet: 192.168.201.132@tcp:/lustre on /mnt/lustre type lustre (rw,checksum,encrypt,flock,lazystatfs,lruresize,nolock,statfs_project,nouser_fid2path,user_xattr,verbose) PASS 86 (26s) == replay-single test 87a: write replay ================== 12:05:48 (1773677148) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1268832 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1269244 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 1800 3605220 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 3576 7210464 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB, 8.0 MiB) copied, 0.348314 s, 24.1 MB/s Failing ost1 on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server 12:05:59 (1773677159) shut down facet: ost1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover ost1 to oleg132-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 12:06:23 (1773677183) targets are mounted 12:06:23 (1773677183) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 8+0 records in 8+0 records out 8388608 bytes (8.4 MB, 8.0 MiB) copied, 0.177205 s, 47.3 MB/s PASS 87a (46s) == replay-single test 87b: write replay with changed data (checksum resend) ========================================================== 12:06:34 (1773677194) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2472 1268832 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1269244 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9992 3597028 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11768 7202272 1% /mnt/lustre 8+0 records in 8+0 records out 8388608 bytes (8.4 MB, 8.0 MiB) copied, 0.242597 s, 34.6 MB/s 8+0 records in 8+0 records out 8 bytes copied, 0.00504868 s, 1.6 kB/s Failing ost1 on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server 12:06:45 (1773677205) shut down facet: ost1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover ost1 to oleg132-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 12:07:06 (1773677226) targets are mounted 12:07:06 (1773677226) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec 0+1 records in 0+1 records out 72 bytes copied, 0.0110776 s, 6.5 kB/s PASS 87b (42s) == replay-single test 88: MDS should not assign same objid to different files ========================================================== 12:07:17 (1773677237) UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1269244 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9996 3597024 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11772 7202268 1% /mnt/lustre UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 1414116 2476 1268828 1% /mnt/lustre[MDT:0] lustre-MDT0001_UUID 1414116 2060 1269244 1% /mnt/lustre[MDT:1] lustre-OST0000_UUID 3833116 9996 3597024 1% /mnt/lustre[OST:0] lustre-OST0001_UUID 3833116 1776 3605244 1% /mnt/lustre[OST:1] filesystem_summary: 7666232 11772 7202268 1% /mnt/lustre before test: last_id = 6913, next_id = 6884 Creating to objid 6913 on ost lustre-OST0000... total: 31 open/close in 0.37 seconds: 82.91 ops/second total: 8 open/close in 0.11 seconds: 73.88 ops/second before recovery: last_id = 6945, next_id = 6922 Stopping /mnt/lustre-mds1 (opts:) on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server Failover mds1 to oleg132-server oleg132-server.virtnet Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 Failover ost1 to oleg132-server oleg132-server.virtnet Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 after recovery: last_id = 6953, next_id = 6922 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.10177 s, 5.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0709122 s, 7.4 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0840589 s, 6.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0911756 s, 5.8 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0837173 s, 6.3 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0644884 s, 8.1 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.095128 s, 5.5 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0631815 s, 8.3 MB/s -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6884 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6885 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6886 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6887 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6888 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6889 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6890 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6891 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6892 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6893 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6894 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6895 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6896 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6897 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6898 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6899 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6900 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6901 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6902 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6903 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6904 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6905 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6906 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6907 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6908 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6909 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6910 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6911 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6912 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6913 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6914 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6915 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6916 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6917 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6918 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6919 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6920 -rw-r--r-- 1 root root 0 Mar 16 12:07 /mnt/lustre/d88.replay-single/f-6921 -rw-r--r-- 1 root root 524288 Mar 16 12:08 /mnt/lustre/d88.replay-single/f-6925 -rw-r--r-- 1 root root 524288 Mar 16 12:08 /mnt/lustre/d88.replay-single/f-6926 -rw-r--r-- 1 root root 524288 Mar 16 12:08 /mnt/lustre/d88.replay-single/f-6927 -rw-r--r-- 1 root root 524288 Mar 16 12:08 /mnt/lustre/d88.replay-single/f-6928 -rw-r--r-- 1 root root 524288 Mar 16 12:08 /mnt/lustre/d88.replay-single/f-6929 -rw-r--r-- 1 root root 524288 Mar 16 12:08 /mnt/lustre/d88.replay-single/f-6930 -rw-r--r-- 1 root root 524288 Mar 16 12:08 /mnt/lustre/d88.replay-single/f-6931 -rw-r--r-- 1 root root 524288 Mar 16 12:08 /mnt/lustre/d88.replay-single/f-6932 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0720607 s, 7.3 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.067124 s, 7.8 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0763597 s, 6.9 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0640032 s, 8.2 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0612627 s, 8.6 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0720385 s, 7.3 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.0761369 s, 6.9 MB/s 128+0 records in 128+0 records out 524288 bytes (524 kB, 512 KiB) copied, 0.084777 s, 6.2 MB/s PASS 88 (78s) == replay-single test 89: no disk space leak on late ost connection ========================================================== 12:08:34 (1773677314) Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg132-server mds-ost sync done. Waiting for MDT destroys to complete 10+0 records in 10+0 records out 10485760 bytes (10 MB, 10 MiB) copied, 0.187762 s, 55.8 MB/s -rw-r--r-- 1 root root 10485760 Mar 16 12:08 /mnt/lustre/d89.replay-single/f89.replay-single Stopping /mnt/lustre-ost1 (opts:) on oleg132-server Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 12:08:47 (1773677327) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 12:09:05 (1773677345) targets are mounted 12:09:05 (1773677345) facet_failover done Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 Starting client: oleg132-client.virtnet: -o user_xattr,flock 192.168.201.132@tcp:/lustre /mnt/lustre osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 67 sec Waiting for orphan cleanup... osp.lustre-OST0000-osc-MDT0000.old_sync_processed osp.lustre-OST0000-osc-MDT0001.old_sync_processed osp.lustre-OST0001-osc-MDT0000.old_sync_processed osp.lustre-OST0001-osc-MDT0001.old_sync_processed wait 40 secs maximumly for oleg132-server mds-ost sync done. Waiting for MDT destroys to complete free_before: 7646268 free_after: 7646268 PASS 89 (124s) == replay-single test 90: lfs find identifies the missing striped file segments ========================================================== 12:10:38 (1773677438) Create the files Fail ost1 lustre-OST0000_UUID, display the list of affected files Stopping /mnt/lustre-ost1 (opts:) on oleg132-server General Query: lfs find /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single /mnt/lustre/d90.replay-single/f0 /mnt/lustre/d90.replay-single/f1 /mnt/lustre/d90.replay-single/all Querying files on shutdown ost1: lfs find --obd lustre-OST0000_UUID /mnt/lustre/d90.replay-single/f0 /mnt/lustre/d90.replay-single/all Check getstripe: /home/green/git/lustre-release/lustre/utils/lfs getstripe -r --obd lustre-OST0000_UUID /mnt/lustre/d90.replay-single/f0 lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 obdidx objid objid group 0 6955 0x1b2b 0x280000401 * /mnt/lustre/d90.replay-single/all lmm_stripe_count: 2 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 obdidx objid objid group 0 6954 0x1b2a 0x280000401 * /mnt/lustre/d90.replay-single/all /mnt/lustre/d90.replay-single/f0 Failover ost1 to oleg132-server oleg132-server.virtnet Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 PASS 90 (29s) == replay-single test 93a: replay + reconnect ============ 12:11:07 (1773677467) 1+0 records in 1+0 records out 1024 bytes (1.0 kB, 1.0 KiB) copied, 0.00751142 s, 136 kB/s fail_val=40 fail_loc=0x715 Failing ost1 on oleg132-server Stopping /mnt/lustre-ost1 (opts:) on oleg132-server 12:11:11 (1773677471) shut down facet: ost1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover ost1 to oleg132-server mount facets: ost1 Start ost1: mount -t lustre -o localrecov /dev/mapper/ost1_flakey /mnt/lustre-ost1 seq.cli-lustre-OST0000-super.width=65536 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-OST0000 12:11:30 (1773677490) targets are mounted 12:11:30 (1773677490) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec PASS 93a (68s) == replay-single test 93b: replay + reconnect on mds ===== 12:12:15 (1773677535) total: 20 open/close in 0.35 seconds: 56.57 ops/second fail_val=80 fail_loc=0x715 Failing mds1 on oleg132-server Stopping /mnt/lustre-mds1 (opts:) on oleg132-server 12:12:21 (1773677541) shut down facet: mds1 facet_host: oleg132-server facet_failover_host: oleg132-server Failover mds1 to oleg132-server mount facets: mds1 Start mds1: mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 oleg132-server: oleg132-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all pdsh@oleg132-client: oleg132-server: ssh exited with exit code 1 Started lustre-MDT0000 12:12:38 (1773677558) targets are mounted 12:12:38 (1773677558) facet_failover done oleg132-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec PASS 93b (114s) == replay-single test complete, duration 9334 sec ======== 12:14:09 (1773677649) === replay-single: start cleanup 12:14:10 (1773677650) === === replay-single: finish cleanup 12:14:17 (1773677657) ===