== sanity-lnet test 205: Check health and resends for multi-rail local failures ========================================================== 04:10:36 (1743495036) Cleaning up LNet /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 114: echo: write error: Device or resource busy LNET unconfigure error 22: Invalid argument /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 156: echo: write error: Device or resource busy unloading modules via unload_modules_local on: 'oleg632-server' oleg632-server: oleg632-server.virtnet: executing unload_modules_local oleg632-server: /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 114: echo: write error: Device or resource busy oleg632-server: LNET unconfigure error 22: Invalid argument oleg632-server: /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 156: echo: write error: Device or resource busy modules unloaded. /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1001: echo: write error: Device or resource busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1002: echo: write error: Device or resource busy Loading modules from /home/green/git/lustre-release/lustre detected 4 online CPUs by sysfs MODOPTS_LIBCFS= Force libcfs to create 2 CPU partitions ../libcfs/libcfs/libcfs options: 'cpu_npartitions=2' /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl lnet configure -a oleg632-server: Writer error: failed to resolve Netlink family id oleg632-server: opening /dev/lnet failed: No such file or directory oleg632-server: hint: the kernel modules may not be loaded oleg632-server: IOC_LIBCFS_GET_NI error 2: No such file or directory pdsh@oleg632-client: oleg632-server: ssh exited with exit code 1 oleg632-server: oleg632-server.virtnet: executing load_lnet oleg632-server: Loading modules from /home/green/git/lustre-release/lustre oleg632-server: /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1001: echo: write error: Device or resource busy oleg632-server: /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 1002: echo: write error: Device or resource busy oleg632-server: detected 4 online CPUs by sysfs oleg632-server: MODOPTS_LIBCFS= oleg632-server: Force libcfs to create 2 CPU partitions /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl discover 192.168.206.132@tcp discover: - primary nid: 192.168.206.132@tcp Multi-Rail: true peer_ni: - nid: 192.168.206.132@tcp oleg632-server: oleg632-server.virtnet: executing lnet_if_list oleg632-server: oleg632-server.virtnet: executing /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl net add --net tcp1 --if ens2 /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl lnet configure /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl net add --net tcp1 --if ens2 default via 192.168.206.254 dev ens2 192.168.206.0/24 dev ens2 proto kernel scope link src 192.168.206.32 default via 192.168.206.254 dev ens2 192.168.206.0/24 dev ens2 proto kernel scope link src 192.168.206.32 net: - net type: lo local NI(s): - nid: 0@lo status: up - net type: tcp local NI(s): - nid: 192.168.206.32@tcp status: up interfaces: 0: ens2 - net type: tcp1 local NI(s): - nid: 192.168.206.32@tcp1 status: up interfaces: 0: ens2 - primary nid: 192.168.206.132@tcp - nid: 192.168.206.132@tcp health stats: health value: 1000 - nid: 192.168.206.132@tcp1 health stats: health value: 1000 debug=+net Simulate local_interrupt /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl ping 192.168.206.132@tcp manage: - ping: errno: -5 descr: ! 'failed to ping 192.168.206.132@tcp: Input/output error' /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl peer set --health 1000 --all /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl net set --health 1000 --all Check that 2 resends took place Check that local NI health has been changed Simulate local_dropped /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl ping 192.168.206.132@tcp manage: - ping: errno: -5 descr: ! 'failed to ping 192.168.206.132@tcp: Input/output error' /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl peer set --health 1000 --all /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl net set --health 1000 --all Check that 2 resends took place Check that local NI health has been changed Simulate local_aborted /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl ping 192.168.206.132@tcp manage: - ping: errno: -5 descr: ! 'failed to ping 192.168.206.132@tcp: Input/output error' /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl peer set --health 1000 --all /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl net set --health 1000 --all Check that 2 resends took place Check that local NI health has been changed Simulate local_no_route /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl ping 192.168.206.132@tcp manage: - ping: errno: -5 descr: ! 'failed to ping 192.168.206.132@tcp: Input/output error' /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl peer set --health 1000 --all /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl net set --health 1000 --all Check that 2 resends took place Check that local NI health has been changed Simulate local_timeout /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl ping 192.168.206.132@tcp manage: - ping: errno: -5 descr: ! 'failed to ping 192.168.206.132@tcp: Input/output error' /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl peer set --health 1000 --all /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl net set --health 1000 --all Check that 2 resends took place Check that local NI health has been changed Simulate local_error /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl ping 192.168.206.132@tcp manage: - ping: errno: -5 descr: ! 'failed to ping 192.168.206.132@tcp: Input/output error' /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl peer set --health 1000 --all /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl net set --health 1000 --all Check that no resends took place Check that local NI health has been changed oleg632-server: oleg632-server.virtnet: executing /home/green/git/lustre-release/lustre/../lnet/utils/lnetctl net del --net tcp1 --if ens2 Writer error: failed to resolve Netlink family id /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 114: echo: write error: Device or resource busy /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 156: echo: write error: Device or resource busy unloading modules via unload_modules_local on: 'oleg632-server' oleg632-server: oleg632-server.virtnet: executing unload_modules_local oleg632-server: /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 114: echo: write error: Device or resource busy oleg632-server: /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 156: echo: write error: Device or resource busy modules unloaded. oleg632-server: oleg632-server.virtnet: executing unload_modules_local oleg632-server: /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 114: echo: write error: Device or resource busy oleg632-server: LNET unconfigure error 22: Invalid argument oleg632-server: /home/green/git/lustre-release/lustre/scripts/lustre_rmmod: line 156: echo: write error: Device or resource busy pdsh@oleg632-client: oleg632-client: ssh exited with exit code 2 pdsh@oleg632-client: oleg632-server: ssh exited with exit code 2 pdsh@oleg632-client: oleg632-client: ssh exited with exit code 2 pdsh@oleg632-client: oleg632-server: ssh exited with exit code 2