[ 0.000000] Linux version 4.18.0rh8.10-debug (green@maintenance) (gcc version 8.5.0 20210514 (Red Hat 8.5.0-26) (GCC)) #2 SMP Mon Jul 14 01:24:22 EDT 2025 [ 0.000000] Command line: rd.shell root=nbd:192.168.200.253:rocky8.10:ext4:ro:-p,-b4096 ro crashkernel=256M panic=1 nomodeset ipmtu=9000 ip=dhcp rd.neednet=1 init_on_free=off mitigations=off console=ttyS1,115200 audit=0 [ 0.000000] x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers' [ 0.000000] x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers' [ 0.000000] x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers' [ 0.000000] x86/fpu: xstate_offset[2]: 576, xstate_sizes[2]: 256 [ 0.000000] x86/fpu: Enabled xstate features 0x7, context size is 832 bytes, using 'standard' format. [ 0.000000] signal: max sigframe size: 1776 [ 0.000000] BIOS-provided physical RAM map: [ 0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009fbff] usable [ 0.000000] BIOS-e820: [mem 0x000000000009fc00-0x000000000009ffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000000f0000-0x00000000000fffff] reserved [ 0.000000] BIOS-e820: [mem 0x0000000000100000-0x00000000bffcdfff] usable [ 0.000000] BIOS-e820: [mem 0x00000000bffce000-0x00000000bfffffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000feffc000-0x00000000feffffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000fffc0000-0x00000000ffffffff] reserved [ 0.000000] BIOS-e820: [mem 0x0000000100000000-0x0000000146dfffff] usable [ 0.000000] NX (Execute Disable) protection: active [ 0.000000] SMBIOS 2.8 present. [ 0.000000] DMI: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-8.fc42 06/10/2025 [ 0.000000] Hypervisor detected: KVM [ 0.000000] kvm-clock: Using msrs 4b564d01 and 4b564d00 [ 0.000000] kvm-clock: using sched offset of 800506112 cycles [ 0.000000] clocksource: kvm-clock: mask: 0xffffffffffffffff max_cycles: 0x1cd42e4dffb, max_idle_ns: 881590591483 ns [ 0.000000] tsc: Detected 2399.996 MHz processor [ 0.000000] last_pfn = 0x146e00 max_arch_pfn = 0x400000000 [ 0.000000] x86/PAT: Configuration [0-7]: WB WC UC- UC WB WP UC- WT [ 0.000000] last_pfn = 0xbffce max_arch_pfn = 0x400000000 [ 0.000000] found SMP MP-table at [mem 0x000f54b0-0x000f54bf] [ 0.000000] RAMDISK: [mem 0xbcc54000-0xbffbffff] [ 0.000000] ACPI: Early table checksum verification disabled [ 0.000000] ACPI: RSDP 0x00000000000F52D0 000014 (v00 BOCHS ) [ 0.000000] ACPI: RSDT 0x00000000BFFE2439 000034 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: FACP 0x00000000BFFE22D5 000074 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: DSDT 0x00000000BFFE0040 002295 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: FACS 0x00000000BFFE0000 000040 [ 0.000000] ACPI: APIC 0x00000000BFFE2349 000090 (v03 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: HPET 0x00000000BFFE23D9 000038 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: WAET 0x00000000BFFE2411 000028 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: Reserving FACP table memory at [mem 0xbffe22d5-0xbffe2348] [ 0.000000] ACPI: Reserving DSDT table memory at [mem 0xbffe0040-0xbffe22d4] [ 0.000000] ACPI: Reserving FACS table memory at [mem 0xbffe0000-0xbffe003f] [ 0.000000] ACPI: Reserving APIC table memory at [mem 0xbffe2349-0xbffe23d8] [ 0.000000] ACPI: Reserving HPET table memory at [mem 0xbffe23d9-0xbffe2410] [ 0.000000] ACPI: Reserving WAET table memory at [mem 0xbffe2411-0xbffe2438] [ 0.000000] No NUMA configuration found [ 0.000000] Faking a node at [mem 0x0000000000000000-0x0000000146dfffff] [ 0.000000] NODE_DATA(0) allocated [mem 0x1465a3000-0x1465cdfff] [ 0.000000] Reserving 256MB of memory at 2752MB for crashkernel (System RAM: 4205MB) [ 0.000000] Zone ranges: [ 0.000000] DMA [mem 0x0000000000001000-0x0000000000ffffff] [ 0.000000] DMA32 [mem 0x0000000001000000-0x00000000ffffffff] [ 0.000000] Normal [mem 0x0000000100000000-0x0000000146dfffff] [ 0.000000] Device empty [ 0.000000] Movable zone start for each node [ 0.000000] Early memory node ranges [ 0.000000] node 0: [mem 0x0000000000001000-0x000000000009efff] [ 0.000000] node 0: [mem 0x0000000000100000-0x00000000bffcdfff] [ 0.000000] node 0: [mem 0x0000000100000000-0x0000000146dfffff] [ 0.000000] Zeroed struct page in unavailable ranges: 4756 pages [ 0.000000] Initmem setup node 0 [mem 0x0000000000001000-0x0000000146dfffff] [ 0.000000] ACPI: PM-Timer IO Port: 0x608 [ 0.000000] ACPI: LAPIC_NMI (acpi_id[0xff] dfl dfl lint[0x1]) [ 0.000000] IOAPIC[0]: apic_id 0, version 17, address 0xfec00000, GSI 0-23 [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 5 global_irq 5 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 10 global_irq 10 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 11 global_irq 11 high level) [ 0.000000] Using ACPI (MADT) for SMP configuration information [ 0.000000] ACPI: HPET id: 0x8086a201 base: 0xfed00000 [ 0.000000] TSC deadline timer available [ 0.000000] smpboot: Allowing 4 CPUs, 0 hotplug CPUs [ 0.000000] kvm-guest: KVM setup pv remote TLB flush [ 0.000000] kvm-guest: setup PV sched yield [ 0.000000] PM: Registered nosave memory: [mem 0x00000000-0x00000fff] [ 0.000000] PM: Registered nosave memory: [mem 0x0009f000-0x0009ffff] [ 0.000000] PM: Registered nosave memory: [mem 0x000a0000-0x000effff] [ 0.000000] PM: Registered nosave memory: [mem 0x000f0000-0x000fffff] [ 0.000000] PM: Registered nosave memory: [mem 0xbffce000-0xbfffffff] [ 0.000000] PM: Registered nosave memory: [mem 0xc0000000-0xfeffbfff] [ 0.000000] PM: Registered nosave memory: [mem 0xfeffc000-0xfeffffff] [ 0.000000] PM: Registered nosave memory: [mem 0xff000000-0xfffbffff] [ 0.000000] PM: Registered nosave memory: [mem 0xfffc0000-0xffffffff] [ 0.000000] [mem 0xc0000000-0xfeffbfff] available for PCI devices [ 0.000000] Booting paravirtualized kernel on KVM [ 0.000000] clocksource: refined-jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 1910969940391419 ns [ 0.000000] setup_percpu: NR_CPUS:8192 nr_cpumask_bits:4 nr_cpu_ids:4 nr_node_ids:1 [ 0.000000] percpu: Embedded 63 pages/cpu s221184 r8192 d28672 u524288 [ 0.000000] kvm-guest: PV spinlocks enabled [ 0.000000] PV qspinlock hash table entries: 256 (order: 0, 4096 bytes, linear) [ 0.000000] Built 1 zonelists, mobility grouping on. Total pages: 1059606 [ 0.000000] Policy zone: Normal [ 0.000000] Kernel command line: rd.shell root=nbd:192.168.200.253:rocky8.10:ext4:ro:-p,-b4096 ro crashkernel=256M panic=1 nomodeset ipmtu=9000 ip=dhcp rd.neednet=1 init_on_free=off mitigations=off console=ttyS1,115200 audit=0 [ 0.000000] Specific versions of hardware are certified with Red Hat Enterprise Linux 8. Please see the list of hardware certified with Red Hat Enterprise Linux 8 at https://catalog.redhat.com. [ 0.000000] audit: disabled (until reboot) [ 0.000000] software IO TLB: area num 4. [ 0.000000] Memory: 2829652K/4306352K available (18435K kernel code, 11221K rwdata, 7248K rodata, 2908K init, 18040K bss, 524580K reserved, 0K cma-reserved) [ 0.000000] SLUB: HWalign=64, Order=0-3, MinObjects=0, CPUs=4, Nodes=1 [ 0.000000] kmemleak: Kernel memory leak detector disabled [ 0.000000] ftrace: allocating 41240 entries in 162 pages [ 0.000000] ftrace: allocated 162 pages with 3 groups [ 0.000000] rcu: Hierarchical RCU implementation. [ 0.000000] rcu: RCU event tracing is enabled. [ 0.000000] rcu: RCU restricting CPUs from NR_CPUS=8192 to nr_cpu_ids=4. [ 0.000000] rcu: RCU callback double-/use-after-free debug enabled. [ 0.000000] Rude variant of Tasks RCU enabled. [ 0.000000] Tracing variant of Tasks RCU enabled. [ 0.000000] rcu: RCU calculated value of scheduler-enlistment delay is 100 jiffies. [ 0.000000] rcu: Adjusting geometry for rcu_fanout_leaf=16, nr_cpu_ids=4 [ 0.000000] NR_IRQS: 524544, nr_irqs: 456, preallocated irqs: 16 [ 0.000000] random: get_random_bytes called from start_kernel+0x622/0x9a8 with crng_init=0 [ 0.001000] Console: colour *CGA 80x25 [ 0.001000] printk: console [ttyS1] enabled [ 0.001000] ACPI: Core revision 20220331 [ 0.001000] clocksource: hpet: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 19112604467 ns [ 0.001010] APIC: Switch to symmetric I/O mode setup [ 0.002382] x2apic enabled [ 0.003009] Switched APIC routing to physical x2apic. [ 0.004016] kvm-guest: setup PV IPIs [ 0.007394] ..TIMER: vector=0x30 apic1=0 pin1=2 apic2=-1 pin2=-1 [ 0.008000] clocksource: tsc-early: mask: 0xffffffffffffffff max_cycles: 0x229833f6470, max_idle_ns: 440795327230 ns [ 0.008032] Calibrating delay loop (skipped) preset value.. 4799.99 BogoMIPS (lpj=2399996) [ 0.009015] pid_max: default: 32768 minimum: 301 [ 0.010165] LSM: Security Framework initializing [ 0.011062] Yama: becoming mindful. [ 0.012043] SELinux: Initializing. [ 0.013110] *** VALIDATE selinux *** [ 0.025465] Dentry cache hash table entries: 1048576 (order: 11, 8388608 bytes, vmalloc) [ 0.032173] Inode-cache hash table entries: 524288 (order: 10, 4194304 bytes, vmalloc) [ 0.033173] Mount-cache hash table entries: 16384 (order: 5, 131072 bytes, vmalloc) [ 0.034136] Mountpoint-cache hash table entries: 16384 (order: 5, 131072 bytes, vmalloc) [ 0.035121] *** VALIDATE tmpfs *** [ 0.037467] *** VALIDATE proc *** [ 0.038241] *** VALIDATE cgroup *** [ 0.039007] *** VALIDATE cgroup2 *** [ 0.040264] x86/cpu: User Mode Instruction Prevention (UMIP) activated [ 0.041173] Last level iTLB entries: 4KB 0, 2MB 0, 4MB 0 [ 0.042009] Last level dTLB entries: 4KB 0, 2MB 0, 4MB 0, 1GB 0 [ 0.043099] Spectre V2 : User space: Vulnerable [ 0.044010] Speculative Store Bypass: Vulnerable [ 0.047652] debug: unmapping init [mem 0xffffffffbe059000-0xffffffffbe060fff] [ 0.049480] smpboot: CPU0: Intel(R) Xeon(R) CPU E5-2695 v2 @ 2.40GHz (family: 0x6, model: 0x3e, stepping: 0x4) [ 0.051284] Performance Events: IvyBridge events, full-width counters, Intel PMU driver. [ 0.052027] ... version: 2 [ 0.053012] ... bit width: 48 [ 0.054011] ... generic registers: 4 [ 0.055011] ... value mask: 0000ffffffffffff [ 0.056015] ... max period: 00007fffffffffff [ 0.057014] ... fixed-purpose events: 3 [ 0.058011] ... event mask: 000000070000000f [ 0.059298] rcu: Hierarchical SRCU implementation. [ 0.062050] smp: Bringing up secondary CPUs ... [ 0.064755] x86: Booting SMP configuration: [ 0.065136] .... node #0, CPUs: #1 #2 #3 [ 0.078017] smp: Brought up 1 node, 4 CPUs [ 0.080016] smpboot: Max logical packages: 1 [ 0.081016] smpboot: Total of 4 processors activated (19199.96 BogoMIPS) [ 0.163024] node 0 deferred pages initialised in 77ms [ 0.181260] devtmpfs: initialized [ 0.182410] x86/mm: Memory block size: 128MB [ 0.186824] gcov: version magic: 0x41383552 [ 0.190167] clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 1911260446275000 ns [ 0.193151] futex hash table entries: 1024 (order: 4, 65536 bytes, vmalloc) [ 0.196622] pinctrl core: initialized pinctrl subsystem [ 0.200345] [ 0.201000] ************************************************************* [ 0.204015] ** NOTICE NOTICE NOTICE NOTICE NOTICE NOTICE NOTICE ** [ 0.207012] ** ** [ 0.210015] ** IOMMU DebugFS SUPPORT HAS BEEN ENABLED IN THIS KERNEL ** [ 0.211014] ** ** [ 0.216018] ** This means that this kernel is built to expose internal ** [ 0.219014] ** IOMMU data structures, which may compromise security on ** [ 0.220017] ** your system. ** [ 0.223018] ** ** [ 0.225016] ** If you see this message and you are not debugging the ** [ 0.228017] ** kernel, report this immediately to your vendor! ** [ 0.233014] ** ** [ 0.237015] ** NOTICE NOTICE NOTICE NOTICE NOTICE NOTICE NOTICE ** [ 0.242016] ************************************************************* [ 0.248199] NET: Registered protocol family 16 [ 0.252130] DMA: preallocated 512 KiB GFP_KERNEL pool for atomic allocations [ 0.258168] DMA: preallocated 512 KiB GFP_KERNEL|GFP_DMA pool for atomic allocations [ 0.262340] DMA: preallocated 512 KiB GFP_KERNEL|GFP_DMA32 pool for atomic allocations [ 0.269403] cpuidle: using governor menu [ 0.273415] acpiphp: ACPI Hot Plug PCI Controller Driver version: 0.5 [ 0.280062] PCI: Using configuration type 1 for base access [ 0.283231] core: PMU erratum BJ122, BV98, HSD29 worked around, HT is on [ 0.297491] HugeTLB registered 1.00 GiB page size, pre-allocated 0 pages [ 0.298020] HugeTLB registered 2.00 MiB page size, pre-allocated 0 pages [ 0.300085] cryptd: max_cpu_qlen set to 1000 [ 0.311380] ACPI: Added _OSI(Module Device) [ 0.312017] ACPI: Added _OSI(Processor Device) [ 0.315021] ACPI: Added _OSI(3.0 _SCP Extensions) [ 0.318015] ACPI: Added _OSI(Processor Aggregator Device) [ 0.326800] ACPI: 1 ACPI AML tables successfully acquired and loaded [ 0.344652] ACPI: Interpreter enabled [ 0.347052] ACPI: PM: (supports S0 S3 S4 S5) [ 0.348012] ACPI: Using IOAPIC for interrupt routing [ 0.351109] PCI: Using host bridge windows from ACPI; if necessary, use "pci=nocrs" and report a bug [ 0.358116] ACPI: Enabled 2 GPEs in block 00 to 0F [ 0.373172] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-ff]) [ 0.376053] acpi PNP0A03:00: _OSC: OS supports [ASPM ClockPM Segments MSI HPX-Type3] [ 0.380294] acpi PNP0A03:00: _OSC: not requesting OS control; OS requires [ExtendedConfig ASPM ClockPM MSI] [ 0.385085] acpi PNP0A03:00: fail to add MMCONFIG information, can't access extended PCI configuration space under this bridge. [ 0.394631] acpiphp: Slot [2] registered [ 0.396337] acpiphp: Slot [5] registered [ 0.398290] acpiphp: Slot [6] registered [ 0.400525] acpiphp: Slot [7] registered [ 0.403201] acpiphp: Slot [8] registered [ 0.407180] acpiphp: Slot [9] registered [ 0.409381] acpiphp: Slot [10] registered [ 0.412157] acpiphp: Slot [3] registered [ 0.413388] acpiphp: Slot [4] registered [ 0.415113] acpiphp: Slot [11] registered [ 0.417152] acpiphp: Slot [12] registered [ 0.420133] acpiphp: Slot [13] registered [ 0.423131] acpiphp: Slot [14] registered [ 0.425194] acpiphp: Slot [15] registered [ 0.427261] acpiphp: Slot [16] registered [ 0.430204] acpiphp: Slot [17] registered [ 0.433169] acpiphp: Slot [18] registered [ 0.436126] acpiphp: Slot [19] registered [ 0.440154] acpiphp: Slot [20] registered [ 0.444145] acpiphp: Slot [21] registered [ 0.450140] acpiphp: Slot [22] registered [ 0.460148] acpiphp: Slot [23] registered [ 0.464152] acpiphp: Slot [24] registered [ 0.468307] acpiphp: Slot [25] registered [ 0.471108] acpiphp: Slot [26] registered [ 0.474133] acpiphp: Slot [27] registered [ 0.478127] acpiphp: Slot [28] registered [ 0.480092] acpiphp: Slot [29] registered [ 0.483376] acpiphp: Slot [30] registered [ 0.490167] acpiphp: Slot [31] registered [ 0.493076] PCI host bridge to bus 0000:00 [ 0.494000] pci_bus 0000:00: root bus resource [io 0x0000-0x0cf7 window] [ 0.498030] pci_bus 0000:00: root bus resource [io 0x0d00-0xffff window] [ 0.499000] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window] [ 0.505030] pci_bus 0000:00: root bus resource [mem 0xc0000000-0xfebfffff window] [ 0.508025] pci_bus 0000:00: root bus resource [mem 0xe0000000000-0xe007fffffff window] [ 0.511029] pci_bus 0000:00: root bus resource [bus 00-ff] [ 0.516316] pci 0000:00:00.0: [8086:1237] type 00 class 0x060000 [ 0.522892] pci 0000:00:01.0: [8086:7000] type 00 class 0x060100 [ 0.527972] pci 0000:00:01.1: [8086:7010] type 00 class 0x010180 [ 0.546014] pci 0000:00:01.1: reg 0x20: [io 0xc320-0xc32f] [ 0.553605] pci 0000:00:01.1: legacy IDE quirk: reg 0x10: [io 0x01f0-0x01f7] [ 0.560021] pci 0000:00:01.1: legacy IDE quirk: reg 0x14: [io 0x03f6] [ 0.566368] pci 0000:00:01.1: legacy IDE quirk: reg 0x18: [io 0x0170-0x0177] [ 0.572026] pci 0000:00:01.1: legacy IDE quirk: reg 0x1c: [io 0x0376] [ 0.576000] pci 0000:00:01.3: [8086:7113] type 00 class 0x068000 [ 0.581211] pci 0000:00:01.3: quirk: [io 0x0600-0x063f] claimed by PIIX4 ACPI [ 0.586067] pci 0000:00:01.3: quirk: [io 0x0700-0x070f] claimed by PIIX4 SMB [ 0.592108] pci 0000:00:02.0: [1af4:1000] type 00 class 0x020000 [ 0.600016] pci 0000:00:02.0: reg 0x10: [io 0xc300-0xc31f] [ 0.620016] pci 0000:00:02.0: reg 0x20: [mem 0xe0000000000-0xe0000003fff 64bit pref] [ 0.628016] pci 0000:00:02.0: reg 0x30: [mem 0xfeb80000-0xfebbffff pref] [ 0.635531] pci 0000:00:05.0: [1af4:1001] type 00 class 0x010000 [ 0.644021] pci 0000:00:05.0: reg 0x10: [io 0xc000-0xc07f] [ 0.671026] pci 0000:00:05.0: reg 0x14: [mem 0xfebc0000-0xfebc0fff] [ 0.688000] pci 0000:00:05.0: reg 0x20: [mem 0xe0000004000-0xe0000007fff 64bit pref] [ 0.688000] pci 0000:00:06.0: [1af4:1001] type 00 class 0x010000 [ 0.705035] pci 0000:00:06.0: reg 0x10: [io 0xc080-0xc0ff] [ 0.720025] pci 0000:00:06.0: reg 0x14: [mem 0xfebc1000-0xfebc1fff] [ 0.744024] pci 0000:00:06.0: reg 0x20: [mem 0xe0000008000-0xe000000bfff 64bit pref] [ 0.755187] pci 0000:00:07.0: [1af4:1001] type 00 class 0x010000 [ 0.761019] pci 0000:00:07.0: reg 0x10: [io 0xc100-0xc17f] [ 0.768032] pci 0000:00:07.0: reg 0x14: [mem 0xfebc2000-0xfebc2fff] [ 0.788022] pci 0000:00:07.0: reg 0x20: [mem 0xe000000c000-0xe000000ffff 64bit pref] [ 0.799781] pci 0000:00:08.0: [1af4:1001] type 00 class 0x010000 [ 0.811026] pci 0000:00:08.0: reg 0x10: [io 0xc180-0xc1ff] [ 0.820022] pci 0000:00:08.0: reg 0x14: [mem 0xfebc3000-0xfebc3fff] [ 0.834022] pci 0000:00:08.0: reg 0x20: [mem 0xe0000010000-0xe0000013fff 64bit pref] [ 0.847183] pci 0000:00:09.0: [1af4:1001] type 00 class 0x010000 [ 0.857020] pci 0000:00:09.0: reg 0x10: [io 0xc200-0xc27f] [ 0.867019] pci 0000:00:09.0: reg 0x14: [mem 0xfebc4000-0xfebc4fff] [ 0.891219] pci 0000:00:09.0: reg 0x20: [mem 0xe0000014000-0xe0000017fff 64bit pref] [ 0.904699] pci 0000:00:0a.0: [1af4:1001] type 00 class 0x010000 [ 0.920018] pci 0000:00:0a.0: reg 0x10: [io 0xc280-0xc2ff] [ 0.927020] pci 0000:00:0a.0: reg 0x14: [mem 0xfebc5000-0xfebc5fff] [ 0.944020] pci 0000:00:0a.0: reg 0x20: [mem 0xe0000018000-0xe000001bfff 64bit pref] [ 0.957000] ACPI: PCI: Interrupt link LNKA configured for IRQ 10 [ 0.961426] ACPI: PCI: Interrupt link LNKB configured for IRQ 10 [ 0.964531] ACPI: PCI: Interrupt link LNKC configured for IRQ 11 [ 0.968339] ACPI: PCI: Interrupt link LNKD configured for IRQ 11 [ 0.971238] ACPI: PCI: Interrupt link LNKS configured for IRQ 9 [ 0.977309] iommu: Default domain type: Passthrough [ 0.978000] SCSI subsystem initialized [ 0.979463] ACPI: bus type USB registered [ 0.982149] usbcore: registered new interface driver usbfs [ 0.985125] usbcore: registered new interface driver hub [ 0.988100] usbcore: registered new device driver usb [ 0.991316] pps_core: LinuxPPS API ver. 1 registered [ 0.994015] pps_core: Software ver. 5.3.6 - Copyright 2005-2007 Rodolfo Giometti [ 1.000092] PTP clock support registered [ 1.002343] EDAC MC: Ver: 3.0.0 [ 1.006206] PCI: Using ACPI for IRQ routing [ 1.009704] NetLabel: Initializing [ 1.012240] NetLabel: domain hash size = 128 [ 1.016014] NetLabel: protocols = UNLABELED CIPSOv4 CALIPSO [ 1.020043] NetLabel: unlabeled traffic allowed by default [ 1.025298] vgaarb: loaded [ 1.028065] hpet0: at MMIO 0xfed00000, IRQs 2, 8, 0 [ 1.031010] hpet0: 3 comparators, 64-bit 100.000000 MHz counter [ 1.045118] clocksource: Switched to clocksource kvm-clock [ 1.345488] VFS: Disk quotas dquot_6.6.0 [ 1.355329] VFS: Dquot-cache hash table entries: 512 (order 0, 4096 bytes) [ 1.363159] *** VALIDATE ramfs *** [ 1.364946] *** VALIDATE hugetlbfs *** [ 1.371367] pnp: PnP ACPI init [ 1.379888] pnp: PnP ACPI: found 6 devices [ 1.414243] clocksource: acpi_pm: mask: 0xffffff max_cycles: 0xffffff, max_idle_ns: 2085701024 ns [ 1.419662] pci_bus 0000:00: resource 4 [io 0x0000-0x0cf7 window] [ 1.423625] pci_bus 0000:00: resource 5 [io 0x0d00-0xffff window] [ 1.426319] pci_bus 0000:00: resource 6 [mem 0x000a0000-0x000bffff window] [ 1.430515] pci_bus 0000:00: resource 7 [mem 0xc0000000-0xfebfffff window] [ 1.433869] pci_bus 0000:00: resource 8 [mem 0xe0000000000-0xe007fffffff window] [ 1.438659] NET: Registered protocol family 2 [ 1.441468] IP idents hash table entries: 131072 (order: 8, 1048576 bytes, vmalloc) [ 1.447902] tcp_listen_portaddr_hash hash table entries: 4096 (order: 5, 163840 bytes, vmalloc) [ 1.453629] TCP established hash table entries: 65536 (order: 7, 524288 bytes, vmalloc) [ 1.460968] TCP bind hash table entries: 65536 (order: 9, 2097152 bytes, vmalloc) [ 1.464367] TCP: Hash tables configured (established 65536 bind 65536) [ 1.468754] MPTCP token hash table entries: 8192 (order: 6, 393216 bytes, vmalloc) [ 1.510852] UDP hash table entries: 4096 (order: 6, 393216 bytes, vmalloc) [ 1.515366] UDP-Lite hash table entries: 4096 (order: 6, 393216 bytes, vmalloc) [ 1.521124] NET: Registered protocol family 1 [ 1.524703] RPC: Registered named UNIX socket transport module. [ 1.527115] RPC: Registered udp transport module. [ 1.532710] RPC: Registered tcp transport module. [ 1.536419] RPC: Registered tcp NFSv4.1 backchannel transport module. [ 1.544277] NET: Registered protocol family 44 [ 1.547664] pci 0000:00:00.0: Limiting direct PCI/PCI transfers [ 1.550836] pci 0000:00:01.0: PIIX3: Enabling Passive Release [ 1.559445] pci 0000:00:01.0: Activating ISA DMA hang workarounds [ 1.565164] PCI: CLS 0 bytes, default 64 [ 1.570568] Unpacking initramfs... [ 4.143613] debug: unmapping init [mem 0xffff8dcafcc54000-0xffff8dcafffbffff] [ 4.151537] PCI-DMA: Using software bounce buffering for IO (SWIOTLB) [ 4.161650] software IO TLB: mapped [mem 0x00000000a8000000-0x00000000ac000000] (64MB) [ 4.164898] clocksource: tsc: mask: 0xffffffffffffffff max_cycles: 0x229833f6470, max_idle_ns: 440795327230 ns [ 4.978723] Initialise system trusted keyrings [ 4.981438] Key type blacklist registered [ 4.984251] workingset: timestamp_bits=36 max_order=20 bucket_order=0 [ 4.995472] zbud: loaded [ 5.001507] *** VALIDATE nfs *** [ 5.003318] *** VALIDATE nfs4 *** [ 5.007062] pstore: using deflate compression [ 5.012716] Platform Keyring initialized [ 5.251583] NET: Registered protocol family 38 [ 5.254125] Key type asymmetric registered [ 5.256423] Asymmetric key parser 'x509' registered [ 5.259931] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 247) [ 5.265456] io scheduler mq-deadline registered [ 5.268649] io scheduler kyber registered [ 5.271638] io scheduler bfq registered [ 5.275257] atomic64_test: passed for x86-64 platform with CX8 and with SSE [ 5.279781] shpchp: Standard Hot Plug PCI Controller Driver version: 0.4 [ 5.283940] input: Power Button as /devices/LNXSYSTM:00/LNXPWRBN:00/input/input0 [ 5.287965] ACPI: Power Button [PWRF] [ 5.298346] ACPI: \_SB_.LNKB: Enabled at IRQ 10 [ 5.312685] ACPI: \_SB_.LNKA: Enabled at IRQ 11 [ 5.351406] ACPI: \_SB_.LNKC: Enabled at IRQ 11 [ 5.370566] ACPI: \_SB_.LNKD: Enabled at IRQ 10 [ 5.428052] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled [ 5.472553] 00:03: ttyS1 at I/O 0x2f8 (irq = 3, base_baud = 115200) is a 16550A [ 5.507575] 00:04: ttyS0 at I/O 0x3f8 (irq = 4, base_baud = 115200) is a 16550A [ 5.515350] Non-volatile memory driver v1.3 [ 5.517744] Linux agpgart interface v0.103 [ 5.573551] virtio_blk virtio1: [vda] 134584 512-byte logical blocks (68.9 MB/65.7 MiB) [ 5.580013] vda: detected capacity change from 0 to 68907008 [ 5.601725] virtio_blk virtio2: [vdb] 2097152 512-byte logical blocks (1.07 GB/1.00 GiB) [ 5.605181] vdb: detected capacity change from 0 to 1073741824 [ 5.625594] virtio_blk virtio3: [vdc] 5120000 512-byte logical blocks (2.62 GB/2.44 GiB) [ 5.630334] vdc: detected capacity change from 0 to 2621440000 [ 5.649134] virtio_blk virtio4: [vdd] 5120000 512-byte logical blocks (2.62 GB/2.44 GiB) [ 5.655916] vdd: detected capacity change from 0 to 2621440000 [ 5.675321] virtio_blk virtio5: [vde] 8388608 512-byte logical blocks (4.29 GB/4.00 GiB) [ 5.679546] vde: detected capacity change from 0 to 4294967296 [ 5.716495] virtio_blk virtio6: [vdf] 8388608 512-byte logical blocks (4.29 GB/4.00 GiB) [ 5.762098] vdf: detected capacity change from 0 to 4294967296 [ 5.822669] libphy: Fixed MDIO Bus: probed [ 5.844437] usbcore: registered new interface driver usbserial_generic [ 5.847183] usbserial: USB Serial support registered for generic [ 5.849416] i8042: PNP: PS/2 Controller [PNP0303:KBD,PNP0f13:MOU] at 0x60,0x64 irq 1,12 [ 5.854616] serio: i8042 KBD port at 0x60,0x64 irq 1 [ 5.856407] serio: i8042 AUX port at 0x60,0x64 irq 12 [ 5.858405] mousedev: PS/2 mouse device common for all mice [ 5.861395] rtc_cmos 00:05: RTC can wake from S4 [ 5.867622] input: AT Translated Set 2 keyboard as /devices/platform/i8042/serio0/input/input1 [ 5.871530] rtc_cmos 00:05: registered as rtc0 [ 5.874326] rtc_cmos 00:05: alarms up to one day, y3k, 242 bytes nvram, hpet irqs [ 5.874803] input: VirtualPS/2 VMware VMMouse as /devices/platform/i8042/serio1/input/input4 [ 5.878108] intel_pstate: CPU model not supported [ 5.881559] input: VirtualPS/2 VMware VMMouse as /devices/platform/i8042/serio1/input/input3 [ 5.884765] hid: raw HID events driver (C) Jiri Kosina [ 5.885945] usbcore: registered new interface driver usbhid [ 5.887487] usbhid: USB HID core driver [ 5.888462] drop_monitor: Initializing network drop monitor service [ 5.889926] Initializing XFRM netlink socket [ 5.891323] NET: Registered protocol family 10 [ 5.893530] Segment Routing with IPv6 [ 5.894589] NET: Registered protocol family 17 [ 5.897486] mpls_gso: MPLS GSO support [ 5.905474] RAS: Correctable Errors collector initialized. [ 5.908574] AVX version of gcm_enc/dec engaged. [ 5.910966] AES CTR mode by8 optimization enabled [ 6.100902] sched_clock: Marking stable (6100883570, 0)->(8073743181, -1972859611) [ 6.106577] registered taskstats version 1 [ 6.111506] Loading compiled-in X.509 certificates [ 6.113926] zswap: loaded using pool lzo/zbud [ 6.153293] Key type big_key registered [ 6.171956] Key type encrypted registered [ 6.178975] ima: No TPM chip found, activating TPM-bypass! [ 6.181253] ima: Allocated hash algorithm: sha1 [ 6.183624] ima: No architecture policies found [ 6.188768] evm: Initialising EVM extended attributes: [ 6.191052] evm: security.selinux [ 6.192612] evm: security.ima [ 6.194553] evm: security.capability [ 6.196266] evm: HMAC attrs: 0x1 [ 6.199618] rtc_cmos 00:05: setting system clock to 2026-03-16 13:36:02 UTC (1773668162) [ 6.211637] debug: unmapping init [mem 0xffffffffbf003000-0xffffffffbf1fffff] [ 6.216294] debug: unmapping init [mem 0xffffffffbdd82000-0xffffffffbe058fff] [ 6.226157] Write protecting the kernel read-only data: 28672k [ 6.230535] debug: unmapping init [mem 0xffffffffbc403000-0xffffffffbc5fffff] [ 6.234612] debug: unmapping init [mem 0xffffffffbcd14000-0xffffffffbcdfffff] [ 6.345516] systemd[1]: systemd 239 (239-82.el8_10.5) running in system mode. (+PAM +AUDIT +SELINUX +IMA -APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 +SECCOMP +BLKID +ELFUTILS +KMOD +IDN2 -IDN +PCRE2 default-hierarchy=legacy) [ 6.374850] systemd[1]: Detected virtualization kvm. [ 6.382362] systemd[1]: Detected architecture x86-64. [ 6.393952] systemd[1]: Running in initial RAM disk. Welcome to Rocky Linux 8.10 (Green Obsidian) dracut-049-233.git20240115.el8 (Initramfs)! [ 6.433849] systemd[1]: No hostname configured. [ 6.437717] systemd[1]: Set hostname to . [ 6.442489] random: systemd: uninitialized urandom read (16 bytes read) [ 6.445484] systemd[1]: Initializing machine ID from random generator. [ 6.537212] random: ln: uninitialized urandom read (6 bytes read) [ 6.705791] random: systemd: uninitialized urandom read (16 bytes read) [ 6.709573] systemd[1]: Listening on Journal Socket (/dev/log). [ OK ] Listening on Journal Socket (/dev/log). [ 6.717595] systemd[1]: Reached target Timers. [ OK ] Reached target Timers. [ 6.727581] systemd[1]: Reached target Initrd Root Device. [ OK ] Reached target Initrd Root Device. [ OK ] Listening on udev Control Socket. [ OK ] Listening on Journal Socket. Starting Create list of required st…ce nodes for the current kernel... Starting Apply Kernel Variables... Starting Setup Virtual Console... [ OK ] Reached target Local File Systems. Starting Create Volatile Files and Directories... [ OK ] Listening on udev Kernel Socket. [ OK ] Reached target Sockets. [ OK ] Reached target Swap. Starting Journal Service... [ OK ] Started Dispatch Password Requests to Console Directory Watch. [ OK ] Reached target Paths. [ OK ] Started Memstrack Anylazing Service. [ OK ] Reached target Slices. [ OK ] Reached target Local Encrypted Volumes. [ OK ] Started Create list of required sta…vice nodes for the current kernel. [ OK ] Started Apply Kernel Variables. [ OK ] Started Setup Virtual Console. [ OK ] Started Create Volatile Files and Directories. Starting dracut cmdline hook... Starting Create Static Device Nodes in /dev... [ OK ] Started Create Static Device Nodes in /dev. [ OK ] Started Journal Service. [ OK ] Started dracut cmdline hook. Starting dracut pre-udev hook... [ 8.142340] device-mapper: uevent: version 1.0.3 [ 8.147120] device-mapper: ioctl: 4.46.0-ioctl (2022-02-22) initialised: dm-devel@redhat.com [ OK ] Started dracut pre-udev hook. Starting udev Kernel Device Manager... [ OK ] Started udev Kernel Device Manager. Starting dracut pre-trigger hook... [ OK ] Started dracut pre-trigger hook. Starting udev Coldplug all Devices... Mounting Kernel Configuration File System... [ OK ] Mounted Kernel Configuration File System. [ OK ] Started udev Coldplug all Devices. [ OK ] Reached target System Initialization. [ OK ] Reached target Basic System. [ OK ] Started Hardware RNG Entropy Gatherer Daemon. [ 9.918549] virtio_net virtio0 ens2: renamed from eth0 [ 9.973972] random: fast init done Starting dracut initqueue hook... [ 10.757874] scsi host0: ata_piix [ 10.771659] scsi host1: ata_piix [ 10.794590] ata1: PATA max MWDMA2 cmd 0x1f0 ctl 0x3f6 bmdma 0xc320 irq 14 [ 10.799291] ata2: PATA max MWDMA2 cmd 0x170 ctl 0x376 bmdma 0xc328 irq 15 [ 14.704704] random: crng init done [ 14.708976] random: 7 urandom warning(s) missed due to ratelimiting [ 16.493493] dracut-initqueue[593]: RTNETLINK answers: File exists Starting nbd nbd0... [ OK ] Started nbd nbd0. [ OK ] Started dracut initqueue hook. Mounting /sysroot... [ OK ] Reached target Remote File Systems (Pre). [ OK ] Reached target Remote File Systems. [ 17.625438] EXT4-fs (nbd0): mounted filesystem with ordered data mode. Opts: (null) [ OK ] Mounted /sysroot. [ OK ] Reached target Initrd Root File System. Starting Reload Configuration from the Real Root... [ OK ] Started Reload Configuration from the Real Root. [ OK ] Reached target Initrd File Systems. [ OK ] Reached target Initrd Default Target. Starting dracut pre-pivot and cleanup hook... [ OK ] Started dracut pre-pivot and cleanup hook. Starting Cleaning Up and Shutting Down Daemons... [ OK ] Stopped dracut pre-pivot and cleanup hook. [ OK ] Stopped target Remote File Systems. Stopping Hardware RNG Entropy Gatherer Daemon... [ OK ] Stopped target Timers. [ OK ] Stopped target Remote File Systems (Pre). [ OK ] Stopped dracut initqueue hook. [ OK ] Stopped target Initrd Default Target. [ OK ] Stopped target Initrd Root Device. [ OK ] Stopped Hardware RNG Entropy Gatherer Daemon. [ OK ] Stopped target Basic System. [ OK ] Stopped target System Initialization. [ OK ] Stopped udev Coldplug all Devices. [ OK ] Stopped dracut pre-trigger hook. Stopping udev Kernel Device Manager... [ OK ] Stopped target Swap. [ OK ] Stopped target Local Encrypted Volumes. [ OK ] Stopped Apply Kernel Variables. [ OK ] Stopped Create Volatile Files and Directories. [ OK ] Stopped target Local File Systems. [ OK ] Stopped target Paths. [ OK ] Stopped Dispatch Password Requests to Console Directory Watch. [ OK ] Stopped target Slices. [ OK ] Stopped target Sockets. [ OK ] Stopped udev Kernel Device Manager. [ OK ] Started Cleaning Up and Shutting Down Daemons. [ OK ] Stopped Create Static Device Nodes in /dev. [ OK ] Stopped Create list of required sta…vice nodes for the current kernel. [ OK ] Stopped dracut pre-udev hook. [ OK ] Stopped dracut cmdline hook. [ OK ] Closed udev Kernel Socket. [ OK ] Closed udev Control Socket. Starting Cleanup udevd DB... [ OK ] Started Cleanup udevd DB. [ OK ] Reached target Switch Root. Starting Switch Root... [ 20.177493] printk: systemd: 26 output lines suppressed due to ratelimiting [ 20.785687] SELinux: Disabled at runtime. [ 20.865111] systemd[1]: systemd 239 (239-82.el8_10.5) running in system mode. (+PAM +AUDIT +SELINUX +IMA -APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 +SECCOMP +BLKID +ELFUTILS +KMOD +IDN2 -IDN +PCRE2 default-hierarchy=legacy) [ 20.873738] systemd[1]: Detected virtualization kvm. [ 20.875984] systemd[1]: Detected architecture x86-64. Welcome to Rocky Linux 8.10 (Green Obsidian)! [ 21.987319] systemd[1]: initrd-switch-root.service: Succeeded. [ 21.991529] systemd[1]: Stopped Switch Root. [ OK ] Stopped Switch Root. [ 22.007061] systemd[1]: systemd-journald.service: Service has no hold-off time (RestartSec=0), scheduling restart. [ 22.013396] systemd[1]: systemd-journald.service: Scheduled restart job, restart counter is at 1. [ 22.021274] systemd[1]: Stopped Journal Service. [ OK ] Stopped Journal Service. [ 22.029362] systemd[1]: Starting Journal Service... Starting Journal Service... [ 22.038557] systemd[1]: Created slice User and Session Slice. [ OK ] Created slice User and Session Slice. [ OK ] Reached target rpc_pipefs.target. [ OK ] Reached target Slices. [ OK ] Listening on initctl Compatibility Named Pipe. [ OK ] Created slice system-getty.slice. [ OK ] Listening on Process Core Dump Socket. Starting Remount Root and Kernel File Systems... [ OK ] Created slice system-sshd\x2dkeygen.slice. [FAILED] Failed to set up automount Arbitrar…rmats File System Automount Point. See 'systemctl status proc-sys-fs-binfmt_misc.automount' for details. [ OK ] Created slice system-serial\x2dgetty.slice. Starting Apply Kernel Variables... Mounting Huge Pages File System... [ OK ] Listening on RPCbind Server Activation Socket. Activating swap /dev/disk/by-label/SWAP... Starting Create list of required st…ce nodes for the current kernel... Mounting POSIX Message Queue File System... [[ 22.304789] Adding 1048572k swap on /dev/vdb. Priority:-2 extents:1 across:1048572k FS  OK ] Stopped target Switch Root. [ OK ] Stopped target Initrd Root File System. [ OK ] Listening on udev Control Socket. [ OK ] Listening on udev Kernel Socket. [ OK ] Started Forward Password Requests to Wall Directory Watch. [ OK ] Reached target RPC Port Mapper. Starting udev Coldplug all Devices... Mounting Kernel Debug File System... [ OK ] Started Dispatch Password Requests to Console Directory Watch. [ OK ] Reached target Local Encrypted Volumes. [ OK ] Reached target Paths. [ OK ] Stopped target Initrd File Systems. [FAILED] Failed to start Remount Root and Kernel File Systems. See 'systemctl status systemd-remount-fs.service' for details. [ OK ] Started Journal Service. [ OK ] Started Apply Kernel Variables. [ OK ] Mounted Huge Pages File System. [ OK ] Activated swap /dev/disk/by-label/SWAP. [ OK ] Started Create list of required sta…vice nodes for the current kernel. [ OK ] Mounted POSIX Message Queue File System. [ OK ] Mounted Kernel Debug File System. [ OK ] Reached target Swap. Starting Configure read-only root support... Starting Flush Journal to Persistent Storage... Starting Create Static Device Nodes in /dev... [ OK ] Started Flush Journal to Persistent Storage. [ OK ] Started udev Coldplug all Devices. [ OK ] Started Create Static Device Nodes in /dev. Starting udev Kernel Device Manager... [ OK ] Reached target Local File Systems (Pre). Mounting /mnt... Mounting /home/green/git/lustre-release... [ OK ] Mounted /mnt. [ 23.208358] squashfs: version 4.0 (2009/01/31) Phillip Lougher [ OK ] Mounted /home/green/git/lustre-release. [ OK ] Started udev Kernel Device Manager. [ 23.980551] piix4_smbus 0000:00:01.3: SMBus Host Controller at 0x700, revision 0 [ 24.067754] input: PC Speaker as /devices/platform/pcspkr/input/input5 [ 24.535274] RAPL PMU: API unit is 2^-32 Joules, 0 fixed counters, 10737418240 ms ovfl timer [ 24.606864] EDAC sbridge: Ver: 1.1.2 [ 28.042049] Key type dns_resolver registered [* ] A start job is running for Configur…-only root support (6s / no limit)[ 28.724642] NFS: Registering the id_resolver key type [ 28.729405] Key type id_resolver registered [ 28.733141] Key type id_legacy registered [ OK ] Started Configure read-only root support. [ OK ] Reached target Local File Systems. Starting Mark the need to relabel after reboot... Starting Rebuild Dynamic Linker Cache... Starting Create Volatile Files and Directories... Starting Load/Save Random Seed... [ OK ] Started Mark the need to relabel after reboot. [ OK ] Started Load/Save Random Seed. [ OK ] Started Create Volatile Files and Directories. Starting Update UTMP about System Boot/Shutdown... Starting RPC Bind... [ OK ] Started Update UTMP about System Boot/Shutdown. [ OK ] Started RPC Bind. [ OK ] Started Rebuild Dynamic Linker Cache. Starting Update is Completed... [ OK ] Started Update is Completed. [ OK ] Reached target System Initialization. [ OK ] Started daily update of the root trust anchor for DNSSEC. [ OK ] Started dnf makecache --timer. [ OK ] Listening on D-Bus System Message Bus Socket. [ OK ] Reached target Sockets. [ OK ] Reached target Basic System. [ OK ] Started irqbalance daemon. [ OK ] Started Hardware RNG Entropy Gatherer Daemon. Starting Login Service... [ OK ] Started D-Bus System Message Bus. Starting Network Manager... Starting Restore /run/initramfs on shutdown... [ OK ] Reached target sshd-keygen.target. [ OK ] Started Daily Cleanup of Temporary Directories. [ OK ] Reached target Timers. [ OK ] Started Restore /run/initramfs on shutdown. [ OK ] Started Network Manager. [ OK ] Reached target Network. Starting Dynamic System Tuning Daemon... Starting OpenSSH server daemon... Starting GSSAPI Proxy Daemon... Starting Network Manager Wait Online... [ OK ] Started OpenSSH server daemon. [ OK ] Started GSSAPI Proxy Daemon. [ OK ] Started Login Service. [ OK ] Reached target NFS client services. [ OK ] Reached target Remote File Systems (Pre). [ OK ] Reached target Remote File Systems. Starting Permit User Sessions... Starting Hostname Service... [ OK ] Started Permit User Sessions. [ OK ] Started Serial Getty on ttyS1. [ OK ] Started Serial Getty on ttyS0. [ OK ] Started Getty on tty1. [ OK ] Reached target Login Prompts. [ OK ] Started Command Scheduler. [ OK ] Started Hostname Service. Starting Network Manager Script Dispatcher Service... [ OK ] Started Network Manager Script Dispatcher Service. [ OK ] Started Network Manager Wait Online. [ OK ] Reached target Network is Online. Starting Notify NFS peers of a restart... Starting Crash recovery kernel arming... Starting System Logging Service... [ OK ] Started Notify NFS peers of a restart. [ OK ] Started System Logging Service. Starting Authorization Manager... [ OK ] Started Dynamic System Tuning Daemon. [ OK ] Reached target Multi-User System. [ OK ] Reached target Graphical Interface. Starting Update UTMP about System Runlevel Changes... [ OK ] Started Update UTMP about System Runlevel Changes. [ OK ] Started Authorization Manager. Rocky Linux 8.10 (Green Obsidian) Kernel 4.18.0rh8.10-debug on an x86_64 oleg139-server login: [ 52.320618] spl: loading out-of-tree module taints kernel. [ 55.643576] ZFS: Loaded module v2.3.2-1, ZFS pool version 5000, ZFS filesystem version 5 [ 62.100685] Key type ._llcrypt registered [ 62.102312] Key type .llcrypt registered [ 62.158537] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_hostid [ 73.065884] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing load_modules_local [ 74.104592] libcfs: HW NUMA nodes: 1, HW CPU cores: 4, npartitions: 1 [ 74.117487] alg: No test for adler32 (adler32-zlib) [ 75.354734] Lustre: Lustre: Build Version: 2.17.51_1_gb548ff5 [ 75.952485] LNet: Added LNI 192.168.201.139@tcp [8/256/0/180] [ 77.617619] Key type lgssc registered [ 78.791655] Lustre: Echo OBD driver; http://www.lustre.org/ [ 85.191347] vdc: vdc1 vdc9 [ 90.255815] vde: vde1 vde9 [ 96.141439] vdf: vdf1 vdf9 [ 105.916467] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing load_modules_local [ 110.649252] Lustre: lustre-MDT0000: mounting server target with '-t lustre' deprecated, use '-t lustre_tgt' [ 111.817737] Lustre: Setting parameter lustre-MDT0000.mdt.identity_upcall=/home/green/git/lustre-release/lustre/utils/l_getidentity in log lustre-MDT0000 [ 111.965295] Lustre: ctl-lustre-MDT0000: No data found on store. Initialize space. [ 112.020821] Lustre: lustre-MDT0000: new disk, initializing [ 112.254769] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 112.293604] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000200000400-0x0000000240000400]:0:mdt [ 114.198373] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all [ 117.044797] Lustre: Modifying parameter general.debug_raw_pointers=Y in log params [ 120.088282] Lustre: lustre-OST0000: new disk, initializing [ 120.092639] Lustre: srv-lustre-OST0000: No data found on store. Initialize space. [ 120.097126] Lustre: Skipped 1 previous similar message [ 120.159246] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 120.495408] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000240000400-0x0000000280000400]:0:ost [ 120.499697] Lustre: cli-lustre-OST0000-super: Allocated super-sequence [0x0000000240000400-0x0000000280000400]:0:ost] [ 120.559749] Lustre: lustre-OST0000-osc-MDT0000: update sequence from 0x100000000 to 0x240000400 [ 123.153437] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all [ 129.031881] Lustre: lustre-OST0001: new disk, initializing [ 129.034423] Lustre: srv-lustre-OST0001: No data found on store. Initialize space. [ 129.078373] Lustre: lustre-OST0001: Imperative Recovery not enabled, recovery window 60-180 [ 132.989106] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all [ 137.766195] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000280000400-0x00000002c0000400]:1:ost [ 137.771073] Lustre: cli-lustre-OST0001-super: Allocated super-sequence [0x0000000280000400-0x00000002c0000400]:1:ost] [ 137.854072] Lustre: lustre-OST0001-osc-MDT0000: update sequence from 0x100010000 to 0x280000400 [ 140.339679] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 144.653260] Lustre: Setting parameter general.lod.*.mdt_hash=crush in log params [ 152.578770] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing check_logdir /tmp/testlogs/ [ 155.109319] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing yml_node [ 157.664585] Lustre: DEBUG MARKER: Client: 2.17.51.1 [ 159.086514] Lustre: DEBUG MARKER: MDS: 2.17.51.1 [ 160.646829] Lustre: DEBUG MARKER: OSS: 2.17.51.1 [ 161.524482] Lustre: DEBUG MARKER: -----============= acceptance-small: recovery-small ============----- Mon Mar 16 09:38:36 EDT 2026 [ 170.722169] Lustre: DEBUG MARKER: excepting tests: 136 [ 171.820571] Lustre: DEBUG MARKER: === recovery-small: start setup 09:38:47 (1773668327) === [ 174.046563] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing check_config_client /mnt/lustre [ 183.364812] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 185.372198] Lustre: 10992:0:(mgs_llog.c:1348:mgs_modify_param()) MGS: modify general/lod.*.mdt_hash=crush (mode = 0) failed: rc = -17 [ 187.255738] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 189.136550] Lustre: DEBUG MARKER: === recovery-small: finish setup 09:39:04 (1773668344) === [ 190.128620] Lustre: DEBUG MARKER: == recovery-small test 1: create, chmod, stat: drop req, drop rep ========================================================== 09:39:05 (1773668345) [ 190.781978] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 206.372907] Lustre: lustre-MDT0000: Client 23cc2862-3913-41ae-9c10-e71c229b7702 (at 192.168.201.39@tcp) reconnecting [ 207.558706] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 207.562095] LustreError: 5762:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb6f08d880 x1859825927466624/t4294967298(0) o36->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:479/0 lens 520/448 e 0 to 0 dl 1773668374 ref 1 fl Interpret:/200/0 rc 0/0 job:'mcreate.0' uid:0 gid:0 projid:4294967295 [ 223.773565] Lustre: lustre-MDT0000: Client 23cc2862-3913-41ae-9c10-e71c229b7702 (at 192.168.201.39@tcp) reconnecting [ 223.789976] Lustre: 7975:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb753f0380 x1859825927466624/t4294967298(0) o36->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:496/0 lens 520/2880 e 0 to 0 dl 1773668391 ref 1 fl Interpret:/202/0 rc 0/0 job:'mcreate.0' uid:0 gid:0 projid:4294967295 [ 224.995557] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 241.181997] Lustre: lustre-MDT0000: Client 23cc2862-3913-41ae-9c10-e71c229b7702 (at 192.168.201.39@tcp) reconnecting [ 242.461161] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 242.463804] LustreError: 5763:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb7b7c1c00 x1859825927470080/t4294967300(0) o36->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:514/0 lens 488/456 e 0 to 0 dl 1773668409 ref 1 fl Interpret:/200/0 rc 0/0 job:'tchmod.0' uid:0 gid:0 projid:4294967295 [ 258.076105] Lustre: lustre-MDT0000: Client 23cc2862-3913-41ae-9c10-e71c229b7702 (at 192.168.201.39@tcp) reconnecting [ 258.092644] Lustre: 7435:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb7b7c3b80 x1859825927470080/t4294967300(0) o36->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:530/0 lens 488/3152 e 0 to 0 dl 1773668425 ref 1 fl Interpret:/202/0 rc 0/0 job:'tchmod.0' uid:0 gid:0 projid:4294967295 [ 259.420368] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 274.972507] Lustre: lustre-MDT0000: Client 23cc2862-3913-41ae-9c10-e71c229b7702 (at 192.168.201.39@tcp) reconnecting [ 276.112160] Lustre: *** cfs_fail_loc=122, val=2147483648*** [ 276.115675] LustreError: 7435:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb46275f80 x1859825927472640/t0(0) o34->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:548/0 lens 472/464 e 0 to 0 dl 1773668443 ref 1 fl Interpret:/600/0 rc 0/0 job:'statone.0' uid:0 gid:0 projid:0 [ 292.382233] Lustre: lustre-MDT0000: Client 23cc2862-3913-41ae-9c10-e71c229b7702 (at 192.168.201.39@tcp) reconnecting [ 296.021435] Lustre: DEBUG MARKER: == recovery-small test 4: open: drop req, drop rep ======= 09:40:51 (1773668451) [ 296.579486] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 312.861146] Lustre: lustre-MDT0000: Client 23cc2862-3913-41ae-9c10-e71c229b7702 (at 192.168.201.39@tcp) reconnecting [ 314.008123] Lustre: *** cfs_fail_loc=122, val=2147483648*** [ 314.012232] LustreError: 5766:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb44fe6d80 x1859825927478016/t4294967306(0) o35->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:586/0 lens 392/456 e 0 to 0 dl 1773668481 ref 1 fl Interpret:/600/0 rc 0/0 job:'cat.0' uid:0 gid:0 projid:0 [ 330.250025] Lustre: 5766:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb7b2b8a80 x1859825927478016/t4294967306(0) o35->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:602/0 lens 392/456 e 0 to 0 dl 1773668497 ref 1 fl Interpret:/602/0 rc 0/0 job:'cat.0' uid:0 gid:0 projid:0 [ 334.076242] Lustre: DEBUG MARKER: == recovery-small test 5: rename: drop req, drop rep ===== 09:41:29 (1773668489) [ 334.676761] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 350.239332] Lustre: lustre-MDT0000: Client 23cc2862-3913-41ae-9c10-e71c229b7702 (at 192.168.201.39@tcp) reconnecting [ 350.244877] Lustre: Skipped 1 previous similar message [ 351.425573] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 351.428713] LustreError: 5777:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dca43f89f80 x1859825927484032/t4294967310(0) o36->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:623/0 lens 552/456 e 0 to 0 dl 1773668518 ref 1 fl Interpret:/200/0 rc 0/0 job:'mv.0' uid:0 gid:0 projid:4294967295 [ 367.627770] Lustre: 5776:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb7b7e0e00 x1859825927484032/t4294967310(0) o36->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:639/0 lens 552/2888 e 0 to 0 dl 1773668534 ref 1 fl Interpret:/202/0 rc 0/0 job:'mv.0' uid:0 gid:0 projid:4294967295 [ 371.871094] Lustre: DEBUG MARKER: == recovery-small test 6: link, unlink: drop req, drop rep ========================================================== 09:42:07 (1773668527) [ 372.473885] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 390.369037] hrtimer: interrupt took 3003406 ns [ 390.674582] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 390.680191] LustreError: 5764:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb7b42c380 x1859825927490816/t4294967315(0) o36->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:662/0 lens 512/440 e 0 to 0 dl 1773668557 ref 1 fl Interpret:/200/0 rc 0/0 job:'link.0' uid:0 gid:0 projid:4294967295 [ 406.043583] Lustre: 7435:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb75288e00 x1859825927490816/t4294967315(0) o36->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:678/0 lens 512/440 e 0 to 0 dl 1773668573 ref 1 fl Interpret:/202/0 rc 0/0 job:'link.0' uid:0 gid:0 projid:4294967295 [ 409.376963] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 425.104279] Lustre: lustre-MDT0000: Client 23cc2862-3913-41ae-9c10-e71c229b7702 (at 192.168.201.39@tcp) reconnecting [ 425.125560] Lustre: Skipped 3 previous similar messages [ 428.442068] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 428.451325] LustreError: 7435:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb7c8bbb80 x1859825927495168/t4294967317(0) o36->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:700/0 lens 504/456 e 0 to 0 dl 1773668595 ref 1 fl Interpret:/200/0 rc 0/0 job:'unlink.0' uid:0 gid:0 projid:4294967295 [ 444.964547] Lustre: 7435:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dca449c4700 x1859825927495168/t4294967317(0) o36->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:717/0 lens 504/2888 e 0 to 0 dl 1773668612 ref 1 fl Interpret:/202/0 rc 0/0 job:'unlink.0' uid:0 gid:0 projid:4294967295 [ 457.756733] Lustre: DEBUG MARKER: == recovery-small test 8: touch: drop rep (bug 1423) ===== 09:43:32 (1773668612) [ 475.670045] Lustre: 7435:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb42dba300 x1859825927498880/t4294967320(0) o36->23cc2862-3913-41ae-9c10-e71c229b7702@192.168.201.39@tcp:747/0 lens 488/3152 e 0 to 0 dl 1773668642 ref 1 fl Interpret:/202/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 482.824979] Lustre: DEBUG MARKER: == recovery-small test 9: pause bulk on OST (bug 1420) === 09:43:57 (1773668637) [ 484.674173] LustreError: 12292:0:(tgt_handler.c:2735:tgt_brw_write()) cfs_fail_timeout id 214 sleeping for 5000ms [ 489.680147] LustreError: 12292:0:(tgt_handler.c:2735:tgt_brw_write()) cfs_fail_timeout id 214 awake [ 496.592069] Lustre: DEBUG MARKER: == recovery-small test 10a: finish request on server after client eviction (bug 1521) ========================================================== 09:44:11 (1773668651) [ 513.504749] Lustre: 7435:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668653/real 1773668653] req@ffff8dcb6fffca80 x1859825940138496/t0(0) o104->lustre-MDT0000@192.168.201.39@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668669 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 528.864385] Lustre: 7435:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668669/real 1773668669] req@ffff8dcb6fffca80 x1859825940138496/t0(0) o104->lustre-MDT0000@192.168.201.39@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668685 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 537.568733] Lustre: mdt00_003: service thread pid 7435 was inactive for 40.619 seconds. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes: [ 537.590734] task:mdt00_003 state:I stack:0 pid:7435 ppid:2 flags:0x80004000 [ 537.598915] Call Trace: [ 537.603591] __schedule+0x351/0xcb0 [ 537.611985] schedule+0xc0/0x180 [ 537.615043] schedule_timeout+0xb4/0x190 [ 537.618453] ? __next_timer_interrupt+0x160/0x160 [ 537.622342] wait_woken+0x9c/0xd0 [ 537.629701] ptlrpc_set_wait+0x3af/0xa50 [ptlrpc] [ 537.634063] ? do_wait_intr+0xf0/0xf0 [ 537.640423] ldlm_run_ast_work+0x10d/0x4d0 [ptlrpc] [ 537.647303] ldlm_handle_conflict_lock+0x97/0x490 [ptlrpc] [ 537.652090] ldlm_lock_enqueue+0x321/0xcd0 [ptlrpc] [ 537.655913] ldlm_cli_enqueue_local+0x6fe/0xbd0 [ptlrpc] [ 537.659372] ? ldlm_cli_enqueue_local+0xbd0/0xbd0 [ptlrpc] [ 537.663490] ? mdt_object_put+0x130/0x130 [mdt] [ 537.665996] mdt_object_lock_internal+0x20b/0x5a0 [mdt] [ 537.673660] ? mdt_object_put+0x130/0x130 [mdt] [ 537.677554] ? ldlm_cli_enqueue_local+0xbd0/0xbd0 [ptlrpc] [ 537.685581] mdt_object_lock+0x9e/0x240 [mdt] [ 537.689101] mdt_object_stripes_lock+0x28b/0x670 [mdt] [ 537.694264] mdt_reint_setattr+0xf58/0x1f90 [mdt] [ 537.699974] mdt_reint_rec+0x139/0x2b0 [mdt] [ 537.703729] mdt_reint_internal+0x6a0/0xdc0 [mdt] [ 537.707369] mdt_reint+0x163/0x190 [mdt] [ 537.710109] tgt_handle_request0+0x137/0xaf0 [ptlrpc] [ 537.715051] tgt_request_handle+0x573/0x1e70 [ptlrpc] [ 537.718812] ptlrpc_server_handle_request+0x443/0x13b0 [ptlrpc] [ 537.722595] ? lprocfs_counter_add+0x15b/0x210 [obdclass] [ 537.726320] ptlrpc_main+0xce8/0x1400 [ptlrpc] [ 537.731104] ? ptlrpc_wait_event+0x690/0x690 [ptlrpc] [ 537.734215] kthread+0x1d1/0x200 [ 537.737015] ? set_kthread_struct+0x70/0x70 [ 537.739914] ret_from_fork+0x1f/0x30 [ 545.248191] Lustre: 7435:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668685/real 1773668685] req@ffff8dcb6fffca80 x1859825940138496/t0(0) o104->lustre-MDT0000@192.168.201.39@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668701 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 561.632205] Lustre: 7435:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668701/real 1773668701] req@ffff8dcb6fffca80 x1859825940138496/t0(0) o104->lustre-MDT0000@192.168.201.39@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668717 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 576.992182] Lustre: 7435:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668717/real 1773668717] req@ffff8dcb6fffca80 x1859825940138496/t0(0) o104->lustre-MDT0000@192.168.201.39@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668733 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 593.385207] Lustre: 7435:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668733/real 1773668733] req@ffff8dcb6fffca80 x1859825940138496/t0(0) o104->lustre-MDT0000@192.168.201.39@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668749 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 608.737769] LustreError: 7435:0:(ldlm_lockd.c:727:ldlm_handle_ast_error()) ### client (nid 192.168.201.39@tcp) failed to reply to blocking AST (req@00000000da450ade x1859825940138496 status 0 rc -110), evict it ns: mdt-lustre-MDT0000_UUID lock: ffff8dcb7527e200/0x92ba224dac0eedf0 lrc: 4/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 3 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.39@tcp remote: 0xe59155cbed4c3413 expref: 9 pid: 5764 timeout: 693 lvb_type: 0 lru_score: 0 lru_type: 0 [ 608.788833] LustreError: lustre-MDT0000: A client on nid 192.168.201.39@tcp was evicted due to a lock blocking callback time out: rc -110 [ 608.799582] LustreError: 5752:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 15s: evicting client at 192.168.201.39@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff8dcb7527e200/0x92ba224dac0eedf0 lrc: 3/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 3 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.39@tcp remote: 0xe59155cbed4c3413 expref: 10 pid: 5764 timeout: 0 lvb_type: 0 lru_score: 0 lru_type: 0 [ 608.849556] Lustre: mdt00_003: service thread pid 7435 completed after 111.900s. This likely indicates the system was overloaded (too many service threads, or not enough hardware resources). [ 617.050750] Lustre: DEBUG MARKER: == recovery-small test 10b: re-send BL AST =============== 09:46:11 (1773668771) [ 632.801537] Lustre: 5764:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668773/real 1773668773] req@ffff8dca44a61f80 x1859825940162816/t0(0) o104->lustre-MDT0000@192.168.201.39@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668789 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 632.819197] Lustre: 5764:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 1 previous similar message [ 639.954303] Lustre: DEBUG MARKER: == recovery-small test 10c: re-send BL AST vs reconnect race (LU-5569) ========================================================== 09:46:34 (1773668794) [ 641.314401] Lustre: lustre-MDT0000: Client 23cc2862-3913-41ae-9c10-e71c229b7702 (at 192.168.201.39@tcp) reconnecting [ 641.320038] Lustre: Skipped 2 previous similar messages [ 646.328269] Lustre: DEBUG MARKER: == recovery-small test 10d: test failed blocking ast ===== 09:46:41 (1773668801) [ 650.804316] LustreError: 6590:0:(ldlm_lockd.c:727:ldlm_handle_ast_error()) ### client (nid 192.168.201.39@tcp) returned error from blocking AST (req@00000000b5fe5610 x1859825940171648 status -71 rc -71), evict it ns: filter-lustre-OST0000_UUID lock: 000000002da22ab6/0x92ba224dac0ef146 lrc: 4/0,0 mode: PW/PW res: [0x240000400:0x7:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->18446744073709551615) gid 0 flags: 0x60000480000020 nid: 192.168.201.39@tcp remote: 0xe59155cbed4c35d3 expref: 6 pid: 6544 timeout: 750 lvb_type: 0 lru_score: 0 lru_type: 0 [ 650.834779] LustreError: lustre-OST0000: A client on nid 192.168.201.39@tcp was evicted due to a lock blocking callback time out: rc -71 [ 650.840866] LustreError: 5752:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 0s: evicting client at 192.168.201.39@tcp ns: filter-lustre-OST0000_UUID lock: 000000002da22ab6/0x92ba224dac0ef146 lrc: 3/0,0 mode: PW/PW res: [0x240000400:0x7:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->18446744073709551615) gid 0 flags: 0x60000480000020 nid: 192.168.201.39@tcp remote: 0xe59155cbed4c35d3 expref: 7 pid: 6544 timeout: 0 lvb_type: 0 lru_score: 0 lru_type: 0 [ 659.845900] Lustre: DEBUG MARKER: == recovery-small test 10e: re-send BL AST vs reconnect race 2 ========================================================== 09:46:54 (1773668814) [ 661.185406] Lustre: DEBUG MARKER: SKIP: recovery-small test_10e need two clients [ 662.744902] Lustre: DEBUG MARKER: == recovery-small test 11: wake up a thread waiting for completion after eviction (b=2460) ========================================================== 09:46:57 (1773668817) [ 678.881264] Lustre: 6590:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668819/real 1773668819] req@ffff8dcb7b7c0a80 x1859825940174336/t0(0) o104->lustre-OST0001@192.168.201.39@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668835 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 686.816783] Lustre: DEBUG MARKER: == recovery-small test 12: recover from timed out resend in ptlrpcd (b=2494) ========================================================== 09:47:21 (1773668841) [ 687.624501] Lustre: *** cfs_fail_loc=115, val=2147483648*** [ 730.519251] Lustre: DEBUG MARKER: == recovery-small test 13: mdc_readpage restart test (bug 1138) ========================================================== 09:48:05 (1773668885) [ 754.423733] Lustre: DEBUG MARKER: == recovery-small test 14: mdc_readpage resend test (bug 1138) ========================================================== 09:48:29 (1773668909) [ 755.717469] Lustre: *** cfs_fail_loc=106, val=0*** [ 755.722423] Lustre: Skipped 1 previous similar message [ 761.717298] Lustre: DEBUG MARKER: == recovery-small test 15: failed open (-ENOMEM) ========= 09:48:36 (1773668916) [ 762.492048] Lustre: *** cfs_fail_loc=128, val=0*** [ 768.438762] Lustre: DEBUG MARKER: == recovery-small test 16: timeout bulk put, don't evict client (2732) ========================================================== 09:48:43 (1773668923) [ 769.997541] Lustre: *** cfs_fail_loc=504, val=0*** [ 769.999746] LustreError: 6551:0:(ldlm_lib.c:3620:target_bulk_io()) @@@ truncated bulk READ 0(102400) req@ffff8dcb42db9500 x1859825927580160/t0(0) o3->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:287/0 lens 488/440 e 0 to 0 dl 1773668937 ref 1 fl Interpret:/600/0 rc 0/0 job:'cmp.0' uid:0 gid:0 projid:0 [ 770.039868] Lustre: lustre-OST0001: Bulk IO read error with 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp), client will retry: rc -110 [ 813.949959] Lustre: DEBUG MARKER: == recovery-small test 17a: timeout bulk get, don't evict client (2732) ========================================================== 09:49:28 (1773668968) [ 864.596221] Lustre: DEBUG MARKER: == recovery-small test 17b: timeout bulk get, dont evict client (3582) ========================================================== 09:50:19 (1773669019) [ 865.639212] Lustre: DEBUG MARKER: SKIP: recovery-small test_17b Needs multiple clients [ 866.910060] Lustre: DEBUG MARKER: == recovery-small test 18a: manual ost invalidate clears page cache immediately ========================================================== 09:50:21 (1773669021) [ 872.289824] Lustre: DEBUG MARKER: == recovery-small test 18b: eviction and reconnect clears page cache (2766) ========================================================== 09:50:27 (1773669027) [ 873.673972] Lustre: 18145:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting 3bb7fa52-952c-4b44-b5a6-0367668a8a30 at adminstrative request [ 900.825171] Lustre: DEBUG MARKER: == recovery-small test 18c: Dropped connect reply after eviction handing (14755) ========================================================== 09:50:55 (1773669055) [ 902.271611] Lustre: 18426:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting 3bb7fa52-952c-4b44-b5a6-0367668a8a30 at adminstrative request [ 904.009546] Lustre: *** cfs_fail_loc=225, val=0*** [ 904.010940] Lustre: Skipped 1 previous similar message [ 914.439855] Lustre: lustre-OST0000: Client 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp) reconnecting [ 914.451658] Lustre: Skipped 4 previous similar messages [ 921.361717] Lustre: DEBUG MARKER: == recovery-small test 19a: test expired_lock_main on mds (2867) ========================================================== 09:51:16 (1773669076) [ 922.707418] Lustre: *** cfs_fail_loc=304, val=0*** [ 937.999877] Lustre: *** cfs_fail_loc=304, val=0*** [ 954.382223] Lustre: *** cfs_fail_loc=304, val=0*** [ 970.785379] Lustre: *** cfs_fail_loc=304, val=0*** [ 986.154571] Lustre: *** cfs_fail_loc=304, val=0*** [ 1002.511345] Lustre: *** cfs_fail_loc=304, val=0*** [ 1018.901827] Lustre: *** cfs_fail_loc=304, val=0*** [ 1024.992404] LustreError: 5752:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 102s: evicting client at 192.168.201.39@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff8dcb6f1ef000/0x92ba224dac0ef9ce lrc: 3/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 3 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.39@tcp remote: 0xe59155cbed4c388f expref: 16 pid: 5764 timeout: 1022 lvb_type: 0 lru_score: 0 lru_type: 0 [ 1041.357857] Lustre: DEBUG MARKER: == recovery-small test 19b: test expired_lock_main on ost (2867) ========================================================== 09:53:14 (1773669194) [ 1061.929934] Lustre: *** cfs_fail_loc=304, val=0*** [ 1061.936223] Lustre: Skipped 1 previous similar message [ 1126.448762] Lustre: *** cfs_fail_loc=304, val=0*** [ 1126.456325] Lustre: Skipped 3 previous similar messages [ 1147.872366] LustreError: 5752:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 102s: evicting client at 192.168.201.39@tcp ns: filter-lustre-OST0000_UUID lock: 00000000112ddd35/0x92ba224dac0efc75 lrc: 3/0,0 mode: PW/PW res: [0x240000400:0xe:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->4095) gid 0 flags: 0x60000400000020 nid: 192.168.201.39@tcp remote: 0xe59155cbed4c3a02 expref: 6 pid: 6546 timeout: 1145 lvb_type: 0 lru_score: 0 lru_type: 0 [ 1163.468502] Lustre: DEBUG MARKER: == recovery-small test 19c: check reconnect and lock resend do not trigger expired_lock_main ========================================================== 09:55:17 (1773669317) [ 1183.643160] Lustre: DEBUG MARKER: == recovery-small test 20a: ldlm_handle_enqueue error (should return error) ========================================================== 09:55:38 (1773669338) [ 1192.770825] Lustre: DEBUG MARKER: == recovery-small test 20b: ldlm_handle_enqueue error (should return error) ========================================================== 09:55:47 (1773669347) [ 1202.017199] Lustre: DEBUG MARKER: == recovery-small test 21a: drop close request while close and open are both in flight ========================================================== 09:55:56 (1773669356) [ 1203.474737] LustreError: 5763:0:(mdt_open.c:1464:mdt_reint_open()) cfs_fail_timeout id 129 sleeping for 5000ms [ 1205.628738] LustreError: 5763:0:(mdt_open.c:1464:mdt_reint_open()) cfs_fail_timeout interrupted [ 1206.684717] Lustre: *** cfs_fail_loc=115, val=2147483648*** [ 1232.463082] Lustre: DEBUG MARKER: == recovery-small test 21b: drop open request while close and open are both in flight ========================================================== 09:56:26 (1773669386) [ 1381.132990] Lustre: DEBUG MARKER: == recovery-small test 21c: drop both request while close and open are both in flight ========================================================== 09:58:55 (1773669535) [ 1415.073914] Lustre: DEBUG MARKER: == recovery-small test 21d: drop close reply while close and open are both in flight ========================================================== 09:59:29 (1773669569) [ 1416.531746] LustreError: 7975:0:(mdt_open.c:1464:mdt_reint_open()) cfs_fail_timeout id 129 sleeping for 5000ms [ 1418.937547] LustreError: 7975:0:(mdt_open.c:1464:mdt_reint_open()) cfs_fail_timeout interrupted [ 1420.023373] Lustre: *** cfs_fail_loc=122, val=2147483648*** [ 1420.028155] LustreError: 14475:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dca41a85500 x1859825927719424/t4294967559(0) o35->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:182/0 lens 392/456 e 0 to 0 dl 1773669587 ref 1 fl Interpret:/600/0 rc 0/0 job:'multiop.0' uid:0 gid:0 projid:0 [ 1420.074051] LustreError: 14475:0:(ldlm_lib.c:3325:target_send_reply_msg()) Skipped 1 previous similar message [ 1436.167515] Lustre: lustre-MDT0000: Client 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp) reconnecting [ 1436.185100] Lustre: Skipped 15 previous similar messages [ 1436.205821] Lustre: 5765:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb7036dc50 x1859825927719424/t4294967559(0) o35->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:198/0 lens 392/456 e 0 to 0 dl 1773669603 ref 1 fl Interpret:/602/0 rc 0/0 job:'multiop.0' uid:0 gid:0 projid:0 [ 1446.373931] Lustre: DEBUG MARKER: == recovery-small test 21e: drop open reply while close and open are both in flight ========================================================== 10:00:00 (1773669600) [ 1447.681099] LustreError: 14463:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dca44a60380 x1859825927729792/t4294967576(0) o36->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:331/0 lens 488/456 e 0 to 0 dl 1773669736 ref 1 fl Interpret:/200/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 1585.213357] Lustre: 5762:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dca41a87b80 x1859825927729792/t4294967576(0) o36->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:469/0 lens 488/3152 e 0 to 0 dl 1773669874 ref 1 fl Interpret:/202/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 1587.387419] Lustre: DEBUG MARKER: == recovery-small test 21f: drop both reply while close and open are both in flight ========================================================== 10:02:21 (1773669741) [ 1588.381575] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 1588.384340] Lustre: Skipped 1 previous similar message [ 1588.386052] LustreError: 7975:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb7b72c380 x1859825927750400/t4294967595(0) o36->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:472/0 lens 488/456 e 0 to 0 dl 1773669877 ref 1 fl Interpret:/200/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 1607.190948] Lustre: 5763:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb75289500 x1859825927750400/t4294967595(0) o36->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:491/0 lens 488/3152 e 0 to 0 dl 1773669896 ref 1 fl Interpret:/202/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 1607.228465] Lustre: 5763:0:(mdt_recovery.c:102:mdt_req_from_lrd()) Skipped 1 previous similar message [ 1614.678661] Lustre: DEBUG MARKER: == recovery-small test 21g: drop open reply and close request while close and open are both in flight ========================================================== 10:02:49 (1773669769) [ 1615.729274] LustreError: 5763:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb47e89180 x1859825927762048/t4294967614(0) o36->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:500/0 lens 488/456 e 0 to 0 dl 1773669905 ref 1 fl Interpret:/200/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 1615.745326] LustreError: 5763:0:(ldlm_lib.c:3325:target_send_reply_msg()) Skipped 1 previous similar message [ 1618.381537] Lustre: *** cfs_fail_loc=115, val=2147483648*** [ 1618.390286] Lustre: Skipped 3 previous similar messages [ 1634.839455] Lustre: 7435:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dca44464000 x1859825927762048/t4294967614(0) o36->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:519/0 lens 488/3152 e 0 to 0 dl 1773669924 ref 1 fl Interpret:/202/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 1642.686798] Lustre: DEBUG MARKER: == recovery-small test 21h: drop open request and close reply while close and open are both in flight ========================================================== 10:03:17 (1773669797) [ 1666.601711] Lustre: DEBUG MARKER: == recovery-small test 22: drop close request and do mknod ========================================================== 10:03:41 (1773669821) [ 1689.374923] Lustre: DEBUG MARKER: == recovery-small test 23: client hang when close a file after mds crash ========================================================== 10:04:04 (1773669844) [ 1697.661270] Lustre: Failing over lustre-MDT0000 [ 1697.973528] Lustre: server umount lustre-MDT0000 complete [ 1714.004178] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1714.274131] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1714.289715] Lustre: Skipped 1 previous similar message [ 1714.437339] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1714.498489] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1715.298441] Lustre: 3299:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773669855/real 1773669855] req@ffff8dca410b3100 x1859825940377984/t0(0) o400->lustre-MDT0000-lwp-OST0000@0@lo:12/10 lens 224/224 e 0 to 1 dl 1773669871 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 1717.318370] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 1719.792553] Lustre: lustre-MDT0000-lwp-OST0000: Connection restored to 0@lo (at 0@lo) [ 1721.864510] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 1721.926016] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 1721.950900] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:21 to 0x280000400:65) [ 1721.955080] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:26 to 0x240000400:65) [ 1724.838775] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1725.665739] Lustre: 3297:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773669865/real 1773669865] req@ffff8dca43f88700 x1859825940378752/t0(0) o400->lustre-MDT0000-lwp-OST0000@0@lo:12/10 lens 224/224 e 0 to 1 dl 1773669881 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 1725.696928] Lustre: 3297:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 3 previous similar messages [ 1726.285660] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1732.905694] Lustre: DEBUG MARKER: == recovery-small test 24a: fsync error (should return error) ========================================================== 10:04:47 (1773669887) [ 1734.164724] Lustre: 25148:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting 3bb7fa52-952c-4b44-b5a6-0367668a8a30 at adminstrative request [ 1739.233233] Lustre: DEBUG MARKER: == recovery-small test 24b: test dirty page discard due to client eviction ========================================================== 10:04:54 (1773669894) [ 1740.296764] Lustre: 25395:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting 3bb7fa52-952c-4b44-b5a6-0367668a8a30 at adminstrative request [ 1745.333718] Lustre: DEBUG MARKER: == recovery-small test 26a: evict dead exports =========== 10:05:00 (1773669900) [ 1746.805876] Lustre: DEBUG MARKER: SKIP: recovery-small test_26a msg and ost1 are at the same node [ 1747.873323] Lustre: DEBUG MARKER: == recovery-small test 26b: evict dead exports =========== 10:05:03 (1773669903) [ 1748.902517] Lustre: DEBUG MARKER: SKIP: recovery-small test_26b msg and ost1 are at the same node [ 1749.854956] Lustre: DEBUG MARKER: == recovery-small test 27: fail LOV while using OSC's ==== 10:05:05 (1773669905) [ 1752.200593] Lustre: Failing over lustre-MDT0000 [ 1752.438756] Lustre: server umount lustre-MDT0000 complete [ 1766.646121] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1766.862690] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1766.871888] Lustre: Skipped 1 previous similar message [ 1767.090604] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1767.134235] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1769.210234] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 1771.104676] Lustre: 3297:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773669911/real 1773669911] req@ffff8dcb6ff8bb80 x1859825940401280/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1773669927 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 1771.119064] Lustre: 3297:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 1 previous similar message [ 1772.007551] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 0@lo (at 0@lo) [ 1772.013331] Lustre: Skipped 1 previous similar message [ 1772.100650] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 1772.142297] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 1772.167156] Lustre: 26317:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb75289500 x1859825927871360/t8589934822(0) o101->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:534/0 lens 664/3488 e 0 to 0 dl 1773669939 ref 1 fl Interpret:/602/0 rc 0/0 job:'writemany.0' uid:0 gid:0 projid:0 [ 1772.172473] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:103 to 0x240000400:129) [ 1772.172624] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:101 to 0x280000400:129) [ 1772.178694] Lustre: 26317:0:(mdt_recovery.c:102:mdt_req_from_lrd()) Skipped 1 previous similar message [ 1861.944668] Lustre: Failing over lustre-MDT0000 [ 1862.115200] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1862.117480] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 1862.122432] Lustre: Skipped 1 previous similar message [ 1862.211962] Lustre: server umount lustre-MDT0000 complete [ 1875.548200] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1876.139813] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1876.205732] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1878.135584] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 1879.558900] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 1879.619950] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 1879.636478] Lustre: 30177:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dca429e4700 x1859825930934656/t12884911469(0) o36->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:641/0 lens 512/2888 e 0 to 0 dl 1773670046 ref 1 fl Interpret:/202/0 rc 0/0 job:'writemany.0' uid:0 gid:0 projid:4294967295 [ 1879.643739] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:1699 to 0x240000400:1729) [ 1879.643928] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:1699 to 0x280000400:1729) [ 1882.089463] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 0@lo (at 0@lo) [ 1882.092816] Lustre: Skipped 1 previous similar message [ 1882.992726] Lustre: DEBUG MARKER: == recovery-small test 28: handle error adding new clients (bug 6086) ========================================================== 10:07:18 (1773670038) [ 1899.488458] Lustre: 30179:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773670039/real 1773670039] req@ffff8dcb42db9500 x1859825941141760/t0(0) o104->lustre-MDT0000@192.168.201.39@tcp:15/16 lens 328/224 e 0 to 1 dl 1773670055 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 1899.506332] Lustre: 30179:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 5 previous similar messages [ 1902.474381] Lustre: Failing over lustre-MDT0000 [ 1902.562863] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1902.563507] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 1902.569656] Lustre: Skipped 1 previous similar message [ 1902.575025] Lustre: Skipped 2 previous similar messages [ 1902.818920] Lustre: server umount lustre-MDT0000 complete [ 1915.922600] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1916.086735] Lustre: *** cfs_fail_loc=12f, val=0*** [ 1916.088954] LustreError: 25895:0:(tgt_lastrcvd.c:1090:tgt_client_new()) lustre-OST0000: no room for 0 clients - fix LR_MAX_CLIENTS [ 1916.093915] LustreError: lustre-OST0000-osc-MDT0000: operation ost_connect to node 0@lo failed: rc = -75 [ 1916.145836] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1916.179703] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1917.879447] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 1920.079903] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 1920.107853] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 1920.131981] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:1699 to 0x240000400:1761) [ 1920.131996] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:1731 to 0x280000400:1825) [ 1921.522455] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1922.192671] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1922.534899] Lustre: lustre-MDT0000-lwp-OST0000: Connection restored to 0@lo (at 0@lo) [ 1922.537557] Lustre: Skipped 1 previous similar message [ 1926.046991] Lustre: DEBUG MARKER: == recovery-small test 29a: error adding new clients doesn't cause LBUG (bug 22273) ========================================================== 10:08:01 (1773670081) [ 1927.064392] Lustre: Failing over lustre-MDT0000 [ 1927.282719] Lustre: server umount lustre-MDT0000 complete [ 1929.834042] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1929.985844] Lustre: *** cfs_fail_loc=711, val=0*** [ 1929.986679] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1929.993717] Lustre: Skipped 1 previous similar message [ 1930.049304] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1930.079819] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1930.079993] Lustre: lustre-MDT0000: Aborting client recovery [ 1930.083577] LustreError: 32621:0:(ldlm_lib.c:2983:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 1930.086974] Lustre: 32670:0:(ldlm_lib.c:2386:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 1930.090081] Lustre: 32670:0:(genops.c:1620:class_disconnect_stale_exports()) lustre-MDT0000: disconnect stale client 3bb7fa52-952c-4b44-b5a6-0367668a8a30@ [ 1930.094257] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 1930.105915] Lustre: lustre-MDT0000-osd: cancel update llog [0x200000400:0x1:0x0] [ 1930.146494] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:1699 to 0x240000400:1793) [ 1930.146493] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:1731 to 0x280000400:1857) [ 1931.663223] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 1935.334560] Lustre: lustre-MDT0000-lwp-OST0000: Connection restored to 0@lo (at 0@lo) [ 1935.338383] Lustre: Skipped 1 previous similar message [ 1938.504844] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing wait_import_state FULL os[cp].lustre-OST0000-osc-MDT0000.ost_server_uuid 50 [ 1938.614726] Lustre: DEBUG MARKER: os[cp].lustre-OST0000-osc-MDT0000.ost_server_uuid in FULL state after 0 sec [ 1941.374390] Lustre: DEBUG MARKER: == recovery-small test 29b: error adding new clients doesn't cause LBUG (bug 22273) ========================================================== 10:08:16 (1773670096) [ 1942.444831] Lustre: Failing over lustre-OST0000 [ 1942.497293] Lustre: server umount lustre-OST0000 complete [ 1945.571135] Lustre: lustre-OST0000-osc-MDT0000: Connection to lustre-OST0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1945.571972] LustreError: lustre-OST0000-osc-MDT0000: operation ost_statfs to node 0@lo failed: rc = -107 [ 1945.828531] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 1945.837037] Lustre: lustre-OST0000: in recovery but waiting for the first client to connect [ 1945.837371] Lustre: lustre-OST0000: Aborting recovery [ 1945.840498] Lustre: Skipped 2 previous similar messages [ 1945.842882] LustreError: 34020:0:(ldlm_lib.c:2983:target_stop_recovery_thread()) lustre-OST0000: Aborting recovery [ 1945.845568] Lustre: 34034:0:(ldlm_lib.c:2386:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 1945.854262] Lustre: 34034:0:(ldlm_lib.c:2386:target_recovery_overseer()) Skipped 2 previous similar messages [ 1945.858115] Lustre: 34034:0:(genops.c:1620:class_disconnect_stale_exports()) lustre-OST0000: disconnect stale client 3bb7fa52-952c-4b44-b5a6-0367668a8a30@ [ 1945.864369] Lustre: lustre-OST0000: disconnecting 2 stale clients [ 1945.892096] LustreError: 34034:0:(ofd_obd.c:1325:ofd_iocontrol()) lustre-OST0000: iocontrol from 'tgt_recover_0' cmd=c00866c1 _IOWR('f', 193, 8) unrecognized: rc = -25 [ 1946.126350] Lustre: *** cfs_fail_loc=711, val=0*** [ 1947.508273] LustreError: lustre-OST0000-osc-MDT0000: This client was evicted by lustre-OST0000; in progress operations using this service will fail. [ 1947.517627] Lustre: lustre-OST0000-osc-MDT0000: Connection restored to 0@lo (at 0@lo) [ 1947.528439] Lustre: Skipped 1 previous similar message [ 1948.616333] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 1956.285303] Lustre: DEBUG MARKER: == recovery-small test 50: failover MDS under load ======= 10:08:31 (1773670111) [ 1967.939023] Lustre: Failing over lustre-MDT0000 [ 1968.160726] Lustre: server umount lustre-MDT0000 complete [ 1981.278560] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1981.472328] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1981.715150] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1981.844775] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1981.852183] Lustre: Skipped 2 previous similar messages [ 1982.880120] Lustre: 3299:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773670083/real 1773670083] req@ffff8dca429e5180 x1859825941157504/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1773670138 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 1983.600907] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 1986.054612] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 1986.097114] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 1986.124910] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:2012 to 0x240000400:2049) [ 1986.125316] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:2076 to 0x280000400:2113) [ 1987.062082] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 0@lo (at 0@lo) [ 1988.196726] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1989.283458] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 2052.471210] Lustre: Failing over lustre-MDT0000 [ 2052.759878] Lustre: server umount lustre-MDT0000 complete [ 2066.201413] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 2066.391600] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2066.399128] Lustre: Skipped 1 previous similar message [ 2066.607462] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 2066.789473] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 2068.586762] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 2072.038489] Lustre: lustre-MDT0000-lwp-OST0000: Connection restored to 0@lo (at 0@lo) [ 2072.041376] Lustre: Skipped 1 previous similar message [ 2072.069699] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 2072.121722] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 2072.134073] Lustre: 36320:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb46cfed80 x1859825933833728/t30064778752(0) o101->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:79/0 lens 672/3488 e 0 to 0 dl 1773670239 ref 1 fl Interpret:/602/0 rc 0/0 job:'writemany.0' uid:0 gid:0 projid:0 [ 2072.142890] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:3311 to 0x240000400:3329) [ 2072.143280] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:3375 to 0x280000400:3393) [ 2073.975939] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 2074.963804] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 2137.787984] Lustre: Failing over lustre-MDT0000 [ 2137.861711] LustreError: 3297:0:(client.c:1380:ptlrpc_import_delay_req()) @@@ IMP_CLOSED req@ffff8dcb42de1180 x1859825942396288/t0(0) o6->lustre-OST0000-osc-MDT0000@0@lo:28/4 lens 544/432 e 0 to 0 dl 0 ref 1 fl Rpc:QU/200/ffffffff rc 0/-1 job:'osp-syn-0-0.0' uid:0 gid:0 projid:4294967295 [ 2138.008703] Lustre: server umount lustre-MDT0000 complete [ 2151.190210] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 2151.355081] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2151.361245] Lustre: Skipped 2 previous similar messages [ 2151.667421] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 2151.895375] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 2152.965430] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 2153.039055] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 2153.062266] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:4639 to 0x240000400:4673) [ 2153.062293] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:4704 to 0x280000400:4737) [ 2153.913984] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 2153.953230] Lustre: 3297:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773670294/real 1773670294] req@ffff8dcb42de0380 x1859825942397184/t0(0) o400->lustre-MDT0000-lwp-OST0000@0@lo:12/10 lens 224/224 e 0 to 1 dl 1773670310 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 2153.967602] Lustre: 3297:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 10 previous similar messages [ 2156.519782] Lustre: lustre-MDT0000-lwp-OST0000: Connection restored to 0@lo (at 0@lo) [ 2156.523151] Lustre: Skipped 1 previous similar message [ 2158.648264] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 2159.522671] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 2184.166624] Lustre: DEBUG MARKER: == recovery-small test 51: failover MDS during recovery == 10:12:19 (1773670339) [ 2186.567535] Lustre: Failing over lustre-MDT0000 [ 2186.835570] Lustre: server umount lustre-MDT0000 complete [ 2203.033778] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 2204.930952] Lustre: DEBUG MARKER: test_51: failover in 1 sec [ 2206.213839] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 2206.277163] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 2206.304136] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:5296 to 0x240000400:5313) [ 2206.304309] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:5361 to 0x280000400:5377) [ 2207.025874] Lustre: Failing over lustre-MDT0000 [ 2207.306302] Lustre: server umount lustre-MDT0000 complete [ 2220.790757] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 2220.797069] LustreError: Skipped 1 previous similar message [ 2222.914435] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 2224.416920] Lustre: DEBUG MARKER: test_51: failover in 5 sec [ 2226.726234] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:5324 to 0x240000400:5345) [ 2226.726301] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:5389 to 0x280000400:5409) [ 2230.324258] Lustre: Failing over lustre-MDT0000 [ 2230.646831] Lustre: server umount lustre-MDT0000 complete [ 2245.628461] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 2246.234353] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:5433 to 0x240000400:5473) [ 2246.234358] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:5496 to 0x280000400:5537) [ 2247.385695] Lustre: DEBUG MARKER: test_51: failover in 10 sec [ 2258.671242] Lustre: Failing over lustre-MDT0000 [ 2258.784059] LustreError: 3297:0:(client.c:1380:ptlrpc_import_delay_req()) @@@ IMP_CLOSED req@ffff8dcb702f2680 x1859825942925312/t0(0) o6->lustre-OST0000-osc-MDT0000@0@lo:28/4 lens 544/432 e 0 to 0 dl 0 ref 1 fl Rpc:QU/200/ffffffff rc 0/-1 job:'osp-syn-0-0.0' uid:0 gid:0 projid:4294967295 [ 2258.793694] LustreError: 3297:0:(client.c:1380:ptlrpc_import_delay_req()) Skipped 2 previous similar messages [ 2258.936106] Lustre: server umount lustre-MDT0000 complete [ 2274.730898] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 2276.270281] Lustre: DEBUG MARKER: test_51: failover in 20 sec [ 2276.869278] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 2276.873765] Lustre: Skipped 2 previous similar messages [ 2276.908225] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 2276.911372] Lustre: Skipped 2 previous similar messages [ 2276.931306] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:5742 to 0x240000400:5761) [ 2276.931338] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:5806 to 0x280000400:5825) [ 2297.994563] Lustre: Failing over lustre-MDT0000 [ 2298.236218] Lustre: server umount lustre-MDT0000 complete [ 2312.079977] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2312.088843] Lustre: Skipped 9 previous similar messages [ 2312.457186] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 2312.460757] Lustre: Skipped 4 previous similar messages [ 2312.517740] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 2312.521417] Lustre: Skipped 4 previous similar messages [ 2314.478142] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 2316.104019] Lustre: DEBUG MARKER: test_51: failover in 25 sec [ 2317.285718] Lustre: lustre-MDT0000-lwp-OST0000: Connection restored to 0@lo (at 0@lo) [ 2317.288375] Lustre: Skipped 9 previous similar messages [ 2318.382860] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:6223 to 0x240000400:6241) [ 2318.383877] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:6288 to 0x280000400:6305) [ 2342.630765] Lustre: Failing over lustre-MDT0000 [ 2342.896436] Lustre: server umount lustre-MDT0000 complete [ 2356.017632] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 2356.024448] LustreError: Skipped 3 previous similar messages [ 2358.455239] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 2359.384835] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:6765 to 0x280000400:6849) [ 2359.384909] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:6701 to 0x240000400:6785) [ 2360.229831] Lustre: DEBUG MARKER: test_51: failover in 30 sec [ 2391.614396] Lustre: Failing over lustre-MDT0000 [ 2391.843715] Lustre: server umount lustre-MDT0000 complete [ 2407.772359] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 2410.502209] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 2410.506765] Lustre: Skipped 2 previous similar messages [ 2410.539878] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 2410.545349] Lustre: Skipped 2 previous similar messages [ 2410.563915] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:7368 to 0x240000400:7393) [ 2410.564144] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:7432 to 0x280000400:7457) [ 2412.960121] Lustre: 3297:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773670553/real 1773670553] req@ffff8dcb75196a00 x1859825943707648/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1773670569 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 2412.973402] Lustre: 3297:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 41 previous similar messages [ 2432.258101] Lustre: DEBUG MARKER: == recovery-small test 52: failover OST under load ======= 10:16:27 (1773670587) [ 2443.927566] Lustre: Failing over lustre-OST0000 [ 2443.994981] Lustre: lustre-OST0000: Not available for connect from 192.168.201.39@tcp (stopping) [ 2444.033433] Lustre: server umount lustre-OST0000 complete [ 2444.943195] LustreError: lustre-OST0000-osc-MDT0000: operation ost_create to node 0@lo failed: rc = -107 [ 2444.948290] LustreError: 6544:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2444.958266] LustreError: 6544:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 2 previous similar messages [ 2446.383357] LustreError: 26337:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 192.168.201.39@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2451.463503] LustreError: 27490:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 192.168.201.39@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2451.471730] LustreError: 27490:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 1 previous similar message [ 2456.582408] LustreError: 26325:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 192.168.201.39@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2456.592694] LustreError: 26325:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 1 previous similar message [ 2460.405256] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 2465.021371] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 2465.923325] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 2774.496450] Lustre: Failing over lustre-OST0000 [ 2774.614564] Lustre: lustre-OST0000: Not available for connect from 192.168.201.39@tcp (stopping) [ 2775.556892] LustreError: lustre-OST0000-osc-MDT0000: operation ost_destroy to node 0@lo failed: rc = -107 [ 2775.562451] Lustre: lustre-OST0000-osc-MDT0000: Connection to lustre-OST0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2775.568803] Lustre: Skipped 5 previous similar messages [ 2776.667329] Lustre: server umount lustre-OST0000 complete [ 2778.600525] LustreError: 25894:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2778.627761] LustreError: 25894:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 1 previous similar message [ 2788.838233] LustreError: 25896:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2788.848877] LustreError: 25896:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 3 previous similar messages [ 2791.034754] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 2791.038394] Lustre: Skipped 3 previous similar messages [ 2791.045389] Lustre: lustre-OST0000: in recovery but waiting for the first client to connect [ 2791.049040] Lustre: Skipped 3 previous similar messages [ 2792.027994] Lustre: lustre-OST0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 2792.033919] Lustre: Skipped 1 previous similar message [ 2792.297509] Lustre: lustre-OST0000-osc-MDT0000: Connection restored to 0@lo (at 0@lo) [ 2792.298197] Lustre: lustre-OST0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 2792.301517] Lustre: Skipped 6 previous similar messages [ 2792.314562] Lustre: Skipped 1 previous similar message [ 2794.148800] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 2798.602609] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 2799.566941] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 3105.589769] Lustre: Failing over lustre-OST0000 [ 3105.671737] Lustre: server umount lustre-OST0000 complete [ 3106.494608] LustreError: lustre-OST0000-osc-MDT0000: operation ost_create to node 0@lo failed: rc = -107 [ 3106.498359] LustreError: Skipped 5 previous similar messages [ 3106.502342] LustreError: 14785:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 3106.511208] LustreError: 14785:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 1 previous similar message [ 3119.465850] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 3122.408672] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 3126.876103] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 3127.952357] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 3399.242306] Lustre: DEBUG MARKER: == recovery-small test 53a: touch: drop rep ============== 10:32:34 (1773671554) [ 3399.887524] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 3399.889540] Lustre: Skipped 3 previous similar messages [ 3399.891150] LustreError: 44528:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb459c1500 x1859825974942336/t0(0) o101->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:652/0 lens 576/688 e 0 to 0 dl 1773671567 ref 1 fl Interpret:/600/0 rc 0/0 job:'openfile.0' uid:0 gid:0 projid:0 [ 3399.904619] LustreError: 44528:0:(ldlm_lib.c:3325:target_send_reply_msg()) Skipped 1 previous similar message [ 3415.074599] Lustre: lustre-MDT0000: Client 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp) reconnecting [ 3415.078802] Lustre: Skipped 5 previous similar messages [ 3419.039079] Lustre: DEBUG MARKER: == recovery-small test 53b: touch: drop rep ============== 10:32:54 (1773671574) [ 3419.807712] LustreError: 44527:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb46265f80 x1859825974947456/t0(0) o101->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:672/0 lens 576/688 e 0 to 0 dl 1773671587 ref 1 fl Interpret:/600/0 rc 0/0 job:'openfile.0' uid:0 gid:0 projid:0 [ 3439.142815] Lustre: DEBUG MARKER: == recovery-small test 53c: touch: drop rep ============== 10:33:14 (1773671594) [ 3439.901652] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 3439.905065] Lustre: Skipped 1 previous similar message [ 3439.907215] LustreError: 44527:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb48ae4000 x1859825974950400/t68719583588(0) o101->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:692/0 lens 664/664 e 0 to 0 dl 1773671607 ref 1 fl Interpret:/600/0 rc 0/0 job:'openfile.0' uid:0 gid:0 projid:0 [ 3455.505400] Lustre: 44528:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dca490d6a00 x1859825974950400/t68719583588(0) o101->3bb7fa52-952c-4b44-b5a6-0367668a8a30@192.168.201.39@tcp:707/0 lens 664/3488 e 0 to 0 dl 1773671622 ref 1 fl Interpret:H/602/0 rc 0/0 job:'openfile.0' uid:0 gid:0 projid:0 [ 3455.522800] Lustre: 44528:0:(mdt_recovery.c:102:mdt_req_from_lrd()) Skipped 1 previous similar message [ 3459.730885] Lustre: DEBUG MARKER: == recovery-small test 54: back in time ================== 10:33:34 (1773671614) [ 3471.638336] Lustre: Failing over lustre-MDT0000 [ 3472.087307] Lustre: server umount lustre-MDT0000 complete [ 3486.084430] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 3486.090765] LustreError: Skipped 1 previous similar message [ 3486.324345] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 3486.332336] Lustre: Skipped 1 previous similar message [ 3486.441508] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 3486.493587] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 3486.496630] Lustre: Skipped 1 previous similar message [ 3488.558701] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 3489.184100] Lustre: 3298:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773671629/real 1773671629] req@ffff8dcb6ef75c00 x1859825951111808/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1773671645 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 3489.222209] Lustre: 3298:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 3 previous similar messages [ 3490.828628] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 3490.832281] Lustre: Skipped 1 previous similar message [ 3490.866508] Lustre: lustre-MDT0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 3490.869674] Lustre: Skipped 1 previous similar message [ 3490.887720] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:23953 to 0x240000400:24257) [ 3490.887746] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:25950 to 0x280000400:25985) [ 3491.813205] Lustre: lustre-MDT0000-lwp-OST0000: Connection restored to 0@lo (at 0@lo) [ 3491.815290] Lustre: Skipped 1 previous similar message [ 3492.569392] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 3493.346239] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 3498.098421] Lustre: DEBUG MARKER: == recovery-small test 55: ost_brw_read/write drops timed-out read/write request ========================================================== 10:34:13 (1773671653) [ 3502.092203] Lustre: *** cfs_fail_loc=21d, val=0*** [ 3502.092231] LustreError: 6551:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.39@tcp because locking object 0x240000400:24259 took 0 seconds (limit was 11). [ 3502.093916] Lustre: Skipped 4 previous similar messages [ 3502.096145] Lustre: lustre-OST0000: Bulk IO write error with 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp), client will retry: rc = -110 [ 3502.099274] LustreError: 6551:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 1 previous similar message [ 3518.468210] Lustre: lustre-OST0000: Client 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp) reconnecting [ 3518.471342] Lustre: Skipped 2 previous similar messages [ 3518.476642] LustreError: 12292:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.39@tcp because locking object 0x240000400:24259 took 0 seconds (limit was 11). [ 3518.477288] Lustre: lustre-OST0000: Bulk IO write error with 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp), client will retry: rc = -110 [ 3518.483129] LustreError: 12292:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 15 previous similar messages [ 3518.492249] Lustre: Skipped 16 previous similar messages [ 3534.857795] LustreError: 6549:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.39@tcp because locking object 0x240000400:24259 took 0 seconds (limit was 11). [ 3534.858516] Lustre: lustre-OST0000: Bulk IO write error with 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp), client will retry: rc = -110 [ 3534.865886] LustreError: 6549:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 8 previous similar messages [ 3534.870098] Lustre: Skipped 7 previous similar messages [ 3550.155991] LustreError: 12292:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.39@tcp because locking object 0x240000400:24259 took 0 seconds (limit was 11). [ 3550.156352] Lustre: lustre-OST0000: Bulk IO write error with 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp), client will retry: rc = -110 [ 3550.163236] LustreError: 12292:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 5 previous similar messages [ 3550.168017] Lustre: Skipped 7 previous similar messages [ 3566.600362] Lustre: *** cfs_fail_loc=21d, val=0*** [ 3566.601403] LustreError: 14134:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.39@tcp because locking object 0x240000400:24259 took 0 seconds (limit was 11). [ 3566.601898] Lustre: Skipped 32 previous similar messages [ 3566.602925] Lustre: lustre-OST0000: Bulk IO write error with 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp), client will retry: rc = -110 [ 3566.606759] LustreError: 14134:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 7 previous similar messages [ 3581.967658] LustreError: 14236:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.39@tcp because locking object 0x240000400:24259 took 0 seconds (limit was 11). [ 3581.968429] Lustre: lustre-OST0000: Bulk IO write error with 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp), client will retry: rc = -110 [ 3581.975891] LustreError: 14236:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 9 previous similar messages [ 3581.980413] Lustre: Skipped 15 previous similar messages [ 3598.346506] LustreError: 46360:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.39@tcp because locking object 0x240000400:24259 took 0 seconds (limit was 11). [ 3598.347447] Lustre: lustre-OST0000: Bulk IO write error with 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp), client will retry: rc = -110 [ 3598.354812] LustreError: 46360:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 8 previous similar messages [ 3598.364742] Lustre: Skipped 8 previous similar messages [ 3646.480873] LustreError: 12292:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.39@tcp because locking object 0x240000400:24259 took 0 seconds (limit was 11). [ 3646.481511] Lustre: lustre-OST0000: Bulk IO write error with 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp), client will retry: rc = -110 [ 3646.489026] LustreError: 12292:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 26 previous similar messages [ 3646.493970] Lustre: Skipped 25 previous similar messages [ 3678.216109] Lustre: lustre-OST0000: Client 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp) reconnecting [ 3678.219907] Lustre: Skipped 9 previous similar messages [ 3694.603424] Lustre: *** cfs_fail_loc=21d, val=0*** [ 3694.604876] Lustre: Skipped 66 previous similar messages [ 3726.348290] LustreError: 51931:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.39@tcp because locking object 0x240000400:24259 took 0 seconds (limit was 11). [ 3726.349621] Lustre: lustre-OST0000: Bulk IO write error with 3bb7fa52-952c-4b44-b5a6-0367668a8a30 (at 192.168.201.39@tcp), client will retry: rc = -110 [ 3726.355513] LustreError: 51931:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 43 previous similar messages [ 3726.358638] Lustre: Skipped 44 previous similar messages [ 3779.530922] Lustre: DEBUG MARKER: == recovery-small test 56: do not fail on getattr resend ========================================================== 10:38:55 (1773671935) [ 3779.915304] LustreError: 51217:0:(mdt_handler.c:2334:mdt_getattr_name_lock()) cfs_fail_timeout id 136 sleeping for 40000ms [ 3819.976176] LustreError: 51217:0:(mdt_handler.c:2334:mdt_getattr_name_lock()) cfs_fail_timeout id 136 awake [ 3827.267337] Lustre: DEBUG MARKER: == recovery-small test 57: read procfs entries causes kernel crash ========================================================== 10:39:41 (1773671981) [ 3831.507887] Lustre: Failing over lustre-MDT0000 [ 3831.865928] Lustre: server umount lustre-MDT0000 complete [ 3838.046383] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 3838.699922] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:25950 to 0x280000400:26017) [ 3838.701531] Lustre: lustre-MDT0000: Aborting client recovery [ 3838.713727] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:24261 to 0x240000400:24289) [ 3842.887142] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 3854.539458] Lustre: DEBUG MARKER: == recovery-small test 58: Eviction in the middle of open RPC reply processing ========================================================== 10:40:09 (1773672009) [ 3889.398873] Lustre: DEBUG MARKER: == recovery-small test 59: Read cancel race on client eviction ========================================================== 10:40:41 (1773672041) [ 3911.726414] LustreError: 26336:0:(ldlm_lockd.c:727:ldlm_handle_ast_error()) ### client (nid 192.168.201.39@tcp) returned error from blocking AST (req@000000006cdcd108 x1859825951211648 status -107 rc -107), evict it ns: filter-lustre-OST0000_UUID lock: 00000000a9f8bfd8/0x92ba224dac46abe0 lrc: 4/0,0 mode: PW/PW res: [0x240000400:0x5ee2:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->4095) gid 0 flags: 0x60000400000020 nid: 192.168.201.39@tcp remote: 0xe59155cbed548ebd expref: 5 pid: 26337 timeout: 4011 lvb_type: 0 lru_score: 0 lru_type: 0 [ 3911.798924] LustreError: lustre-OST0000: A client on nid 192.168.201.39@tcp was evicted due to a lock blocking callback time out: rc -107 [ 3911.822617] LustreError: 5752:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 0s: evicting client at 192.168.201.39@tcp ns: filter-lustre-OST0000_UUID lock: 00000000a9f8bfd8/0x92ba224dac46abe0 lrc: 3/0,0 mode: PW/PW res: [0x240000400:0x5ee2:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->4095) gid 0 flags: 0x60000400000020 nid: 192.168.201.39@tcp remote: 0xe59155cbed548ebd expref: 6 pid: 26337 timeout: 0 lvb_type: 0 lru_score: 0 lru_type: 0 [ 3938.002934] Lustre: DEBUG MARKER: == recovery-small test 60: Add Changelog entries during MDS failover ========================================================== 10:41:30 (1773672090) [ 3938.422713] LustreError: 53176:0:(ldlm_lockd.c:727:ldlm_handle_ast_error()) ### client (nid 192.168.201.39@tcp) returned error from blocking AST (req@00000000c7be5f2a x1859825951216640 status -107 rc -107), evict it ns: mdt-lustre-MDT0000_UUID lock: ffff8dcb4b0d2400/0x92ba224dac46abfc lrc: 4/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 3 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.39@tcp remote: 0xe59155cbed548ecb expref: 6 pid: 53773 timeout: 4038 lvb_type: 0 lru_score: 0 lru_type: 0 [ 3938.510748] LustreError: lustre-MDT0000: A client on nid 192.168.201.39@tcp was evicted due to a lock blocking callback time out: rc -107 [ 3938.539368] LustreError: 5752:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 0s: evicting client at 192.168.201.39@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff8dcb4b0d2400/0x92ba224dac46abfc lrc: 3/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 3 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.39@tcp remote: 0xe59155cbed548ecb expref: 7 pid: 53773 timeout: 0 lvb_type: 0 lru_score: 0 lru_type: 0 [ 3945.821557] Lustre: lustre-MDD0000: changelog on [ 3996.925447] Lustre: lustre-OST0001: haven't heard from client f0d8a0ea-d3a6-4eeb-8a7b-c9157ba290af (at 192.168.201.39@tcp) in 101 seconds. I think it's dead, and I am evicting it. exp ffff8dcb48785000, cur 1773672153 deadline 1773672152 last 1773672052 [ 4093.911442] Lustre: Failing over lustre-MDT0000 [ 4094.947352] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 4094.957816] Lustre: Skipped 3 previous similar messages [ 4094.966429] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 4094.981385] Lustre: Skipped 2 previous similar messages [ 4095.064993] Lustre: server umount lustre-MDT0000 complete [ 4116.449099] Lustre: 3298:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773672256/real 1773672256] req@ffff8dcb6ec67b80 x1859825951265664/t0(0) o400->MGC192.168.201.139@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1773672272 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 4116.476333] Lustre: 3298:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 8 previous similar messages [ 4116.488915] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 4127.955368] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 4127.973578] Lustre: Skipped 1 previous similar message [ 4128.230565] Lustre: lustre-MDD0000: changelog on [ 4128.269530] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 4130.954154] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 4131.907647] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 4132.026744] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:26791 to 0x240000400:26817) [ 4132.039701] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:28519 to 0x280000400:28545) [ 4134.897434] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 0@lo (at 0@lo) [ 4134.911611] Lustre: Skipped 3 previous similar messages [ 4135.181276] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 4366.249440] Lustre: lustre-MDD0000: changelog off [ 4382.438216] Lustre: DEBUG MARKER: == recovery-small test 61: Verify to not reuse orphan objects - bug 17025 ========================================================== 10:48:56 (1773672536) [ 4389.663121] LustreError: 56408:0:(osd_handler.c:720:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 4391.138783] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 4397.408970] Lustre: Failing over lustre-MDT0000 [ 4397.539459] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 4397.554924] Lustre: Skipped 1 previous similar message [ 4398.105708] Lustre: server umount lustre-MDT0000 complete [ 4410.874082] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 4412.361216] Lustre: lustre-MDT0000: Aborting client recovery [ 4412.373181] LustreError: 57026:0:(ldlm_lib.c:2983:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 4412.395651] Lustre: 57076:0:(ldlm_lib.c:2386:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 4412.407934] Lustre: 57076:0:(ldlm_lib.c:2386:target_recovery_overseer()) Skipped 2 previous similar messages [ 4412.433820] Lustre: 57076:0:(genops.c:1620:class_disconnect_stale_exports()) lustre-MDT0000: disconnect stale client d20e2220-6037-4c9b-9382-b9ba1fd6bf21@ [ 4412.461138] Lustre: 57076:0:(genops.c:1620:class_disconnect_stale_exports()) Skipped 1 previous similar message [ 4412.474243] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 4412.659186] Lustre: lustre-MDT0000-osd: cancel update llog [0x200002b10:0x1:0x0] [ 4412.931857] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:28519 to 0x280000400:28577) [ 4412.932670] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:26791 to 0x240000400:26849) [ 4421.455727] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 4444.027206] Lustre: DEBUG MARKER: == recovery-small test 65: lock enqueue for destroyed export ========================================================== 10:49:57 (1773672597) [ 4447.420411] LustreError: 26336:0:(ldlm_lockd.c:1431:ldlm_handle_enqueue()) cfs_fail_timeout id 31e sleeping for 6000ms [ 4447.694869] Lustre: *** cfs_fail_loc=31e, val=0*** [ 4447.697031] Lustre: Skipped 3 previous similar messages [ 4449.408842] LustreError: 14785:0:(ldlm_lockd.c:1431:ldlm_handle_enqueue()) cfs_fail_timeout id 31e sleeping for 6000ms [ 4453.027198] Lustre: 57721:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting d20e2220-6037-4c9b-9382-b9ba1fd6bf21 at adminstrative request [ 4453.046685] LustreError: 6559:0:(ldlm_lockd.c:2933:ldlm_bl_thread_exports()) cfs_fail_timeout id 31e sleeping for 4000ms [ 4453.522310] LustreError: 26336:0:(ldlm_lockd.c:1431:ldlm_handle_enqueue()) cfs_fail_timeout id 31e awake [ 4453.539849] LustreError: 26336:0:(ldlm_lockd.c:1453:ldlm_handle_enqueue()) ### lock on destroyed export 00000000f7a480bf ns: filter-lustre-OST0000_UUID lock: 00000000c16446d5/0x92ba224dac4e2dc9 lrc: 3/0,0 mode: --/PW res: [0x240000400:0x68e3:0x0].0x0 rrc: 4 type: EXT [0->4095] (req 0->4095) gid 0 flags: 0x70000000020020 nid: 192.168.201.39@tcp remote: 0xe59155cbed55a0ff expref: 5 pid: 26336 timeout: 0 lvb_type: 0 lru_score: 0 lru_type: 0 [ 4455.519062] LustreError: 14785:0:(ldlm_lockd.c:1431:ldlm_handle_enqueue()) cfs_fail_timeout id 31e awake [ 4456.936173] LustreError: 6559:0:(ldlm_lockd.c:2933:ldlm_bl_thread_exports()) cfs_fail_timeout interrupted [ 4463.561453] Lustre: lustre-OST0000: Client cbeb1868-defe-41c4-a3ef-886b04b78e90 (at 192.168.201.39@tcp) reconnecting [ 4463.585267] Lustre: Skipped 8 previous similar messages [ 4482.353100] Lustre: DEBUG MARKER: == recovery-small test 66: lock enqueue re-send vs client eviction ========================================================== 10:50:35 (1773672635) [ 4485.507467] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 4485.517147] LustreError: 57712:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb4a265f80 x1859825979671552/t0(0) o101->d20e2220-6037-4c9b-9382-b9ba1fd6bf21@192.168.201.39@tcp:227/0 lens 576/688 e 0 to 0 dl 1773672652 ref 1 fl Interpret:/600/0 rc 0/0 job:'stat.0' uid:0 gid:0 projid:0 [ 4487.277764] LustreError: 57036:0:(mdt_handler.c:2334:mdt_getattr_name_lock()) cfs_fail_timeout id 136 sleeping for 40000ms [ 4490.812962] Lustre: 58145:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting d20e2220-6037-4c9b-9382-b9ba1fd6bf21 at adminstrative request [ 4492.960096] LustreError: 57036:0:(mdt_handler.c:2334:mdt_getattr_name_lock()) cfs_fail_timeout interrupted [ 4506.852254] Lustre: DEBUG MARKER: == recovery-small test 67: connect vs import invalidate race ========================================================== 10:50:59 (1773672659) [ 4511.538624] Lustre: 58446:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting d20e2220-6037-4c9b-9382-b9ba1fd6bf21 at adminstrative request [ 4538.970122] Lustre: DEBUG MARKER: == recovery-small test 100: IR: Make sure normal recovery still works w/o IR ========================================================== 10:51:32 (1773672692) [ 4548.648314] Lustre: Failing over lustre-OST0000 [ 4549.147582] Lustre: server umount lustre-OST0000 complete [ 4550.113196] LustreError: lustre-OST0000-osc-MDT0000: operation ost_statfs to node 0@lo failed: rc = -107 [ 4550.145363] LustreError: 6545:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 4550.184239] LustreError: 6545:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 6 previous similar messages [ 4555.245359] LustreError: 27490:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 4565.480354] LustreError: 25896:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 4565.558068] LustreError: 25896:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 1 previous similar message [ 4585.677453] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 4599.597252] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 4602.090765] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 4621.341911] Lustre: DEBUG MARKER: == recovery-small test 101a: IR: Make sure IR works w/o normal recovery ========================================================== 10:52:54 (1773672774) [ 4628.140275] Lustre: Failing over lustre-OST0000 [ 4628.436660] Lustre: server umount lustre-OST0000 complete [ 4631.044688] LustreError: 6590:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 4631.079991] LustreError: 6590:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 1 previous similar message [ 4651.662879] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 4663.207969] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 4676.802085] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 4680.310816] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 4700.056329] Lustre: DEBUG MARKER: == recovery-small test 101b: IR: Make sure IR works w/o normal recovery and proceed EAGAIN ========================================================== 10:54:13 (1773672853) [ 4712.188928] Lustre: Failing over lustre-OST0000 [ 4712.374446] Lustre: server umount lustre-OST0000 complete [ 4712.968592] Lustre: lustre-OST0000-osc-MDT0000: Connection to lustre-OST0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 4713.043530] Lustre: Skipped 5 previous similar messages [ 4713.067551] LustreError: 26337:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 4713.108411] LustreError: 26337:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 3 previous similar messages [ 4738.069976] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 4738.118535] Lustre: lustre-OST0000: in recovery but waiting for the first client to connect [ 4738.128364] LustreError: 62324:0:(ofd_dev.c:633:ofd_prepare()) cfs_fail_timeout id 247 sleeping for 25000ms [ 4738.130934] Lustre: Skipped 5 previous similar messages [ 4763.163391] LustreError: 62324:0:(ofd_dev.c:633:ofd_prepare()) cfs_fail_timeout id 247 awake [ 4763.649350] Lustre: lustre-OST0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 4763.675184] Lustre: Skipped 2 previous similar messages [ 4763.890595] Lustre: lustre-OST0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 4763.891527] Lustre: lustre-OST0000-osc-MDT0000: Connection restored to 0@lo (at 0@lo) [ 4763.909389] Lustre: Skipped 2 previous similar messages [ 4763.941881] Lustre: Skipped 5 previous similar messages [ 4776.627280] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 4790.894295] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 4793.887833] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 4807.526747] Lustre: DEBUG MARKER: == recovery-small test 102: IR: New client gets updated nidtbl after MGS restart ========================================================== 10:56:01 (1773672961) [ 4813.483366] Lustre: Failing over lustre-OST0000 [ 4813.674766] Lustre: server umount lustre-OST0000 complete [ 4814.831699] LustreError: 6546:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 4814.864871] LustreError: 6546:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 4 previous similar messages [ 4837.497608] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 4847.977069] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 4860.902128] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 4863.698948] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 4878.438539] Lustre: Failing over lustre-MDT0000 [ 4879.516632] Lustre: server umount lustre-MDT0000 complete [ 4892.989649] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 4894.184559] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 4894.204720] Lustre: Skipped 2 previous similar messages [ 4894.555305] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:26853 to 0x240000400:26881) [ 4894.563153] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:28579 to 0x280000400:28609) [ 4895.084327] Lustre: 3296:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773673035/real 1773673035] req@ffff8dcb4738b480 x1859825952072576/t0(0) o400->lustre-MDT0000-lwp-OST0000@0@lo:12/10 lens 224/224 e 0 to 1 dl 1773673051 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 4902.884670] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 4910.897854] Lustre: Failing over lustre-OST0000 [ 4911.117345] Lustre: server umount lustre-OST0000 complete [ 4914.151781] LustreError: lustre-OST0000-osc-MDT0000: operation ost_statfs to node 0@lo failed: rc = -107 [ 4948.913562] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 4961.652304] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 4978.936948] Lustre: DEBUG MARKER: == recovery-small test 103: IR: MDS can start w/o MGS and get updated nidtbl later ========================================================== 10:58:52 (1773673132) [ 4983.519541] Lustre: DEBUG MARKER: SKIP: recovery-small test_103 needs separate mgs and mds [ 4986.837710] Lustre: DEBUG MARKER: == recovery-small test 104: IR: ost can disable IR voluntarily ========================================================== 10:59:00 (1773673140) [ 4993.464253] Lustre: Failing over lustre-OST0000 [ 4993.669466] Lustre: server umount lustre-OST0000 complete [ 4994.025814] LustreError: 26990:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 4994.055270] LustreError: 26990:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 9 previous similar messages [ 5018.295424] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 5039.194758] Lustre: DEBUG MARKER: == recovery-small test 105: IR: NON IR clients support === 10:59:53 (1773673193) [ 5042.329699] Lustre: DEBUG MARKER: SKIP: recovery-small test_105 Needs multiple clients [ 5045.964184] Lustre: DEBUG MARKER: == recovery-small test 106: lightweight connection support ========================================================== 10:59:58 (1773673198) [ 5057.972583] LustreError: 68373:0:(osd_handler.c:720:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 5059.602960] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 5063.197318] Lustre: Failing over lustre-MDT0000 [ 5063.668045] Lustre: server umount lustre-MDT0000 complete [ 5095.397314] LustreError: 3295:0:(client.c:1390:ptlrpc_import_delay_req()) @@@ invalidate in flight req@ffff8dca4c820e00 x1859825952123264/t0(0) o250->MGC192.168.201.139@tcp@0@lo:26/25 lens 520/544 e 0 to 0 dl 0 ref 1 fl Rpc:NQU/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 5104.224428] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 5105.978982] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:26853 to 0x240000400:26913) [ 5105.985774] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:28611 to 0x280000400:28641) [ 5126.781520] Lustre: DEBUG MARKER: == recovery-small test 107: drop reint reply, then restart MDT ========================================================== 11:01:19 (1773673279) [ 5129.319084] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 5129.327934] LustreError: 69341:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb4a264a80 x1859825979753216/t94489280517(0) o36->4c8f0584-8ea1-4eae-a656-ec0012913222@192.168.201.39@tcp:116/0 lens 504/448 e 0 to 0 dl 1773673296 ref 1 fl Interpret:/200/0 rc 0/0 job:'mkdir.0' uid:0 gid:0 projid:4294967295 [ 5134.043679] Lustre: Failing over lustre-MDT0000 [ 5134.976609] Lustre: server umount lustre-MDT0000 complete [ 5161.953079] LustreError: 3295:0:(client.c:1390:ptlrpc_import_delay_req()) @@@ invalidate in flight req@ffff8dcb49511f80 x1859825952141440/t0(0) o250->MGC192.168.201.139@tcp@0@lo:26/25 lens 520/544 e 0 to 0 dl 0 ref 1 fl Rpc:NQU/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 5170.640972] Lustre: 70179:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb49513b80 x1859825979753216/t94489280517(0) o36->4c8f0584-8ea1-4eae-a656-ec0012913222@192.168.201.39@tcp:157/0 lens 504/2880 e 0 to 0 dl 1773673337 ref 1 fl Interpret:/202/0 rc 0/0 job:'mkdir.0' uid:0 gid:0 projid:4294967295 [ 5170.725097] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:26853 to 0x240000400:26945) [ 5170.730571] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:28611 to 0x280000400:28673) [ 5171.141872] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 5182.805682] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 5185.503640] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 5199.427072] Lustre: DEBUG MARKER: == recovery-small test 108: client eviction don't crash == 11:02:32 (1773673352) [ 5201.643319] Lustre: 70956:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting 4c8f0584-8ea1-4eae-a656-ec0012913222 at adminstrative request [ 5201.675756] LustreError: 12292:0:(ldlm_lib.c:3599:target_bulk_io()) @@@ Eviction on bulk WRITE req@ffff8dcb48ae7480 x1859825979764096/t0(0) o4->4c8f0584-8ea1-4eae-a656-ec0012913222@192.168.201.39@tcp:188/0 lens 488/448 e 0 to 0 dl 1773673368 ref 1 fl Interpret:/600/0 rc 0/0 job:'dd.0' uid:0 gid:0 projid:0 [ 5201.728200] Lustre: lustre-OST0000: Bulk IO write error with 4c8f0584-8ea1-4eae-a656-ec0012913222 (at 192.168.201.39@tcp), client will retry: rc = -107 [ 5201.746556] Lustre: Skipped 19 previous similar messages [ 5214.704960] Lustre: DEBUG MARKER: == recovery-small test 110a: create remote directory: drop client req ========================================================== 11:02:48 (1773673368) [ 5217.169229] Lustre: DEBUG MARKER: SKIP: recovery-small test_110a needs >= 2 MDTs [ 5220.466430] Lustre: DEBUG MARKER: == recovery-small test 110b: create remote directory: drop Master rep ========================================================== 11:02:53 (1773673373) [ 5223.229544] Lustre: DEBUG MARKER: SKIP: recovery-small test_110b needs >= 2 MDTs [ 5225.438504] Lustre: DEBUG MARKER: == recovery-small test 110c: create remote directory: drop update rep on slave MDT ========================================================== 11:02:59 (1773673379) [ 5227.986801] Lustre: DEBUG MARKER: SKIP: recovery-small test_110c needs >= 2 MDTs [ 5230.506958] Lustre: DEBUG MARKER: == recovery-small test 110d: remove remote directory: drop client req ========================================================== 11:03:04 (1773673384) [ 5232.718737] Lustre: DEBUG MARKER: SKIP: recovery-small test_110d needs >= 2 MDTs [ 5235.627074] Lustre: DEBUG MARKER: == recovery-small test 110e: remove remote directory: drop master rep ========================================================== 11:03:09 (1773673389) [ 5238.434349] Lustre: DEBUG MARKER: SKIP: recovery-small test_110e needs >= 2 MDTs [ 5241.118750] Lustre: DEBUG MARKER: == recovery-small test 110f: remove remote directory: drop slave rep ========================================================== 11:03:14 (1773673394) [ 5244.027181] Lustre: DEBUG MARKER: SKIP: recovery-small test_110f needs >= 2 MDTs [ 5247.673796] Lustre: DEBUG MARKER: == recovery-small test 110g: drop reply during migration ========================================================== 11:03:20 (1773673400) [ 5250.304589] Lustre: DEBUG MARKER: SKIP: recovery-small test_110g needs >= 2 MDTs [ 5253.191687] Lustre: DEBUG MARKER: == recovery-small test 110h: drop update reply during cross-MDT file rename ========================================================== 11:03:26 (1773673406) [ 5255.763467] Lustre: DEBUG MARKER: SKIP: recovery-small test_110h needs >= 2 MDTs [ 5259.302425] Lustre: DEBUG MARKER: == recovery-small test 110i: drop update reply during cross-MDT dir rename ========================================================== 11:03:32 (1773673412) [ 5262.422575] Lustre: DEBUG MARKER: SKIP: recovery-small test_110i needs >= 2 MDTs [ 5265.456389] Lustre: DEBUG MARKER: == recovery-small test 110j: drop update reply during cross-MDT ln ========================================================== 11:03:39 (1773673419) [ 5267.645316] Lustre: DEBUG MARKER: SKIP: recovery-small test_110j needs >= 2 MDTs [ 5271.224475] Lustre: DEBUG MARKER: == recovery-small test 110k: FID_QUERY failed during recovery ========================================================== 11:03:44 (1773673424) [ 5274.268946] Lustre: DEBUG MARKER: SKIP: recovery-small test_110k needs >= 2 MDTS [ 5278.088594] Lustre: DEBUG MARKER: == recovery-small test 110m: update resent vs original RPC race ========================================================== 11:03:50 (1773673430) [ 5282.838684] Lustre: DEBUG MARKER: SKIP: recovery-small test_110m needs at least 2 MDTs [ 5286.528687] Lustre: DEBUG MARKER: == recovery-small test 111: mdd setup fail should not cause umount oops ========================================================== 11:03:59 (1773673439) [ 5290.920479] Lustre: Failing over lustre-MDT0000 [ 5291.559813] Lustre: server umount lustre-MDT0000 complete [ 5305.503587] Lustre: *** cfs_fail_loc=151, val=0*** [ 5305.507297] LustreError: 72943:0:(mdd_device.c:674:mdd_changelog_init()) lustre-MDD0000: changelog setup during init failed: rc = -5 [ 5305.516411] LustreError: 72943:0:(mdd_device.c:1406:mdd_prepare()) lustre-MDD0000: failed to initialize changelog: rc = -5 [ 5305.524559] LustreError: 72943:0:(tgt_mount.c:2566:server_fill_super()) Unable to start targets: -5 [ 5305.536670] Lustre: Failing over lustre-MDT0000 [ 5305.944757] Lustre: server umount lustre-MDT0000 complete [ 5305.949982] LustreError: 72943:0:(super25.c:186:lustre_fill_super()) llite: Unable to mount : rc = -5 [ 5315.340685] LustreError: 73244:0:(ldlm_resource.c:1170:ldlm_resource_complain()) MGC192.168.201.139@tcp: namespace resource [0x65727473756c:0x0:0x0].0x0 (ffff8dcb44f5cc00) refcount nonzero (1) after lock cleanup; forcing cleanup. [ 5315.408497] LustreError: 5759:0:(mgc_request.c:614:do_requeue()) failed processing log: -5 [ 5319.978066] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:28611 to 0x280000400:28705) [ 5319.978872] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:26947 to 0x240000400:26977) [ 5324.133412] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 5339.438529] Lustre: DEBUG MARKER: == recovery-small test 112a: bulk resend while orignal request is in progress ========================================================== 11:04:53 (1773673493) [ 5342.233071] LustreError: 51932:0:(tgt_handler.c:2735:tgt_brw_write()) cfs_fail_timeout id 214 sleeping for 20000ms [ 5362.336331] LustreError: 51932:0:(tgt_handler.c:2735:tgt_brw_write()) cfs_fail_timeout id 214 awake [ 5377.503610] Lustre: DEBUG MARKER: == recovery-small test 115a: read: late REQ MDunlink and no bulk ========================================================== 11:05:30 (1773673530) [ 5394.043130] Lustre: DEBUG MARKER: == recovery-small test 115b: write: late REQ MDunlink and no bulk ========================================================== 11:05:47 (1773673547) [ 5398.868813] Lustre: *** cfs_fail_loc=215, val=2*** [ 5398.874413] Lustre: Skipped 39 previous similar messages [ 5412.675928] Lustre: DEBUG MARKER: == recovery-small test 115c: read: late Reply MDunlink and no bulk ========================================================== 11:06:06 (1773673566) [ 5429.413520] Lustre: DEBUG MARKER: == recovery-small test 115d: write: late Reply MDunlink and no bulk ========================================================== 11:06:22 (1773673582) [ 5446.252920] Lustre: DEBUG MARKER: == recovery-small test 115e: read: late Bulk MDunlink and no reply ========================================================== 11:06:39 (1773673599) [ 5461.258157] Lustre: DEBUG MARKER: == recovery-small test 115f: read: late REQ MDunlink and no reply ========================================================== 11:06:54 (1773673614) [ 5465.940234] Lustre: *** cfs_fail_loc=211, val=2147483648*** [ 5465.956524] LustreError: 51928:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb65dd8700 x1859825979803136/t0(0) o3->4c8f0584-8ea1-4eae-a656-ec0012913222@192.168.201.39@tcp:492/0 lens 488/4536 e 0 to 0 dl 1773673672 ref 1 fl Interpret:/600/0 rc 4096/0 job:'dd.0' uid:0 gid:0 projid:0 [ 5521.426806] Lustre: lustre-OST0001: Client 4c8f0584-8ea1-4eae-a656-ec0012913222 (at 192.168.201.39@tcp) reconnecting [ 5521.463420] Lustre: Skipped 3 previous similar messages [ 5534.387557] Lustre: DEBUG MARKER: == recovery-small test 115g: read: late REQ MDunlink and Reply MDunlink ========================================================== 11:08:07 (1773673687) [ 5606.242861] Lustre: DEBUG MARKER: == recovery-small test 120: flock race: completion vs. evict ========================================================== 11:09:20 (1773673760) [ 5610.395659] Lustre: 76006:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 4c8f0584-8ea1-4eae-a656-ec0012913222 at adminstrative request [ 5617.082283] Lustre: 76060:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 4c8f0584-8ea1-4eae-a656-ec0012913222 at adminstrative request [ 5625.223059] Lustre: 76112:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 4c8f0584-8ea1-4eae-a656-ec0012913222 at adminstrative request [ 5629.425822] Lustre: 76161:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 4c8f0584-8ea1-4eae-a656-ec0012913222 at adminstrative request [ 5642.510159] Lustre: 76240:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 4c8f0584-8ea1-4eae-a656-ec0012913222 at adminstrative request [ 5671.892855] Lustre: 76468:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 4c8f0584-8ea1-4eae-a656-ec0012913222 at adminstrative request [ 5671.920078] Lustre: 76468:0:(genops.c:1793:obd_export_evict_by_uuid()) Skipped 3 previous similar messages [ 5683.528487] Lustre: DEBUG MARKER: == recovery-small test 113: ldlm enqueue dropped reply should not cause deadlocks ========================================================== 11:10:36 (1773673836) [ 5686.111230] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 5686.126370] LustreError: 74369:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb6ef77100 x1859825979869440/t0(0) o101->4c8f0584-8ea1-4eae-a656-ec0012913222@192.168.201.39@tcp:673/0 lens 576/688 e 0 to 0 dl 1773673853 ref 1 fl Interpret:/600/0 rc 0/0 job:'stat.0' uid:0 gid:0 projid:0 [ 5722.442460] Lustre: DEBUG MARKER: == recovery-small test 130a: enqueue resend on not existing file ========================================================== 11:11:16 (1773673876) [ 5725.273720] LustreError: 73257:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 sleeping for 10000ms [ 5735.336456] LustreError: 73257:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 awake [ 5735.347425] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 5735.375067] LustreError: 73257:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb669a1f80 x1859825979878528/t0(0) o101->4c8f0584-8ea1-4eae-a656-ec0012913222@192.168.201.39@tcp:737/0 lens 576/584 e 1 to 0 dl 1773673917 ref 1 fl Interpret:/600/0 rc 301/0 job:'stat.0' uid:0 gid:0 projid:0 [ 5776.941806] Lustre: DEBUG MARKER: == recovery-small test 130b: enqueue resend on a stale inode ========================================================== 11:12:10 (1773673930) [ 5779.983669] LustreError: 73254:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 sleeping for 10000ms [ 5790.048722] LustreError: 73254:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 awake [ 5790.060565] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 5790.071778] LustreError: 73254:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb46265c00 x1859825979885568/t0(0) o101->4c8f0584-8ea1-4eae-a656-ec0012913222@192.168.201.39@tcp:51/0 lens 576/584 e 0 to 0 dl 1773673986 ref 1 fl Interpret:/600/0 rc 301/0 job:'stat.0' uid:0 gid:0 projid:0 [ 5835.811618] Lustre: *** cfs_fail_loc=217, val=0*** [ 5848.213677] Lustre: DEBUG MARKER: == recovery-small test 130c: layout intent resend on a stale inode ========================================================== 11:13:21 (1773674001) [ 5853.201183] LustreError: 73256:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 sleeping for 10000ms [ 5863.248149] LustreError: 73256:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 awake [ 5891.575292] Lustre: DEBUG MARKER: == recovery-small test 132: long punch =================== 11:14:05 (1773674045) [ 5936.096209] Lustre: ll_ost_io00_003: service thread pid 12292 was inactive for 41.392 seconds. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes: [ 5936.122353] task:ll_ost_io00_003 state:D stack:0 pid:12292 ppid:2 flags:0x80004000 [ 5936.136256] Call Trace: [ 5936.147626] __schedule+0x351/0xcb0 [ 5936.167551] schedule+0xc0/0x180 [ 5936.174266] schedule_timeout+0xb4/0x190 [ 5936.181219] ? __next_timer_interrupt+0x160/0x160 [ 5936.188661] ? kvm_clock_get_cycles+0x2c/0x50 [ 5936.192680] ? ktime_get+0x65/0x110 [ 5936.201510] schedule_timeout_uninterruptible+0x2d/0x40 [ 5936.206816] __cfs_fail_timeout_set+0x13b/0x240 [libcfs] [ 5936.218488] ofd_punch_hdl+0x4ec/0xbc0 [ofd] [ 5936.226870] tgt_handle_request0+0x137/0xaf0 [ptlrpc] [ 5936.236647] tgt_request_handle+0x573/0x1e70 [ptlrpc] [ 5936.250149] ptlrpc_server_handle_request+0x443/0x13b0 [ptlrpc] [ 5936.274058] ? lprocfs_counter_add+0x15b/0x210 [obdclass] [ 5936.287419] ptlrpc_main+0xce8/0x1400 [ptlrpc] [ 5936.299281] ? ptlrpc_wait_event+0x690/0x690 [ptlrpc] [ 5936.305733] kthread+0x1d1/0x200 [ 5936.312519] ? set_kthread_struct+0x70/0x70 [ 5936.326301] ret_from_fork+0x1f/0x30 [ 6014.776906] LustreError: 12292:0:(ofd_dev.c:2175:ofd_punch_hdl()) cfs_fail_timeout id 236 awake [ 6014.792414] Lustre: ll_ost_io00_003: service thread pid 12292 completed after 120.088s. This likely indicates the system was overloaded (too many service threads, or not enough hardware resources). [ 6030.906831] Lustre: DEBUG MARKER: == recovery-small test 131: IO vs evict results to IO under staled lock ========================================================== 11:16:24 (1773674184) [ 6037.186978] Lustre: 78501:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting 4c8f0584-8ea1-4eae-a656-ec0012913222 at adminstrative request [ 6037.213033] LustreError: 6559:0:(ldlm_lockd.c:2933:ldlm_bl_thread_exports()) cfs_fail_timeout id 31e sleeping for 4000ms [ 6037.230111] LustreError: 6559:0:(ldlm_lockd.c:2933:ldlm_bl_thread_exports()) Skipped 1 previous similar message [ 6040.921328] Lustre: DEBUG MARKER: recovery-small test_131: @@@@@@ FAIL: dd succeeded [ 6070.784515] Lustre: DEBUG MARKER: == recovery-small test 133: don't fail on flock resend === 11:17:04 (1773674224) [ 6075.579562] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 6075.587249] LustreError: 73256:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dcb7cb50380 x1859825979935360/t0(0) o101->4c8f0584-8ea1-4eae-a656-ec0012913222@192.168.201.39@tcp:346/0 lens 328/344 e 0 to 0 dl 1773674281 ref 1 fl Interpret:/600/0 rc 0/0 job:'multiop.0' uid:0 gid:0 projid:0 [ 6130.707127] Lustre: lustre-MDT0000: Client 4c8f0584-8ea1-4eae-a656-ec0012913222 (at 192.168.201.39@tcp) reconnecting [ 6130.722470] Lustre: Skipped 4 previous similar messages [ 6143.013878] Lustre: DEBUG MARKER: == recovery-small test 134: race between failover and search for reply data free slot ========================================================== 11:18:16 (1773674296) [ 6145.661080] Lustre: DEBUG MARKER: SKIP: recovery-small test_134 Need 2+ clients, have 1 [ 6149.142624] Lustre: DEBUG MARKER: == recovery-small test 135: DOM: open/create resend to return size ========================================================== 11:18:22 (1773674302) [ 6151.613605] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 6151.624851] LustreError: 73254:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff8dca497dd180 x1859825979945344/t103079215281(0) o101->4c8f0584-8ea1-4eae-a656-ec0012913222@192.168.201.39@tcp:422/0 lens 648/720 e 0 to 0 dl 1773674357 ref 1 fl Interpret:/600/0 rc 301/0 job:'openfile.0' uid:0 gid:0 projid:0 [ 6206.521496] Lustre: 73567:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff8dcb7ad62680 x1859825979945344/t103079215281(0) o101->4c8f0584-8ea1-4eae-a656-ec0012913222@192.168.201.39@tcp:477/0 lens 648/3488 e 0 to 0 dl 1773674412 ref 1 fl Interpret:/602/0 rc 0/0 job:'openfile.0' uid:0 gid:0 projid:0 [ 6220.634391] Lustre: DEBUG MARKER: SKIP: recovery-small test_136 skipping excluded test 136 [ 6223.943789] Lustre: DEBUG MARKER: == recovery-small test 137: late resend must be skipped if already applied ========================================================== 11:19:37 (1773674377) [ 6227.405612] LustreError: 73256:0:(mdt_reint.c:979:mdt_reint_setattr()) cfs_race id 525 sleeping [ 6232.544402] LustreError: 73256:0:(mdt_reint.c:979:mdt_reint_setattr()) cfs_fail_race id 525 awake: rc=0 [ 6232.643571] LustreError: 73256:0:(mdt_reint.c:979:mdt_reint_setattr()) cfs_fail_race id 525 waking [ 6250.968308] Lustre: DEBUG MARKER: == recovery-small test 138: Umount MDT during recovery === 11:20:04 (1773674404) [ 6254.163720] Lustre: DEBUG MARKER: SKIP: recovery-small test_138 needs >= 2 MDTs [ 6256.864283] Lustre: DEBUG MARKER: == recovery-small test 139: corrupted catid won't cause crash ========================================================== 11:20:10 (1773674410) [ 6259.803949] Lustre: DEBUG MARKER: SKIP: recovery-small test_139 needs >= 2 MDTs [ 6262.615590] Lustre: DEBUG MARKER: == recovery-small test 140a: local mount is flagged properly ========================================================== 11:20:16 (1773674416) [ 6269.129313] Lustre: lustre-MDT0000: local client 1b74d887-3363-4b1f-a88f-9057dd2d9266 w/o recovery [ 6269.173517] Lustre: Mounted lustre-client [ 6275.172365] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 6286.900794] LustreError: 80998:0:(lov_obd.c:783:lov_cleanup()) lustre-clilov-ffff8dca48532000: lov tgt 0 not cleaned! deathrow=0, lovrc=1 [ 6287.011437] Lustre: Unmounted lustre-client [ 6293.119977] Lustre: Mounted lustre-client [ 6299.659488] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 6310.090473] LustreError: 81595:0:(lov_obd.c:783:lov_cleanup()) lustre-clilov-ffff8dca48046000: lov tgt 0 not cleaned! deathrow=0, lovrc=1 [ 6310.113529] LustreError: 81595:0:(lov_obd.c:783:lov_cleanup()) Skipped 1 previous similar message [ 6310.163141] Lustre: Unmounted lustre-client [ 6323.752863] Lustre: DEBUG MARKER: == recovery-small test 140b: local mount is excluded from recovery ========================================================== 11:21:16 (1773674476) [ 6329.447317] Lustre: lustre-MDT0000: local client 85b6f4ae-2196-405c-a8ba-c980236e8986 w/o recovery [ 6329.494463] Lustre: Mounted lustre-client [ 6335.333345] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 6339.918374] LustreError: 82437:0:(osd_handler.c:720:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 6341.551879] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 6350.352303] LustreError: 82634:0:(lov_obd.c:783:lov_cleanup()) lustre-clilov-ffff8dca4795b000: lov tgt 0 not cleaned! deathrow=0, lovrc=1 [ 6350.394529] LustreError: 82634:0:(lov_obd.c:783:lov_cleanup()) Skipped 1 previous similar message [ 6350.488201] Lustre: Unmounted lustre-client [ 6355.318797] Lustre: Failing over lustre-MDT0000 [ 6355.434120] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 6355.449685] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 6355.478830] Lustre: Skipped 12 previous similar messages [ 6355.506438] Lustre: Skipped 2 previous similar messages [ 6356.125247] Lustre: server umount lustre-MDT0000 complete [ 6375.914429] Lustre: 3296:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773674516/real 1773674516] req@ffff8dca41b48700 x1859825952418304/t0(0) o400->MGC192.168.201.139@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1773674532 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 6375.972891] Lustre: 3296:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 27 previous similar messages [ 6375.999769] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 6376.032812] LustreError: Skipped 4 previous similar messages [ 6387.703922] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 6387.710681] Lustre: Skipped 6 previous similar messages [ 6388.023908] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 6388.051436] Lustre: Skipped 6 previous similar messages [ 6394.874633] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 0@lo (at 0@lo) [ 6394.896946] Lustre: Skipped 11 previous similar messages [ 6395.448646] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 6395.473487] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 6395.507476] Lustre: Skipped 6 previous similar messages [ 6395.697228] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 6395.716763] Lustre: Skipped 6 previous similar messages [ 6395.786429] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:28718 to 0x280000400:28737) [ 6395.789018] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:26992 to 0x240000400:27009) [ 6408.670623] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 6411.747898] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 6427.782453] Lustre: DEBUG MARKER: == recovery-small test 141: do not lose locks on MGS restart ========================================================== 11:23:01 (1773674581) [ 6431.296482] Lustre: DEBUG MARKER: SKIP: recovery-small test_141 cannot run in local mode or from build tree [ 6434.694341] Lustre: DEBUG MARKER: == recovery-small test 142: orphan name stub can be cleaned up in startup ========================================================== 11:23:08 (1773674588) [ 6436.662383] Lustre: *** cfs_fail_loc=165, val=0*** [ 6439.556105] Lustre: Failing over lustre-MDT0000 [ 6440.429671] Lustre: server umount lustre-MDT0000 complete [ 6452.498906] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 6453.158295] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 6453.195866] Lustre: Skipped 1 previous similar message [ 6457.135836] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:28718 to 0x280000400:28769) [ 6457.143431] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:27011 to 0x240000400:27041) [ 6462.253741] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 6462.368396] Lustre: 3297:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773674597/real 1773674597] req@ffff8dca41b4a300 x1859825952438144/t0(0) o400->lustre-MDT0000-lwp-OST0000@0@lo:12/10 lens 224/224 e 0 to 1 dl 1773674618 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 6477.670585] Lustre: DEBUG MARKER: == recovery-small test 143: orphan cleanup thread shouldn't be blocked even delete failed ========================================================== 11:23:51 (1773674631) [ 6480.178049] Lustre: Failing over lustre-MDT0000 [ 6480.691276] Lustre: server umount lustre-MDT0000 complete [ 6525.347788] LustreError: 3295:0:(client.c:1390:ptlrpc_import_delay_req()) @@@ invalidate in flight req@ffff8dcb66a6ed80 x1859825952454912/t0(0) o250->MGC192.168.201.139@tcp@0@lo:26/25 lens 520/544 e 0 to 0 dl 0 ref 1 fl Rpc:NQU/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 6526.647991] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 6526.654659] Lustre: Skipped 1 previous similar message [ 6526.816940] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 6526.834124] Lustre: Skipped 1 previous similar message [ 6528.593779] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 6528.611789] Lustre: Skipped 1 previous similar message [ 6528.848279] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 6528.863628] Lustre: Skipped 1 previous similar message [ 6528.977762] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:28718 to 0x280000400:28801) [ 6528.978993] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:27011 to 0x240000400:27073) [ 6528.985915] LustreError: 86638:0:(mdd_orphans.c:441:mdd_orphan_index_iterate()) lustre-MDD0000: bad FID [0x0:0x0:0x0] cleaning 'PENDING' [ 6529.601880] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 0@lo (at 0@lo) [ 6529.617324] Lustre: Skipped 3 previous similar messages [ 6534.547873] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 6543.785529] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 [ 6558.061821] Lustre: DEBUG MARKER: == recovery-small test 144a: MDT failover should stop precreation threads ========================================================== 11:25:11 (1773674711) [ 6570.749327] LustreError: lustre-OST0000-osc-MDT0000: operation ost_create to node 0@lo failed: rc = -19 [ 6570.755816] Lustre: Failing over lustre-OST0000 [ 6573.805842] Lustre: server umount lustre-OST0000 complete [ 6574.600546] LustreError: 87511:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 192.168.201.39@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 6574.649086] LustreError: 87511:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 2 previous similar messages [ 6619.671768] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 6652.703610] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 6657.945582] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 6727.400367] Lustre: Failing over lustre-MDT0000 [ 6729.659685] Lustre: server umount lustre-MDT0000 complete [ 6746.080341] Lustre: 3297:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773674886/real 1773674886] req@ffff8dca59d6e300 x1859825952697856/t0(0) o400->MGC192.168.201.139@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1773674902 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 6746.151503] Lustre: 3297:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 14 previous similar messages [ 6746.163311] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 6746.183385] LustreError: Skipped 1 previous similar message [ 6755.858630] Lustre: MGS: Client 80858b98-1e85-4253-bd78-ba77b114b5a5 (at 0@lo) reconnecting [ 6755.865339] Lustre: Skipped 2 previous similar messages [ 6755.887401] Lustre: MGC192.168.201.139@tcp: Connection restored to 0@lo (at 0@lo) [ 6755.897446] Lustre: Skipped 2 previous similar messages [ 6756.854300] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 6756.881915] Lustre: Skipped 4 previous similar messages [ 6757.396754] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 6757.403821] Lustre: Skipped 1 previous similar message [ 6757.587934] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 6757.606501] Lustre: Skipped 1 previous similar message [ 6757.918992] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 6757.938919] Lustre: Skipped 1 previous similar message [ 6758.217586] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 6758.231955] Lustre: Skipped 1 previous similar message [ 6758.296968] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:53770 to 0x280000400:53793) [ 6758.302316] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:52106 to 0x240000400:52129) [ 6758.304379] LustreError: 89256:0:(mdd_orphans.c:441:mdd_orphan_index_iterate()) lustre-MDD0000: bad FID [0x0:0x0:0x0] cleaning 'PENDING' [ 6765.612778] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 6776.397798] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 6779.140890] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 6785.522379] Lustre: Failing over lustre-MDT0000 [ 6786.231382] Lustre: server umount lustre-MDT0000 complete [ 6814.689990] LustreError: 3295:0:(client.c:1390:ptlrpc_import_delay_req()) @@@ invalidate in flight req@ffff8dca64385180 x1859825952716544/t0(0) o250->MGC192.168.201.139@tcp@0@lo:26/25 lens 520/544 e 0 to 0 dl 0 ref 1 fl Rpc:NQU/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 6819.622568] LustreError: 90356:0:(mdd_orphans.c:441:mdd_orphan_index_iterate()) lustre-MDD0000: bad FID [0x0:0x0:0x0] cleaning 'PENDING' [ 6819.625239] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000400:52106 to 0x240000400:52161) [ 6819.627394] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000400:53770 to 0x280000400:53825) [ 6824.000983] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 6836.250798] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 6838.656576] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 7049.603684] Lustre: DEBUG MARKER: == recovery-small test 144b: orphan cleanup shouldn't be blocked for no objects+failover situation ========================================================== 11:33:17 (1773675197) [ 7063.394788] Lustre: Failing over lustre-OST0000 [ 7063.418036] LustreError: lustre-OST0000-osc-MDT0000: operation ost_create to node 0@lo failed: rc = -19 [ 7063.438030] Lustre: lustre-OST0000-osc-MDT0000: Connection to lustre-OST0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 7063.468975] Lustre: Skipped 2 previous similar messages [ 7064.705591] Lustre: server umount lustre-OST0000 complete [ 7065.109395] LustreError: 88126:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 192.168.201.39@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 7065.163838] LustreError: 88126:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 5 previous similar messages [ 7088.935570] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 7088.942373] Lustre: Skipped 1 previous similar message [ 7088.957914] Lustre: lustre-OST0000: in recovery but waiting for the first client to connect [ 7088.966477] Lustre: Skipped 1 previous similar message [ 7090.339782] Lustre: lustre-OST0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 7090.370128] Lustre: Skipped 1 previous similar message [ 7090.839522] Lustre: lustre-OST0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 7090.859761] Lustre: Skipped 1 previous similar message [ 7090.872202] Lustre: lustre-OST0000-osc-MDT0000: Connection restored to 0@lo (at 0@lo) [ 7090.880856] Lustre: Skipped 4 previous similar messages [ 7111.669203] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 7138.375873] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 7144.649536] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 7150.791464] Lustre: lustre-OST0001-osc-MDT0000: update sequence from 0x280000400 to 0x280000401 [ 7159.770572] Lustre: lustre-OST0000-osc-MDT0000: update sequence from 0x240000400 to 0x240000401 [ 7458.850865] Lustre: DEBUG MARKER: == recovery-small test 144c: reconnection during orphan cleanup shouldn't lose LAST_ID synchronization ========================================================== 11:40:07 (1773675607) [ 7657.095639] Lustre: Failing over lustre-MDT0000 [ 7657.451766] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 7657.454639] Lustre: Skipped 2 previous similar messages [ 7659.539036] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.39@tcp (stopping) [ 7660.759692] Lustre: server umount lustre-MDT0000 complete [ 7673.010144] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 7673.025259] LustreError: Skipped 2 previous similar messages [ 7679.635116] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 7685.763092] LustreError: 88662:0:(ofd_dev.c:1616:ofd_create_hdl()) cfs_fail_timeout id 254 sleeping for 5000ms [ 7685.768253] LustreError: 94104:0:(mdd_orphans.c:441:mdd_orphan_index_iterate()) lustre-MDD0000: bad FID [0x0:0x0:0x0] cleaning 'PENDING' [ 7685.773743] LustreError: 88662:0:(ofd_dev.c:1616:ofd_create_hdl()) Skipped 1 previous similar message [ 7688.276540] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 [ 7690.771167] LustreError: 88662:0:(ofd_dev.c:1616:ofd_create_hdl()) cfs_fail_timeout id 254 awake [ 7690.783548] LustreError: 88662:0:(ofd_dev.c:1616:ofd_create_hdl()) Skipped 1 previous similar message [ 7690.819982] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:13259 to 0x280000401:13313) [ 7690.848347] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000401:20659 to 0x240000401:20737) [ 7691.067089] Lustre: lustre-OST0000-osc-MDT0000: Connection to lustre-OST0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 7691.090529] Lustre: Skipped 2 previous similar messages [ 7691.107809] Lustre: lustre-OST0000: Client lustre-MDT0000-mdtlov_UUID (at 0@lo) reconnecting [ 7691.130692] Lustre: lustre-OST0000-osc-MDT0000: Connection restored to 0@lo (at 0@lo) [ 7691.142054] Lustre: Skipped 2 previous similar messages [ 7713.870528] Lustre: DEBUG MARKER: == recovery-small test 145: connect mdtlovs and process update logs after recovery expire ========================================================== 11:44:28 (1773675868) [ 7715.843965] Lustre: DEBUG MARKER: SKIP: recovery-small test_145 needs >= 3 MDTs [ 7718.284963] Lustre: DEBUG MARKER: == recovery-small test 146: test eviction is counted properly ========================================================== 11:44:32 (1773675872) [ 7721.136256] Lustre: 94761:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 4c8f0584-8ea1-4eae-a656-ec0012913222 at adminstrative request [ 7731.949938] Lustre: DEBUG MARKER: == recovery-small test 147: Check client reconnect ======= 11:44:45 (1773675885) [ 7734.758551] Lustre: 95108:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting 4c8f0584-8ea1-4eae-a656-ec0012913222 at adminstrative request [ 7734.910701] Lustre: *** cfs_fail_loc=225, val=0*** [ 7749.140473] Lustre: *** cfs_fail_loc=225, val=0*** [ 7769.606063] Lustre: *** cfs_fail_loc=225, val=0*** [ 7794.184299] Lustre: *** cfs_fail_loc=225, val=0*** [ 7859.717882] Lustre: *** cfs_fail_loc=225, val=0*** [ 7859.722820] Lustre: Skipped 1 previous similar message [ 7902.011646] Lustre: DEBUG MARKER: == recovery-small test 148: data corruption through resend ========================================================== 11:47:36 (1773676056) [ 7908.228363] LustreError: 12292:0:(tgt_handler.c:2907:tgt_brw_write()) cfs_fail_timeout id 227 sleeping for 27000ms [ 7935.304476] LustreError: 12292:0:(tgt_handler.c:2907:tgt_brw_write()) cfs_fail_timeout id 227 awake [ 7935.312384] LustreError: 12292:0:(tgt_handler.c:2907:tgt_brw_write()) Skipped 1 previous similar message [ 7949.526544] Lustre: DEBUG MARKER: == recovery-small test 149: skip orphan removal at umount ========================================================== 11:48:23 (1773676103) [ 7951.581713] Lustre: DEBUG MARKER: SKIP: recovery-small test_149 needs >= 2 MDTs [ 7954.003530] Lustre: DEBUG MARKER: == recovery-small test 150: statfs when MDT0 offline with lazystatfs option ========================================================== 11:48:28 (1773676108) [ 7956.252664] Lustre: DEBUG MARKER: SKIP: recovery-small test_150 needs >= 2 MDTs [ 7958.605872] Lustre: DEBUG MARKER: == recovery-small test 152: QoS object allocation could be awakened in case of OST failover ========================================================== 11:48:32 (1773676112) [ 7966.748527] ODEBUG: object 000000005201c749 is on stack 00000000c4d29f37, but NOT annotated. [ 7966.757615] WARNING: CPU: 3 PID: 94307 at lib/debugobjects.c:368 __debug_object_init.cold.5+0x35/0x15f [ 7966.770718] Modules linked in: lustre(O) osp(O) ofd(O) lod(O) mdt(O) mdd(O) mgs(O) osd_zfs(O) lquota(O) lfsck(O) obdecho(O) mgc(O) mdc(O) lov(O) osc(O) lmv(O) ec(O) fid(O) fld(O) ptlrpc_gss(O) ptlrpc(O) obdclass(O) ksocklnd(O) lnet(O) libcfs(O) zfs(O) spl(O) rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver intel_rapl_msr intel_rapl_common sb_edac rapl pcspkr i2c_piix4 squashfs crct10dif_pclmul crc32_pclmul crc32c_intel ata_generic ata_piix ghash_clmulni_intel serio_raw libata dm_mirror dm_region_hash dm_log dm_mod sha512_ssse3 sha512_generic [ 7966.832385] CPU: 3 PID: 94307 Comm: mdt00_003 Kdump: loaded Tainted: G O -------- - - 4.18.0rh8.10-debug #2 [ 7966.839494] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-8.fc42 06/10/2025 [ 7966.845370] RIP: 0010:__debug_object_init.cold.5+0x35/0x15f [ 7966.849207] Code: be bc 48 83 05 23 61 0c 03 01 89 05 59 69 0c 03 65 48 8b 04 25 00 dd 01 00 48 8b 50 18 e8 93 68 99 ff 48 83 05 1b 61 0c 03 01 <0f> 0b 48 83 05 19 61 0c 03 01 48 83 05 19 61 0c 03 01 e9 3f ee ff [ 7966.860158] RSP: 0018:ffff9a0242d974a0 EFLAGS: 00010002 [ 7966.864412] RAX: 0000000000000050 RBX: ffff9a0242d975a8 RCX: 0000000000000000 [ 7966.872486] RDX: 0000000000000000 RSI: ffff8dcb8219e5a8 RDI: ffff8dcb8219e5a8 [ 7966.879380] RBP: ffffffffbd306ae0 R08: 0000000000000000 R09: c0000000ffff7fff [ 7966.884955] R10: 0000000000000001 R11: ffff9a0242d97298 R12: ffffffffbeb39f88 [ 7966.889335] R13: 000000000004c7a0 R14: ffffffffbeb39f80 R15: ffff8dcb515556b8 [ 7966.896045] FS: 0000000000000000(0000) GS:ffff8dcb82180000(0000) knlGS:0000000000000000 [ 7966.899450] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 7966.903434] CR2: 00007fe3c497b040 CR3: 000000001fa16004 CR4: 0000000000170ee0 [ 7966.909069] Call Trace: [ 7966.911571] ? show_regs.cold.9+0x22/0x2f [ 7966.914249] ? __warn+0xc8/0x150 [ 7966.915941] ? __debug_object_init.cold.5+0x35/0x15f [ 7966.919219] ? report_bug+0x113/0x140 [ 7966.921275] ? do_error_trap+0xb6/0x130 [ 7966.923278] ? do_invalid_op+0x46/0x60 [ 7966.925878] ? __debug_object_init.cold.5+0x35/0x15f [ 7966.928432] ? invalid_op+0x14/0x20 [ 7966.930294] ? __debug_object_init.cold.5+0x35/0x15f [ 7966.933513] ? lod_set_pool+0x260/0x260 [lod] [ 7966.936450] debug_object_init+0x22/0x30 [ 7966.938524] init_timer_key+0x28/0x120 [ 7966.940732] lod_ost_alloc_qos+0x770/0x1c30 [lod] [ 7966.942979] lod_qos_prep_create+0x134e/0x1bc0 [lod] [ 7966.945300] lod_prepare_create+0x204/0x460 [lod] [ 7966.948018] lod_declare_striped_create+0x270/0xf80 [lod] [ 7966.951058] ? lod_sub_declare_create+0x111/0x320 [lod] [ 7966.957487] lod_declare_create+0x3d4/0x9c0 [lod] [ 7966.963798] ? _raw_spin_unlock+0x12/0x30 [ 7966.967359] mdd_declare_create_object_internal+0x107/0x4a0 [mdd] [ 7966.978689] mdd_declare_create_object.isra.25+0x55/0xc40 [mdd] [ 7966.983248] mdd_declare_create+0x6a/0x6c0 [mdd] [ 7966.987266] mdd_create+0x5bd/0x1d00 [mdd] [ 7966.990282] ? mdt_version_save+0xa8/0x210 [mdt] [ 7966.994289] mdt_reint_open+0x337c/0x3c10 [mdt] [ 7966.996533] mdt_reint_rec+0x139/0x2b0 [mdt] [ 7967.004517] mdt_reint_internal+0x6a0/0xdc0 [mdt] [ 7967.007348] mdt_intent_open+0x180/0x5b0 [mdt] [ 7967.010547] mdt_intent_opc.constprop.43+0x153/0xfb0 [mdt] [ 7967.013549] ? mdt_intent_fixup_resent+0x2e0/0x2e0 [mdt] [ 7967.017245] mdt_intent_policy+0x14b/0x670 [mdt] [ 7967.020969] ldlm_lock_enqueue+0x43c/0xcd0 [ptlrpc] [ 7967.024232] ? _raw_read_unlock+0x12/0x30 [ 7967.026277] ? cfs_hash_rw_unlock+0x11/0x30 [obdclass] [ 7967.029269] ldlm_handle_enqueue+0xcaf/0x2280 [ptlrpc] [ 7967.032367] tgt_enqueue+0xd0/0x300 [ptlrpc] [ 7967.036321] tgt_handle_request0+0x137/0xaf0 [ptlrpc] [ 7967.039341] tgt_request_handle+0x573/0x1e70 [ptlrpc] [ 7967.042828] ptlrpc_server_handle_request+0x443/0x13b0 [ptlrpc] [ 7967.046560] ? lprocfs_counter_add+0x15b/0x210 [obdclass] [ 7967.049980] ptlrpc_main+0xce8/0x1400 [ptlrpc] [ 7967.053003] ? ptlrpc_wait_event+0x690/0x690 [ptlrpc] [ 7967.056228] kthread+0x1d1/0x200 [ 7967.058515] ? set_kthread_struct+0x70/0x70 [ 7967.059971] ret_from_fork+0x1f/0x30 [ 7967.061452] ---[ end trace a4dde8fbfe60a4fe ]--- [ 7969.138476] ODEBUG: object 000000001ca1a777 is on stack 0000000006b6f2fc, but NOT annotated. [ 7969.155549] WARNING: CPU: 0 PID: 96040 at lib/debugobjects.c:368 __debug_object_init.cold.5+0x35/0x15f [ 7969.160234] Modules linked in: lustre(O) osp(O) ofd(O) lod(O) mdt(O) mdd(O) mgs(O) osd_zfs(O) lquota(O) lfsck(O) obdecho(O) mgc(O) mdc(O) lov(O) osc(O) lmv(O) ec(O) fid(O) fld(O) ptlrpc_gss(O) ptlrpc(O) obdclass(O) ksocklnd(O) lnet(O) libcfs(O) zfs(O) spl(O) rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver intel_rapl_msr intel_rapl_common sb_edac rapl pcspkr i2c_piix4 squashfs crct10dif_pclmul crc32_pclmul crc32c_intel ata_generic ata_piix ghash_clmulni_intel serio_raw libata dm_mirror dm_region_hash dm_log dm_mod sha512_ssse3 sha512_generic [ 7969.198408] CPU: 0 PID: 96040 Comm: mdt00_004 Kdump: loaded Tainted: G W O -------- - - 4.18.0rh8.10-debug #2 [ 7969.204917] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-8.fc42 06/10/2025 [ 7969.211634] RIP: 0010:__debug_object_init.cold.5+0x35/0x15f [ 7969.214528] Code: be bc 48 83 05 23 61 0c 03 01 89 05 59 69 0c 03 65 48 8b 04 25 00 dd 01 00 48 8b 50 18 e8 93 68 99 ff 48 83 05 1b 61 0c 03 01 <0f> 0b 48 83 05 19 61 0c 03 01 48 83 05 19 61 0c 03 01 e9 3f ee ff [ 7969.228917] RSP: 0018:ffff9a02432a74a0 EFLAGS: 00010002 [ 7969.235440] RAX: 0000000000000050 RBX: ffff9a02432a75a8 RCX: 0000000000000000 [ 7969.239853] RDX: 0000000000000000 RSI: ffff8dcb8201e5a8 RDI: ffff8dcb8201e5a8 [ 7969.243221] RBP: ffffffffbd306ae0 R08: 0000000000000000 R09: c0000000ffff7fff [ 7969.247586] R10: 0000000000000001 R11: ffff9a02432a7298 R12: ffffffffbeb3d8c8 [ 7969.251666] R13: 00000000000500e0 R14: ffffffffbeb3d8c0 R15: ffff8dcb7b807050 [ 7969.258424] FS: 0000000000000000(0000) GS:ffff8dcb82000000(0000) knlGS:0000000000000000 [ 7969.262797] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 7969.268236] CR2: 00007f73541b0000 CR3: 000000001fa16004 CR4: 0000000000170ef0 [ 7969.274168] Call Trace: [ 7969.275616] ? show_regs.cold.9+0x22/0x2f [ 7969.280238] ? __warn+0xc8/0x150 [ 7969.288619] ? __debug_object_init.cold.5+0x35/0x15f [ 7969.295295] ? report_bug+0x113/0x140 [ 7969.298085] ? do_error_trap+0xb6/0x130 [ 7969.299773] ? do_invalid_op+0x46/0x60 [ 7969.302727] ? __debug_object_init.cold.5+0x35/0x15f [ 7969.306112] ? invalid_op+0x14/0x20 [ 7969.307559] ? __debug_object_init.cold.5+0x35/0x15f [ 7969.311146] ? lod_set_pool+0x260/0x260 [lod] [ 7969.315838] debug_object_init+0x22/0x30 [ 7969.317934] init_timer_key+0x28/0x120 [ 7969.319047] lod_ost_alloc_qos+0x770/0x1c30 [lod] [ 7969.324305] lod_qos_prep_create+0x134e/0x1bc0 [lod] [ 7969.326135] lod_prepare_create+0x204/0x460 [lod] [ 7969.329388] lod_declare_striped_create+0x270/0xf80 [lod] [ 7969.332376] ? lod_sub_declare_create+0x111/0x320 [lod] [ 7969.335388] lod_declare_create+0x3d4/0x9c0 [lod] [ 7969.337141] ? _raw_spin_unlock+0x12/0x30 [ 7969.338444] mdd_declare_create_object_internal+0x107/0x4a0 [mdd] [ 7969.341503] mdd_declare_create_object.isra.25+0x55/0xc40 [mdd] [ 7969.344218] mdd_declare_create+0x6a/0x6c0 [mdd] [ 7969.346010] mdd_create+0x5bd/0x1d00 [mdd] [ 7969.348229] ? mdt_version_save+0xa8/0x210 [mdt] [ 7969.351292] mdt_reint_open+0x337c/0x3c10 [mdt] [ 7969.353340] mdt_reint_rec+0x139/0x2b0 [mdt] [ 7969.354706] mdt_reint_internal+0x6a0/0xdc0 [mdt] [ 7969.357167] mdt_intent_open+0x180/0x5b0 [mdt] [ 7969.358492] mdt_intent_opc.constprop.43+0x153/0xfb0 [mdt] [ 7969.361583] ? mdt_intent_fixup_resent+0x2e0/0x2e0 [mdt] [ 7969.363799] mdt_intent_policy+0x14b/0x670 [mdt] [ 7969.365783] ldlm_lock_enqueue+0x43c/0xcd0 [ptlrpc] [ 7969.367377] ? _raw_read_unlock+0x12/0x30 [ 7969.368391] ? cfs_hash_rw_unlock+0x11/0x30 [obdclass] [ 7969.371170] ldlm_handle_enqueue+0xcaf/0x2280 [ptlrpc] [ 7969.376227] tgt_enqueue+0xd0/0x300 [ptlrpc] [ 7969.379217] tgt_handle_request0+0x137/0xaf0 [ptlrpc] [ 7969.383514] tgt_request_handle+0x573/0x1e70 [ptlrpc] [ 7969.392388] ptlrpc_server_handle_request+0x443/0x13b0 [ptlrpc] [ 7969.396421] ? lprocfs_counter_add+0x15b/0x210 [obdclass] [ 7969.400229] ptlrpc_main+0xce8/0x1400 [ptlrpc] [ 7969.404053] ? ptlrpc_wait_event+0x690/0x690 [ptlrpc] [ 7969.407563] kthread+0x1d1/0x200 [ 7969.412458] ? set_kthread_struct+0x70/0x70 [ 7969.415375] ret_from_fork+0x1f/0x30 [ 7969.418906] ---[ end trace a4dde8fbfe60a4ff ]--- [ 7971.552447] ODEBUG: object 000000003c5b6de7 is on stack 00000000768785b2, but NOT annotated. [ 7971.570325] WARNING: CPU: 2 PID: 93631 at lib/debugobjects.c:368 __debug_object_init.cold.5+0x35/0x15f [ 7971.584643] Modules linked in: lustre(O) osp(O) ofd(O) lod(O) mdt(O) mdd(O) mgs(O) osd_zfs(O) lquota(O) lfsck(O) obdecho(O) mgc(O) mdc(O) lov(O) osc(O) lmv(O) ec(O) fid(O) fld(O) ptlrpc_gss(O) ptlrpc(O) obdclass(O) ksocklnd(O) lnet(O) libcfs(O) zfs(O) spl(O) rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver intel_rapl_msr intel_rapl_common sb_edac rapl pcspkr i2c_piix4 squashfs crct10dif_pclmul crc32_pclmul crc32c_intel ata_generic ata_piix ghash_clmulni_intel serio_raw libata dm_mirror dm_region_hash dm_log dm_mod sha512_ssse3 sha512_generic [ 7971.647319] CPU: 2 PID: 93631 Comm: mdt00_002 Kdump: loaded Tainted: G W O -------- - - 4.18.0rh8.10-debug #2 [ 7971.656478] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-8.fc42 06/10/2025 [ 7971.671550] RIP: 0010:__debug_object_init.cold.5+0x35/0x15f [ 7971.674087] Code: be bc 48 83 05 23 61 0c 03 01 89 05 59 69 0c 03 65 48 8b 04 25 00 dd 01 00 48 8b 50 18 e8 93 68 99 ff 48 83 05 1b 61 0c 03 01 <0f> 0b 48 83 05 19 61 0c 03 01 48 83 05 19 61 0c 03 01 e9 3f ee ff [ 7971.689442] RSP: 0018:ffff9a0242a4b4a0 EFLAGS: 00010006 [ 7971.694586] RAX: 0000000000000050 RBX: ffff9a0242a4b5a8 RCX: 0000000000000000 [ 7971.700500] RDX: 0000000000000000 RSI: ffff8dcb8211e5a8 RDI: ffff8dcb8211e5a8 [ 7971.705996] RBP: ffffffffbd306ae0 R08: 0000000000000000 R09: c0000000ffff7fff [ 7971.712307] R10: 0000000000000001 R11: ffff9a0242a4b298 R12: ffffffffbeb096a8 [ 7971.718502] R13: 000000000001bec0 R14: ffffffffbeb096a0 R15: ffff8dcb48131dc0 [ 7971.724377] FS: 0000000000000000(0000) GS:ffff8dcb82100000(0000) knlGS:0000000000000000 [ 7971.731077] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 7971.735455] CR2: 0000558abf9ba930 CR3: 000000001fa16003 CR4: 0000000000170ee0 [ 7971.745669] Call Trace: [ 7971.747480] ? show_regs.cold.9+0x22/0x2f [ 7971.753403] ? __warn+0xc8/0x150 [ 7971.755472] ? __debug_object_init.cold.5+0x35/0x15f [ 7971.760098] ? report_bug+0x113/0x140 [ 7971.763066] ? do_error_trap+0xb6/0x130 [ 7971.764205] ? do_invalid_op+0x46/0x60 [ 7971.765228] ? __debug_object_init.cold.5+0x35/0x15f [ 7971.769420] ? invalid_op+0x14/0x20 [ 7971.770881] ? __debug_object_init.cold.5+0x35/0x15f [ 7971.775039] ? lod_set_pool+0x260/0x260 [lod] [ 7971.778751] debug_object_init+0x22/0x30 [ 7971.780847] init_timer_key+0x28/0x120 [ 7971.783262] lod_ost_alloc_qos+0x770/0x1c30 [lod] [ 7971.787689] lod_qos_prep_create+0x134e/0x1bc0 [lod] [ 7971.789548] lod_prepare_create+0x204/0x460 [lod] [ 7971.791911] lod_declare_striped_create+0x270/0xf80 [lod] [ 7971.794851] ? lod_sub_declare_create+0x111/0x320 [lod] [ 7971.797131] lod_declare_create+0x3d4/0x9c0 [lod] [ 7971.798940] ? _raw_spin_unlock+0x12/0x30 [ 7971.800756] mdd_declare_create_object_internal+0x107/0x4a0 [mdd] [ 7971.803844] mdd_declare_create_object.isra.25+0x55/0xc40 [mdd] [ 7971.806131] mdd_declare_create+0x6a/0x6c0 [mdd] [ 7971.808864] mdd_create+0x5bd/0x1d00 [mdd] [ 7971.810821] ? mdt_version_save+0xa8/0x210 [mdt] [ 7971.812658] mdt_reint_open+0x337c/0x3c10 [mdt] [ 7971.814895] mdt_reint_rec+0x139/0x2b0 [mdt] [ 7971.816878] mdt_reint_internal+0x6a0/0xdc0 [mdt] [ 7971.819202] mdt_intent_open+0x180/0x5b0 [mdt] [ 7971.821374] mdt_intent_opc.constprop.43+0x153/0xfb0 [mdt] [ 7971.823709] ? mdt_intent_fixup_resent+0x2e0/0x2e0 [mdt] [ 7971.825529] mdt_intent_policy+0x14b/0x670 [mdt] [ 7971.827701] ldlm_lock_enqueue+0x43c/0xcd0 [ptlrpc] [ 7971.830703] ? _raw_read_unlock+0x12/0x30 [ 7971.831960] ? cfs_hash_rw_unlock+0x11/0x30 [obdclass] [ 7971.834355] ldlm_handle_enqueue+0xcaf/0x2280 [ptlrpc] [ 7971.838072] tgt_enqueue+0xd0/0x300 [ptlrpc] [ 7971.840486] tgt_handle_request0+0x137/0xaf0 [ptlrpc] [ 7971.843630] tgt_request_handle+0x573/0x1e70 [ptlrpc] [ 7971.845767] ptlrpc_server_handle_request+0x443/0x13b0 [ptlrpc] [ 7971.848411] ? lprocfs_counter_add+0x15b/0x210 [obdclass] [ 7971.850762] ptlrpc_main+0xce8/0x1400 [ptlrpc] [ 7971.854181] ? ptlrpc_wait_event+0x690/0x690 [ptlrpc] [ 7971.856160] kthread+0x1d1/0x200 [ 7971.857800] ? set_kthread_struct+0x70/0x70 [ 7971.859379] ret_from_fork+0x1f/0x30 [ 7971.860699] ---[ end trace a4dde8fbfe60a500 ]--- [ 7981.781476] LustreError: 93629:0:(lod_qos.c:789:lod_ost_alloc_rr()) cfs_fail_timeout id 173 awake [ 8000.226958] Lustre: DEBUG MARKER: == recovery-small test 153: evict vs reconnect race ====== 11:49:13 (1773676153) [ 8002.535740] Lustre: *** cfs_fail_loc=174, val=0*** [ 8002.538556] Lustre: Skipped 3 previous similar messages [ 8027.761790] Lustre: Failing over lustre-MDT0000 [ 8028.150109] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 8028.162127] Lustre: Skipped 1 previous similar message [ 8028.450444] Lustre: server umount lustre-MDT0000 complete [ 8038.506651] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 8039.215279] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 8039.224986] Lustre: Skipped 1 previous similar message [ 8039.382276] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 8039.394084] Lustre: Skipped 1 previous similar message [ 8040.999848] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 8041.007085] Lustre: Skipped 1 previous similar message [ 8041.182133] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 8041.193708] Lustre: Skipped 1 previous similar message [ 8041.263062] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:13802 to 0x280000401:13825) [ 8041.267447] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000401:21257 to 0x240000401:21281) [ 8041.270559] LustreError: 97340:0:(mdd_orphans.c:441:mdd_orphan_index_iterate()) lustre-MDD0000: bad FID [0x0:0x0:0x0] cleaning 'PENDING' [ 8045.508218] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 8052.460162] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 [ 8060.737192] Lustre: DEBUG MARKER: == recovery-small test 154a: corruption update llog can be skipped ========================================================== 11:50:14 (1773676214) [ 8062.921394] Lustre: DEBUG MARKER: SKIP: recovery-small test_154a needs >= 2 MDTs [ 8065.090814] Lustre: DEBUG MARKER: == recovery-small test 154b: restore update llog after failed recovery ========================================================== 11:50:19 (1773676219) [ 8067.188637] Lustre: DEBUG MARKER: SKIP: recovery-small test_154b needs >= 2 MDTs [ 8069.642570] Lustre: DEBUG MARKER: == recovery-small test 155: failover after client remount ========================================================== 11:50:23 (1773676223) [ 8091.613442] LustreError: 98357:0:(osd_handler.c:720:osd_ro()) lustre-MDT0000: *** setting device osd-zfs read-only *** [ 8092.699977] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 8094.710331] Lustre: Failing over lustre-MDT0000 [ 8094.747725] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.39@tcp (stopping) [ 8095.055451] Lustre: server umount lustre-MDT0000 complete [ 8113.530755] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 8115.653776] LustreError: 99017:0:(mdd_orphans.c:441:mdd_orphan_index_iterate()) lustre-MDD0000: bad FID [0x0:0x0:0x0] cleaning 'PENDING' [ 8115.654266] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:13802 to 0x280000401:13857) [ 8115.654556] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000401:21283 to 0x240000401:21313) [ 8119.263642] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 8130.902628] Lustre: DEBUG MARKER: == recovery-small test 156: tot_granted miscount after client eviction ========================================================== 11:51:25 (1773676285) [ 8133.084656] Lustre: Setting parameter general.timeout=5 in log params [ 8138.577588] LustreError: 99789:0:(osd_handler.c:720:osd_ro()) lustre-OST0000: *** setting device osd-zfs read-only *** [ 8139.693285] Lustre: DEBUG MARKER: ost1 REPLAY BARRIER on lustre-OST0000 [ 8142.912426] Lustre: Failing over lustre-OST0000 [ 8143.627911] Lustre: server umount lustre-OST0000 complete [ 8143.859932] LustreError: 92117:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 8143.885100] LustreError: 92117:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 5 previous similar messages [ 8152.034887] LustreError: 88662:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 8152.065842] LustreError: 88662:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 14 previous similar messages [ 8155.107571] Lustre: 3298:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773676256/real 1773676256] req@ffff8dcb7cb53800 x1859825966045312/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1773676311 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 8155.189229] Lustre: 3298:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 29 previous similar messages [ 8170.930747] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 8203.502828] Lustre: lustre-OST0000: recovery is timed out, evict stale exports [ 8203.525945] Lustre: 100349:0:(genops.c:1620:class_disconnect_stale_exports()) lustre-OST0000: disconnect stale client f83e3cd8-14c0-4dd9-a128-64d2c1fe6eea@192.168.201.39@tcp [ 8203.543205] Lustre: lustre-OST0000: disconnecting 1 stale clients [ 8203.553734] Lustre: 100349:0:(ldlm_lib.c:2067:extend_recovery_timer()) lustre-OST0000: extended recovery timer reached hard limit: 45, extend: 1 [ 8203.666174] Lustre: 100349:0:(ldlm_lib.c:2930:target_recovery_thread()) too long recovery - read logs [ 8203.685360] LustreError: dumping log to /tmp/lustre-log.1773676359.100349 [ 8214.349373] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 8216.290790] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 8227.601839] Lustre: Modifying parameter general.timeout=20 in log params [ 8229.605281] Lustre: DEBUG MARKER: == recovery-small test 157: eviction during mmaped i/o === 11:53:04 (1773676384) [ 8232.046752] Lustre: 101351:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting f83e3cd8-14c0-4dd9-a128-64d2c1fe6eea at adminstrative request [ 8240.125412] Lustre: DEBUG MARKER: == recovery-small test 158a: connect without access right ========================================================== 11:53:14 (1773676394) [ 8241.910252] Lustre: DEBUG MARKER: SKIP: recovery-small test_158a needs >= 2 MDTS [ 8244.068419] Lustre: DEBUG MARKER: == recovery-small test 160: MDT destroys are blocked by grouplocks ========================================================== 11:53:18 (1773676398) [ 8250.476965] LustreError: 6590:0:(ofd_dev.c:1877:ofd_destroy_hdl()) lustre-OST0000: error destroying object [0x240000401:0x534a:0x0]: -5 [ 8250.509913] LustreError: 6590:0:(ofd_dev.c:1877:ofd_destroy_hdl()) Skipped 9 previous similar messages [ 8296.728161] Lustre: DEBUG MARKER: == recovery-small test 161: evict osp by ping evictor ==== 11:54:10 (1773676450) [ 8298.779349] Lustre: DEBUG MARKER: SKIP: recovery-small test_161 needs >= 2 MDTs [ 8301.144883] Lustre: DEBUG MARKER: == recovery-small test 162: File attributes should be persisted after MDS failover ========================================================== 11:54:15 (1773676455) [ 8304.226987] Lustre: Failing over lustre-MDT0000 [ 8305.019359] Lustre: server umount lustre-MDT0000 complete [ 8314.547084] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 8314.960982] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 8314.981894] Lustre: Skipped 5 previous similar messages [ 8317.318549] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:13859 to 0x280000401:13889) [ 8317.325960] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000401:21327 to 0x240000401:21345) [ 8317.350465] LustreError: 102976:0:(mdd_orphans.c:441:mdd_orphan_index_iterate()) lustre-MDD0000: bad FID [0x0:0x0:0x0] cleaning 'PENDING' [ 8320.503019] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 0@lo (at 0@lo) [ 8320.518821] Lustre: Skipped 5 previous similar messages [ 8321.514399] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug -1 all [ 8329.000267] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing _wait_recovery_complete *.lustre-MDT0000.recovery_status 1475 [ 8336.596336] Lustre: DEBUG MARKER: == recovery-small test 163: changelog check for fail write and processing records ========================================================== 11:54:50 (1773676490) [ 8340.264633] Lustre: lustre-MDD0000: changelog on [ 8344.545400] Lustre: *** cfs_fail_loc=18f, val=30*** [ 8345.080605] Lustre: *** cfs_fail_loc=18f, val=30*** [ 8345.085436] Lustre: Skipped 18 previous similar messages [ 8345.318659] LustreError: 102860:0:(mdd_dir.c:830:mdd_changelog_write_rec()) lustre-MDD0000: failed to write changelog record file [0x1:0x94:0x0] rec idx 31 off 13488 chnlg idx 31: rc = -5 [ 8345.340231] LustreError: 102860:0:(llog_cat.c:592:llog_cat_add_rec()) llog_write_rec -5: lh=ffff8dca56f29a00 [ 8345.349118] LustreError: 102860:0:(mdd_dir.c:1492:mdd_changelog_ns_store()) lustre-MDD0000: cannot store changelog record: type = 1, name = '02', t = [0x20000fe02:0x23:0x0], p = [0x20000fe02:0x5:0x0]: rc = -5 [ 8346.029731] LustreError: 102862:0:(mdd_dir.c:830:mdd_changelog_write_rec()) lustre-MDD0000: failed to write changelog record file [0x1:0x94:0x0] rec idx 62 off 19024 chnlg idx 62: rc = -5 [ 8346.058954] LustreError: 102862:0:(llog_cat.c:592:llog_cat_add_rec()) llog_write_rec -5: lh=ffff8dca56f29a00 [ 8346.068731] LustreError: 102862:0:(mdd_dir.c:1492:mdd_changelog_ns_store()) lustre-MDD0000: cannot store changelog record: type = 1, name = '56', t = [0x20000fe02:0x42:0x0], p = [0x20000fe02:0x5:0x0]: rc = -5 [ 8346.109890] Lustre: *** cfs_fail_loc=18f, val=30*** [ 8346.118702] Lustre: Skipped 41 previous similar messages [ 8353.748445] Lustre: Hit invalid llog record: idx 0, type 0, id 0 [ 8353.778883] Lustre: Hit invalid llog record: idx 0, type 0, id 0 [ 8354.936219] Lustre: lustre-MDD0000: changelog off [ 8360.483824] Lustre: DEBUG MARKER: == recovery-small test complete, duration 8197 sec ======= 11:55:14 (1773676514) [ 8361.440207] Lustre: 3296:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773676462/real 1773676462] req@ffff8dcb49512680 x1859825966154368/t0(0) o400->lustre-MDT0000-lwp-OST0001@0@lo:12/10 lens 224/224 e 0 to 1 dl 1773676517 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 8361.496053] Lustre: 3296:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 3 previous similar messages [ 8362.722819] Lustre: DEBUG MARKER: === recovery-small: start cleanup 11:55:16 (1773676516) === [ 8727.575351] Lustre: DEBUG MARKER: === recovery-small: finish cleanup 12:01:22 (1773676882) === [ 8729.684347] Lustre: Failing over lustre-MDT0000 [ 8730.083886] LustreError: 102863:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 8730.094562] LustreError: 102863:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 21 previous similar messages [ 8730.200183] Lustre: server umount lustre-MDT0000 complete [ 8746.465736] Lustre: 3298:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773676886/real 1773676886] req@ffff8dcb6eac1500 x1859825967399552/t0(0) o400->MGC192.168.201.139@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1773676902 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 8746.499194] Lustre: 3298:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 2 previous similar messages [ 8746.508691] LustreError: MGC192.168.201.139@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 8755.746818] LustreError: 3295:0:(client.c:1390:ptlrpc_import_delay_req()) @@@ invalidate in flight req@ffff8dcb6eac0380 x1859825967401216/t0(0) o250->MGC192.168.201.139@tcp@0@lo:26/25 lens 520/544 e 0 to 0 dl 0 ref 1 fl Rpc:NQU/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 8756.418238] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 8756.423970] Lustre: Skipped 3 previous similar messages [ 8756.511310] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 8756.523555] Lustre: Skipped 3 previous similar messages [ 8757.317094] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 1 client reconnects [ 8757.322932] Lustre: Skipped 3 previous similar messages [ 8757.374690] Lustre: lustre-MDT0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 8757.380505] Lustre: Skipped 3 previous similar messages [ 8757.406189] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:13859 to 0x280000401:13921) [ 8757.406189] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x240000401:21327 to 0x240000401:21377) [ 8757.410567] LustreError: 105374:0:(mdd_orphans.c:441:mdd_orphan_index_iterate()) lustre-MDD0000: bad FID [0x0:0x0:0x0] cleaning 'PENDING' [ 8760.196301] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all [ 8767.227877] Lustre: DEBUG MARKER: oleg139-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 8768.770551] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 8774.036048] Lustre: server umount lustre-MDT0000 complete [ 8777.783310] LustreError: 5748:0:(ldlm_lockd.c:2564:ldlm_cancel_handler()) ldlm_cancel from 0@lo arrived at 1773676934 with bad export cookie 10572800792205948184 [ 8788.082863] Lustre: server umount lustre-OST0000 complete [ 8802.430754] Lustre: server umount lustre-OST0001 complete [ 8812.678130] Lustre: DEBUG MARKER: oleg139-server.virtnet: executing unload_modules_local [ 8815.201840] Key type lgssc unregistered [ 8815.424944] LNet: 106900:0:(lib-ptl.c:967:lnet_clear_lazy_portal()) Active lazy portal 0 on exit [ 8815.436880] LNetError: 106900:0:(acceptor.c:252:lnet_acceptor_remove_socket()) Interface ens2 not found [ 8815.452617] LNet: Removed LNI 192.168.201.139@tcp [ 8816.007366] Key type .llcrypt unregistered [ 8816.011028] Key type ._llcrypt unregistered