[ 0.000000] Linux version 4.18.0rh8.10-debug (green@maintenance) (gcc version 8.5.0 20210514 (Red Hat 8.5.0-26) (GCC)) #2 SMP Mon Jul 14 01:24:22 EDT 2025 [ 0.000000] Command line: rd.shell root=nbd:192.168.200.253:rocky8.10:ext4:ro:-p,-b4096 ro crashkernel=256M panic=1 nomodeset ipmtu=9000 ip=dhcp rd.neednet=1 init_on_free=off mitigations=off console=ttyS1,115200 audit=0 [ 0.000000] x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers' [ 0.000000] x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers' [ 0.000000] x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers' [ 0.000000] x86/fpu: xstate_offset[2]: 576, xstate_sizes[2]: 256 [ 0.000000] x86/fpu: Enabled xstate features 0x7, context size is 832 bytes, using 'standard' format. [ 0.000000] signal: max sigframe size: 1776 [ 0.000000] BIOS-provided physical RAM map: [ 0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009fbff] usable [ 0.000000] BIOS-e820: [mem 0x000000000009fc00-0x000000000009ffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000000f0000-0x00000000000fffff] reserved [ 0.000000] BIOS-e820: [mem 0x0000000000100000-0x00000000bffcdfff] usable [ 0.000000] BIOS-e820: [mem 0x00000000bffce000-0x00000000bfffffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000feffc000-0x00000000feffffff] reserved [ 0.000000] BIOS-e820: [mem 0x00000000fffc0000-0x00000000ffffffff] reserved [ 0.000000] BIOS-e820: [mem 0x0000000100000000-0x0000000146dfffff] usable [ 0.000000] NX (Execute Disable) protection: active [ 0.000000] SMBIOS 2.8 present. [ 0.000000] DMI: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-8.fc42 06/10/2025 [ 0.000000] Hypervisor detected: KVM [ 0.000000] kvm-clock: Using msrs 4b564d01 and 4b564d00 [ 0.000000] kvm-clock: using sched offset of 502656814 cycles [ 0.000000] clocksource: kvm-clock: mask: 0xffffffffffffffff max_cycles: 0x1cd42e4dffb, max_idle_ns: 881590591483 ns [ 0.000000] tsc: Detected 2399.996 MHz processor [ 0.000000] e820: update [mem 0x00000000-0x00000fff] usable ==> reserved [ 0.000000] e820: remove [mem 0x000a0000-0x000fffff] usable [ 0.000000] last_pfn = 0x146e00 max_arch_pfn = 0x400000000 [ 0.000000] MTRR default type: write-back [ 0.000000] MTRR fixed ranges enabled: [ 0.000000] 00000-9FFFF write-back [ 0.000000] A0000-BFFFF uncachable [ 0.000000] C0000-FFFFF write-protect [ 0.000000] MTRR variable ranges enabled: [ 0.000000] 0 base 0000C0000000 mask 3FFFC0000000 uncachable [ 0.000000] 1 disabled [ 0.000000] 2 disabled [ 0.000000] 3 disabled [ 0.000000] 4 disabled [ 0.000000] 5 disabled [ 0.000000] 6 disabled [ 0.000000] 7 disabled [ 0.000000] x86/PAT: Configuration [0-7]: WB WC UC- UC WB WP UC- WT [ 0.000000] last_pfn = 0xbffce max_arch_pfn = 0x400000000 [ 0.000000] found SMP MP-table at [mem 0x000f54b0-0x000f54bf] [ 0.000000] BRK [0x65801000, 0x65801fff] PGTABLE [ 0.000000] BRK [0x65802000, 0x65802fff] PGTABLE [ 0.000000] BRK [0x65803000, 0x65803fff] PGTABLE [ 0.000000] BRK [0x65804000, 0x65804fff] PGTABLE [ 0.000000] BRK [0x65805000, 0x65805fff] PGTABLE [ 0.000000] BRK [0x65806000, 0x65806fff] PGTABLE [ 0.000000] BRK [0x65807000, 0x65807fff] PGTABLE [ 0.000000] BRK [0x65808000, 0x65808fff] PGTABLE [ 0.000000] BRK [0x65809000, 0x65809fff] PGTABLE [ 0.000000] BRK [0x6580a000, 0x6580afff] PGTABLE [ 0.000000] BRK [0x6580b000, 0x6580bfff] PGTABLE [ 0.000000] BRK [0x6580c000, 0x6580cfff] PGTABLE [ 0.000000] RAMDISK: [mem 0xbcc54000-0xbffbffff] [ 0.000000] ACPI: Early table checksum verification disabled [ 0.000000] ACPI: RSDP 0x00000000000F52D0 000014 (v00 BOCHS ) [ 0.000000] ACPI: RSDT 0x00000000BFFE2439 000034 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: FACP 0x00000000BFFE22D5 000074 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: DSDT 0x00000000BFFE0040 002295 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: FACS 0x00000000BFFE0000 000040 [ 0.000000] ACPI: APIC 0x00000000BFFE2349 000090 (v03 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: HPET 0x00000000BFFE23D9 000038 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: WAET 0x00000000BFFE2411 000028 (v01 BOCHS BXPC 00000001 BXPC 00000001) [ 0.000000] ACPI: Reserving FACP table memory at [mem 0xbffe22d5-0xbffe2348] [ 0.000000] ACPI: Reserving DSDT table memory at [mem 0xbffe0040-0xbffe22d4] [ 0.000000] ACPI: Reserving FACS table memory at [mem 0xbffe0000-0xbffe003f] [ 0.000000] ACPI: Reserving APIC table memory at [mem 0xbffe2349-0xbffe23d8] [ 0.000000] ACPI: Reserving HPET table memory at [mem 0xbffe23d9-0xbffe2410] [ 0.000000] ACPI: Reserving WAET table memory at [mem 0xbffe2411-0xbffe2438] [ 0.000000] ACPI: Local APIC address 0xfee00000 [ 0.000000] No NUMA configuration found [ 0.000000] Faking a node at [mem 0x0000000000000000-0x0000000146dfffff] [ 0.000000] NODE_DATA(0) allocated [mem 0x1465a3000-0x1465cdfff] [ 0.000000] Reserving 256MB of memory at 2752MB for crashkernel (System RAM: 4205MB) [ 0.000000] Zone ranges: [ 0.000000] DMA [mem 0x0000000000001000-0x0000000000ffffff] [ 0.000000] DMA32 [mem 0x0000000001000000-0x00000000ffffffff] [ 0.000000] Normal [mem 0x0000000100000000-0x0000000146dfffff] [ 0.000000] Device empty [ 0.000000] Movable zone start for each node [ 0.000000] Early memory node ranges [ 0.000000] node 0: [mem 0x0000000000001000-0x000000000009efff] [ 0.000000] node 0: [mem 0x0000000000100000-0x00000000bffcdfff] [ 0.000000] node 0: [mem 0x0000000100000000-0x0000000146dfffff] [ 0.000000] Zeroed struct page in unavailable ranges: 4756 pages [ 0.000000] Initmem setup node 0 [mem 0x0000000000001000-0x0000000146dfffff] [ 0.000000] On node 0 totalpages: 1076588 [ 0.000000] DMA zone: 64 pages used for memmap [ 0.000000] DMA zone: 158 pages reserved [ 0.000000] DMA zone: 3998 pages, LIFO batch:0 [ 0.000000] DMA32 zone: 12224 pages used for memmap [ 0.000000] DMA32 zone: 782286 pages, LIFO batch:63 [ 0.000000] Normal zone: 4536 pages used for memmap [ 0.000000] Normal zone: 290304 pages, LIFO batch:63 [ 0.000000] ACPI: PM-Timer IO Port: 0x608 [ 0.000000] ACPI: Local APIC address 0xfee00000 [ 0.000000] ACPI: LAPIC_NMI (acpi_id[0xff] dfl dfl lint[0x1]) [ 0.000000] IOAPIC[0]: apic_id 0, version 17, address 0xfec00000, GSI 0-23 [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 5 global_irq 5 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 10 global_irq 10 high level) [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 11 global_irq 11 high level) [ 0.000000] ACPI: IRQ0 used by override. [ 0.000000] ACPI: IRQ5 used by override. [ 0.000000] ACPI: IRQ9 used by override. [ 0.000000] ACPI: IRQ10 used by override. [ 0.000000] ACPI: IRQ11 used by override. [ 0.000000] Using ACPI (MADT) for SMP configuration information [ 0.000000] ACPI: HPET id: 0x8086a201 base: 0xfed00000 [ 0.000000] TSC deadline timer available [ 0.000000] smpboot: Allowing 4 CPUs, 0 hotplug CPUs [ 0.000000] kvm-guest: KVM setup pv remote TLB flush [ 0.000000] kvm-guest: setup PV sched yield [ 0.000000] PM: Registered nosave memory: [mem 0x00000000-0x00000fff] [ 0.000000] PM: Registered nosave memory: [mem 0x0009f000-0x0009ffff] [ 0.000000] PM: Registered nosave memory: [mem 0x000a0000-0x000effff] [ 0.000000] PM: Registered nosave memory: [mem 0x000f0000-0x000fffff] [ 0.000000] PM: Registered nosave memory: [mem 0xbffce000-0xbfffffff] [ 0.000000] PM: Registered nosave memory: [mem 0xc0000000-0xfeffbfff] [ 0.000000] PM: Registered nosave memory: [mem 0xfeffc000-0xfeffffff] [ 0.000000] PM: Registered nosave memory: [mem 0xff000000-0xfffbffff] [ 0.000000] PM: Registered nosave memory: [mem 0xfffc0000-0xffffffff] [ 0.000000] [mem 0xc0000000-0xfeffbfff] available for PCI devices [ 0.000000] Booting paravirtualized kernel on KVM [ 0.000000] clocksource: refined-jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 1910969940391419 ns [ 0.000000] setup_percpu: NR_CPUS:8192 nr_cpumask_bits:4 nr_cpu_ids:4 nr_node_ids:1 [ 0.000000] percpu: Embedded 63 pages/cpu s221184 r8192 d28672 u524288 [ 0.000000] pcpu-alloc: s221184 r8192 d28672 u524288 alloc=1*2097152 [ 0.000000] pcpu-alloc: [0] 0 1 2 3 [ 0.000000] kvm-guest: PV spinlocks enabled [ 0.000000] PV qspinlock hash table entries: 256 (order: 0, 4096 bytes, linear) [ 0.000000] Built 1 zonelists, mobility grouping on. Total pages: 1059606 [ 0.000000] Policy zone: Normal [ 0.000000] Kernel command line: rd.shell root=nbd:192.168.200.253:rocky8.10:ext4:ro:-p,-b4096 ro crashkernel=256M panic=1 nomodeset ipmtu=9000 ip=dhcp rd.neednet=1 init_on_free=off mitigations=off console=ttyS1,115200 audit=0 [ 0.000000] Specific versions of hardware are certified with Red Hat Enterprise Linux 8. Please see the list of hardware certified with Red Hat Enterprise Linux 8 at https://catalog.redhat.com. [ 0.000000] audit: disabled (until reboot) [ 0.000000] software IO TLB: area num 4. [ 0.000000] Memory: 2829652K/4306352K available (18435K kernel code, 11221K rwdata, 7248K rodata, 2908K init, 18040K bss, 524580K reserved, 0K cma-reserved) [ 0.000000] SLUB: HWalign=64, Order=0-3, MinObjects=0, CPUs=4, Nodes=1 [ 0.000000] kmemleak: Kernel memory leak detector disabled [ 0.000000] ftrace: allocating 41240 entries in 162 pages [ 0.000000] ftrace: allocated 162 pages with 3 groups [ 0.000000] rcu: Hierarchical RCU implementation. [ 0.000000] rcu: RCU event tracing is enabled. [ 0.000000] rcu: RCU restricting CPUs from NR_CPUS=8192 to nr_cpu_ids=4. [ 0.000000] rcu: RCU callback double-/use-after-free debug enabled. [ 0.000000] Rude variant of Tasks RCU enabled. [ 0.000000] Tracing variant of Tasks RCU enabled. [ 0.000000] rcu: RCU calculated value of scheduler-enlistment delay is 100 jiffies. [ 0.000000] rcu: Adjusting geometry for rcu_fanout_leaf=16, nr_cpu_ids=4 [ 0.000000] NR_IRQS: 524544, nr_irqs: 456, preallocated irqs: 16 [ 0.000000] random: get_random_bytes called from start_kernel+0x622/0x9a8 with crng_init=0 [ 0.001000] Console: colour *CGA 80x25 [ 0.001000] printk: console [ttyS1] enabled [ 0.001000] ACPI: Core revision 20220331 [ 0.001000] clocksource: hpet: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 19112604467 ns [ 0.001000] hpet clockevent registered [ 0.001010] APIC: Switch to symmetric I/O mode setup [ 0.003116] x2apic enabled [ 0.004008] Switched APIC routing to physical x2apic. [ 0.005011] kvm-guest: setup PV IPIs [ 0.008000] ..TIMER: vector=0x30 apic1=0 pin1=2 apic2=-1 pin2=-1 [ 0.008000] clocksource: tsc-early: mask: 0xffffffffffffffff max_cycles: 0x229833f6470, max_idle_ns: 440795327230 ns [ 0.008018] Calibrating delay loop (skipped) preset value.. 4799.99 BogoMIPS (lpj=2399996) [ 0.009010] pid_max: default: 32768 minimum: 301 [ 0.010352] LSM: Security Framework initializing [ 0.011053] Yama: becoming mindful. [ 0.012038] SELinux: Initializing. [ 0.013060] *** VALIDATE selinux *** [ 0.021668] Dentry cache hash table entries: 1048576 (order: 11, 8388608 bytes, vmalloc) [ 0.026371] Inode-cache hash table entries: 524288 (order: 10, 4194304 bytes, vmalloc) [ 0.028069] Mount-cache hash table entries: 16384 (order: 5, 131072 bytes, vmalloc) [ 0.029110] Mountpoint-cache hash table entries: 16384 (order: 5, 131072 bytes, vmalloc) [ 0.030114] *** VALIDATE tmpfs *** [ 0.031473] *** VALIDATE proc *** [ 0.032220] *** VALIDATE cgroup *** [ 0.033010] *** VALIDATE cgroup2 *** [ 0.035251] x86/cpu: User Mode Instruction Prevention (UMIP) activated [ 0.036347] Last level iTLB entries: 4KB 0, 2MB 0, 4MB 0 [ 0.037005] Last level dTLB entries: 4KB 0, 2MB 0, 4MB 0, 1GB 0 [ 0.038025] Spectre V2 : User space: Vulnerable [ 0.039004] Speculative Store Bypass: Vulnerable [ 0.042288] debug: unmapping init [mem 0xffffffff9c059000-0xffffffff9c060fff] [ 0.044000] smpboot: CPU0: Intel(R) Xeon(R) CPU E5-2695 v2 @ 2.40GHz (family: 0x6, model: 0x3e, stepping: 0x4) [ 0.044707] Performance Events: IvyBridge events, full-width counters, Intel PMU driver. [ 0.045016] ... version: 2 [ 0.046011] ... bit width: 48 [ 0.047008] ... generic registers: 4 [ 0.048009] ... value mask: 0000ffffffffffff [ 0.049012] ... max period: 00007fffffffffff [ 0.050011] ... fixed-purpose events: 3 [ 0.050837] ... event mask: 000000070000000f [ 0.051255] rcu: Hierarchical SRCU implementation. [ 0.053394] smp: Bringing up secondary CPUs ... [ 0.054523] x86: Booting SMP configuration: [ 0.055015] .... node #0, CPUs: #1 #2 #3 [ 0.059073] smp: Brought up 1 node, 4 CPUs [ 0.061011] smpboot: Max logical packages: 1 [ 0.062013] smpboot: Total of 4 processors activated (19199.96 BogoMIPS) [ 0.213289] node 0 deferred pages initialised in 150ms [ 0.217266] devtmpfs: initialized [ 0.218220] x86/mm: Memory block size: 128MB [ 0.221633] gcov: version magic: 0x41383552 [ 0.224214] clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 1911260446275000 ns [ 0.226074] futex hash table entries: 1024 (order: 4, 65536 bytes, vmalloc) [ 0.228475] pinctrl core: initialized pinctrl subsystem [ 0.230674] ************************************************************* [ 0.233012] ** NOTICE NOTICE NOTICE NOTICE NOTICE NOTICE NOTICE ** [ 0.235008] ** ** [ 0.238010] ** IOMMU DebugFS SUPPORT HAS BEEN ENABLED IN THIS KERNEL ** [ 0.240008] ** ** [ 0.242010] ** This means that this kernel is built to expose internal ** [ 0.244020] ** IOMMU data structures, which may compromise security on ** [ 0.247009] ** your system. ** [ 0.250015] ** ** [ 0.252007] ** If you see this message and you are not debugging the ** [ 0.254008] ** kernel, report this immediately to your vendor! ** [ 0.257009] ** ** [ 0.259009] ** NOTICE NOTICE NOTICE NOTICE NOTICE NOTICE NOTICE ** [ 0.260010] ************************************************************* [ 0.262456] NET: Registered protocol family 16 [ 0.264662] DMA: preallocated 512 KiB GFP_KERNEL pool for atomic allocations [ 0.267047] DMA: preallocated 512 KiB GFP_KERNEL|GFP_DMA pool for atomic allocations [ 0.270045] DMA: preallocated 512 KiB GFP_KERNEL|GFP_DMA32 pool for atomic allocations [ 0.273430] cpuidle: using governor menu [ 0.275508] acpiphp: ACPI Hot Plug PCI Controller Driver version: 0.5 [ 0.277422] PCI: Using configuration type 1 for base access [ 0.279085] core: PMU erratum BJ122, BV98, HSD29 worked around, HT is on [ 0.287226] HugeTLB registered 1.00 GiB page size, pre-allocated 0 pages [ 0.289061] HugeTLB registered 2.00 MiB page size, pre-allocated 0 pages [ 0.292067] cryptd: max_cpu_qlen set to 1000 [ 0.296038] ACPI: Added _OSI(Module Device) [ 0.297013] ACPI: Added _OSI(Processor Device) [ 0.298000] ACPI: Added _OSI(3.0 _SCP Extensions) [ 0.299011] ACPI: Added _OSI(Processor Aggregator Device) [ 0.302497] ACPI: 1 ACPI AML tables successfully acquired and loaded [ 0.308367] ACPI: Interpreter enabled [ 0.309036] ACPI: PM: (supports S0 S3 S4 S5) [ 0.310008] ACPI: Using IOAPIC for interrupt routing [ 0.311104] PCI: Using host bridge windows from ACPI; if necessary, use "pci=nocrs" and report a bug [ 0.315439] ACPI: Enabled 2 GPEs in block 00 to 0F [ 0.326572] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-ff]) [ 0.329026] acpi PNP0A03:00: _OSC: OS supports [ASPM ClockPM Segments MSI HPX-Type3] [ 0.332014] acpi PNP0A03:00: _OSC: not requesting OS control; OS requires [ExtendedConfig ASPM ClockPM MSI] [ 0.335066] acpi PNP0A03:00: fail to add MMCONFIG information, can't access extended PCI configuration space under this bridge. [ 0.340313] acpiphp: Slot [2] registered [ 0.341085] acpiphp: Slot [5] registered [ 0.342045] acpiphp: Slot [6] registered [ 0.344102] acpiphp: Slot [7] registered [ 0.345100] acpiphp: Slot [8] registered [ 0.347134] acpiphp: Slot [9] registered [ 0.349232] acpiphp: Slot [10] registered [ 0.351116] acpiphp: Slot [3] registered [ 0.353069] acpiphp: Slot [4] registered [ 0.354064] acpiphp: Slot [11] registered [ 0.355069] acpiphp: Slot [12] registered [ 0.356074] acpiphp: Slot [13] registered [ 0.357046] acpiphp: Slot [14] registered [ 0.359063] acpiphp: Slot [15] registered [ 0.361071] acpiphp: Slot [16] registered [ 0.362048] acpiphp: Slot [17] registered [ 0.363043] acpiphp: Slot [18] registered [ 0.364089] acpiphp: Slot [19] registered [ 0.366047] acpiphp: Slot [20] registered [ 0.366990] acpiphp: Slot [21] registered [ 0.368065] acpiphp: Slot [22] registered [ 0.369008] acpiphp: Slot [23] registered [ 0.370068] acpiphp: Slot [24] registered [ 0.371260] acpiphp: Slot [25] registered [ 0.373081] acpiphp: Slot [26] registered [ 0.374066] acpiphp: Slot [27] registered [ 0.376068] acpiphp: Slot [28] registered [ 0.378068] acpiphp: Slot [29] registered [ 0.379051] acpiphp: Slot [30] registered [ 0.380046] acpiphp: Slot [31] registered [ 0.382047] PCI host bridge to bus 0000:00 [ 0.383014] pci_bus 0000:00: root bus resource [io 0x0000-0x0cf7 window] [ 0.386018] pci_bus 0000:00: root bus resource [io 0x0d00-0xffff window] [ 0.388014] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window] [ 0.391019] pci_bus 0000:00: root bus resource [mem 0xc0000000-0xfebfffff window] [ 0.394019] pci_bus 0000:00: root bus resource [mem 0xe0000000000-0xe007fffffff window] [ 0.396018] pci_bus 0000:00: root bus resource [bus 00-ff] [ 0.399169] pci 0000:00:00.0: [8086:1237] type 00 class 0x060000 [ 0.402158] pci 0000:00:01.0: [8086:7000] type 00 class 0x060100 [ 0.406127] pci 0000:00:01.1: [8086:7010] type 00 class 0x010180 [ 0.415012] pci 0000:00:01.1: reg 0x20: [io 0xc320-0xc32f] [ 0.419932] pci 0000:00:01.1: legacy IDE quirk: reg 0x10: [io 0x01f0-0x01f7] [ 0.422011] pci 0000:00:01.1: legacy IDE quirk: reg 0x14: [io 0x03f6] [ 0.424009] pci 0000:00:01.1: legacy IDE quirk: reg 0x18: [io 0x0170-0x0177] [ 0.427010] pci 0000:00:01.1: legacy IDE quirk: reg 0x1c: [io 0x0376] [ 0.429581] pci 0000:00:01.3: [8086:7113] type 00 class 0x068000 [ 0.433157] pci 0000:00:01.3: quirk: [io 0x0600-0x063f] claimed by PIIX4 ACPI [ 0.435021] pci 0000:00:01.3: quirk: [io 0x0700-0x070f] claimed by PIIX4 SMB [ 0.437523] pci 0000:00:02.0: [1af4:1000] type 00 class 0x020000 [ 0.442010] pci 0000:00:02.0: reg 0x10: [io 0xc300-0xc31f] [ 0.454012] pci 0000:00:02.0: reg 0x20: [mem 0xe0000000000-0xe0000003fff 64bit pref] [ 0.459000] pci 0000:00:02.0: reg 0x30: [mem 0xfeb80000-0xfebbffff pref] [ 0.466471] pci 0000:00:05.0: [1af4:1001] type 00 class 0x010000 [ 0.472013] pci 0000:00:05.0: reg 0x10: [io 0xc000-0xc07f] [ 0.478023] pci 0000:00:05.0: reg 0x14: [mem 0xfebc0000-0xfebc0fff] [ 0.495015] pci 0000:00:05.0: reg 0x20: [mem 0xe0000004000-0xe0000007fff 64bit pref] [ 0.510158] pci 0000:00:06.0: [1af4:1001] type 00 class 0x010000 [ 0.519018] pci 0000:00:06.0: reg 0x10: [io 0xc080-0xc0ff] [ 0.527014] pci 0000:00:06.0: reg 0x14: [mem 0xfebc1000-0xfebc1fff] [ 0.549016] pci 0000:00:06.0: reg 0x20: [mem 0xe0000008000-0xe000000bfff 64bit pref] [ 0.559593] pci 0000:00:07.0: [1af4:1001] type 00 class 0x010000 [ 0.565014] pci 0000:00:07.0: reg 0x10: [io 0xc100-0xc17f] [ 0.572013] pci 0000:00:07.0: reg 0x14: [mem 0xfebc2000-0xfebc2fff] [ 0.588024] pci 0000:00:07.0: reg 0x20: [mem 0xe000000c000-0xe000000ffff 64bit pref] [ 0.598262] pci 0000:00:08.0: [1af4:1001] type 00 class 0x010000 [ 0.605017] pci 0000:00:08.0: reg 0x10: [io 0xc180-0xc1ff] [ 0.617024] pci 0000:00:08.0: reg 0x14: [mem 0xfebc3000-0xfebc3fff] [ 0.638023] pci 0000:00:08.0: reg 0x20: [mem 0xe0000010000-0xe0000013fff 64bit pref] [ 0.650346] pci 0000:00:09.0: [1af4:1001] type 00 class 0x010000 [ 0.657022] pci 0000:00:09.0: reg 0x10: [io 0xc200-0xc27f] [ 0.666013] pci 0000:00:09.0: reg 0x14: [mem 0xfebc4000-0xfebc4fff] [ 0.681018] pci 0000:00:09.0: reg 0x20: [mem 0xe0000014000-0xe0000017fff 64bit pref] [ 0.691010] pci 0000:00:0a.0: [1af4:1001] type 00 class 0x010000 [ 0.695016] pci 0000:00:0a.0: reg 0x10: [io 0xc280-0xc2ff] [ 0.703014] pci 0000:00:0a.0: reg 0x14: [mem 0xfebc5000-0xfebc5fff] [ 0.725015] pci 0000:00:0a.0: reg 0x20: [mem 0xe0000018000-0xe000001bfff 64bit pref] [ 0.738035] ACPI: PCI: Interrupt link LNKA configured for IRQ 10 [ 0.741422] ACPI: PCI: Interrupt link LNKB configured for IRQ 10 [ 0.744878] ACPI: PCI: Interrupt link LNKC configured for IRQ 11 [ 0.749560] ACPI: PCI: Interrupt link LNKD configured for IRQ 11 [ 0.752232] ACPI: PCI: Interrupt link LNKS configured for IRQ 9 [ 0.756138] iommu: Default domain type: Passthrough [ 0.759471] SCSI subsystem initialized [ 0.760130] ACPI: bus type USB registered [ 0.762127] usbcore: registered new interface driver usbfs [ 0.764068] usbcore: registered new interface driver hub [ 0.767076] usbcore: registered new device driver usb [ 0.768144] pps_core: LinuxPPS API ver. 1 registered [ 0.771011] pps_core: Software ver. 5.3.6 - Copyright 2005-2007 Rodolfo Giometti [ 0.775057] PTP clock support registered [ 0.776621] EDAC MC: Ver: 3.0.0 [ 0.780068] PCI: Using ACPI for IRQ routing [ 0.782010] PCI: pci_cache_line_size set to 64 bytes [ 0.782747] e820: reserve RAM buffer [mem 0x0009fc00-0x0009ffff] [ 0.782760] e820: reserve RAM buffer [mem 0xbffce000-0xbfffffff] [ 0.782765] e820: reserve RAM buffer [mem 0x146e00000-0x147ffffff] [ 0.783365] NetLabel: Initializing [ 0.784010] NetLabel: domain hash size = 128 [ 0.784915] NetLabel: protocols = UNLABELED CIPSOv4 CALIPSO [ 0.787111] NetLabel: unlabeled traffic allowed by default [ 0.790120] vgaarb: loaded [ 0.792271] hpet0: at MMIO 0xfed00000, IRQs 2, 8, 0 [ 0.793006] hpet0: 3 comparators, 64-bit 100.000000 MHz counter [ 0.798963] clocksource: Switched to clocksource kvm-clock [ 0.946960] VFS: Disk quotas dquot_6.6.0 [ 0.948382] VFS: Dquot-cache hash table entries: 512 (order 0, 4096 bytes) [ 0.951081] *** VALIDATE ramfs *** [ 0.952469] *** VALIDATE hugetlbfs *** [ 0.954540] pnp: PnP ACPI init [ 0.956042] pnp 00:00: Plug and Play ACPI device, IDs PNP0303 (active) [ 0.956117] pnp 00:01: Plug and Play ACPI device, IDs PNP0f13 (active) [ 0.956145] pnp 00:02: [dma 2] [ 0.956178] pnp 00:02: Plug and Play ACPI device, IDs PNP0700 (active) [ 0.956221] pnp 00:03: Plug and Play ACPI device, IDs PNP0501 (active) [ 0.956318] pnp 00:04: Plug and Play ACPI device, IDs PNP0501 (active) [ 0.956398] pnp 00:05: Plug and Play ACPI device, IDs PNP0b00 (active) [ 0.957062] pnp: PnP ACPI: found 6 devices [ 0.977548] clocksource: acpi_pm: mask: 0xffffff max_cycles: 0xffffff, max_idle_ns: 2085701024 ns [ 0.981373] pci_bus 0000:00: resource 4 [io 0x0000-0x0cf7 window] [ 0.985483] pci_bus 0000:00: resource 5 [io 0x0d00-0xffff window] [ 0.988219] pci_bus 0000:00: resource 6 [mem 0x000a0000-0x000bffff window] [ 0.992917] pci_bus 0000:00: resource 7 [mem 0xc0000000-0xfebfffff window] [ 0.997032] pci_bus 0000:00: resource 8 [mem 0xe0000000000-0xe007fffffff window] [ 1.000751] NET: Registered protocol family 2 [ 1.003642] IP idents hash table entries: 131072 (order: 8, 1048576 bytes, vmalloc) [ 1.012270] tcp_listen_portaddr_hash hash table entries: 4096 (order: 5, 163840 bytes, vmalloc) [ 1.020895] TCP established hash table entries: 65536 (order: 7, 524288 bytes, vmalloc) [ 1.028652] TCP bind hash table entries: 65536 (order: 9, 2097152 bytes, vmalloc) [ 1.032419] TCP: Hash tables configured (established 65536 bind 65536) [ 1.034776] MPTCP token hash table entries: 8192 (order: 6, 393216 bytes, vmalloc) [ 1.038663] UDP hash table entries: 4096 (order: 6, 393216 bytes, vmalloc) [ 1.042094] UDP-Lite hash table entries: 4096 (order: 6, 393216 bytes, vmalloc) [ 1.046133] NET: Registered protocol family 1 [ 1.051058] RPC: Registered named UNIX socket transport module. [ 1.053627] RPC: Registered udp transport module. [ 1.054945] RPC: Registered tcp transport module. [ 1.056472] RPC: Registered tcp NFSv4.1 backchannel transport module. [ 1.058744] NET: Registered protocol family 44 [ 1.060333] pci 0000:00:00.0: Limiting direct PCI/PCI transfers [ 1.062961] pci 0000:00:01.0: PIIX3: Enabling Passive Release [ 1.064780] pci 0000:00:01.0: Activating ISA DMA hang workarounds [ 1.067379] PCI: CLS 0 bytes, default 64 [ 1.069848] Unpacking initramfs... [ 2.779971] debug: unmapping init [mem 0xffff994ffcc54000-0xffff994ffffbffff] [ 2.784966] PCI-DMA: Using software bounce buffering for IO (SWIOTLB) [ 2.787383] software IO TLB: mapped [mem 0x00000000a8000000-0x00000000ac000000] (64MB) [ 2.789699] clocksource: tsc: mask: 0xffffffffffffffff max_cycles: 0x229833f6470, max_idle_ns: 440795327230 ns [ 3.478406] Initialise system trusted keyrings [ 3.480898] Key type blacklist registered [ 3.484068] workingset: timestamp_bits=36 max_order=20 bucket_order=0 [ 3.493304] zbud: loaded [ 3.497570] *** VALIDATE nfs *** [ 3.499073] *** VALIDATE nfs4 *** [ 3.500705] pstore: using deflate compression [ 3.503875] Platform Keyring initialized [ 3.641269] NET: Registered protocol family 38 [ 3.642749] Key type asymmetric registered [ 3.644411] Asymmetric key parser 'x509' registered [ 3.646358] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 247) [ 3.649529] io scheduler mq-deadline registered [ 3.650561] io scheduler kyber registered [ 3.651874] io scheduler bfq registered [ 3.653273] atomic64_test: passed for x86-64 platform with CX8 and with SSE [ 3.655557] shpchp: Standard Hot Plug PCI Controller Driver version: 0.4 [ 3.658597] input: Power Button as /devices/LNXSYSTM:00/LNXPWRBN:00/input/input0 [ 3.661113] ACPI: Power Button [PWRF] [ 3.667063] ACPI: \_SB_.LNKB: Enabled at IRQ 10 [ 3.676080] ACPI: \_SB_.LNKA: Enabled at IRQ 11 [ 3.694702] ACPI: \_SB_.LNKC: Enabled at IRQ 11 [ 3.701720] ACPI: \_SB_.LNKD: Enabled at IRQ 10 [ 3.718528] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled [ 3.749252] 00:03: ttyS1 at I/O 0x2f8 (irq = 3, base_baud = 115200) is a 16550A [ 3.784034] 00:04: ttyS0 at I/O 0x3f8 (irq = 4, base_baud = 115200) is a 16550A [ 3.793551] Non-volatile memory driver v1.3 [ 3.795573] Linux agpgart interface v0.103 [ 3.835813] virtio_blk virtio1: [vda] 134584 512-byte logical blocks (68.9 MB/65.7 MiB) [ 3.841958] vda: detected capacity change from 0 to 68907008 [ 3.861491] virtio_blk virtio2: [vdb] 2097152 512-byte logical blocks (1.07 GB/1.00 GiB) [ 3.864826] vdb: detected capacity change from 0 to 1073741824 [ 3.884240] virtio_blk virtio3: [vdc] 5120000 512-byte logical blocks (2.62 GB/2.44 GiB) [ 3.887824] vdc: detected capacity change from 0 to 2621440000 [ 3.901130] virtio_blk virtio4: [vdd] 5120000 512-byte logical blocks (2.62 GB/2.44 GiB) [ 3.903775] vdd: detected capacity change from 0 to 2621440000 [ 3.920294] virtio_blk virtio5: [vde] 8388608 512-byte logical blocks (4.29 GB/4.00 GiB) [ 3.928493] vde: detected capacity change from 0 to 4294967296 [ 3.965391] virtio_blk virtio6: [vdf] 8388608 512-byte logical blocks (4.29 GB/4.00 GiB) [ 3.970886] vdf: detected capacity change from 0 to 4294967296 [ 3.995492] libphy: Fixed MDIO Bus: probed [ 4.017961] usbcore: registered new interface driver usbserial_generic [ 4.020665] usbserial: USB Serial support registered for generic [ 4.023967] i8042: PNP: PS/2 Controller [PNP0303:KBD,PNP0f13:MOU] at 0x60,0x64 irq 1,12 [ 4.030103] serio: i8042 KBD port at 0x60,0x64 irq 1 [ 4.032319] serio: i8042 AUX port at 0x60,0x64 irq 12 [ 4.035504] mousedev: PS/2 mouse device common for all mice [ 4.040422] rtc_cmos 00:05: RTC can wake from S4 [ 4.045114] input: AT Translated Set 2 keyboard as /devices/platform/i8042/serio0/input/input1 [ 4.049208] rtc_cmos 00:05: registered as rtc0 [ 4.053209] rtc_cmos 00:05: alarms up to one day, y3k, 242 bytes nvram, hpet irqs [ 4.055843] intel_pstate: CPU model not supported [ 4.058496] input: VirtualPS/2 VMware VMMouse as /devices/platform/i8042/serio1/input/input4 [ 4.065942] hid: raw HID events driver (C) Jiri Kosina [ 4.068182] usbcore: registered new interface driver usbhid [ 4.068282] input: VirtualPS/2 VMware VMMouse as /devices/platform/i8042/serio1/input/input3 [ 4.069910] usbhid: USB HID core driver [ 4.079035] drop_monitor: Initializing network drop monitor service [ 4.082553] Initializing XFRM netlink socket [ 4.085028] NET: Registered protocol family 10 [ 4.089186] Segment Routing with IPv6 [ 4.092117] NET: Registered protocol family 17 [ 4.095855] mpls_gso: MPLS GSO support [ 4.097657] start plist test [ 4.102491] end plist test [ 4.105159] RAS: Correctable Errors collector initialized. [ 4.108335] AVX version of gcm_enc/dec engaged. [ 4.111841] AES CTR mode by8 optimization enabled [ 4.215290] sched_clock: Marking stable (4215271271, 0)->(5193892773, -978621502) [ 4.218889] registered taskstats version 1 [ 4.221726] Loading compiled-in X.509 certificates [ 4.223888] zswap: loaded using pool lzo/zbud [ 4.248642] Key type big_key registered [ 4.289183] Key type encrypted registered [ 4.292553] ima: No TPM chip found, activating TPM-bypass! [ 4.296966] ima: Allocated hash algorithm: sha1 [ 4.298645] ima: No architecture policies found [ 4.301456] evm: Initialising EVM extended attributes: [ 4.304460] evm: security.selinux [ 4.307171] evm: security.ima [ 4.308615] evm: security.capability [ 4.310936] evm: HMAC attrs: 0x1 [ 4.314155] rtc_cmos 00:05: setting system clock to 2026-03-16 13:35:12 UTC (1773668112) [ 4.323309] debug: unmapping init [mem 0xffffffff9d003000-0xffffffff9d1fffff] [ 4.327756] debug: unmapping init [mem 0xffffffff9bd82000-0xffffffff9c058fff] [ 4.336437] Write protecting the kernel read-only data: 28672k [ 4.343572] debug: unmapping init [mem 0xffffffff9a403000-0xffffffff9a5fffff] [ 4.347373] debug: unmapping init [mem 0xffffffff9ad14000-0xffffffff9adfffff] [ 4.394556] systemd[1]: systemd 239 (239-82.el8_10.5) running in system mode. (+PAM +AUDIT +SELINUX +IMA -APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 +SECCOMP +BLKID +ELFUTILS +KMOD +IDN2 -IDN +PCRE2 default-hierarchy=legacy) [ 4.405615] systemd[1]: Detected virtualization kvm. [ 4.407557] systemd[1]: Detected architecture x86-64. [ 4.409674] systemd[1]: Running in initial RAM disk. [ 4.437437] systemd[1]: No hostname configured. [ 4.439108] systemd[1]: Set hostname to . [ 4.441206] random: systemd: uninitialized urandom read (16 bytes read) [ 4.444387] systemd[1]: Initializing machine ID from random generator. [ 4.743476] random: systemd: uninitialized urandom read (16 bytes read) [ 4.747259] systemd[1]: Listening on udev Control Socket. [ 4.758613] random: systemd: uninitialized urandom read (16 bytes read) [ 4.762210] systemd[1]: Reached target Local File Systems. [ 4.770746] systemd[1]: Listening on Journal Socket. [ 5.942673] device-mapper: uevent: version 1.0.3 [ 5.945423] device-mapper: ioctl: 4.46.0-ioctl (2022-02-22) initialised: dm-devel@redhat.com [ 7.195380] random: fast init done [ 7.281073] virtio_net virtio0 ens2: renamed from eth0 [ 7.297969] libata version 3.00 loaded. [ 7.311792] ata_piix 0000:00:01.1: version 2.13 [ 7.325589] scsi host0: ata_piix [ 7.373437] scsi host1: ata_piix [ 7.377278] ata1: PATA max MWDMA2 cmd 0x1f0 ctl 0x3f6 bmdma 0xc320 irq 14 [ 7.383499] ata2: PATA max MWDMA2 cmd 0x170 ctl 0x376 bmdma 0xc328 irq 15 [ 12.826554] random: crng init done [ 12.828904] random: 7 urandom warning(s) missed due to ratelimiting [ 14.614620] EXT4-fs (nbd0): mounted filesystem with ordered data mode. Opts: (null) [ 16.984405] printk: systemd: 26 output lines suppressed due to ratelimiting [ 17.551781] SELinux: Disabled at runtime. [ 17.644775] systemd[1]: systemd 239 (239-82.el8_10.5) running in system mode. (+PAM +AUDIT +SELINUX +IMA -APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 +SECCOMP +BLKID +ELFUTILS +KMOD +IDN2 -IDN +PCRE2 default-hierarchy=legacy) [ 17.656745] systemd[1]: Detected virtualization kvm. [ 17.659471] systemd[1]: Detected architecture x86-64. [ 18.663869] systemd[1]: initrd-switch-root.service: Succeeded. [ 18.668894] systemd[1]: Stopped Switch Root. [ 18.677477] systemd[1]: systemd-journald.service: Service has no hold-off time (RestartSec=0), scheduling restart. [ 18.684820] systemd[1]: systemd-journald.service: Scheduled restart job, restart counter is at 1. [ 18.693942] systemd[1]: Stopped Journal Service. [ 18.711482] systemd[1]: Starting Journal Service... [ 18.718097] systemd[1]: Listening on udev Kernel Socket. [ 19.444808] Adding 1048572k swap on /dev/vdb. Priority:-2 extents:1 across:1048572k FS [ 20.141079] squashfs: version 4.0 (2009/01/31) Phillip Lougher [ 20.792140] input: PC Speaker as /devices/platform/pcspkr/input/input5 [ 20.850715] piix4_smbus 0000:00:01.3: SMBus Host Controller at 0x700, revision 0 [ 21.879406] RAPL PMU: API unit is 2^-32 Joules, 0 fixed counters, 10737418240 ms ovfl timer [ 21.907334] EDAC sbridge: Seeking for: PCI ID 8086:0ea0 [ 21.907366] EDAC sbridge: Ver: 1.1.2 [ 26.522533] Key type dns_resolver registered [ 27.369666] NFS: Registering the id_resolver key type [ 27.373304] Key type id_resolver registered [ 27.375706] Key type id_legacy registered [ 57.507106] hrtimer: interrupt took 2177138 ns [ 87.541113] libcfs: loading out-of-tree module taints kernel. [ 87.586511] Key type ._llcrypt registered [ 87.588620] Key type .llcrypt registered [ 87.649640] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_hostid [ 97.969115] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing load_modules_local [ 98.771305] libcfs: HW NUMA nodes: 1, HW CPU cores: 4, npartitions: 1 [ 98.781057] alg: No test for adler32 (adler32-zlib) [ 99.989222] Lustre: Lustre: Build Version: 2.17.51_1_gb548ff5 [ 100.502098] LNet: Added LNI 192.168.201.103@tcp [8/256/0/180] [ 102.191424] Key type lgssc registered [ 103.086879] Lustre: Echo OBD driver; http://www.lustre.org/ [ 113.291190] ZFS: Loaded module v2.3.2-1, ZFS pool version 5000, ZFS filesystem version 5 [ 137.136919] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing load_modules_local [ 145.161887] Lustre: lustre-MDT0000: mounting server target with '-t lustre' deprecated, use '-t lustre_tgt' [ 145.189887] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 146.355198] Lustre: Setting parameter lustre-MDT0000.mdt.identity_upcall=/home/green/git/lustre-release/lustre/utils/l_getidentity in log lustre-MDT0000 [ 146.373635] Lustre: ctl-lustre-MDT0000: No data found on store. Initialize space. [ 146.420627] Lustre: lustre-MDT0000: new disk, initializing [ 146.469139] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 146.486649] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000200000400-0x0000000240000400]:0:mdt [ 148.943102] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all [ 156.760151] LDISKFS-fs (dm-1): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 156.839414] Lustre: 6483:0:(mgs_llog.c:1348:mgs_modify_param()) MGS: modify lustre-MDT0001/mdt.identity_upcall=/home/green/git/lustre-release/lustre/utils/l_getidentity (mode = 0) failed: rc = -17 [ 156.869107] Lustre: srv-lustre-MDT0001: No data found on store. Initialize space. [ 156.874383] Lustre: Skipped 1 previous similar message [ 156.938198] Lustre: lustre-MDT0001: new disk, initializing [ 156.981783] Lustre: lustre-MDT0001: Imperative Recovery not enabled, recovery window 60-180 [ 157.000249] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000240000400-0x0000000280000400]:1:mdt [ 157.008294] Lustre: cli-ctl-lustre-MDT0001: Allocated super-sequence [0x0000000240000400-0x0000000280000400]:1:mdt] [ 159.004989] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all [ 161.795138] Lustre: Modifying parameter general.debug_raw_pointers=Y in log params [ 166.871783] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 167.002041] Lustre: lustre-OST0000: new disk, initializing [ 167.004683] Lustre: srv-lustre-OST0000: No data found on store. Initialize space. [ 167.032672] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 169.331321] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000280000400-0x00000002c0000400]:0:ost [ 169.335479] Lustre: cli-lustre-OST0000-super: Allocated super-sequence [0x0000000280000400-0x00000002c0000400]:0:ost] [ 169.382399] Lustre: lustre-OST0000-osc-MDT0000: update sequence from 0x100000000 to 0x280000401 [ 169.839668] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all [ 177.716542] LDISKFS-fs (dm-3): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 177.808510] Lustre: lustre-OST0001: new disk, initializing [ 177.812281] Lustre: srv-lustre-OST0001: No data found on store. Initialize space. [ 177.859843] Lustre: lustre-OST0001: Imperative Recovery not enabled, recovery window 60-180 [ 181.273071] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all [ 183.816795] Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x00000002c0000400-0x0000000300000400]:1:ost [ 183.822904] Lustre: cli-lustre-OST0001-super: Allocated super-sequence [0x00000002c0000400-0x0000000300000400]:1:ost] [ 183.858864] Lustre: lustre-OST0001-osc-MDT0000: update sequence from 0x100010000 to 0x2c0000401 [ 188.563406] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 191.036220] Lustre: Setting parameter general.lod.*.mdt_hash=crush in log params [ 198.863097] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing check_logdir /tmp/testlogs/ [ 201.123635] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing yml_node [ 203.882276] Lustre: DEBUG MARKER: Client: 2.17.51.1 [ 205.459963] Lustre: DEBUG MARKER: MDS: 2.17.51.1 [ 206.937201] Lustre: DEBUG MARKER: OSS: 2.17.51.1 [ 207.952163] Lustre: DEBUG MARKER: -----============= acceptance-small: recovery-small ============----- Mon Mar 16 09:38:35 EDT 2026 [ 216.813945] Lustre: DEBUG MARKER: excepting tests: 136 [ 217.984335] Lustre: DEBUG MARKER: === recovery-small: start setup 09:38:45 (1773668325) === [ 220.265223] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing check_config_client /mnt/lustre [ 231.273303] Lustre: DEBUG MARKER: Using TIMEOUT=20 [ 233.366839] Lustre: 13149:0:(mgs_llog.c:1348:mgs_modify_param()) MGS: modify general/lod.*.mdt_hash=crush (mode = 0) failed: rc = -17 [ 235.175939] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 238.249851] Lustre: DEBUG MARKER: === recovery-small: finish setup 09:39:05 (1773668345) === [ 239.365636] Lustre: DEBUG MARKER: == recovery-small test 1: create, chmod, stat: drop req, drop rep ========================================================== 09:39:06 (1773668346) [ 239.934831] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 255.163912] Lustre: lustre-MDT0000: Client 22349dc6-b7f5-4900-94db-a78e37956fa6 (at 192.168.201.3@tcp) reconnecting [ 256.275551] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 256.277385] LustreError: 6489:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff99507f68bb80 x1859825904400128/t4294967300(0) o36->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:480/0 lens 520/448 e 0 to 0 dl 1773668375 ref 1 fl Interpret:/200/0 rc 0/0 job:'mcreate.0' uid:0 gid:0 projid:4294967295 [ 272.568788] Lustre: lustre-MDT0000: Client 22349dc6-b7f5-4900-94db-a78e37956fa6 (at 192.168.201.3@tcp) reconnecting [ 272.581872] Lustre: 9879:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff994f44460e00 x1859825904400128/t4294967300(0) o36->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:496/0 lens 520/2880 e 0 to 0 dl 1773668391 ref 1 fl Interpret:/202/0 rc 0/0 job:'mcreate.0' uid:0 gid:0 projid:4294967295 [ 273.847320] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 289.462388] Lustre: lustre-MDT0000: Client 22349dc6-b7f5-4900-94db-a78e37956fa6 (at 192.168.201.3@tcp) reconnecting [ 290.682131] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 290.684859] LustreError: 6489:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff995067485c00 x1859825904404736/t4294967302(0) o36->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:514/0 lens 488/456 e 0 to 0 dl 1773668409 ref 1 fl Interpret:/200/0 rc 0/0 job:'tchmod.0' uid:0 gid:0 projid:4294967295 [ 306.879631] Lustre: lustre-MDT0000: Client 22349dc6-b7f5-4900-94db-a78e37956fa6 (at 192.168.201.3@tcp) reconnecting [ 306.898329] Lustre: 11311:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff994f450a0e00 x1859825904404736/t4294967302(0) o36->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:531/0 lens 488/3152 e 0 to 0 dl 1773668426 ref 1 fl Interpret:/202/0 rc 0/0 job:'tchmod.0' uid:0 gid:0 projid:4294967295 [ 308.160804] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 324.796466] Lustre: lustre-MDT0000: Client 22349dc6-b7f5-4900-94db-a78e37956fa6 (at 192.168.201.3@tcp) reconnecting [ 326.020908] Lustre: *** cfs_fail_loc=122, val=2147483648*** [ 326.023406] LustreError: 6490:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff994f4598fb80 x1859825904408320/t0(0) o34->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:550/0 lens 472/464 e 0 to 0 dl 1773668445 ref 1 fl Interpret:/600/0 rc 0/0 job:'statone.0' uid:0 gid:0 projid:0 [ 342.203663] Lustre: lustre-MDT0000: Client 22349dc6-b7f5-4900-94db-a78e37956fa6 (at 192.168.201.3@tcp) reconnecting [ 346.235643] Lustre: DEBUG MARKER: == recovery-small test 4: open: drop req, drop rep ======= 09:40:53 (1773668453) [ 346.868099] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 362.166772] Lustre: lustre-MDT0000: Client 22349dc6-b7f5-4900-94db-a78e37956fa6 (at 192.168.201.3@tcp) reconnecting [ 363.331082] Lustre: *** cfs_fail_loc=122, val=2147483648*** [ 363.332754] LustreError: 6492:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff994f45dbb480 x1859825904414592/t4294967308(0) o35->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:587/0 lens 392/456 e 0 to 0 dl 1773668482 ref 1 fl Interpret:/600/0 rc 0/0 job:'cat.0' uid:0 gid:0 projid:0 [ 379.559350] Lustre: 6492:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff994f4513df80 x1859825904414592/t4294967308(0) o35->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:603/0 lens 392/456 e 0 to 0 dl 1773668498 ref 1 fl Interpret:/602/0 rc 0/0 job:'cat.0' uid:0 gid:0 projid:0 [ 383.533466] Lustre: DEBUG MARKER: == recovery-small test 5: rename: drop req, drop rep ===== 09:41:30 (1773668490) [ 384.040545] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 400.577220] Lustre: lustre-MDT0000: Client 22349dc6-b7f5-4900-94db-a78e37956fa6 (at 192.168.201.3@tcp) reconnecting [ 400.586492] Lustre: Skipped 1 previous similar message [ 401.729683] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 401.732968] LustreError: 6504:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff994f460f5c00 x1859825904422272/t4294967312(0) o36->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:625/0 lens 552/456 e 0 to 0 dl 1773668520 ref 1 fl Interpret:/200/0 rc 0/0 job:'mv.0' uid:0 gid:0 projid:4294967295 [ 417.959044] Lustre: 6503:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff994f46140700 x1859825904422272/t4294967312(0) o36->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:642/0 lens 552/2888 e 0 to 0 dl 1773668537 ref 1 fl Interpret:/202/0 rc 0/0 job:'mv.0' uid:0 gid:0 projid:4294967295 [ 422.182677] Lustre: DEBUG MARKER: == recovery-small test 6: link, unlink: drop req, drop rep ========================================================== 09:42:09 (1773668529) [ 422.721277] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 442.422424] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 442.426625] LustreError: 6491:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff995067480e00 x1859825904429952/t4294967317(0) o36->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:666/0 lens 512/440 e 0 to 0 dl 1773668561 ref 1 fl Interpret:/200/0 rc 0/0 job:'link.0' uid:0 gid:0 projid:4294967295 [ 458.937868] Lustre: 6490:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff994f46034a80 x1859825904429952/t4294967317(0) o36->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:683/0 lens 512/440 e 0 to 0 dl 1773668578 ref 1 fl Interpret:/202/0 rc 0/0 job:'link.0' uid:0 gid:0 projid:4294967295 [ 462.539972] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 478.948634] Lustre: lustre-MDT0000: Client 22349dc6-b7f5-4900-94db-a78e37956fa6 (at 192.168.201.3@tcp) reconnecting [ 478.958992] Lustre: Skipped 3 previous similar messages [ 482.305673] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 482.312357] LustreError: 6491:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff994f45c9bb80 x1859825904435456/t4294967319(0) o36->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:706/0 lens 504/456 e 0 to 0 dl 1773668601 ref 1 fl Interpret:/200/0 rc 0/0 job:'unlink.0' uid:0 gid:0 projid:4294967295 [ 498.901133] Lustre: 6489:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff994f45cb6a00 x1859825904435456/t4294967319(0) o36->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:723/0 lens 504/2888 e 0 to 0 dl 1773668618 ref 1 fl Interpret:/202/0 rc 0/0 job:'unlink.0' uid:0 gid:0 projid:4294967295 [ 509.560333] Lustre: DEBUG MARKER: == recovery-small test 8: touch: drop rep (bug 1423) ===== 09:43:35 (1773668615) [ 527.027971] Lustre: 13715:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff994f45abc700 x1859825904439936/t4294967322(0) o36->22349dc6-b7f5-4900-94db-a78e37956fa6@192.168.201.3@tcp:751/0 lens 488/3152 e 0 to 0 dl 1773668646 ref 1 fl Interpret:/202/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 533.894465] Lustre: DEBUG MARKER: == recovery-small test 9: pause bulk on OST (bug 1420) === 09:44:00 (1773668640) [ 535.619869] LustreError: 8375:0:(tgt_handler.c:2735:tgt_brw_write()) cfs_fail_timeout id 214 sleeping for 5000ms [ 540.719150] LustreError: 8375:0:(tgt_handler.c:2735:tgt_brw_write()) cfs_fail_timeout id 214 awake [ 547.645589] Lustre: DEBUG MARKER: == recovery-small test 10a: finish request on server after client eviction (bug 1521) ========================================================== 09:44:14 (1773668654) [ 564.191263] Lustre: 6489:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668656/real 1773668656] req@ffff994f4677ad80 x1859825915089280/t0(0) o104->lustre-MDT0000@192.168.201.3@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668672 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 568.799284] Lustre: 8369:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668660/real 1773668660] req@ffff995076463100 x1859825915091712/t0(0) o104->lustre-OST0001@192.168.201.3@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668676 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 580.575191] Lustre: 6489:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668672/real 1773668672] req@ffff994f4677ad80 x1859825915089280/t0(0) o104->lustre-MDT0000@192.168.201.3@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668688 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 585.183325] Lustre: 8369:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668677/real 1773668677] req@ffff995076463100 x1859825915091712/t0(0) o104->lustre-OST0001@192.168.201.3@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668693 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 590.815124] Lustre: mdt00_000: service thread pid 6489 was inactive for 42.845 seconds. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes: [ 590.838820] task:mdt00_000 state:I stack:0 pid:6489 ppid:2 flags:0x80004000 [ 590.843725] Call Trace: [ 590.849291] __schedule+0x351/0xcb0 [ 590.853565] schedule+0xc0/0x180 [ 590.855845] schedule_timeout+0xb4/0x190 [ 590.860581] ? __next_timer_interrupt+0x160/0x160 [ 590.868278] wait_woken+0x9c/0xd0 [ 590.872701] ptlrpc_set_wait+0x3af/0xa50 [ptlrpc] [ 590.880504] ? do_wait_intr+0xf0/0xf0 [ 590.888780] ldlm_run_ast_work+0x10d/0x4d0 [ptlrpc] [ 590.896917] ldlm_handle_conflict_lock+0x97/0x490 [ptlrpc] [ 590.907484] ldlm_lock_enqueue+0x321/0xcd0 [ptlrpc] [ 590.916724] ldlm_cli_enqueue_local+0x6fe/0xbd0 [ptlrpc] [ 590.920203] ? ldlm_cli_enqueue_local+0xbd0/0xbd0 [ptlrpc] [ 590.932849] ? mdt_object_put+0x130/0x130 [mdt] [ 590.935864] mdt_object_lock_internal+0x20b/0x5a0 [mdt] [ 590.940769] ? mdt_object_put+0x130/0x130 [mdt] [ 590.945752] ? ldlm_cli_enqueue_local+0xbd0/0xbd0 [ptlrpc] [ 590.958510] mdt_object_lock+0x9e/0x240 [mdt] [ 590.964995] mdt_object_stripes_lock+0x28b/0x670 [mdt] [ 590.968793] mdt_reint_setattr+0xf58/0x1f90 [mdt] [ 590.972241] mdt_reint_rec+0x139/0x2b0 [mdt] [ 590.975473] mdt_reint_internal+0x6a0/0xdc0 [mdt] [ 590.982884] mdt_reint+0x163/0x190 [mdt] [ 590.992688] tgt_handle_request0+0x137/0xaf0 [ptlrpc] [ 590.999892] tgt_request_handle+0x573/0x1e70 [ptlrpc] [ 591.006523] ptlrpc_server_handle_request+0x443/0x13b0 [ptlrpc] [ 591.011770] ? lprocfs_counter_add+0x15b/0x210 [obdclass] [ 591.015170] ptlrpc_main+0xce8/0x1400 [ptlrpc] [ 591.025728] ? ptlrpc_wait_event+0x690/0x690 [ptlrpc] [ 591.030664] kthread+0x1d1/0x200 [ 591.032807] ? set_kthread_struct+0x70/0x70 [ 591.036476] ret_from_fork+0x1f/0x30 [ 594.912356] Lustre: ll_ost00_000: service thread pid 8369 was inactive for 42.479 seconds. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes: [ 594.932165] task:ll_ost00_000 state:I stack:0 pid:8369 ppid:2 flags:0x80004000 [ 594.938782] Call Trace: [ 594.941208] __schedule+0x351/0xcb0 [ 594.942874] schedule+0xc0/0x180 [ 594.944887] schedule_timeout+0xb4/0x190 [ 594.946504] ? __next_timer_interrupt+0x160/0x160 [ 594.949538] wait_woken+0x9c/0xd0 [ 594.951866] ptlrpc_set_wait+0x3af/0xa50 [ptlrpc] [ 594.954884] ? do_wait_intr+0xf0/0xf0 [ 594.959205] ldlm_run_ast_work+0x10d/0x4d0 [ptlrpc] [ 594.967574] ldlm_handle_conflict_lock+0x97/0x490 [ptlrpc] [ 594.977166] ldlm_lock_enqueue+0x321/0xcd0 [ptlrpc] [ 594.981943] ldlm_cli_enqueue_local+0x58d/0xbd0 [ptlrpc] [ 594.988038] ? ldlm_cli_enqueue_local+0xbd0/0xbd0 [ptlrpc] [ 594.992039] ? ldlm_blocking_ast_nocheck+0x3f0/0x3f0 [ptlrpc] [ 594.997595] ofd_destroy_by_fid+0x391/0x7c0 [ofd] [ 595.001383] ? ldlm_blocking_ast_nocheck+0x3f0/0x3f0 [ptlrpc] [ 595.008907] ? ldlm_cli_enqueue_local+0xbd0/0xbd0 [ptlrpc] [ 595.011773] ? ofd_destroy_hdl+0x34a/0xfc0 [ofd] [ 595.013908] ofd_destroy_hdl+0x34a/0xfc0 [ofd] [ 595.016602] tgt_handle_request0+0x137/0xaf0 [ptlrpc] [ 595.018605] tgt_request_handle+0x573/0x1e70 [ptlrpc] [ 595.022949] ptlrpc_server_handle_request+0x443/0x13b0 [ptlrpc] [ 595.025784] ? lprocfs_counter_add+0x15b/0x210 [obdclass] [ 595.029663] ptlrpc_main+0xce8/0x1400 [ptlrpc] [ 595.034824] ? ptlrpc_wait_event+0x690/0x690 [ptlrpc] [ 595.039197] kthread+0x1d1/0x200 [ 595.042195] ? set_kthread_struct+0x70/0x70 [ 595.044184] ret_from_fork+0x1f/0x30 [ 595.935452] Lustre: 6489:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668688/real 1773668688] req@ffff994f4677ad80 x1859825915089280/t0(0) o104->lustre-MDT0000@192.168.201.3@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668704 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 612.319165] Lustre: 6489:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668704/real 1773668704] req@ffff994f4677ad80 x1859825915089280/t0(0) o104->lustre-MDT0000@192.168.201.3@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668720 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 612.341671] Lustre: 6489:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 1 previous similar message [ 628.703234] Lustre: 6489:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668720/real 1773668720] req@ffff994f4677ad80 x1859825915089280/t0(0) o104->lustre-MDT0000@192.168.201.3@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668736 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 628.746365] Lustre: 6489:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 1 previous similar message [ 660.455738] LustreError: 6489:0:(ldlm_lockd.c:727:ldlm_handle_ast_error()) ### client (nid 192.168.201.3@tcp) failed to reply to blocking AST (req@00000000a7a87578 x1859825915089280 status 0 rc -110), evict it ns: mdt-lustre-MDT0000_UUID lock: ffff99506fbaf600/0x441c89a012505003 lrc: 4/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 3 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.3@tcp remote: 0x2f77f56113783f9a expref: 9 pid: 13715 timeout: 744 lvb_type: 0 lru_score: 0 lru_type: 0 [ 660.514286] LustreError: lustre-MDT0000: A client on nid 192.168.201.3@tcp was evicted due to a lock blocking callback time out: rc -110 [ 660.525341] LustreError: 6479:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 16s: evicting client at 192.168.201.3@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff99506fbaf600/0x441c89a012505003 lrc: 3/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 3 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.3@tcp remote: 0x2f77f56113783f9a expref: 10 pid: 13715 timeout: 0 lvb_type: 0 lru_score: 0 lru_type: 0 [ 660.564786] Lustre: mdt00_000: service thread pid 6489 completed after 112.595s. This likely indicates the system was overloaded (too many service threads, or not enough hardware resources). [ 665.058933] Lustre: 8369:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668757/real 1773668757] req@ffff995076463100 x1859825915091712/t0(0) o104->lustre-OST0001@192.168.201.3@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668773 ref 1 fl Rpc:XQr/2/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 665.093667] Lustre: 8369:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 4 previous similar messages [ 665.101856] LustreError: 8369:0:(ldlm_lockd.c:727:ldlm_handle_ast_error()) ### client (nid 192.168.201.3@tcp) failed to reply to blocking AST (req@000000002bc82cbd x1859825915091712 status 0 rc -110), evict it ns: filter-lustre-OST0001_UUID lock: 000000002aaff569/0x441c89a012504fcb lrc: 4/0,0 mode: PR/PR res: [0x2c0000401:0x4:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->18446744073709551615) gid 0 flags: 0x60000400010020 nid: 192.168.201.3@tcp remote: 0x2f77f56113783f93 expref: 6 pid: 8660 timeout: 749 lvb_type: 1 lru_score: 0 lru_type: 0 [ 665.142689] LustreError: lustre-OST0001: A client on nid 192.168.201.3@tcp was evicted due to a lock blocking callback time out: rc -110 [ 665.163558] LustreError: 6479:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 16s: evicting client at 192.168.201.3@tcp ns: filter-lustre-OST0001_UUID lock: 000000002aaff569/0x441c89a012504fcb lrc: 3/0,0 mode: PR/PR res: [0x2c0000401:0x4:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->18446744073709551615) gid 0 flags: 0x60000400010020 nid: 192.168.201.3@tcp remote: 0x2f77f56113783f93 expref: 7 pid: 8660 timeout: 0 lvb_type: 1 lru_score: 0 lru_type: 0 [ 665.210698] Lustre: ll_ost00_000: service thread pid 8369 completed after 112.777s. This likely indicates the system was overloaded (too many service threads, or not enough hardware resources). [ 668.183397] Lustre: DEBUG MARKER: == recovery-small test 10b: re-send BL AST =============== 09:46:15 (1773668775) [ 691.203575] Lustre: DEBUG MARKER: == recovery-small test 10c: re-send BL AST vs reconnect race (LU-5569) ========================================================== 09:46:37 (1773668797) [ 692.544960] Lustre: lustre-MDT0000: Client 22349dc6-b7f5-4900-94db-a78e37956fa6 (at 192.168.201.3@tcp) reconnecting [ 692.551048] Lustre: Skipped 2 previous similar messages [ 698.083901] Lustre: DEBUG MARKER: == recovery-small test 10d: test failed blocking ast ===== 09:46:44 (1773668804) [ 702.321525] LustreError: 8660:0:(ldlm_lockd.c:727:ldlm_handle_ast_error()) ### client (nid 192.168.201.3@tcp) returned error from blocking AST (req@00000000dcf2a820 x1859825915160704 status -71 rc -71), evict it ns: filter-lustre-OST0000_UUID lock: 00000000a7a1f081/0x441c89a0125053a6 lrc: 4/0,0 mode: PW/PW res: [0x280000401:0x7:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->18446744073709551615) gid 0 flags: 0x60000480000020 nid: 192.168.201.3@tcp remote: 0x2f77f5611378418b expref: 5 pid: 8371 timeout: 802 lvb_type: 0 lru_score: 0 lru_type: 0 [ 702.349349] LustreError: lustre-OST0000: A client on nid 192.168.201.3@tcp was evicted due to a lock blocking callback time out: rc -71 [ 702.359812] LustreError: 6479:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 0s: evicting client at 192.168.201.3@tcp ns: filter-lustre-OST0000_UUID lock: 00000000a7a1f081/0x441c89a0125053a6 lrc: 3/0,0 mode: PW/PW res: [0x280000401:0x7:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->18446744073709551615) gid 0 flags: 0x60000480000020 nid: 192.168.201.3@tcp remote: 0x2f77f5611378418b expref: 6 pid: 8371 timeout: 0 lvb_type: 0 lru_score: 0 lru_type: 0 [ 710.445772] Lustre: DEBUG MARKER: == recovery-small test 10e: re-send BL AST vs reconnect race 2 ========================================================== 09:46:57 (1773668817) [ 711.771969] Lustre: DEBUG MARKER: SKIP: recovery-small test_10e need two clients [ 713.630794] Lustre: DEBUG MARKER: == recovery-small test 11: wake up a thread waiting for completion after eviction (b=2460) ========================================================== 09:47:00 (1773668820) [ 730.591455] Lustre: 8369:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773668822/real 1773668822] req@ffff994f46036300 x1859825915166208/t0(0) o104->lustre-OST0000@192.168.201.3@tcp:15/16 lens 328/224 e 0 to 1 dl 1773668838 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 730.642378] Lustre: 8369:0:(client.c:2478:ptlrpc_expire_one_request()) Skipped 1 previous similar message [ 737.854124] Lustre: DEBUG MARKER: == recovery-small test 12: recover from timed out resend in ptlrpcd (b=2494) ========================================================== 09:47:24 (1773668844) [ 738.597386] Lustre: *** cfs_fail_loc=115, val=2147483648*** [ 781.167570] Lustre: DEBUG MARKER: == recovery-small test 13: mdc_readpage restart test (bug 1138) ========================================================== 09:48:08 (1773668888) [ 805.725494] Lustre: DEBUG MARKER: == recovery-small test 14: mdc_readpage resend test (bug 1138) ========================================================== 09:48:32 (1773668912) [ 806.713172] Lustre: *** cfs_fail_loc=106, val=0*** [ 806.726479] Lustre: Skipped 1 previous similar message [ 812.778330] Lustre: DEBUG MARKER: == recovery-small test 15: failed open (-ENOMEM) ========= 09:48:39 (1773668919) [ 813.625622] Lustre: *** cfs_fail_loc=128, val=0*** [ 819.696798] Lustre: DEBUG MARKER: == recovery-small test 16: timeout bulk put, don't evict client (2732) ========================================================== 09:48:46 (1773668926) [ 821.338229] Lustre: *** cfs_fail_loc=504, val=0*** [ 821.345330] LustreError: 8377:0:(ldlm_lib.c:3620:target_bulk_io()) @@@ truncated bulk READ 0(102400) req@ffff994f45abca80 x1859825904529536/t0(0) o3->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:290/0 lens 488/440 e 0 to 0 dl 1773668940 ref 1 fl Interpret:/600/0 rc 0/0 job:'cmp.0' uid:0 gid:0 projid:0 [ 821.374991] Lustre: lustre-OST0000: Bulk IO read error with e4121254-1881-4044-b64c-77d9b653b80f (at 192.168.201.3@tcp), client will retry: rc -110 [ 864.270934] Lustre: DEBUG MARKER: == recovery-small test 17a: timeout bulk get, don't evict client (2732) ========================================================== 09:49:31 (1773668971) [ 913.884985] Lustre: DEBUG MARKER: == recovery-small test 17b: timeout bulk get, dont evict client (3582) ========================================================== 09:50:20 (1773669020) [ 914.779600] Lustre: DEBUG MARKER: SKIP: recovery-small test_17b Needs multiple clients [ 916.082670] Lustre: DEBUG MARKER: == recovery-small test 18a: manual ost invalidate clears page cache immediately ========================================================== 09:50:23 (1773669023) [ 921.312760] Lustre: DEBUG MARKER: == recovery-small test 18b: eviction and reconnect clears page cache (2766) ========================================================== 09:50:28 (1773669028) [ 922.597712] Lustre: 20212:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting e4121254-1881-4044-b64c-77d9b653b80f at adminstrative request [ 949.948434] Lustre: DEBUG MARKER: == recovery-small test 18c: Dropped connect reply after eviction handing (14755) ========================================================== 09:50:56 (1773669056) [ 951.399493] Lustre: 20478:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting e4121254-1881-4044-b64c-77d9b653b80f at adminstrative request [ 953.081950] Lustre: *** cfs_fail_loc=225, val=0*** [ 953.084456] Lustre: Skipped 1 previous similar message [ 963.241000] Lustre: lustre-OST0000: Client e4121254-1881-4044-b64c-77d9b653b80f (at 192.168.201.3@tcp) reconnecting [ 963.248042] Lustre: Skipped 4 previous similar messages [ 969.941614] Lustre: DEBUG MARKER: == recovery-small test 19a: test expired_lock_main on mds (2867) ========================================================== 09:51:17 (1773669077) [ 971.443285] Lustre: *** cfs_fail_loc=304, val=0*** [ 987.825186] Lustre: *** cfs_fail_loc=304, val=0*** [ 1003.183177] Lustre: *** cfs_fail_loc=304, val=0*** [ 1012.703167] Lustre: mdt00_005: service thread pid 13715 was inactive for 41.268 seconds. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes: [ 1012.713347] task:mdt00_005 state:I stack:0 pid:13715 ppid:2 flags:0x80004000 [ 1012.719662] Call Trace: [ 1012.725894] __schedule+0x351/0xcb0 [ 1012.727951] schedule+0xc0/0x180 [ 1012.730804] schedule_timeout+0xb4/0x190 [ 1012.734331] ? __next_timer_interrupt+0x160/0x160 [ 1012.738329] ? do_raw_spin_unlock+0x75/0x190 [ 1012.740563] ldlm_completion_ast+0xc26/0x12b0 [ptlrpc] [ 1012.743777] ? woken_wake_function+0x30/0x30 [ 1012.751232] ldlm_cli_enqueue_local+0x601/0xbd0 [ptlrpc] [ 1012.760207] ? ldlm_cli_enqueue_local+0xbd0/0xbd0 [ptlrpc] [ 1012.765225] ? mdt_object_put+0x130/0x130 [mdt] [ 1012.767692] mdt_object_lock_internal+0x20b/0x5a0 [mdt] [ 1012.770161] ? mdt_object_put+0x130/0x130 [mdt] [ 1012.773058] ? ldlm_cli_enqueue_local+0xbd0/0xbd0 [ptlrpc] [ 1012.776273] mdt_object_lock+0x9e/0x240 [mdt] [ 1012.779034] mdt_object_stripes_lock+0x28b/0x670 [mdt] [ 1012.781525] mdt_reint_setattr+0xf58/0x1f90 [mdt] [ 1012.783471] mdt_reint_rec+0x139/0x2b0 [mdt] [ 1012.785857] mdt_reint_internal+0x6a0/0xdc0 [mdt] [ 1012.789324] mdt_reint+0x163/0x190 [mdt] [ 1012.790980] tgt_handle_request0+0x137/0xaf0 [ptlrpc] [ 1012.799741] tgt_request_handle+0x573/0x1e70 [ptlrpc] [ 1012.803993] ptlrpc_server_handle_request+0x443/0x13b0 [ptlrpc] [ 1012.808158] ? lprocfs_counter_add+0x15b/0x210 [obdclass] [ 1012.811160] ptlrpc_main+0xce8/0x1400 [ptlrpc] [ 1012.814201] ? ptlrpc_wait_event+0x690/0x690 [ptlrpc] [ 1012.817159] kthread+0x1d1/0x200 [ 1012.819014] ? set_kthread_struct+0x70/0x70 [ 1012.821230] ret_from_fork+0x1f/0x30 [ 1019.597330] Lustre: *** cfs_fail_loc=304, val=0*** [ 1035.951303] Lustre: *** cfs_fail_loc=304, val=0*** [ 1051.307118] Lustre: *** cfs_fail_loc=304, val=0*** [ 1067.705244] Lustre: *** cfs_fail_loc=304, val=0*** [ 1074.146029] LustreError: 6479:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 103s: evicting client at 192.168.201.3@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff994f44b2c600/0x441c89a012505c9e lrc: 3/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 4 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.3@tcp remote: 0x2f77f56113784463 expref: 17 pid: 9879 timeout: 1071 lvb_type: 0 lru_score: 0 lru_type: 0 [ 1074.216666] Lustre: mdt00_005: service thread pid 13715 completed after 102.781s. This likely indicates the system was overloaded (too many service threads, or not enough hardware resources). [ 1093.236339] Lustre: DEBUG MARKER: == recovery-small test 19b: test expired_lock_main on ost (2867) ========================================================== 09:53:18 (1773669198) [ 1114.332436] Lustre: *** cfs_fail_loc=304, val=0*** [ 1114.342141] Lustre: Skipped 1 previous similar message [ 1179.900634] Lustre: *** cfs_fail_loc=304, val=0*** [ 1179.903795] Lustre: Skipped 3 previous similar messages [ 1201.120350] LustreError: 6479:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 103s: evicting client at 192.168.201.3@tcp ns: filter-lustre-OST0001_UUID lock: 0000000052b4db1b/0x441c89a012505f76 lrc: 3/0,0 mode: PW/PW res: [0x2c0000401:0xc:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->4095) gid 0 flags: 0x60000400000020 nid: 192.168.201.3@tcp remote: 0x2f77f56113784600 expref: 6 pid: 8371 timeout: 1198 lvb_type: 0 lru_score: 0 lru_type: 0 [ 1215.902469] Lustre: DEBUG MARKER: == recovery-small test 19c: check reconnect and lock resend do not trigger expired_lock_main ========================================================== 09:55:22 (1773669322) [ 1237.034530] Lustre: DEBUG MARKER: == recovery-small test 20a: ldlm_handle_enqueue error (should return error) ========================================================== 09:55:43 (1773669343) [ 1246.172484] Lustre: DEBUG MARKER: == recovery-small test 20b: ldlm_handle_enqueue error (should return error) ========================================================== 09:55:52 (1773669352) [ 1256.005304] Lustre: DEBUG MARKER: == recovery-small test 21a: drop close request while close and open are both in flight ========================================================== 09:56:02 (1773669362) [ 1257.301363] LustreError: 6489:0:(mdt_open.c:1464:mdt_reint_open()) cfs_fail_timeout id 129 sleeping for 5000ms [ 1259.193507] LustreError: 6489:0:(mdt_open.c:1464:mdt_reint_open()) cfs_fail_timeout interrupted [ 1260.220250] Lustre: *** cfs_fail_loc=115, val=2147483648*** [ 1287.657623] Lustre: DEBUG MARKER: == recovery-small test 21b: drop open request while close and open are both in flight ========================================================== 09:56:33 (1773669393) [ 1471.995087] Lustre: DEBUG MARKER: == recovery-small test 21c: drop both request while close and open are both in flight ========================================================== 09:59:38 (1773669578) [ 1494.186803] Lustre: lustre-MDT0000: Client e4121254-1881-4044-b64c-77d9b653b80f (at 192.168.201.3@tcp) reconnecting [ 1494.204330] Lustre: Skipped 14 previous similar messages [ 1504.689433] Lustre: DEBUG MARKER: == recovery-small test 21d: drop close reply while close and open are both in flight ========================================================== 10:00:10 (1773669610) [ 1506.448836] LustreError: 6490:0:(mdt_open.c:1464:mdt_reint_open()) cfs_fail_timeout id 129 sleeping for 5000ms [ 1508.431319] LustreError: 6490:0:(mdt_open.c:1464:mdt_reint_open()) cfs_fail_timeout interrupted [ 1509.354915] Lustre: *** cfs_fail_loc=122, val=2147483648*** [ 1509.367882] LustreError: 6493:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff99507f684a80 x1859825904700416/t4294967543(0) o35->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:223/0 lens 392/456 e 0 to 0 dl 1773669628 ref 1 fl Interpret:/600/0 rc 0/0 job:'multiop.0' uid:0 gid:0 projid:0 [ 1509.410781] LustreError: 6493:0:(ldlm_lib.c:3325:target_send_reply_msg()) Skipped 1 previous similar message [ 1525.952717] Lustre: 21750:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff99507ff4e300 x1859825904700416/t4294967543(0) o35->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:240/0 lens 392/456 e 0 to 0 dl 1773669645 ref 1 fl Interpret:/602/0 rc 0/0 job:'multiop.0' uid:0 gid:0 projid:0 [ 1536.319183] Lustre: DEBUG MARKER: == recovery-small test 21e: drop open reply while close and open are both in flight ========================================================== 10:00:42 (1773669642) [ 1537.887410] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 1537.895823] LustreError: 13715:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff99507fe31f80 x1859825904711808/t4294967560(0) o36->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:409/0 lens 488/456 e 0 to 0 dl 1773669814 ref 1 fl Interpret:/200/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 1710.306131] Lustre: 13715:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff994f45d30700 x1859825904711808/t4294967560(0) o36->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:581/0 lens 488/3152 e 0 to 0 dl 1773669986 ref 1 fl Interpret:/202/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 1712.365047] Lustre: DEBUG MARKER: == recovery-small test 21f: drop both reply while close and open are both in flight ========================================================== 10:03:38 (1773669818) [ 1713.508430] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 1713.521486] LustreError: 6491:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff99507fe80a80 x1859825904739328/t4294967579(0) o36->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:427/0 lens 488/456 e 0 to 0 dl 1773669832 ref 1 fl Interpret:/200/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 1729.719340] Lustre: 6493:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff994f460f7480 x1859825904740224/t4294967580(0) o35->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:443/0 lens 392/456 e 0 to 0 dl 1773669848 ref 1 fl Interpret:/602/0 rc 0/0 job:'multiop.0' uid:0 gid:0 projid:0 [ 1729.750096] Lustre: 6493:0:(mdt_recovery.c:102:mdt_req_from_lrd()) Skipped 1 previous similar message [ 1737.664225] Lustre: DEBUG MARKER: == recovery-small test 21g: drop open reply and close request while close and open are both in flight ========================================================== 10:04:04 (1773669844) [ 1738.723569] LustreError: 13715:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff994f450df100 x1859825904750720/t4294967598(0) o36->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:452/0 lens 488/456 e 0 to 0 dl 1773669857 ref 1 fl Interpret:/200/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 1738.747339] LustreError: 13715:0:(ldlm_lib.c:3325:target_send_reply_msg()) Skipped 1 previous similar message [ 1741.177149] Lustre: *** cfs_fail_loc=115, val=2147483648*** [ 1741.191279] Lustre: Skipped 3 previous similar messages [ 1754.293733] Lustre: 13715:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff995076462d80 x1859825904750720/t4294967598(0) o36->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:468/0 lens 488/3152 e 0 to 0 dl 1773669873 ref 1 fl Interpret:/202/0 rc 0/0 job:'touch.0' uid:0 gid:0 projid:4294967295 [ 1760.887058] Lustre: DEBUG MARKER: == recovery-small test 21h: drop open request and close reply while close and open are both in flight ========================================================== 10:04:27 (1773669867) [ 1764.225442] Lustre: *** cfs_fail_loc=122, val=2147483648*** [ 1764.227622] Lustre: Skipped 2 previous similar messages [ 1783.331968] Lustre: DEBUG MARKER: == recovery-small test 22: drop close request and do mknod ========================================================== 10:04:50 (1773669890) [ 1804.325192] Lustre: DEBUG MARKER: == recovery-small test 23: client hang when close a file after mds crash ========================================================== 10:05:11 (1773669911) [ 1811.293743] Lustre: Failing over lustre-MDT0000 [ 1811.754749] Lustre: server umount lustre-MDT0000 complete [ 1811.938626] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1811.945287] Lustre: Skipped 3 previous similar messages [ 1817.057800] LustreError: 7931:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1817.068330] LustreError: 7931:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 9 previous similar messages [ 1820.838350] LustreError: 6491:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 192.168.201.3@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1822.178217] LustreError: 6494:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1822.186903] LustreError: 6494:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 3 previous similar messages [ 1825.956986] LustreError: 9879:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 192.168.201.3@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1826.123122] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1826.211731] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1826.402389] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1826.435946] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1828.358098] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 1831.073813] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1831.914351] Lustre: lustre-MDT0000-lwp-MDT0001: Connection restored to 0@lo (at 0@lo) [ 1831.929739] Lustre: lustre-MDT0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 1831.957272] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:23 to 0x280000401:65) [ 1831.957274] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:21 to 0x2c0000401:65) [ 1833.890505] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 1834.818562] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 1839.749635] Lustre: DEBUG MARKER: == recovery-small test 24a: fsync error (should return error) ========================================================== 10:05:47 (1773669947) [ 1840.480062] Lustre: 27053:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting e4121254-1881-4044-b64c-77d9b653b80f at adminstrative request [ 1843.767279] Lustre: DEBUG MARKER: == recovery-small test 24b: test dirty page discard due to client eviction ========================================================== 10:05:51 (1773669951) [ 1844.602401] Lustre: 27296:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting e4121254-1881-4044-b64c-77d9b653b80f at adminstrative request [ 1848.865152] Lustre: DEBUG MARKER: == recovery-small test 26a: evict dead exports =========== 10:05:55 (1773669955) [ 1850.114341] Lustre: DEBUG MARKER: SKIP: recovery-small test_26a msg and ost1 are at the same node [ 1851.144801] Lustre: DEBUG MARKER: == recovery-small test 26b: evict dead exports =========== 10:05:58 (1773669958) [ 1852.135289] Lustre: DEBUG MARKER: SKIP: recovery-small test_26b msg and ost1 are at the same node [ 1853.010325] Lustre: DEBUG MARKER: == recovery-small test 27: fail LOV while using OSC's ==== 10:06:00 (1773669960) [ 1855.168230] Lustre: Failing over lustre-MDT0000 [ 1855.495464] Lustre: server umount lustre-MDT0000 complete [ 1856.674858] LustreError: 6490:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 192.168.201.3@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1857.505898] LustreError: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 1857.506403] Lustre: lustre-MDT0000-osp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1857.527803] Lustre: Skipped 2 previous similar messages [ 1866.914974] LustreError: 6489:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 192.168.201.3@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1866.925309] LustreError: 6489:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 9 previous similar messages [ 1869.691659] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1869.776644] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1870.021944] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1870.056434] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1871.834807] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 1872.040451] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1875.429901] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 0@lo (at 0@lo) [ 1875.433645] Lustre: Skipped 3 previous similar messages [ 1875.445888] Lustre: lustre-MDT0000: Recovery over after 0:03, of 2 clients 2 recovered and 0 were evicted. [ 1875.469197] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:101 to 0x280000401:129) [ 1875.469872] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:100 to 0x2c0000401:129) [ 1965.983022] Lustre: Failing over lustre-MDT0000 [ 1966.128805] LustreError: 3635:0:(client.c:1380:ptlrpc_import_delay_req()) @@@ IMP_CLOSED req@ffff99507fe31180 x1859825916462080/t0(0) o6->lustre-OST0000-osc-MDT0000@0@lo:28/4 lens 544/432 e 0 to 0 dl 0 ref 1 fl Rpc:QU/200/ffffffff rc 0/-1 job:'osp-syn-0-0.0' uid:0 gid:0 projid:4294967295 [ 1966.273309] Lustre: server umount lustre-MDT0000 complete [ 1967.585343] LustreError: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 1967.585581] Lustre: lustre-MDT0000-osp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 1967.589506] LustreError: 11311:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1967.589507] LustreError: 6489:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 1967.589519] LustreError: 6489:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 4 previous similar messages [ 1967.594387] Lustre: Skipped 3 previous similar messages [ 1979.657415] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 1979.708827] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 1979.950684] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 1980.167625] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 1982.045601] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 1984.676415] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 1985.000067] Lustre: lustre-MDT0000-lwp-MDT0001: Connection restored to 0@lo (at 0@lo) [ 1985.002881] Lustre: Skipped 3 previous similar messages [ 1985.017683] Lustre: lustre-MDT0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 1985.043899] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:1867 to 0x2c0000401:1889) [ 1985.043917] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:1866 to 0x280000401:1889) [ 1987.913981] Lustre: DEBUG MARKER: == recovery-small test 28: handle error adding new clients (bug 6086) ========================================================== 10:08:15 (1773670095) [ 2004.447208] Lustre: 20832:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773670096/real 1773670096] req@ffff99507f6e5880 x1859825916546560/t0(0) o104->lustre-MDT0000@192.168.201.3@tcp:15/16 lens 328/224 e 0 to 1 dl 1773670112 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 2007.708956] Lustre: Failing over lustre-MDT0000 [ 2008.055298] Lustre: server umount lustre-MDT0000 complete [ 2010.594039] LustreError: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 2010.596120] Lustre: lustre-MDT0000-lwp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2010.605878] Lustre: Skipped 3 previous similar messages [ 2022.199913] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2022.260480] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 2022.351404] Lustre: *** cfs_fail_loc=12f, val=0*** [ 2022.355416] LustreError: 7419:0:(tgt_lastrcvd.c:1090:tgt_client_new()) lustre-MDT0001: no room for 0 clients - fix LR_MAX_CLIENTS [ 2022.363397] LustreError: lustre-MDT0001-osp-MDT0000: operation mds_connect to node 0@lo failed: rc = -75 [ 2022.416187] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 2022.436831] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 2024.081478] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2025.635128] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 2027.497918] Lustre: lustre-MDT0000-lwp-OST0000: Connection restored to 0@lo (at 0@lo) [ 2027.500576] Lustre: Skipped 3 previous similar messages [ 2027.519162] Lustre: lustre-MDT0000: Recovery over after 0:02, of 2 clients 2 recovered and 0 were evicted. [ 2027.542394] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:1867 to 0x2c0000401:1921) [ 2027.542593] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:1891 to 0x280000401:1921) [ 2029.290166] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 2030.063253] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 2034.725422] Lustre: DEBUG MARKER: == recovery-small test 29a: error adding new clients doesn't cause LBUG (bug 22273) ========================================================== 10:09:01 (1773670141) [ 2036.266594] Lustre: Failing over lustre-MDT0000 [ 2036.719325] Lustre: server umount lustre-MDT0000 complete [ 2037.731925] Lustre: lustre-MDT0000-lwp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2037.732729] LustreError: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 2037.733977] LustreError: 20832:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2037.733989] LustreError: 20832:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 28 previous similar messages [ 2037.739471] Lustre: Skipped 3 previous similar messages [ 2041.088571] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2041.188151] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 2041.299994] Lustre: *** cfs_fail_loc=711, val=0*** [ 2041.396743] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 2041.431893] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 2041.432805] Lustre: lustre-MDT0000: Aborting client recovery [ 2041.438531] LustreError: 34634:0:(ldlm_lib.c:2983:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 2041.444229] Lustre: 34671:0:(ldlm_lib.c:2386:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 2046.442743] Lustre: lustre-MDT0000-lwp-MDT0001: Connection restored to 0@lo (at 0@lo) [ 2046.446705] Lustre: Skipped 3 previous similar messages [ 2046.461199] Lustre: 34671:0:(genops.c:1620:class_disconnect_stale_exports()) lustre-MDT0000: disconnect stale client e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp [ 2046.467356] Lustre: lustre-MDT0000: disconnecting 2 stale clients [ 2046.470147] LustreError: 34671:0:(tgt_grant.c:234:tgt_grant_sanity_check()) mdt_obd_disconnect: tot_granted 0 != fo_tot_granted 2097152 [ 2046.477889] LustreError: 34671:0:(ldlm_lib.c:1916:abort_lock_replay_queue()) @@@ aborted: req@ffff99507fec3800 x1859825908106624/t0(0) o101->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:44/0 lens 328/0 e 0 to 0 dl 1773670204 ref 1 fl Complete:/240/ffffffff rc 0/-1 job:'ldlm_lock_repla.0' uid:0 gid:0 projid:4294967295 [ 2046.492522] Lustre: 34671:0:(ldlm_lib.c:2386:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 2046.492699] LustreError: lustre-MDT0000-osp-MDT0001: operation ldlm_enqueue to node 0@lo failed: rc = -107 [ 2046.501525] Lustre: lustre-MDT0000-osd: cancel update llog [0x200000400:0x1:0x0] [ 2046.503689] Lustre: lustre-MDT0000: Denying connection for new client lustre-MDT0001-mdtlov_UUID (at 0@lo), waiting for 2 known clients (0 recovered, 0 in progress, and 2 evicted) already passed deadline 34:06 [ 2046.518264] Lustre: Skipped 1 previous similar message [ 2046.519270] Lustre: lustre-MDT0001-osp-MDT0000: cancel update llog [0x240000401:0x1:0x0] [ 2046.561668] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:1867 to 0x2c0000401:1953) [ 2046.562306] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:1891 to 0x280000401:1953) [ 2048.484649] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2051.557148] LustreError: lustre-MDT0000-osp-MDT0001: This client was evicted by lustre-MDT0000; in progress operations using this service will fail. [ 2062.796126] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing wait_import_state FULL os[cp].lustre-OST0000-osc-MDT0000.ost_server_uuid 50 [ 2062.924068] Lustre: DEBUG MARKER: os[cp].lustre-OST0000-osc-MDT0000.ost_server_uuid in FULL state after 0 sec [ 2066.038892] Lustre: DEBUG MARKER: == recovery-small test 29b: error adding new clients doesn't cause LBUG (bug 22273) ========================================================== 10:09:33 (1773670173) [ 2067.444648] Lustre: Failing over lustre-OST0000 [ 2067.529424] Lustre: server umount lustre-OST0000 complete [ 2068.960339] LustreError: lustre-OST0000-osc-MDT0001: operation ost_statfs to node 0@lo failed: rc = -107 [ 2068.963912] Lustre: lustre-OST0000-osc-MDT0001: Connection to lustre-OST0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2068.970676] Lustre: Skipped 1 previous similar message [ 2071.934224] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 2072.076035] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 2072.087079] Lustre: lustre-OST0000: in recovery but waiting for the first client to connect [ 2072.087234] Lustre: lustre-OST0000: Aborting recovery [ 2072.089402] Lustre: Skipped 2 previous similar messages [ 2072.094449] LustreError: 36198:0:(ldlm_lib.c:2983:target_stop_recovery_thread()) lustre-OST0000: Aborting recovery [ 2072.098660] Lustre: 36215:0:(ldlm_lib.c:2386:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 2072.102781] Lustre: 36215:0:(ldlm_lib.c:2386:target_recovery_overseer()) Skipped 1 previous similar message [ 2072.106152] Lustre: 36215:0:(genops.c:1620:class_disconnect_stale_exports()) lustre-OST0000: disconnect stale client lustre-MDT0000-mdtlov_UUID@ [ 2072.111599] Lustre: 36215:0:(genops.c:1620:class_disconnect_stale_exports()) Skipped 1 previous similar message [ 2072.115248] Lustre: lustre-OST0000: disconnecting 3 stale clients [ 2072.120698] LustreError: 36215:0:(ofd_obd.c:1325:ofd_iocontrol()) lustre-OST0000: iocontrol from 'tgt_recover_0' cmd=c00866c1 _IOWR('f', 193, 8) unrecognized: rc = -25 [ 2073.473445] Lustre: *** cfs_fail_loc=711, val=0*** [ 2073.846482] LustreError: lustre-OST0000-osc-MDT0000: This client was evicted by lustre-OST0000; in progress operations using this service will fail. [ 2073.855023] LustreError: lustre-OST0000-osc-MDT0001: This client was evicted by lustre-OST0000; in progress operations using this service will fail. [ 2073.855286] Lustre: lustre-OST0000-osc-MDT0000: Connection restored to 0@lo (at 0@lo) [ 2073.868142] Lustre: Skipped 4 previous similar messages [ 2074.882299] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2089.427058] Lustre: DEBUG MARKER: == recovery-small test 50: failover MDS under load ======= 10:09:56 (1773670196) [ 2101.094474] Lustre: Failing over lustre-MDT0000 [ 2101.310510] Lustre: server umount lustre-MDT0000 complete [ 2102.755568] Lustre: lustre-MDT0000-lwp-OST0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2102.756693] LustreError: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 2102.763140] Lustre: Skipped 3 previous similar messages [ 2115.035600] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2115.098617] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 2115.372701] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 2115.477982] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 2115.483086] Lustre: Skipped 2 previous similar messages [ 2117.212556] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2117.795106] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 2120.680654] Lustre: lustre-MDT0000-lwp-OST0000: Connection restored to 0@lo (at 0@lo) [ 2120.700837] Lustre: lustre-MDT0000: Recovery over after 0:03, of 2 clients 2 recovered and 0 were evicted. [ 2120.723263] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:2162 to 0x2c0000401:2177) [ 2120.723503] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:2163 to 0x280000401:2209) [ 2122.808256] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 2123.770456] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 2186.974192] Lustre: Failing over lustre-MDT0000 [ 2187.167763] Lustre: server umount lustre-MDT0000 complete [ 2187.235758] Lustre: lustre-MDT0000-osp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2187.238465] LustreError: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 2187.240257] LustreError: 11311:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2187.240265] LustreError: 11311:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 21 previous similar messages [ 2187.245254] Lustre: Skipped 4 previous similar messages [ 2201.114923] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2201.189151] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 2201.488406] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 2201.556394] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 2203.194146] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2204.835146] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 2206.699772] Lustre: lustre-MDT0000-lwp-OST0000: Connection restored to 0@lo (at 0@lo) [ 2206.703109] Lustre: Skipped 3 previous similar messages [ 2206.717840] Lustre: lustre-MDT0000: Recovery over after 0:02, of 2 clients 2 recovered and 0 were evicted. [ 2206.745895] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:3449 to 0x280000401:3489) [ 2206.746515] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:3417 to 0x2c0000401:3457) [ 2208.521362] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 2209.425858] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 2272.088095] Lustre: Failing over lustre-MDT0000 [ 2272.337390] Lustre: server umount lustre-MDT0000 complete [ 2273.250927] Lustre: lustre-MDT0000-lwp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2273.256209] Lustre: Skipped 3 previous similar messages [ 2285.982395] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2286.046731] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 2286.366575] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 2286.543264] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 2286.755589] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 2288.617456] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2291.686366] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 0@lo (at 0@lo) [ 2291.688733] Lustre: Skipped 3 previous similar messages [ 2291.700962] Lustre: lustre-MDT0000: Recovery over after 0:05, of 2 clients 2 recovered and 0 were evicted. [ 2291.724051] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:4738 to 0x280000401:4769) [ 2291.724065] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:4706 to 0x2c0000401:4737) [ 2293.527134] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 2294.582245] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 2319.572663] Lustre: DEBUG MARKER: == recovery-small test 51: failover MDS during recovery == 10:13:46 (1773670426) [ 2321.910174] Lustre: Failing over lustre-MDT0000 [ 2322.150834] Lustre: server umount lustre-MDT0000 complete [ 2322.406834] LustreError: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 2322.410828] LustreError: Skipped 1 previous similar message [ 2335.955471] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2337.962063] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 2338.171961] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2339.689800] Lustre: DEBUG MARKER: test_51: failover in 1 sec [ 2341.373853] Lustre: lustre-MDT0000: Recovery over after 0:04, of 2 clients 2 recovered and 0 were evicted. [ 2341.392448] Lustre: 6491:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff995080ee2d80 x1859825914209408/t38654709216(0) o101->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:339/0 lens 664/3488 e 0 to 0 dl 1773670499 ref 1 fl Interpret:/602/0 rc 0/0 job:'writemany.0' uid:0 gid:0 projid:0 [ 2341.403293] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:5354 to 0x280000401:5377) [ 2341.403349] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:5321 to 0x2c0000401:5345) [ 2341.408210] Lustre: 6491:0:(mdt_recovery.c:102:mdt_req_from_lrd()) Skipped 1 previous similar message [ 2341.549809] Lustre: Failing over lustre-MDT0000 [ 2341.808667] Lustre: server umount lustre-MDT0000 complete [ 2356.011254] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2356.119682] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 2356.127508] LustreError: Skipped 1 previous similar message [ 2358.495943] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2360.243960] Lustre: DEBUG MARKER: test_51: failover in 5 sec [ 2361.873903] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:5380 to 0x280000401:5409) [ 2361.874643] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:5349 to 0x2c0000401:5377) [ 2366.277921] Lustre: Failing over lustre-MDT0000 [ 2366.394498] LustreError: 3637:0:(client.c:1380:ptlrpc_import_delay_req()) @@@ IMP_CLOSED req@ffff994f451e4380 x1859825918276992/t0(0) o6->lustre-OST0001-osc-MDT0000@0@lo:28/4 lens 544/432 e 0 to 0 dl 0 ref 1 fl Rpc:QU/200/ffffffff rc 0/-1 job:'osp-syn-1-0.0' uid:0 gid:0 projid:4294967295 [ 2366.646984] Lustre: server umount lustre-MDT0000 complete [ 2380.545607] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2382.871421] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2384.454405] Lustre: DEBUG MARKER: test_51: failover in 10 sec [ 2386.450449] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:5506 to 0x280000401:5537) [ 2386.450773] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:5473 to 0x2c0000401:5505) [ 2395.739677] Lustre: Failing over lustre-MDT0000 [ 2396.065696] Lustre: server umount lustre-MDT0000 complete [ 2410.235101] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2412.801462] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2414.433877] Lustre: DEBUG MARKER: test_51: failover in 20 sec [ 2414.755380] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 2414.758828] Lustre: Skipped 2 previous similar messages [ 2415.606801] Lustre: lustre-MDT0000: Recovery over after 0:01, of 2 clients 2 recovered and 0 were evicted. [ 2415.611767] Lustre: Skipped 2 previous similar messages [ 2415.633494] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:5714 to 0x280000401:5761) [ 2415.634127] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:5682 to 0x2c0000401:5761) [ 2435.652088] Lustre: Failing over lustre-MDT0000 [ 2436.011444] Lustre: server umount lustre-MDT0000 complete [ 2436.067466] Lustre: lustre-MDT0000-lwp-OST0000: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2436.072741] Lustre: Skipped 18 previous similar messages [ 2445.475190] LustreError: 13715:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 192.168.201.3@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 2445.481632] LustreError: 13715:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 93 previous similar messages [ 2449.960080] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2450.318519] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 2450.326822] Lustre: Skipped 4 previous similar messages [ 2450.553200] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 2450.557181] Lustre: Skipped 4 previous similar messages [ 2452.586497] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2454.210543] Lustre: DEBUG MARKER: test_51: failover in 25 sec [ 2455.525361] Lustre: lustre-MDT0000-lwp-MDT0001: Connection restored to 0@lo (at 0@lo) [ 2455.529433] Lustre: Skipped 19 previous similar messages [ 2455.564680] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:6163 to 0x280000401:6209) [ 2455.565430] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:6164 to 0x2c0000401:6209) [ 2480.543743] Lustre: Failing over lustre-MDT0000 [ 2480.884299] Lustre: server umount lustre-MDT0000 complete [ 2494.705312] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2494.763155] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 2494.768813] LustreError: Skipped 3 previous similar messages [ 2497.053283] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2498.585784] Lustre: DEBUG MARKER: test_51: failover in 30 sec [ 2500.105081] Lustre: 6491:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff99507f6e9f80 x1859825916301056/t60129544844(0) o36->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:498/0 lens 512/2888 e 0 to 0 dl 1773670658 ref 1 fl Interpret:/202/0 rc 0/0 job:'writemany.0' uid:0 gid:0 projid:4294967295 [ 2500.106456] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:6653 to 0x280000401:6689) [ 2500.108350] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:6652 to 0x2c0000401:6689) [ 2529.978713] Lustre: Failing over lustre-MDT0000 [ 2530.257261] Lustre: server umount lustre-MDT0000 complete [ 2543.987155] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 2546.200560] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2547.874934] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 2 clients reconnect [ 2547.880612] Lustre: Skipped 2 previous similar messages [ 2549.753618] Lustre: lustre-MDT0000: Recovery over after 0:02, of 2 clients 2 recovered and 0 were evicted. [ 2549.760436] Lustre: Skipped 2 previous similar messages [ 2549.783250] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:7250 to 0x2c0000401:7265) [ 2549.783308] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:7249 to 0x280000401:7265) [ 2570.935564] Lustre: DEBUG MARKER: == recovery-small test 52: failover OST under load ======= 10:17:58 (1773670678) [ 2582.898039] Lustre: Failing over lustre-OST0000 [ 2582.933926] Lustre: lustre-OST0000: Not available for connect from 192.168.201.3@tcp (stopping) [ 2582.987140] Lustre: server umount lustre-OST0000 complete [ 2583.755107] LustreError: lustre-OST0000-osc-MDT0001: operation ost_create to node 0@lo failed: rc = -107 [ 2583.759014] LustreError: Skipped 6 previous similar messages [ 2597.194308] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 2600.379865] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2604.640386] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 2605.602701] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 2913.994774] Lustre: Failing over lustre-OST0000 [ 2914.296543] Lustre: lustre-OST0000-osc-MDT0001: Connection to lustre-OST0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 2914.305654] Lustre: Skipped 11 previous similar messages [ 2914.308504] Lustre: lustre-OST0000: Not available for connect from 0@lo (stopping) [ 2916.117180] Lustre: server umount lustre-OST0000 complete [ 2930.444296] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 2930.637410] Lustre: lustre-OST0000: Imperative Recovery not enabled, recovery window 60-180 [ 2930.641755] Lustre: Skipped 3 previous similar messages [ 2930.653050] Lustre: lustre-OST0000: in recovery but waiting for the first client to connect [ 2930.656179] Lustre: Skipped 3 previous similar messages [ 2931.874990] Lustre: lustre-OST0000: Will be in recovery for at least 1:00, or until 3 clients reconnect [ 2931.881140] Lustre: Skipped 1 previous similar message [ 2932.734744] Lustre: lustre-OST0000: Recovery over after 0:01, of 3 clients 3 recovered and 0 were evicted. [ 2932.742140] Lustre: Skipped 1 previous similar message [ 2932.742277] Lustre: lustre-OST0000-osc-MDT0001: Connection restored to 0@lo (at 0@lo) [ 2932.750014] Lustre: Skipped 13 previous similar messages [ 2933.833899] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 2938.373168] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 2939.270584] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 3245.337536] Lustre: Failing over lustre-OST0000 [ 3245.388060] LustreError: 8371:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-OST0000: not available for connect from 192.168.201.3@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 3245.400898] LustreError: 8371:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 53 previous similar messages [ 3245.429198] Lustre: server umount lustre-OST0000 complete [ 3245.537943] LustreError: lustre-OST0000-osc-MDT0001: operation ost_statfs to node 0@lo failed: rc = -107 [ 3245.543787] LustreError: Skipped 2 previous similar messages [ 3259.483929] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 3259.635508] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 3262.357915] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 3266.816541] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 3267.852853] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 3537.655351] Lustre: DEBUG MARKER: == recovery-small test 53a: touch: drop rep ============== 10:34:04 (1773671644) [ 3538.318066] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 3538.320547] LustreError: 9879:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff9950765b3b80 x1859825948723584/t0(0) o101->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:742/0 lens 576/688 e 0 to 0 dl 1773671657 ref 1 fl Interpret:/600/0 rc 0/0 job:'openfile.0' uid:0 gid:0 projid:0 [ 3538.332905] LustreError: 9879:0:(ldlm_lib.c:3325:target_send_reply_msg()) Skipped 1 previous similar message [ 3554.487355] Lustre: lustre-MDT0000: Client e4121254-1881-4044-b64c-77d9b653b80f (at 192.168.201.3@tcp) reconnecting [ 3554.492621] Lustre: Skipped 6 previous similar messages [ 3557.717481] Lustre: DEBUG MARKER: == recovery-small test 53b: touch: drop rep ============== 10:34:25 (1773671665) [ 3558.367619] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 3558.370309] LustreError: 11311:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff99506ea00a80 x1859825948730496/t0(0) o101->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:7/0 lens 576/688 e 0 to 0 dl 1773671677 ref 1 fl Interpret:/600/0 rc 0/0 job:'openfile.0' uid:0 gid:0 projid:0 [ 3578.096897] Lustre: DEBUG MARKER: == recovery-small test 53c: touch: drop rep ============== 10:34:45 (1773671685) [ 3578.669597] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 3578.673258] LustreError: 11311:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff99507fdfdc00 x1859825948734208/t68719478621(0) o101->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:27/0 lens 664/664 e 0 to 0 dl 1773671697 ref 1 fl Interpret:/600/0 rc 0/0 job:'openfile.0' uid:0 gid:0 projid:0 [ 3594.919611] Lustre: 13715:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff99507bccad80 x1859825948734208/t68719478621(0) o101->e4121254-1881-4044-b64c-77d9b653b80f@192.168.201.3@tcp:44/0 lens 664/3488 e 0 to 0 dl 1773671714 ref 1 fl Interpret:H/602/0 rc 0/0 job:'openfile.0' uid:0 gid:0 projid:0 [ 3598.347650] Lustre: DEBUG MARKER: == recovery-small test 54: back in time ================== 10:35:05 (1773671705) [ 3609.662891] Lustre: Failing over lustre-MDT0000 [ 3610.085341] Lustre: server umount lustre-MDT0000 complete [ 3613.154074] Lustre: lustre-MDT0000-osp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 3613.159975] Lustre: Skipped 6 previous similar messages [ 3623.324705] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3623.380287] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 3623.384534] LustreError: Skipped 1 previous similar message [ 3623.507327] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 3623.541593] Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect [ 3623.545466] Lustre: Skipped 1 previous similar message [ 3624.098454] Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 3 clients reconnect [ 3624.102942] Lustre: Skipped 1 previous similar message [ 3624.996096] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 3628.519529] Lustre: lustre-MDT0000-lwp-OST0001: Connection restored to 0@lo (at 0@lo) [ 3628.521909] Lustre: Skipped 3 previous similar messages [ 3628.531415] Lustre: lustre-MDT0000: Recovery over after 0:04, of 3 clients 3 recovered and 0 were evicted. [ 3628.534499] Lustre: Skipped 1 previous similar message [ 3628.553490] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:7574 to 0x280000401:7617) [ 3628.553491] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:7574 to 0x2c0000401:7617) [ 3630.011159] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 3630.744517] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 3634.828246] Lustre: DEBUG MARKER: == recovery-small test 55: ost_brw_read/write drops timed-out read/write request ========================================================== 10:35:42 (1773671742) [ 3638.735356] Lustre: *** cfs_fail_loc=21d, val=0*** [ 3638.737603] Lustre: Skipped 3 previous similar messages [ 3638.739208] LustreError: 8375:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.3@tcp because locking object 0x280000400:15840 took 0 seconds (limit was 11). [ 3638.745628] Lustre: lustre-OST0000: Bulk IO write error with e4121254-1881-4044-b64c-77d9b653b80f (at 192.168.201.3@tcp), client will retry: rc = -110 [ 3638.746598] LustreError: 8375:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 1 previous similar message [ 3654.819254] Lustre: lustre-OST0000: Client e4121254-1881-4044-b64c-77d9b653b80f (at 192.168.201.3@tcp) reconnecting [ 3654.823543] Lustre: Skipped 2 previous similar messages [ 3654.834270] LustreError: 8377:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.3@tcp because locking object 0x280000400:15840 took 0 seconds (limit was 11). [ 3654.835952] Lustre: lustre-OST0000: Bulk IO write error with e4121254-1881-4044-b64c-77d9b653b80f (at 192.168.201.3@tcp), client will retry: rc = -110 [ 3654.840475] LustreError: 8377:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 12 previous similar messages [ 3654.846348] Lustre: Skipped 12 previous similar messages [ 3655.841562] LustreError: 8376:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.3@tcp because locking object 0x280000400:15840 took 0 seconds (limit was 11). [ 3655.849804] Lustre: lustre-OST0000: Bulk IO write error with e4121254-1881-4044-b64c-77d9b653b80f (at 192.168.201.3@tcp), client will retry: rc = -110 [ 3655.855914] Lustre: Skipped 1 previous similar message [ 3670.185756] LustreError: 32545:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.3@tcp because locking object 0x280000400:15840 took 0 seconds (limit was 11). [ 3670.187926] Lustre: lustre-OST0000: Bulk IO write error with e4121254-1881-4044-b64c-77d9b653b80f (at 192.168.201.3@tcp), client will retry: rc = -110 [ 3670.192574] LustreError: 32545:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 7 previous similar messages [ 3670.197613] Lustre: Skipped 6 previous similar messages [ 3686.569127] LustreError: 53317:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.3@tcp because locking object 0x280000400:15840 took 0 seconds (limit was 11). [ 3686.570278] Lustre: lustre-OST0000: Bulk IO write error with e4121254-1881-4044-b64c-77d9b653b80f (at 192.168.201.3@tcp), client will retry: rc = -110 [ 3686.577036] LustreError: 53317:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 7 previous similar messages [ 3686.580844] Lustre: Skipped 7 previous similar messages [ 3702.952271] Lustre: *** cfs_fail_loc=21d, val=0*** [ 3702.952825] LustreError: 8376:0:(tgt_handler.c:2808:tgt_brw_write()) lustre-OST0000: Dropping timed-out write from 12345-192.168.201.3@tcp because locking object 0x280000400:15840 took 0 seconds (limit was 11). [ 3702.953371] Lustre: lustre-OST0000: Bulk IO write error with e4121254-1881-4044-b64c-77d9b653b80f (at 192.168.201.3@tcp), client will retry: rc = -110 [ 3702.953387] Lustre: Skipped 1 previous similar message [ 3702.954313] Lustre: Skipped 33 previous similar messages [ 3702.960714] LustreError: 8376:0:(tgt_handler.c:2808:tgt_brw_write()) Skipped 6 previous similar messages [ 3723.347027] Lustre: DEBUG MARKER: == recovery-small test 56: do not fail on getattr resend ========================================================== 10:37:10 (1773671830) [ 3723.753173] LustreError: 6491:0:(mdt_handler.c:2334:mdt_getattr_name_lock()) cfs_fail_timeout id 136 sleeping for 40000ms [ 3763.799139] LustreError: 6491:0:(mdt_handler.c:2334:mdt_getattr_name_lock()) cfs_fail_timeout id 136 awake [ 3766.597709] Lustre: DEBUG MARKER: == recovery-small test 57: read procfs entries causes kernel crash ========================================================== 10:37:54 (1773671874) [ 3768.761785] Lustre: Failing over lustre-MDT0000 [ 3768.932809] Lustre: server umount lustre-MDT0000 complete [ 3772.268282] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3772.317483] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 3772.470806] Lustre: lustre-MDT0000: Aborting client recovery [ 3772.472169] LustreError: 54513:0:(ldlm_lib.c:2983:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 3772.474688] Lustre: 54550:0:(ldlm_lib.c:2386:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 3772.478992] Lustre: 54550:0:(ldlm_lib.c:2386:target_recovery_overseer()) Skipped 2 previous similar messages [ 3772.481683] Lustre: 54550:0:(genops.c:1620:class_disconnect_stale_exports()) lustre-MDT0000: disconnect stale client lustre-MDT0001-mdtlov_UUID@ [ 3772.485307] Lustre: 54550:0:(genops.c:1620:class_disconnect_stale_exports()) Skipped 2 previous similar messages [ 3772.489693] Lustre: lustre-MDT0000: disconnecting 1 stale clients [ 3772.494203] Lustre: lustre-MDT0000-osd: cancel update llog [0x200002b10:0x1:0x0] [ 3772.501958] Lustre: lustre-MDT0001-osp-MDT0000: cancel update llog [0x240000404:0x1:0x0] [ 3772.531429] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:7619 to 0x280000401:7649) [ 3772.531540] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:7574 to 0x2c0000401:7649) [ 3774.004226] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 3777.518034] LustreError: lustre-MDT0000-osp-MDT0001: This client was evicted by lustre-MDT0000; in progress operations using this service will fail. [ 3786.370788] Lustre: DEBUG MARKER: == recovery-small test 58: Eviction in the middle of open RPC reply processing ========================================================== 10:38:13 (1773671893) [ 3803.103839] Lustre: 13715:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773671895/real 1773671895] req@ffff994f439d5500 x1859825926767104/t0(0) o104->lustre-MDT0000@192.168.201.3@tcp:15/16 lens 328/224 e 0 to 1 dl 1773671911 ref 1 fl Rpc:XQr/0/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295 projid:4294967295 [ 3806.057768] Lustre: DEBUG MARKER: == recovery-small test 59: Read cancel race on client eviction ========================================================== 10:38:33 (1773671913) [ 3817.054935] LustreError: 28897:0:(ldlm_lockd.c:727:ldlm_handle_ast_error()) ### client (nid 192.168.201.3@tcp) returned error from blocking AST (req@00000000efdab930 x1859825926781312 status -107 rc -107), evict it ns: filter-lustre-OST0000_UUID lock: 000000006dc0d78b/0x441c89a01286b946 lrc: 4/0,0 mode: PW/PW res: [0x280000401:0x1de2:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->4095) gid 0 flags: 0x60000400000020 nid: 192.168.201.3@tcp remote: 0x2f77f5611380666c expref: 5 pid: 27759 timeout: 3917 lvb_type: 0 lru_score: 0 lru_type: 0 [ 3817.068242] LustreError: lustre-OST0000: A client on nid 192.168.201.3@tcp was evicted due to a lock blocking callback time out: rc -107 [ 3817.072433] LustreError: 6479:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 0s: evicting client at 192.168.201.3@tcp ns: filter-lustre-OST0000_UUID lock: 000000006dc0d78b/0x441c89a01286b946 lrc: 3/0,0 mode: PW/PW res: [0x280000401:0x1de2:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->4095) gid 0 flags: 0x60000400000020 nid: 192.168.201.3@tcp remote: 0x2f77f5611380666c expref: 6 pid: 27759 timeout: 0 lvb_type: 0 lru_score: 0 lru_type: 0 [ 3819.788664] Lustre: DEBUG MARKER: == recovery-small test 60: Add Changelog entries during MDS failover ========================================================== 10:38:47 (1773671927) [ 3819.858675] LustreError: 6491:0:(ldlm_lockd.c:727:ldlm_handle_ast_error()) ### client (nid 192.168.201.3@tcp) returned error from blocking AST (req@000000000063c1dd x1859825926783616 status -107 rc -107), evict it ns: mdt-lustre-MDT0000_UUID lock: ffff995060468600/0x441c89a01286b962 lrc: 4/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 3 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.3@tcp remote: 0x2f77f5611380667a expref: 6 pid: 6490 timeout: 3919 lvb_type: 0 lru_score: 0 lru_type: 0 [ 3819.873529] LustreError: lustre-MDT0000: A client on nid 192.168.201.3@tcp was evicted due to a lock blocking callback time out: rc -107 [ 3819.877619] LustreError: 6479:0:(ldlm_lockd.c:255:expired_lock_main()) ### lock callback timer expired after 0s: evicting client at 192.168.201.3@tcp ns: mdt-lustre-MDT0000_UUID lock: ffff995060468600/0x441c89a01286b962 lrc: 3/0,0 mode: PR/PR res: [0x200000007:0x1:0x0].0x0 bits 0x13/0x0 rrc: 3 type: IBT gid 0 flags: 0x60200400000020 nid: 192.168.201.3@tcp remote: 0x2f77f5611380667a expref: 7 pid: 6490 timeout: 0 lvb_type: 0 lru_score: 0 lru_type: 0 [ 3820.852179] Lustre: lustre-MDD0000: changelog on [ 3821.815605] Lustre: lustre-MDD0001: changelog on [ 3836.823202] Lustre: lustre-OST0001: haven't heard from client c1c12db9-3336-497b-8296-98ee4194d330 (at 192.168.201.3@tcp) in 31 seconds. I think it's dead, and I am evicting it. exp ffff994f46c10800, cur 1773671945 deadline 1773671944 last 1773671914 [ 3844.529981] Lustre: Failing over lustre-MDT0000 [ 3844.859460] Lustre: server umount lustre-MDT0000 complete [ 3847.328751] LustreError: 6489:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 192.168.201.3@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. [ 3847.335814] LustreError: 6489:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 30 previous similar messages [ 3849.184081] LustreError: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 3849.188143] LustreError: Skipped 1 previous similar message [ 3858.483348] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 3858.528241] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 3858.663825] Lustre: lustre-MDD0000: changelog on [ 3860.430338] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 3864.131774] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:8935 to 0x2c0000401:8961) [ 3864.131862] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:8935 to 0x280000401:8961) [ 3907.851407] Lustre: lustre-MDT0001: haven't heard from client c1c12db9-3336-497b-8296-98ee4194d330 (at 192.168.201.3@tcp) in 102 seconds. I think it's dead, and I am evicting it. exp ffff99505968d800, cur 1773672016 deadline 1773672014 last 1773671914 [ 4010.233897] Lustre: lustre-MDD0000: changelog off [ 4016.325176] Lustre: lustre-MDD0001: changelog off [ 4037.276702] Lustre: DEBUG MARKER: == recovery-small test 61: Verify to not reuse orphan objects - bug 17025 ========================================================== 10:42:22 (1773672142) [ 4053.277392] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 4058.820635] Lustre: Failing over lustre-MDT0000 [ 4059.643351] Lustre: server umount lustre-MDT0000 complete [ 4077.982292] LDISKFS-fs (dm-0): 3 truncates cleaned up [ 4077.985410] LDISKFS-fs (dm-0): recovery complete [ 4078.020522] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4078.311885] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 4078.836729] Lustre: lustre-MDT0000: Aborting client recovery [ 4078.843576] LustreError: 59076:0:(ldlm_lib.c:2983:target_stop_recovery_thread()) lustre-MDT0000: Aborting recovery [ 4078.856794] Lustre: 59112:0:(ldlm_lib.c:2386:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 4078.864283] Lustre: 59112:0:(ldlm_lib.c:2386:target_recovery_overseer()) Skipped 2 previous similar messages [ 4078.875344] Lustre: 59112:0:(genops.c:1620:class_disconnect_stale_exports()) lustre-MDT0000: disconnect stale client bfd032c3-495f-406b-9a98-bd6e1b2b25d9@ [ 4078.904961] Lustre: lustre-MDT0000: disconnecting 2 stale clients [ 4078.907435] Lustre: lustre-MDT0000: Denying connection for new client bfd032c3-495f-406b-9a98-bd6e1b2b25d9 (at 192.168.201.3@tcp), waiting for 2 known clients (0 recovered, 0 in progress, and 1 evicted) already passed deadline 67:58 [ 4078.934505] Lustre: lustre-MDT0000-osd: cancel update llog [0x2000088d0:0x1:0x0] [ 4078.952895] Lustre: lustre-MDT0001-osp-MDT0000: cancel update llog [0x240000405:0x1:0x0] [ 4079.048816] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:8935 to 0x2c0000401:8993) [ 4079.062506] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:8935 to 0x280000401:8993) [ 4084.244502] LustreError: lustre-MDT0000-osp-MDT0001: This client was evicted by lustre-MDT0000; in progress operations using this service will fail. [ 4085.846158] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 4113.556771] Lustre: DEBUG MARKER: == recovery-small test 65: lock enqueue for destroyed export ========================================================== 10:43:39 (1773672219) [ 4115.742379] LustreError: 27758:0:(ldlm_lockd.c:1431:ldlm_handle_enqueue()) cfs_fail_timeout id 31e sleeping for 6000ms [ 4115.760960] Lustre: *** cfs_fail_loc=31e, val=0*** [ 4115.764936] Lustre: Skipped 3 previous similar messages [ 4117.756626] LustreError: 28896:0:(ldlm_lockd.c:1431:ldlm_handle_enqueue()) cfs_fail_timeout id 31e sleeping for 6000ms [ 4121.062852] Lustre: 59894:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting bfd032c3-495f-406b-9a98-bd6e1b2b25d9 at adminstrative request [ 4121.082246] LustreError: 6478:0:(ldlm_lockd.c:2933:ldlm_bl_thread_exports()) cfs_fail_timeout id 31e sleeping for 4000ms [ 4121.808089] LustreError: 27758:0:(ldlm_lockd.c:1431:ldlm_handle_enqueue()) cfs_fail_timeout id 31e awake [ 4121.825619] LustreError: 27758:0:(ldlm_lockd.c:1453:ldlm_handle_enqueue()) ### lock on destroyed export 0000000050618406 ns: filter-lustre-OST0000_UUID lock: 00000000d8b1e613/0x441c89a0128d1e3c lrc: 3/0,0 mode: --/PW res: [0x280000401:0x2323:0x0].0x0 rrc: 4 type: EXT [0->4095] (req 0->4095) gid 0 flags: 0x70000000020020 nid: 192.168.201.3@tcp remote: 0x2f77f5611381313b expref: 4 pid: 27758 timeout: 0 lvb_type: 0 lru_score: 0 lru_type: 0 [ 4123.760387] LustreError: 28896:0:(ldlm_lockd.c:1431:ldlm_handle_enqueue()) cfs_fail_timeout id 31e awake [ 4124.251627] LustreError: 6478:0:(ldlm_lockd.c:2933:ldlm_bl_thread_exports()) cfs_fail_timeout interrupted [ 4132.021683] Lustre: lustre-OST0000: Client 8c9ee295-608c-4e17-a681-d2965ac0b7fa (at 192.168.201.3@tcp) reconnecting [ 4132.033702] Lustre: Skipped 6 previous similar messages [ 4144.942095] Lustre: DEBUG MARKER: == recovery-small test 66: lock enqueue re-send vs client eviction ========================================================== 10:44:11 (1773672251) [ 4146.838300] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 4146.843930] LustreError: 11311:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff9950755d6d80 x1859825953064320/t0(0) o101->bfd032c3-495f-406b-9a98-bd6e1b2b25d9@192.168.201.3@tcp:640/0 lens 576/688 e 0 to 0 dl 1773672310 ref 1 fl Interpret:/600/0 rc 0/0 job:'stat.0' uid:0 gid:0 projid:0 [ 4149.420553] LustreError: 11311:0:(mdt_handler.c:2334:mdt_getattr_name_lock()) cfs_fail_timeout id 136 sleeping for 40000ms [ 4152.782397] Lustre: 60305:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting bfd032c3-495f-406b-9a98-bd6e1b2b25d9 at adminstrative request [ 4154.318844] LustreError: 11311:0:(mdt_handler.c:2334:mdt_getattr_name_lock()) cfs_fail_timeout interrupted [ 4164.325212] Lustre: DEBUG MARKER: == recovery-small test 67: connect vs import invalidate race ========================================================== 10:44:30 (1773672270) [ 4170.416406] Lustre: 60597:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting bfd032c3-495f-406b-9a98-bd6e1b2b25d9 at adminstrative request [ 4192.512684] Lustre: DEBUG MARKER: == recovery-small test 100: IR: Make sure normal recovery still works w/o IR ========================================================== 10:44:57 (1773672297) [ 4198.722579] Lustre: Failing over lustre-OST0000 [ 4198.833300] Lustre: server umount lustre-OST0000 complete [ 4220.589460] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 4226.037454] Lustre: lustre-OST0000: Will be in recovery for at least 1:00, or until 3 clients reconnect [ 4226.054584] Lustre: Skipped 1 previous similar message [ 4228.997386] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 4233.727539] Lustre: lustre-OST0000: Recovery over after 0:07, of 3 clients 3 recovered and 0 were evicted. [ 4233.728158] Lustre: lustre-OST0000-osc-MDT0001: Connection restored to 0@lo (at 0@lo) [ 4233.745555] Lustre: Skipped 1 previous similar message [ 4233.780369] Lustre: Skipped 15 previous similar messages [ 4238.771848] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 4241.001680] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 4254.299919] Lustre: DEBUG MARKER: == recovery-small test 101a: IR: Make sure IR works w/o normal recovery ========================================================== 10:46:00 (1773672360) [ 4260.197322] Lustre: Failing over lustre-OST0000 [ 4260.371211] Lustre: server umount lustre-OST0000 complete [ 4261.878432] Lustre: lustre-OST0000-osc-MDT0001: Connection to lustre-OST0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 4261.907572] Lustre: Skipped 15 previous similar messages [ 4283.017776] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 4283.526144] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 4283.559653] Lustre: lustre-OST0000: in recovery but waiting for the first client to connect [ 4283.586558] Lustre: Skipped 8 previous similar messages [ 4293.209796] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 4304.310813] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 4306.974403] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 4322.604737] Lustre: DEBUG MARKER: == recovery-small test 101b: IR: Make sure IR works w/o normal recovery and proceed EAGAIN ========================================================== 10:47:08 (1773672428) [ 4329.506452] Lustre: Failing over lustre-OST0000 [ 4329.701475] Lustre: server umount lustre-OST0000 complete [ 4353.813612] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 4354.205675] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 4354.227870] LustreError: 64536:0:(ofd_dev.c:633:ofd_prepare()) cfs_fail_timeout id 247 sleeping for 25000ms [ 4379.287136] LustreError: 64536:0:(ofd_dev.c:633:ofd_prepare()) cfs_fail_timeout id 247 awake [ 4389.781054] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 4401.955278] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 4404.643980] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 4419.070917] Lustre: DEBUG MARKER: == recovery-small test 102: IR: New client gets updated nidtbl after MGS restart ========================================================== 10:48:44 (1773672524) [ 4424.948859] Lustre: Failing over lustre-OST0000 [ 4425.307195] Lustre: server umount lustre-OST0000 complete [ 4449.351948] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 4449.885511] Lustre: lustre-OST0000: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 4459.881734] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 4472.425405] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 4475.359603] Lustre: DEBUG MARKER: osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid in FULL state after 0 sec [ 4488.454304] Lustre: Failing over lustre-MDT0000 [ 4488.775623] Lustre: server umount lustre-MDT0000 complete [ 4489.187535] LustreError: lustre-MDT0000-osp-MDT0001: operation mds_statfs to node 0@lo failed: rc = -107 [ 4489.199066] LustreError: Skipped 4 previous similar messages [ 4489.212928] LustreError: 6494:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0000: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 4489.264793] LustreError: 6494:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 67 previous similar messages [ 4501.504274] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4501.874972] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 4502.593443] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 4502.608608] Lustre: Skipped 4 previous similar messages [ 4507.806400] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:8995 to 0x2c0000401:9025) [ 4507.807068] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:8997 to 0x280000401:9025) [ 4510.023983] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 4517.251087] Lustre: Failing over lustre-OST0000 [ 4517.521416] Lustre: server umount lustre-OST0000 complete [ 4544.108289] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 4558.143302] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 4570.641251] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid [ 4587.727776] Lustre: DEBUG MARKER: == recovery-small test 103: IR: MDS can start w/o MGS and get updated nidtbl later ========================================================== 10:51:32 (1773672692) [ 4592.631751] Lustre: DEBUG MARKER: SKIP: recovery-small test_103 needs separate mgs and mds [ 4596.227794] Lustre: DEBUG MARKER: == recovery-small test 104: IR: ost can disable IR voluntarily ========================================================== 10:51:41 (1773672701) [ 4603.171678] Lustre: Failing over lustre-OST0000 [ 4603.313477] Lustre: server umount lustre-OST0000 complete [ 4618.581848] LDISKFS-fs (dm-2): mounted filesystem with ordered data mode. Opts: errors=remount-ro,no_mbcache,nodelalloc [ 4631.490284] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 4653.211055] Lustre: DEBUG MARKER: == recovery-small test 105: IR: NON IR clients support === 10:52:38 (1773672758) [ 4655.701895] Lustre: DEBUG MARKER: SKIP: recovery-small test_105 Needs multiple clients [ 4659.622891] Lustre: DEBUG MARKER: == recovery-small test 106: lightweight connection support ========================================================== 10:52:44 (1773672764) [ 4676.897929] Lustre: DEBUG MARKER: mds1 REPLAY BARRIER on lustre-MDT0000 [ 4679.499831] Lustre: Failing over lustre-MDT0000 [ 4679.778478] Lustre: server umount lustre-MDT0000 complete [ 4696.049117] Lustre: 3638:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773672788/real 1773672788] req@ffff99507fdfb800 x1859825927883520/t0(0) o400->MGC192.168.201.103@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1773672804 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 4710.930207] LDISKFS-fs (dm-0): 2 truncates cleaned up [ 4710.936787] LDISKFS-fs (dm-0): recovery complete [ 4710.950916] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4729.681157] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:9027 to 0x2c0000401:9057) [ 4729.682282] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:8997 to 0x280000401:9057) [ 4732.333739] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 4757.624980] Lustre: DEBUG MARKER: == recovery-small test 107: drop reint reply, then restart MDT ========================================================== 10:54:22 (1773672862) [ 4759.400440] Lustre: *** cfs_fail_loc=119, val=2147483648*** [ 4759.408449] LustreError: 6489:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff99507f685500 x1859825953162624/t94489280517(0) o36->0dd72a9b-fa7c-4aef-9bbf-cbf02c000f62@192.168.201.3@tcp:453/0 lens 552/448 e 0 to 0 dl 1773672878 ref 1 fl Interpret:/200/0 rc 0/0 job:'mkdir.0' uid:0 gid:0 projid:4294967295 [ 4763.842664] Lustre: Failing over lustre-MDT0000 [ 4764.355291] Lustre: server umount lustre-MDT0000 complete [ 4780.999333] Lustre: 3636:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773672873/real 1773672873] req@ffff994f46d84e00 x1859825927927680/t0(0) o400->MGC192.168.201.103@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1773672889 ref 1 fl Rpc:XNQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 4789.634447] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 4790.879173] LustreError: 72873:0:(import.c:337:ptlrpc_invalidate_import()) MGS: timeout waiting for callback (1 != 0) [ 4790.930469] LustreError: 72873:0:(import.c:361:ptlrpc_invalidate_import()) @@@ still on sending list req@ffff995076633480 x1859825927933568/t0(0) o250->MGC192.168.201.103@tcp@0@lo:26/25 lens 520/544 e 0 to 0 dl 1773672899 ref 1 fl Rpc:NQr/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 4790.998068] LustreError: 72873:0:(import.c:371:ptlrpc_invalidate_import()) MGS: Unregistering RPCs found (0). Network is sluggish? Waiting for them to error out. [ 4791.272948] LustreError: 3634:0:(client.c:1390:ptlrpc_import_delay_req()) @@@ invalidate in flight req@ffff994f439d4000 x1859825927937280/t0(0) o250->MGC192.168.201.103@tcp@0@lo:26/25 lens 520/544 e 0 to 0 dl 0 ref 1 fl Rpc:NQU/200/ffffffff rc 0/-1 job:'kworker.0' uid:0 gid:0 projid:4294967295 [ 4791.482516] Lustre: lustre-MDT0000: Not available for connect from 192.168.201.3@tcp (not set up) [ 4791.491762] Lustre: Skipped 1 previous similar message [ 4796.994322] Lustre: 6490:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff995075b15c00 x1859825953162624/t94489280517(0) o36->0dd72a9b-fa7c-4aef-9bbf-cbf02c000f62@192.168.201.3@tcp:491/0 lens 552/2880 e 0 to 0 dl 1773672916 ref 1 fl Interpret:/202/0 rc 0/0 job:'mkdir.0' uid:0 gid:0 projid:4294967295 [ 4797.032857] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:9027 to 0x2c0000401:9089) [ 4797.033700] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:8997 to 0x280000401:9089) [ 4798.397585] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 4811.272645] Lustre: DEBUG MARKER: oleg103-client.virtnet: executing wait_import_state_mount (FULL|IDLE) mdc.lustre-MDT0000-mdc-*.mds_server_uuid [ 4813.732283] Lustre: DEBUG MARKER: mdc.lustre-MDT0000-mdc-*.mds_server_uuid in FULL state after 0 sec [ 4829.920779] Lustre: DEBUG MARKER: == recovery-small test 108: client eviction don't crash == 10:55:35 (1773672935) [ 4831.642292] Lustre: 73643:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting 0dd72a9b-fa7c-4aef-9bbf-cbf02c000f62 at adminstrative request [ 4847.721990] Lustre: DEBUG MARKER: == recovery-small test 110a: create remote directory: drop client req ========================================================== 10:55:52 (1773672952) [ 4853.600989] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 4869.802397] Lustre: lustre-MDT0000: Client 0dd72a9b-fa7c-4aef-9bbf-cbf02c000f62 (at 192.168.201.3@tcp) reconnecting [ 4869.836273] Lustre: Skipped 3 previous similar messages [ 4883.930456] Lustre: DEBUG MARKER: == recovery-small test 110b: create remote directory: drop Master rep ========================================================== 10:56:29 (1773672989) [ 4885.422198] LustreError: 13715:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff994f46d85f80 x1859825953191424/t4295079609(0) o36->0dd72a9b-fa7c-4aef-9bbf-cbf02c000f62@192.168.201.3@tcp:579/0 lens 560/536 e 0 to 0 dl 1773673004 ref 1 fl Interpret:/200/0 rc 0/0 job:'lfs.0' uid:0 gid:0 projid:4294967295 [ 4901.582586] Lustre: 6491:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff99507c1f0e00 x1859825953191424/t4295079609(0) o36->0dd72a9b-fa7c-4aef-9bbf-cbf02c000f62@192.168.201.3@tcp:595/0 lens 560/2880 e 0 to 0 dl 1773673020 ref 1 fl Interpret:/202/0 rc 0/0 job:'lfs.0' uid:0 gid:0 projid:4294967295 [ 4914.844218] Lustre: DEBUG MARKER: == recovery-small test 110c: create remote directory: drop update rep on slave MDT ========================================================== 10:57:00 (1773673020) [ 4917.093152] Lustre: *** cfs_fail_loc=1701, val=2147483648*** [ 4917.097979] Lustre: Skipped 1 previous similar message [ 4933.599211] Lustre: 7416:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773673025/real 1773673025] req@ffff99507f684380 x1859825928015360/t0(0) o1000->lustre-MDT0000-osp-MDT0001@0@lo:24/4 lens 264/4320 e 0 to 1 dl 1773673041 ref 2 fl Rpc:XQr/200/ffffffff rc 0/-1 job:'osp_up0-1.0' uid:0 gid:0 projid:4294967295 [ 4933.653512] Lustre: lustre-MDT0000-osp-MDT0001: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 4933.675208] Lustre: Skipped 20 previous similar messages [ 4933.697491] Lustre: lustre-MDT0000: Received new MDS connection from 0@lo, keep former export from same NID [ 4933.750670] Lustre: lustre-MDT0000-osp-MDT0001: Connection restored to 0@lo (at 0@lo) [ 4933.764061] Lustre: Skipped 23 previous similar messages [ 4950.161799] Lustre: DEBUG MARKER: == recovery-small test 110d: remove remote directory: drop client req ========================================================== 10:57:34 (1773673054) [ 4952.111939] Lustre: *** cfs_fail_loc=123, val=2147483648*** [ 5021.442352] Lustre: DEBUG MARKER: == recovery-small test 110e: remove remote directory: drop master rep ========================================================== 10:58:47 (1773673127) [ 5023.540462] LustreError: 6490:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff99507fdfca80 x1859825953221632/t4295079628(0) o36->0dd72a9b-fa7c-4aef-9bbf-cbf02c000f62@192.168.201.3@tcp:1/0 lens 496/456 e 0 to 0 dl 1773673181 ref 1 fl Interpret:/200/0 rc 0/0 job:'rm.0' uid:0 gid:0 projid:4294967295 [ 5023.600605] LustreError: 6490:0:(ldlm_lib.c:3325:target_send_reply_msg()) Skipped 1 previous similar message [ 5078.248529] Lustre: 20832:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff994f451e4000 x1859825953221632/t4295079628(0) o36->0dd72a9b-fa7c-4aef-9bbf-cbf02c000f62@192.168.201.3@tcp:56/0 lens 496/2888 e 0 to 0 dl 1773673236 ref 1 fl Interpret:/202/0 rc 0/0 job:'rm.0' uid:0 gid:0 projid:4294967295 [ 5094.783198] Lustre: DEBUG MARKER: == recovery-small test 110f: remove remote directory: drop slave rep ========================================================== 10:59:59 (1773673199) [ 5113.311267] Lustre: 7416:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773673205/real 1773673205] req@ffff994f44462680 x1859825928118016/t0(0) o1000->lustre-MDT0000-osp-MDT0001@0@lo:24/4 lens 2592/4320 e 0 to 1 dl 1773673221 ref 2 fl Rpc:XQr/200/ffffffff rc 0/-1 job:'osp_up0-1.0' uid:0 gid:0 projid:4294967295 [ 5113.362861] Lustre: lustre-MDT0000: Received new MDS connection from 0@lo, keep former export from same NID [ 5128.795393] Lustre: DEBUG MARKER: == recovery-small test 110g: drop reply during migration ========================================================== 11:00:33 (1773673233) [ 5185.741146] Lustre: 6504:0:(mdt_recovery.c:102:mdt_req_from_lrd()) @@@ restoring transno req@ffff994f45cb5180 x1859825953238912/t4295079633(0) o36->0dd72a9b-fa7c-4aef-9bbf-cbf02c000f62@192.168.201.3@tcp:163/0 lens 704/2888 e 0 to 0 dl 1773673343 ref 1 fl Interpret:/202/0 rc 0/0 job:'lfs.0' uid:0 gid:0 projid:4294967295 [ 5198.999831] Lustre: DEBUG MARKER: == recovery-small test 110h: drop update reply during cross-MDT file rename ========================================================== 11:01:44 (1773673304) [ 5202.679451] Lustre: *** cfs_fail_loc=1701, val=2147483648*** [ 5202.690586] Lustre: Skipped 3 previous similar messages [ 5218.271204] Lustre: 7416:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773673310/real 1773673310] req@ffff99507814df80 x1859825928185984/t0(0) o1000->lustre-MDT0000-osp-MDT0001@0@lo:24/4 lens 2584/4320 e 0 to 1 dl 1773673326 ref 2 fl Rpc:XQr/200/ffffffff rc 0/-1 job:'osp_up0-1.0' uid:0 gid:0 projid:4294967295 [ 5218.349032] Lustre: lustre-MDT0000: Received new MDS connection from 0@lo, keep former export from same NID [ 5232.274521] Lustre: DEBUG MARKER: == recovery-small test 110i: drop update reply during cross-MDT dir rename ========================================================== 11:02:17 (1773673337) [ 5251.551317] Lustre: 7416:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773673343/real 1773673343] req@ffff99507bccaa00 x1859825928211712/t0(0) o1000->lustre-MDT0000-osp-MDT0001@0@lo:24/4 lens 2456/4320 e 0 to 1 dl 1773673359 ref 2 fl Rpc:XQr/200/ffffffff rc 0/-1 job:'osp_up0-1.0' uid:0 gid:0 projid:4294967295 [ 5251.621563] Lustre: lustre-MDT0000: Received new MDS connection from 0@lo, keep former export from same NID [ 5264.208702] Lustre: DEBUG MARKER: == recovery-small test 110j: drop update reply during cross-MDT ln ========================================================== 11:02:49 (1773673369) [ 5282.789025] Lustre: 7416:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773673374/real 1773673374] req@ffff99507bcc8a80 x1859825928230784/t0(0) o1000->lustre-MDT0000-osp-MDT0001@0@lo:24/4 lens 2240/4320 e 0 to 1 dl 1773673390 ref 2 fl Rpc:XQr/200/ffffffff rc 0/-1 job:'osp_up0-1.0' uid:0 gid:0 projid:4294967295 [ 5282.824515] Lustre: lustre-MDT0000: Received new MDS connection from 0@lo, keep former export from same NID [ 5295.131551] Lustre: DEBUG MARKER: == recovery-small test 110k: FID_QUERY failed during recovery ========================================================== 11:03:20 (1773673400) [ 5298.567314] Lustre: Failing over lustre-MDT0001 [ 5298.659215] LustreError: lustre-MDT0001-osp-MDT0000: operation mds_statfs to node 0@lo failed: rc = -107 [ 5298.666569] LustreError: Skipped 1 previous similar message [ 5298.674679] Lustre: lustre-MDT0001: Not available for connect from 0@lo (stopping) [ 5299.122989] Lustre: server umount lustre-MDT0001 complete [ 5303.266622] LustreError: 7419:0:(ldlm_lib.c:1178:target_handle_connect()) lustre-MDT0001: not available for connect from 0@lo (no target). If you are running an HA pair check that the target is mounted on the other server. [ 5303.322805] LustreError: 7419:0:(ldlm_lib.c:1178:target_handle_connect()) Skipped 123 previous similar messages [ 5317.510939] LDISKFS-fs (dm-1): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 5318.584578] Lustre: lustre-MDT0001: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 5318.632425] Lustre: *** cfs_fail_loc=1103, val=0*** [ 5318.650764] Lustre: lustre-MDT0001: in recovery but waiting for the first client to connect [ 5318.652105] Lustre: lustre-MDT0001: Aborting client recovery [ 5318.660261] Lustre: Skipped 7 previous similar messages [ 5318.669855] LustreError: 77716:0:(ldlm_lib.c:2983:target_stop_recovery_thread()) lustre-MDT0001: Aborting recovery [ 5318.677095] Lustre: 77744:0:(ldlm_lib.c:2386:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 5318.682762] Lustre: 77744:0:(ldlm_lib.c:2386:target_recovery_overseer()) Skipped 2 previous similar messages [ 5320.805140] Lustre: 77744:0:(genops.c:1620:class_disconnect_stale_exports()) lustre-MDT0001: disconnect stale client lustre-MDT0000-mdtlov_UUID@ [ 5320.848138] Lustre: 77744:0:(genops.c:1620:class_disconnect_stale_exports()) Skipped 1 previous similar message [ 5320.873337] Lustre: lustre-MDT0001: disconnecting 1 stale clients [ 5320.896928] Lustre: 77744:0:(ldlm_lib.c:2386:target_recovery_overseer()) recovery is aborted, evict exports in recovery [ 5320.916478] Lustre: lustre-MDT0001-osd: cancel update llog [0x240000400:0x1:0x0] [ 5320.936095] Lustre: lustre-MDT0000-osp-MDT0001: cancel update llog [0x200000401:0x1:0x0] [ 5320.995378] Lustre: lustre-OST0000: new connection from lustre-MDT0001-mdtlov (cleaning up unused objects from 0x280000400:17057 to 0x280000400:17313) [ 5321.000578] Lustre: lustre-OST0001: new connection from lustre-MDT0001-mdtlov (cleaning up unused objects from 0x2c0000400:18985 to 0x2c0000400:19009) [ 5323.794488] LustreError: lustre-MDT0001-osp-MDT0000: This client was evicted by lustre-MDT0001; in progress operations using this service will fail. [ 5327.961495] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 5335.117038] Lustre: Failing over lustre-MDT0001 [ 5335.546711] Lustre: server umount lustre-MDT0001 complete [ 5348.291767] LDISKFS-fs (dm-1): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 5349.045632] Lustre: lustre-MDT0001: Imperative Recovery enabled, recovery window shrunk from 60-180 down to 60-180 [ 5354.501292] Lustre: lustre-MDT0001: Will be in recovery for at least 1:00, or until 1 client reconnects [ 5354.532215] Lustre: Skipped 8 previous similar messages [ 5354.587795] Lustre: lustre-MDT0001: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted. [ 5354.620732] Lustre: Skipped 8 previous similar messages [ 5354.693826] Lustre: lustre-OST0001: new connection from lustre-MDT0001-mdtlov (cleaning up unused objects from 0x2c0000400:18985 to 0x2c0000400:19041) [ 5354.696153] Lustre: lustre-OST0000: new connection from lustre-MDT0001-mdtlov (cleaning up unused objects from 0x280000400:17057 to 0x280000400:17345) [ 5355.494129] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 5383.170843] Lustre: DEBUG MARKER: == recovery-small test 110m: update resent vs original RPC race ========================================================== 11:04:48 (1773673488) [ 5386.714138] LustreError: 6495:0:(out_handler.c:1175:out_handle()) cfs_race id 525 sleeping [ 5391.839749] LustreError: 6495:0:(out_handler.c:1175:out_handle()) cfs_fail_race id 525 awake: rc=0 [ 5391.902420] LustreError: 7931:0:(out_handler.c:1175:out_handle()) cfs_fail_race id 525 waking [ 5395.428842] Lustre: lustre-MDT0000: Received new MDS connection from 0@lo, keep former export from same NID [ 5395.619115] LustreError: 6495:0:(out_handler.c:1175:out_handle()) cfs_fail_race id 525 waking [ 5395.630686] LustreError: 6495:0:(out_handler.c:1175:out_handle()) Skipped 1 previous similar message [ 5406.321697] Lustre: DEBUG MARKER: == recovery-small test 111: mdd setup fail should not cause umount oops ========================================================== 11:05:12 (1773673512) [ 5410.778451] Lustre: Failing over lustre-MDT0000 [ 5410.825788] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping) [ 5410.842190] Lustre: Skipped 2 previous similar messages [ 5411.351670] Lustre: server umount lustre-MDT0000 complete [ 5423.582859] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 5423.877576] LustreError: MGC192.168.201.103@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail [ 5423.892627] LustreError: Skipped 2 previous similar messages [ 5424.503658] Lustre: lustre-MDT0000: Imperative Recovery not enabled, recovery window 60-180 [ 5424.517738] Lustre: Skipped 4 previous similar messages [ 5424.561488] Lustre: *** cfs_fail_loc=151, val=0*** [ 5424.583988] LustreError: 80403:0:(mdd_device.c:674:mdd_changelog_init()) lustre-MDD0000: changelog setup during init failed: rc = -5 [ 5424.613526] LustreError: 80403:0:(mdd_device.c:1406:mdd_prepare()) lustre-MDD0000: failed to initialize changelog: rc = -5 [ 5424.641471] LustreError: 80403:0:(tgt_mount.c:2566:server_fill_super()) Unable to start targets: -5 [ 5424.683293] Lustre: Failing over lustre-MDT0000 [ 5425.011331] Lustre: server umount lustre-MDT0000 complete [ 5425.014951] LustreError: 80403:0:(super25.c:186:lustre_fill_super()) llite: Unable to mount : rc = -5 [ 5436.537864] LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc [ 5436.770863] LustreError: 80832:0:(ldlm_resource.c:1170:ldlm_resource_complain()) MGC192.168.201.103@tcp: namespace resource [0x65727473756c:0x0:0x0].0x0 (ffff995061e3fc00) refcount nonzero (1) after lock cleanup; forcing cleanup. [ 5436.849220] LustreError: 6486:0:(mgc_request.c:614:do_requeue()) failed processing log: -5 [ 5443.173201] Lustre: lustre-OST0001: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x2c0000401:9092 to 0x2c0000401:9121) [ 5443.178345] Lustre: lustre-OST0000: new connection from lustre-MDT0000-mdtlov (cleaning up unused objects from 0x280000401:9092 to 0x280000401:9121) [ 5444.506954] Lustre: DEBUG MARKER: oleg103-server.virtnet: executing set_default_debug -1 all [ 5459.703648] Lustre: DEBUG MARKER: == recovery-small test 112a: bulk resend while orignal request is in progress ========================================================== 11:06:05 (1773673565) [ 5463.136320] LustreError: 53318:0:(tgt_handler.c:2735:tgt_brw_write()) cfs_fail_timeout id 214 sleeping for 20000ms [ 5483.159258] LustreError: 53318:0:(tgt_handler.c:2735:tgt_brw_write()) cfs_fail_timeout id 214 awake [ 5498.313549] Lustre: DEBUG MARKER: == recovery-small test 115a: read: late REQ MDunlink and no bulk ========================================================== 11:06:43 (1773673603) [ 5514.388583] Lustre: DEBUG MARKER: == recovery-small test 115b: write: late REQ MDunlink and no bulk ========================================================== 11:06:59 (1773673619) [ 5519.140649] Lustre: *** cfs_fail_loc=215, val=2*** [ 5528.937658] Lustre: DEBUG MARKER: == recovery-small test 115c: read: late Reply MDunlink and no bulk ========================================================== 11:07:14 (1773673634) [ 5545.717532] Lustre: DEBUG MARKER: == recovery-small test 115d: write: late Reply MDunlink and no bulk ========================================================== 11:07:30 (1773673650) [ 5549.540621] Lustre: *** cfs_fail_loc=215, val=0*** [ 5559.829741] Lustre: DEBUG MARKER: == recovery-small test 115e: read: late Bulk MDunlink and no reply ========================================================== 11:07:45 (1773673665) [ 5564.899098] LustreError: 10599:0:(ldlm_lib.c:3325:target_send_reply_msg()) @@@ dropping reply req@ffff994f46d84000 x1859825928402304/t0(0) o13->lustre-MDT0001-mdtlov_UUID@0@lo:504/0 lens 224/368 e 0 to 0 dl 1773673684 ref 1 fl Interpret:/200/0 rc 0/0 job:'osp-pre-1-1.0' uid:0 gid:0 projid:4294967295 [ 5564.940592] LustreError: 10599:0:(ldlm_lib.c:3325:target_send_reply_msg()) Skipped 5 previous similar messages [ 5574.722212] Lustre: DEBUG MARKER: == recovery-small test 115f: read: late REQ MDunlink and no reply ========================================================== 11:08:00 (1773673680) [ 5582.372829] Lustre: 3638:0:(client.c:2478:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1773673673/real 1773673673] req@ffff99507fdf9c00 x1859825928402304/t0(0) o13->lustre-OST0001-osc-MDT0001@0@lo:7/4 lens 224/368 e 0 to 1 dl 1773673689 ref 1 fl Rpc:XQr/200/ffffffff rc 0/-1 job:'osp-pre-1-1.0' uid:0 gid:0 projid:4294967295 [ 5582.442130] Lustre: lustre-OST0001-osc-MDT0001: Connection to lustre-OST0001 (at 0@lo) was lost; in progress operations using this service will wait for recovery to complete [ 5582.453486] Lustre: Skipped 15 previous similar messages [ 5582.467548] Lustre: lustre-OST0001: Client lustre-MDT0001-mdtlov_UUID (at 0@lo) reconnecting [ 5582.487659] Lustre: Skipped 4 previous similar messages [ 5582.502752] Lustre: lustre-OST0001-osc-MDT0001: Connection restored to 0@lo (at 0@lo) [ 5582.513448] Lustre: Skipped 15 previous similar messages [ 5647.134985] Lustre: DEBUG MARKER: == recovery-small test 115g: read: late REQ MDunlink and Reply MDunlink ========================================================== 11:09:12 (1773673752) [ 5716.543537] Lustre: DEBUG MARKER: == recovery-small test 120: flock race: completion vs. evict ========================================================== 11:10:22 (1773673822) [ 5720.187215] Lustre: 83540:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 50b6f219-0167-4a18-8ed7-c655d95343ea at adminstrative request [ 5726.560207] Lustre: 83589:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 50b6f219-0167-4a18-8ed7-c655d95343ea at adminstrative request [ 5734.667961] Lustre: 83639:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 50b6f219-0167-4a18-8ed7-c655d95343ea at adminstrative request [ 5738.791211] Lustre: 83688:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 50b6f219-0167-4a18-8ed7-c655d95343ea at adminstrative request [ 5751.182039] Lustre: 83758:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 50b6f219-0167-4a18-8ed7-c655d95343ea at adminstrative request [ 5762.520347] Lustre: 83855:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 50b6f219-0167-4a18-8ed7-c655d95343ea at adminstrative request [ 5762.551425] Lustre: 83855:0:(genops.c:1793:obd_export_evict_by_uuid()) Skipped 1 previous similar message [ 5781.099684] Lustre: 83975:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-MDT0000: evicting 50b6f219-0167-4a18-8ed7-c655d95343ea at adminstrative request [ 5781.113700] Lustre: 83975:0:(genops.c:1793:obd_export_evict_by_uuid()) Skipped 1 previous similar message [ 5790.444701] Lustre: DEBUG MARKER: == recovery-small test 113: ldlm enqueue dropped reply should not cause deadlocks ========================================================== 11:11:36 (1773673896) [ 5792.331229] Lustre: *** cfs_fail_loc=157, val=2147483648*** [ 5792.337447] Lustre: Skipped 4 previous similar messages [ 5839.965352] Lustre: DEBUG MARKER: == recovery-small test 130a: enqueue resend on not existing file ========================================================== 11:12:25 (1773673945) [ 5842.425791] LustreError: 13715:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 sleeping for 10000ms [ 5852.479288] LustreError: 13715:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 awake [ 5882.795740] Lustre: DEBUG MARKER: == recovery-small test 130b: enqueue resend on a stale inode ========================================================== 11:13:08 (1773673988) [ 5885.148871] LustreError: 13715:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 sleeping for 10000ms [ 5895.193801] LustreError: 13715:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 awake [ 5915.347505] Lustre: *** cfs_fail_loc=217, val=0*** [ 5928.252416] Lustre: DEBUG MARKER: == recovery-small test 130c: layout intent resend on a stale inode ========================================================== 11:13:54 (1773674034) [ 5932.822293] LustreError: 6490:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 sleeping for 10000ms [ 5942.874500] LustreError: 6490:0:(mdt_handler.c:5427:mdt_intent_opc()) cfs_fail_timeout id 160 awake [ 5971.358647] Lustre: DEBUG MARKER: == recovery-small test 132: long punch =================== 11:14:36 (1773674076) [ 5973.611885] LustreError: 8375:0:(ofd_dev.c:2175:ofd_punch_hdl()) cfs_fail_timeout id 236 sleeping for 120000ms [ 6046.687168] Lustre: ll_ost_io00_000: service thread pid 8375 was inactive for 73.075 seconds. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes: [ 6046.740284] task:ll_ost_io00_000 state:R stack:0 pid:8375 ppid:2 flags:0x80004000 [ 6046.760112] Call Trace: [ 6046.764757] __schedule+0x351/0xcb0 [ 6046.769444] schedule+0xc0/0x180 [ 6046.772918] schedule_timeout+0xb4/0x190 [ 6046.777836] ? __next_timer_interrupt+0x160/0x160 [ 6046.783175] ? kvm_clock_get_cycles+0x2c/0x50 [ 6046.790156] ? ktime_get+0x65/0x110 [ 6046.801542] schedule_timeout_uninterruptible+0x2d/0x40 [ 6046.814040] __cfs_fail_timeout_set+0x13b/0x240 [libcfs] [ 6046.824885] ofd_punch_hdl+0x4ec/0xbc0 [ofd] [ 6046.833847] tgt_handle_request0+0x137/0xaf0 [ptlrpc] [ 6046.843851] tgt_request_handle+0x573/0x1e70 [ptlrpc] [ 6046.855807] ptlrpc_server_handle_request+0x443/0x13b0 [ptlrpc] [ 6046.865546] ? lprocfs_counter_add+0x15b/0x210 [obdclass] [ 6046.871740] ptlrpc_main+0xce8/0x1400 [ptlrpc] [ 6046.880902] ? ptlrpc_wait_event+0x690/0x690 [ptlrpc] [ 6046.889528] kthread+0x1d1/0x200 [ 6046.891889] ? set_kthread_struct+0x70/0x70 [ 6046.896868] ret_from_fork+0x1f/0x30 [ 6093.695149] LustreError: 8375:0:(ofd_dev.c:2175:ofd_punch_hdl()) cfs_fail_timeout id 236 awake [ 6093.711911] Lustre: ll_ost_io00_000: service thread pid 8375 completed after 120.100s. This likely indicates the system was overloaded (too many service threads, or not enough hardware resources). [ 6110.836160] Lustre: DEBUG MARKER: == recovery-small test 131: IO vs evict results to IO under staled lock ========================================================== 11:16:56 (1773674216) [ 6116.644026] Lustre: 85956:0:(genops.c:1793:obd_export_evict_by_uuid()) lustre-OST0000: evicting 50b6f219-0167-4a18-8ed7-c655d95343ea at adminstrative request [ 6116.677331] LustreError: 13152:0:(ldlm_lockd.c:2933:ldlm_bl_thread_exports()) cfs_fail_timeout id 31e sleeping for 4000ms [ 6120.377949] Lustre: DEBUG MARKER: recovery-small test_131: @@@@@@ FAIL: dd succeeded [ 6120.746865] LustreError: 13152:0:(ldlm_lockd.c:2933:ldlm_bl_thread_exports()) cfs_fail_timeout id 31e awake