== sanity-dom test sanityn: Run sanityn with Data-on-MDT files ========================================================== 04:34:17 (1743496457) mgs: Rocky Linux release 8.10 (Green Obsidian) MGS_OS_ID_LIKE=rhel centos fedora rocky MGS_OS_VERSION_ID=8.10 MGS_OS_ID=rocky MGS_OS_VERSION_CODE=134873088 mds1: Rocky Linux release 8.10 (Green Obsidian) MDS1_OS_VERSION_ID=8.10 MDS1_OS_VERSION_CODE=134873088 MDS1_OS_ID_LIKE=rhel centos fedora rocky MDS1_OS_ID=rocky ost1: Rocky Linux release 8.10 (Green Obsidian) OST1_OS_VERSION_CODE=134873088 OST1_OS_ID_LIKE=rhel centos fedora rocky OST1_OS_VERSION_ID=8.10 OST1_OS_ID=rocky client: Rocky Linux release 8.10 (Green Obsidian) CLIENT_OS_ID=rocky CLIENT_OS_VERSION_CODE=134873088 CLIENT_OS_VERSION_ID=8.10 CLIENT_OS_ID_LIKE=rhel centos fedora rocky oleg606-server: ls: cannot access '/home/green/git/lustre-release/lustre/tests/except/sanityn.*ex': No such file or directory excepting tests: 28 102 skipping tests SLOW=no: 33a === sanityn: start setup 04:35:03 (1743496503) === Starting client oleg606-client.virtnet: -o user_xattr,flock 192.168.206.106@tcp:/lustre /mnt/lustre2 Started clients oleg606-client.virtnet: 192.168.206.106@tcp:/lustre on /mnt/lustre2 type lustre (rw,checksum,flock,user_xattr,lruresize,lazystatfs,nouser_fid2path,verbose,encrypt,statfs_project) oleg606-client.virtnet: executing check_config_client /mnt/lustre oleg606-client.virtnet: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients oleg606-client.virtnet environments Using TIMEOUT=20 osc.lustre-OST0000-osc-ffff960e27d01000.idle_timeout=debug osc.lustre-OST0000-osc-ffff960e3850c000.idle_timeout=debug osc.lustre-OST0001-osc-ffff960e27d01000.idle_timeout=debug osc.lustre-OST0001-osc-ffff960e3850c000.idle_timeout=debug disable quota as required oleg606-server: oleg606-server.virtnet: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all === sanityn: finish setup 04:36:16 (1743496576) === 1+0 records in 1+0 records out 1048576 bytes (1.0 MB, 1.0 MiB) copied, 0.104002 s, 10.1 MB/s running as uid/gid/euid/egid 500/500/500/500, groups: 500 [true] running as uid/gid/euid/egid 500/500/500/500, groups: 500 [touch] [/mnt/lustre/d0_runas_test/f66397] == sanityn test 1: Check attribute updates on 2 mount points ========================================================== 04:36:21 (1743496581) /mnt/lustre/f1.sanityn has type file OK /mnt/lustre/f1.sanityn has perms 0777 OK /mnt/lustre/f1.sanityn has type file OK /mnt/lustre/f1.sanityn has perms 0666 OK /mnt/lustre/f1.sanityn: absent OK PASS 1 (16s) == sanityn test 2a: check cached attribute updates on 2 mtpt's ================================================================== 04:36:37 (1743496597) -rw-r--r-- 1 root root 0 Apr 1 04:36 /mnt/lustre2/f2a /mnt/lustre/f2a has type file OK /mnt/lustre/f2a has perms 0777 OK PASS 2a (16s) == sanityn test 2b: check cached attribute updates on 2 mtpt's ================================================================== 04:36:54 (1743496614) -rw-r--r-- 1 root root 0 Apr 1 04:36 /mnt/lustre2/f2b /mnt/lustre2/f2b has type file OK /mnt/lustre2/f2b has perms 0777 OK PASS 2b (14s) == sanityn test 2c: check cached attribute updates on 2 mtpt's root ============================================================= 04:37:08 (1743496628) /mnt/lustre2 has type dir OK /mnt/lustre2 has perms 0777 OK PASS 2c (15s) == sanityn test 2d: check cached attribute updates on 2 mtpt's root ============================================================= 04:37:23 (1743496643) /mnt/lustre2 has type dir OK /mnt/lustre2 has perms 0755 OK PASS 2d (17s) == sanityn test 2e: check chmod on root is propagated to others ========================================================== 04:37:41 (1743496661) total 338 -rwxrwxrwx 1 root root 0 Apr 1 04:36 f2a -rwxrwxrwx 1 root root 0 Apr 1 04:36 f2b -rw-r--r-- 1 root root 1376855 Apr 1 03:51 ffsx.sanity-dom -rw-r--r-- 1 root root 0 Apr 1 03:49 ffsx.sanity-dom.fsxgood -rw-r--r-- 1 root root 76937 Apr 1 03:51 ffsx.sanity-dom.fsxlog total 338 -rwxrwxrwx 1 root root 0 Apr 1 04:36 f2a -rwxrwxrwx 1 root root 0 Apr 1 04:36 f2b -rw-r--r-- 1 root root 1376855 Apr 1 03:51 ffsx.sanity-dom -rw-r--r-- 1 root root 0 Apr 1 03:49 ffsx.sanity-dom.fsxgood -rw-r--r-- 1 root root 76937 Apr 1 03:51 ffsx.sanity-dom.fsxlog running as uid/gid/euid/egid 500/500/500/500, groups: 500 [dd] [if=/dev/zero] [of=/mnt/lustre2/f2e.sanityn] [count=1] 1+0 records in 1+0 records out 512 bytes copied, 0.0116387 s, 44.0 kB/s PASS 2e (17s) == sanityn test 2f: check attr/owner updates on DNE with 2 mtpt's ========================================================== 04:37:58 (1743496678) SKIP: sanityn test_2f needs >= 2 MDTs SKIP 2f (7s) == sanityn test 2g: check blocks update on sync write ==== 04:38:06 (1743496686) 2+0 records in 2+0 records out 2097152 bytes (2.1 MB, 2.0 MiB) copied, 0.908694 s, 2.3 MB/s /mnt/lustre/f2g.sanityn has 4113 blocks /mnt/lustre2/f2g.sanityn has 4113 blocks PASS 2g (18s) == sanityn test 4: fstat validation on multiple mount points ==================================================================== 04:38:24 (1743496704) PASS 4 (17s) == sanityn test 5: create a file on one mount, truncate it on the other ========================================================== 04:38:41 (1743496721) /mnt/lustre/f5 has type file OK /mnt/lustre/f5 has size 100 OK PASS 5 (17s) == sanityn test 6: remove of open file on other node ============================================================================ 04:38:58 (1743496738) opening writing unlinking /mnt/lustre2/f6.sanityn accessing (1) seeking (1) accessing (2) fstat... reading comparing data truncating seeking (2) writing again seeking (3) reading again comparing data again closing SUCCESS - goto beer PASS 6 (16s) == sanityn test 7: remove of open directory on other node ======================================================================= 04:39:15 (1743496755) creating directory /mnt/lustre/d7 opening directory unlinking /mnt/lustre/d7 Ok, everything goes well. PASS 7 (19s) == sanityn test 8: remove of open special file on other node ==================================================================== 04:39:35 (1743496775) creating special file /mnt/lustre/f8.sanityn opening file unlinking /mnt/lustre/f8.sanityn Ok, everything goes well. PASS 8 (16s) == sanityn test 9a: append of file with sub-page size on multiple mounts ========================================================== 04:39:51 (1743496791) PASS 9a (19s) == sanityn test 9b: append to striped sparse file ======== 04:40:11 (1743496811) 1+0 records in 1+0 records out 1048576 bytes (1.0 MB, 1.0 MiB) copied, 0.11352 s, 9.2 MB/s 3+0 records in 3+0 records out 3 bytes copied, 0.0925855 s, 0.0 kB/s Data read (expecting 'foo'): foo PASS 9b (16s) == sanityn test 10a: write of file with sub-page size on multiple mounts ========================================================== 04:40:27 (1743496827) 1+0 records in 1+0 records out 1 byte copied, 0.0254695 s, 0.0 kB/s 1+0 records in 1+0 records out 1 byte copied, 0.0305855 s, 0.0 kB/s 1+0 records in 1+0 records out 1 byte copied, 0.0247448 s, 0.0 kB/s 1+0 records in 1+0 records out 1 byte copied, 0.0133207 s, 0.1 kB/s 1+0 records in 1+0 records out 1 byte copied, 0.0121127 s, 0.1 kB/s 1+0 records in 1+0 records out 1 byte copied, 0.0177778 s, 0.1 kB/s 1+0 records in 1+0 records out 1 byte copied, 0.0136848 s, 0.1 kB/s 1+0 records in 1+0 records out 1 byte copied, 0.0409474 s, 0.0 kB/s 1+0 records in 1+0 records out 1 byte copied, 0.0179783 s, 0.1 kB/s 1+0 records in 1+0 records out 1 byte copied, 0.0197836 s, 0.1 kB/s 1+0 records in 1+0 records out 1 byte copied, 0.0158384 s, 0.1 kB/s 1+0 records in 1+0 records out 1 byte copied, 0.0165737 s, 0.1 kB/s PASS 10a (18s) == sanityn test 10b: write of file with sub-page size on multiple mounts ========================================================== 04:40:46 (1743496846) 1+0 records in 1+0 records out 3072 bytes (3.1 kB, 3.0 KiB) copied, 0.012047 s, 255 kB/s 1+0 records in 1+0 records out 4096 bytes (4.1 kB, 4.0 KiB) copied, 0.0865255 s, 47.3 kB/s 1+0 records in 1+0 records out 3072 bytes (3.1 kB, 3.0 KiB) copied, 0.000425657 s, 7.2 MB/s PASS 10b (17s) == sanityn test 11: execution of file opened for write should return error ============================================================== 04:41:03 (1743496863) multiop /mnt/lustre/d11/f vO_c TMPPIPE=/tmp/multiop_open_wait_pipe.66397 /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 301: /mnt/lustre2/d11/f: Text file busy PASS 11 (17s) == sanityn test 12: test lock ordering (link, stat, unlink) ========================================================== 04:41:20 (1743496880) warning: '-runtime' deprecated, use '-t runtime' instead - link 217 (time 1743496895.56 total 10.03 last 21.63) - link 508 (time 1743496905.57 total 20.04 last 29.07) - link 828 (time 1743496915.59 total 30.07 last 31.91) - link 1132 (time 1743496925.64 total 40.12 last 30.25) - link 1470 (time 1743496935.66 total 50.14 last 33.74) - link 1700 (time 1743496945.68 total 60.15 last 22.97) - link 1919 (time 1743496955.71 total 70.18 last 21.83) - link 2141 (time 1743496965.75 total 80.22 last 22.11) - link 2353 (time 1743496975.77 total 90.24 last 21.16) total: 2594 link in 100.02 seconds: 25.93 ops/second using seed 2934385853 running for 100 seconds - stat 240 (time 1743496897 ; total 11 ; last 11) - stat 2934 (time 1743496908 ; total 22 ; last 11) - stat 5263 (time 1743496919 ; total 33 ; last 11) - stat 7803 (time 1743496930 ; total 44 ; last 11) - stat 9538 (time 1743496941 ; total 55 ; last 11) - stat 10000 (time 1743496945 ; total 59 ; last 4) - stat 10868 (time 1743496956 ; total 70 ; last 11) - stat 11752 (time 1743496967 ; total 81 ; last 11) - stat 12855 (time 1743496978 ; total 92 ; last 11) total: 13743 stats in 100 seconds: 137.429993 stats/second - unlinked 0 (time 1743496936 ; total 0 ; last 0) unlink(/mnt/lustre2/lockdir/lockfile2594) error: No such file or directory total: 2593 unlinks in 104 seconds: 24.932692 unlinks/second /home/green/git/lustre-release/lustre/tests/lockorder.sh: line 77: kill: (78984) - No such process /home/green/git/lustre-release/lustre/tests/lockorder.sh: line 78: kill: (78986) - No such process PASS 12 (174s) == sanityn test 14aa: execution of file open for write returns -ETXTBSY ========================================================== 04:44:14 (1743497054) multiop /mnt/lustre/d14aa.sanityn/f14aa.sanityn vOw_c TMPPIPE=/tmp/multiop_open_wait_pipe.66397 /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 336: /mnt/lustre2/d14aa.sanityn/f14aa.sanityn: Text file busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4866: 79593 Terminated $MULTIOP_PROG $FILE v$ARGS > $TMPPIPE (wd: ~) PASS 14aa (17s) == sanityn test 14ab: open(RDWR) of executing file returns -ETXTBSY ========================================================== 04:44:32 (1743497072) open(O_RDWR|O_CREAT): Text file busy /home/green/git/lustre-release/lustre/tests/test-framework.sh: line 4866: 80175 Terminated $DIR1/$tdir/sleep 60 (wd: ~) PASS 14ab (16s) == sanityn test 14b: truncate of executing file returns -ETXTBSY ================================================================ 04:44:47 (1743497087) truncate: cannot truncate '/mnt/lustre2/d14b.sanityn/sleep' to length 60: Text file busy /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 351: 80754 Terminated $DIR1/$tdir/sleep 60 PASS 14b (17s) == sanityn test 14c: open(O_TRUNC) of executing file return -ETXTBSY ============================================================ 04:45:04 (1743497104) cp: cannot create regular file '/mnt/lustre2/d14c.sanityn/sleep': Text file busy /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 363: 81337 Terminated $DIR1/$tdir/sleep 60 PASS 14c (15s) == sanityn test 14d: chmod of executing file is still possible ================================================================== 04:45:19 (1743497119) chmod /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 374: 81922 Terminated $DIR1/$tdir/sleep 60 PASS 14d (18s) == sanityn test 17: resource creation/LVB creation race ========================================================================= 04:45:38 (1743497138) fail_loc=0x8000030a PASS 17 (20s) == sanityn test 19: test concurrent uncached read races ========================================================================= 04:45:58 (1743497158) oleg606-server: error: get_param: param_path 'osd-*/lustre-MDT*/read_cache_enable': No such file or directory pdsh@oleg606-client: oleg606-server: ssh exited with exit code 2 SKIP: sanityn test_19 not cache-capable obdfilter SKIP 19 (11s) == sanityn test 20: test extra readahead page left in cache ============================================================== 04:46:09 (1743497169) PASS 20 (16s) == sanityn test 23: others should see updated atime while another read============================================================== 04:46:25 (1743497185) now is 1743497191 starting reads multiop /mnt/lustre/f23.sanityn vor20_c TMPPIPE=/tmp/multiop_open_wait_pipe.66397 new atime is 1743497252 PASS 23 (81s) == sanityn test 27: align non-overlapping extent locks from request ============================================================= 04:47:47 (1743497267) dd 1 started 4+0 records in 4+0 records out 16793600 bytes (17 MB, 16 MiB) copied, 1.26728 s, 13.3 MB/s dd 2 started 1+0 records in 1+0 records out 15728640 bytes (16 MB, 15 MiB) copied, 4.11078 s, 3.8 MB/s 1+0 records in 1+0 records out 8192 bytes (8.2 kB, 8.0 KiB) copied, 0.0532942 s, 154 kB/s dd 3 finished PASS 27 (29s) == sanityn test 39a: file mtime does not change after rename ========================================================== 04:48:16 (1743497296) repeat after cancel_lru_locks PASS 39a (16s) == sanityn test 39b: file mtime the same on clients with/out lock ========================================================== 04:48:32 (1743497312) repeat after cancel_lru_locks PASS 39b (18s) == sanityn test 39c: check truncate mtime update ================================================================================ 04:48:50 (1743497330) repeat after cancel_lru_locks PASS 39c (19s) == sanityn test 39d: sync write should update mtime ====== 04:49:09 (1743497349) fail_loc=0x411 fail_loc=0 PASS 39d (15s) == sanityn test 51a: layout lock: refresh layout should work ========================================================== 04:49:26 (1743497366) 0+1 records in 0+1 records out 158 bytes copied, 0.0291942 s, 5.4 kB/s /home/green/git/lustre-release/lustre/tests/sanityn.sh: line 3519: kill: (88310) - No such process PASS 51a (17s) == sanityn test 51c: layout lock: IT_LAYOUT blocked and correct layout can be returned ========================================================== 04:49:43 (1743497383) fail_loc=0x172 Setting layout to have 2 stripes ... 1+0 records in 1+0 records out 1024 bytes (1.0 kB, 1.0 KiB) copied, 0.0435903 s, 23.5 kB/s PASS 51c (20s) == sanityn test 51d: layout lock: losing layout lock should clean up memory map region ========================================================== 04:50:04 (1743497404) 1+0 records in 1+0 records out 1048576 bytes (1.0 MB, 1.0 MiB) copied, 0.0483548 s, 21.7 MB/s Before revoking layout lock: 1024 KB mapped PASS 51d (15s) == sanityn test 107a: Basic grouplock conflict =========== 04:50:21 (1743497421) 10+0 records in 10+0 records out 10485760 bytes (10 MB, 10 MiB) copied, 1.0885 s, 9.6 MB/s /mnt/lustre/f107a.sanityn lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 0 lmm_stripe_size: 1048576 lmm_pattern: mdt lmm_layout_gen: 0 lmm_stripe_offset: 0 lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 1 lmm_objects: - 0: { l_ost_idx: 1, l_fid: [0x280000400:0x3d4:0x0] } multiop /mnt/lustre/f107a.sanityn vOG14091995_g14091995c TMPPIPE=/tmp/multiop_open_wait_pipe.66397 multiop /mnt/lustre2/f107a.sanityn vO_G16022000r10g16022000c TMPPIPE=/tmp/multiop_open_wait_pipe.66397 First grouplock blocks second one PASS 107a (19s) == sanityn test 107b: Grouplock is added to the head of waiting list ========================================================== 04:50:40 (1743497440) 10+0 records in 10+0 records out 10485760 bytes (10 MB, 10 MiB) copied, 0.95616 s, 11.0 MB/s /mnt/lustre/f107b.sanityn lcm_layout_gen: 3 lcm_mirror_count: 1 lcm_entry_count: 2 lcme_id: 1 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 0 lcme_extent.e_end: 1048576 lmm_stripe_count: 0 lmm_stripe_size: 1048576 lmm_pattern: mdt lmm_layout_gen: 0 lmm_stripe_offset: 0 lcme_id: 2 lcme_mirror_id: 0 lcme_flags: init lcme_extent.e_start: 1048576 lcme_extent.e_end: EOF lmm_stripe_count: 1 lmm_stripe_size: 4194304 lmm_pattern: raid0 lmm_layout_gen: 0 lmm_stripe_offset: 0 lmm_objects: - 0: { l_ost_idx: 0, l_fid: [0x240000400:0x3d7:0x0] } multiop /mnt/lustre/f107b.sanityn vOG14091995_g14091995c TMPPIPE=/tmp/multiop_open_wait_pipe.66397 Grouplock blocks IO First grouplock blocks second one Second grouplock blocks IO PASS 107b (27s) cleanup: ====================================================== == sanityn test complete, duration 1011 sec ============== 04:51:11 (1743497471) === sanityn: start cleanup 04:51:15 (1743497475) === Stopping clients: oleg606-client.virtnet /mnt/lustre2 (opts:) Stopping client oleg606-client.virtnet /mnt/lustre2 opts: === sanityn: finish cleanup 04:51:27 (1743497487) ===