linux

korg/linux

mirror of https://mirrors.bfsu.edu.cn/git/linux.git synced 2024-12-04 01:24:12 +08:00

History

Naohiro Aota 9ce7466f37 btrfs: ensure pages are unlocked on cow_file_range() failure There is a hung_task report on zoned btrfs like below. https://github.com/naota/linux/issues/59 [726.328648] INFO: task rocksdb:high0:11085 blocked for more than 241 seconds. [726.329839] Not tainted 5.16.0-rc1+ #1 [726.330484] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [726.331603] task:rocksdb:high0 state:D stack: 0 pid:11085 ppid: 11082 flags:0x00000000 [726.331608] Call Trace: [726.331611] <TASK> [726.331614] __schedule+0x2e5/0x9d0 [726.331622] schedule+0x58/0xd0 [726.331626] io_schedule+0x3f/0x70 [726.331629] __folio_lock+0x125/0x200 [726.331634] ? find_get_entries+0x1bc/0x240 [726.331638] ? filemap_invalidate_unlock_two+0x40/0x40 [726.331642] truncate_inode_pages_range+0x5b2/0x770 [726.331649] truncate_inode_pages_final+0x44/0x50 [726.331653] btrfs_evict_inode+0x67/0x480 [726.331658] evict+0xd0/0x180 [726.331661] iput+0x13f/0x200 [726.331664] do_unlinkat+0x1c0/0x2b0 [726.331668] __x64_sys_unlink+0x23/0x30 [726.331670] do_syscall_64+0x3b/0xc0 [726.331674] entry_SYSCALL_64_after_hwframe+0x44/0xae [726.331677] RIP: 0033:0x7fb9490a171b [726.331681] RSP: 002b:00007fb943ffac68 EFLAGS: 00000246 ORIG_RAX: 0000000000000057 [726.331684] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007fb9490a171b [726.331686] RDX: 00007fb943ffb040 RSI: 000055a6bbe6ec20 RDI: 00007fb94400d300 [726.331687] RBP: 00007fb943ffad00 R08: 0000000000000000 R09: 0000000000000000 [726.331688] R10: 0000000000000031 R11: 0000000000000246 R12: 00007fb943ffb000 [726.331690] R13: 00007fb943ffb040 R14: 0000000000000000 R15: 00007fb943ffd260 [726.331693] </TASK> While we debug the issue, we found running fstests generic/551 on 5GB non-zoned null_blk device in the emulated zoned mode also had a similar hung issue. Also, we can reproduce the same symptom with an error injected cow_file_range() setup. The hang occurs when cow_file_range() fails in the middle of allocation. cow_file_range() called from do_allocation_zoned() can split the give region ([start, end]) for allocation depending on current block group usages. When btrfs can allocate bytes for one part of the split regions but fails for the other region (e.g. because of -ENOSPC), we return the error leaving the pages in the succeeded regions locked. Technically, this occurs only when @unlock == 0. Otherwise, we unlock the pages in an allocated region after creating an ordered extent. Considering the callers of cow_file_range(unlock=0) won't write out the pages, we can unlock the pages on error exit from cow_file_range(). So, we can ensure all the pages except @locked_page are unlocked on error case. In summary, cow_file_range now behaves like this: - page_started == 1 (return value) - All the pages are unlocked. IO is started. - unlock == 1 - All the pages except @locked_page are unlocked in any case - unlock == 0 - On success, all the pages are locked for writing out them - On failure, all the pages except @locked_page are unlocked Fixes: `42c0110009` ("btrfs: zoned: introduce dedicated data write path for zoned filesystems") CC: stable@vger.kernel.org # 5.12+ Reviewed-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>		2022-07-25 17:45:38 +02:00
..
tests	Revert "btrfs: turn fs_info member buffer_radix into XArray"	2022-07-15 19:14:33 +02:00
acl.c	btrfs: reserve correct number of items for inode creation	2022-05-16 17:03:08 +02:00
async-thread.c	btrfs: simplify WQ_HIGHPRI handling in struct btrfs_workqueue	2022-05-16 17:03:15 +02:00
async-thread.h	btrfs: remove unused typedefs get_extent_t and btrfs_work_func_t	2022-07-25 17:45:36 +02:00
backref.c	btrfs: sink iterator parameter to btrfs_ioctl_logical_to_ino	2022-07-25 17:45:36 +02:00
backref.h	btrfs: sink iterator parameter to btrfs_ioctl_logical_to_ino	2022-07-25 17:45:36 +02:00
block-group.c	btrfs: zoned: zone finish unused block group	2022-05-16 17:17:32 +02:00
block-group.h	btrfs: zoned: prevent allocation from previous data relocation BG	2022-06-21 14:43:48 +02:00
block-rsv.c	btrfs: reserve extra space for the free space tree	2022-01-07 14:18:25 +01:00
block-rsv.h	btrfs: init root block_rsv at init root time	2022-01-03 15:09:48 +01:00
btrfs_inode.h	btrfs: move struct btrfs_dio_private to inode.c	2022-05-16 17:17:32 +02:00
check-integrity.c	btrfs: check-integrity: simplify bio allocation in btrfsic_read_block	2022-05-16 17:03:12 +02:00
check-integrity.h	btrfs: check-integrity: split submit_bio from btrfsic checking	2022-05-16 17:03:12 +02:00
compression.c	btrfs: remove btrfs_end_io_wq	2022-07-25 17:45:33 +02:00
compression.h	btrfs: don't use btrfs_bio_wq_end_io for compressed writes	2022-07-25 17:45:33 +02:00
ctree.c	btrfs: sink parameter is_data to btrfs_set_disk_extent_flags	2022-05-16 17:17:31 +02:00
ctree.h	btrfs: collect commit stats, count, duration	2022-07-25 17:45:37 +02:00
delalloc-space.c	btrfs: avoid blocking on space revervation when doing nowait dio writes	2022-05-16 17:03:10 +02:00
delalloc-space.h
delayed-inode.c	btrfs: batch up release of reserved metadata for delayed items used for deletion	2022-07-25 17:45:37 +02:00
delayed-inode.h	btrfs: reduce amount of reserved metadata for delayed item insertion	2022-07-25 17:44:36 +02:00
delayed-ref.c	btrfs: remove btrfs_delayed_extent_op::is_data	2022-05-16 17:17:31 +02:00
delayed-ref.h	btrfs: remove btrfs_delayed_extent_op::is_data	2022-05-16 17:17:31 +02:00
dev-replace.c	btrfs: use a local variable for fs_devices pointer in btrfs_dev_replace_finishing	2022-05-16 17:03:08 +02:00
dev-replace.h	btrfs: zoned: mark block groups to copy for device-replace	2021-02-09 02:46:07 +01:00
dir-item.c	btrfs: use btrfs_for_each_slot in btrfs_search_dir_index_item	2022-05-16 17:03:07 +02:00
discard.c	btrfs: fix typos in comments	2021-06-22 14:11:57 +02:00
discard.h	btrfs: cleanup btrfs_discard_update_discardable usage	2020-12-08 15:54:02 +01:00
disk-io.c	btrfs: reject log replay if there is unsupported RO compat flag	2022-07-25 17:45:34 +02:00
disk-io.h	btrfs: remove btrfs_end_io_wq	2022-07-25 17:45:33 +02:00
export.c	btrfs: locking: rip out path->leave_spinning	2020-12-08 15:54:02 +01:00
export.h
extent_io.c	btrfs: remove extent writepage address space operation	2022-07-25 17:45:37 +02:00
extent_io.h	btrfs: remove extent writepage address space operation	2022-07-25 17:45:37 +02:00
extent_map.c	btrfs: assert we have a write lock when removing and replacing extent maps	2022-03-14 13:13:50 +01:00
extent_map.h	btrfs: defrag: don't use merged extent map for their generation check	2022-02-23 17:43:13 +01:00
extent-io-tree.h	btrfs: Convert from invalidatepage to invalidate_folio	2022-03-15 08:23:29 -04:00
extent-tree.c	btrfs: use named constant for reserved device space	2022-07-25 17:45:36 +02:00
file-item.c	btrfs: handle csum lookup errors properly on reads	2022-03-14 13:13:51 +01:00
file.c	btrfs: tree-log: make the return value for log syncing consistent	2022-07-25 17:45:34 +02:00
free-space-cache.c	Page cache changes for 5.19	2022-05-24 19:55:07 -07:00
free-space-cache.h	btrfs: change name and type of private member of btrfs_free_space_ctl	2022-01-03 15:09:50 +01:00
free-space-tree.c	btrfs: use rbtree with leftmost node cached for tracking lowest block group	2022-05-16 17:03:13 +02:00
free-space-tree.h
inode-item.c	btrfs: make should_throttle loop local in btrfs_truncate_inode_items	2022-01-07 14:18:25 +01:00
inode-item.h	btrfs: add inode to truncate control	2022-01-07 14:18:24 +01:00
inode.c	btrfs: ensure pages are unlocked on cow_file_range() failure	2022-07-25 17:45:38 +02:00
ioctl.c	btrfs: sink iterator parameter to btrfs_ioctl_logical_to_ino	2022-07-25 17:45:36 +02:00
Kconfig	btrfs: use generic Kconfig option for 256kB page size limit	2022-01-20 08:52:55 +02:00
locking.c	btrfs: don't set lock_owner when locking extent buffer for reading	2022-06-21 14:46:56 +02:00
locking.h	btrfs: assert that extent buffers are write locked instead of only locked	2021-10-26 19:08:02 +02:00
lzo.c	btrfs: replace kmap() with kmap_local_page() in lzo.c	2022-07-25 17:45:33 +02:00
Makefile	Kbuild: add -Wno-shift-negative-value where -Wextra is used	2022-03-13 17:30:31 +09:00
misc.h	btrfs: use correct header for div_u64 in misc.h	2021-09-07 14:29:50 +02:00
ordered-data.c	btrfs: remove the finish_func argument to btrfs_mark_ordered_io_finished	2022-07-25 17:45:37 +02:00
ordered-data.h	btrfs: remove the finish_func argument to btrfs_mark_ordered_io_finished	2022-07-25 17:45:37 +02:00
orphan.c
print-tree.c	btrfs: unify the error handling pattern for read_tree_block()	2022-03-14 13:13:53 +01:00
print-tree.h	btrfs: print the actual offset in btrfs_root_name	2021-01-07 17:25:05 +01:00
props.c	btrfs: move common inode creation code into btrfs_create_new_inode()	2022-05-16 17:03:08 +02:00
props.h	btrfs: move common inode creation code into btrfs_create_new_inode()	2022-05-16 17:03:08 +02:00
qgroup.c	btrfs: avoid blocking on space revervation when doing nowait dio writes	2022-05-16 17:03:10 +02:00
qgroup.h	btrfs: avoid blocking on space revervation when doing nowait dio writes	2022-05-16 17:03:10 +02:00
raid56.c	btrfs: raid56: don't trust any cached sector in __raid56_parity_recover()	2022-07-25 17:45:37 +02:00
raid56.h	btrfs: use btrfs_raid_array to calculate number of parity stripes	2022-07-25 17:45:36 +02:00
rcu-string.h
ref-verify.c	btrfs: stop accessing ->extent_root directly	2022-01-03 15:09:49 +01:00
ref-verify.h
reflink.c	btrfs: balance btree dirty pages and delayed items after clone and dedupe	2022-07-25 17:44:35 +02:00
reflink.h
relocation.c	Page cache changes for 5.19	2022-05-24 19:55:07 -07:00
root-tree.c	btrfs: avoid blocking on space revervation when doing nowait dio writes	2022-05-16 17:03:10 +02:00
scrub.c	btrfs: remove parameter dev_extent_len from scrub_stripe()	2022-07-25 17:45:36 +02:00
send.c	btrfs: send: use boolean types for current inode status	2022-07-25 17:45:37 +02:00
send.h	btrfs: send: simplify includes	2022-07-25 17:45:37 +02:00
space-info.c	btrfs: store chunk size in space-info struct	2022-07-25 17:45:32 +02:00
space-info.h	btrfs: store chunk size in space-info struct	2022-07-25 17:45:32 +02:00
struct-funcs.c	btrfs: remove redundant check in up check_setget_bounds	2022-07-25 17:45:33 +02:00
subpage.c	btrfs: remove extent writepage address space operation	2022-07-25 17:45:37 +02:00
subpage.h	btrfs: make nodesize >= PAGE_SIZE case to reuse the non-subpage routine	2022-05-16 17:03:11 +02:00
super.c	btrfs: use named constant for reserved device space	2022-07-25 17:45:36 +02:00
sysfs.c	btrfs: sysfs: export commit stats	2022-07-25 17:45:37 +02:00
sysfs.h	btrfs: split and refactor btrfs_sysfs_remove_devices_dir	2020-10-07 12:12:21 +02:00
transaction.c	btrfs: collect commit stats, count, duration	2022-07-25 17:45:37 +02:00
transaction.h	btrfs: pass btrfs_fs_info for deleting snapshots and cleaner	2022-03-14 13:13:52 +01:00
tree-checker.c	btrfs: tree-checker: check extent buffer owner against owner rootid	2022-05-16 17:03:09 +02:00
tree-checker.h	btrfs: tree-checker: check extent buffer owner against owner rootid	2022-05-16 17:03:09 +02:00
tree-defrag.c	btrfs: remove unnecessary extent root check in btrfs_defrag_leaves	2022-01-03 15:09:48 +01:00
tree-log.c	btrfs: tree-log: make the return value for log syncing consistent	2022-07-25 17:45:34 +02:00
tree-log.h	btrfs: tree-log: make the return value for log syncing consistent	2022-07-25 17:45:34 +02:00
tree-mod-log.c	btrfs: fix race when picking most recent mod log operation for an old root	2021-04-20 19:27:17 +02:00
tree-mod-log.h	btrfs: add and use helper to get lowest sequence number for the tree mod log	2021-04-19 17:25:17 +02:00
ulist.c
ulist.h
uuid-tree.c	btrfs: drop the _nr from the item helpers	2022-01-03 15:09:43 +01:00
verity.c	btrfs: drop the _nr from the item helpers	2022-01-03 15:09:43 +01:00
volumes.c	btrfs: warn about dev extents that are inside the reserved range	2022-07-25 17:45:36 +02:00
volumes.h	btrfs: increase direct io read size limit to 256 sectors	2022-07-25 17:45:37 +02:00
xattr.c	btrfs: use btrfs_for_each_slot in btrfs_listxattr	2022-05-16 17:03:08 +02:00
xattr.h
zlib.c	Revert "btrfs: compression: drop kmap/kunmap from zlib"	2021-10-29 13:03:05 +02:00
zoned.c	btrfs: zoned: fix comment description for sb_write_pointer logic	2022-07-25 17:44:33 +02:00
zoned.h	btrfs: zoned: prevent allocation from previous data relocation BG	2022-06-21 14:43:48 +02:00
zstd.c	btrfs: use non-bh spin_lock in zstd timer callback	2022-05-16 17:03:13 +02:00