linux

korg/linux

mirror of https://mirrors.bfsu.edu.cn/git/linux.git synced 2024-11-20 10:44:23 +08:00

History

NeilBrown c340702ca2 md/raid10: don't clear bitmap bit when bad-block-list write fails. When a write fails and a bad-block-list is present, we can update the bad-block-list instead of writing the data. If this succeeds then it is OK clear the relevant bitmap-bit as no further 'sync' of the block is needed. However if writing the bad-block-list fails then we need to treat the write as failed and particularly must not clear the bitmap bit. Otherwise the device can be re-added (after any hardware connection issues are resolved) and because the relevant bit in the bitmap is clear, that block will not be resynced. This leads to data corruption. We already delay the final bio_endio() on the write until the bad-block-list is written so that when the write returns: either that data is safe, the bad-block record is safe, or the fact that the device is faulty is safe. However we don't delay the clearing of the bitmap, so the bitmap bit can be recorded as cleared before we know if the bad-block-list was written safely. So: delay that until the write really is safe. i.e. move the call to close_write() until just before calling bio_endio(), and recheck the 'is array degraded' status before making that call. This bug goes back to v3.1 when bad-block-lists were introduced, though it only affects arrays created with mdadm-3.3 or later as only those have bad-block lists. Backports will require at least Commit: `95af587e95` ("md/raid10: ensure device failure recorded before write request returns.") as well. I'll send that to 'stable' separately. Note that of the two tests of R10BIO_WriteError that this patch adds, the first is certain to fail and the second is certain to succeed. However doing it this way makes the patch more obviously correct. I will tidy the code up in a future merge window. Reported-by: Nate Dailey <nate.dailey@stratus.com> Fixes: `bd870a16c5` ("md/raid10: Handle write errors by updating badblock log.") Signed-off-by: NeilBrown <neilb@suse.com>		2015-10-24 16:24:23 +11:00
..
bcache	bcache: remove driver private bio splitting code	2015-08-13 12:31:40 -06:00
persistent-data	dm: remove unlikely() before IS_ERR()	2015-08-12 11:32:21 -04:00
bitmap.c	md/bitmap: don't pass -1 to bitmap_storage_alloc.	2015-10-02 17:24:13 +10:00
bitmap.h	md-cluster: re-add capabilities	2015-04-22 07:59:39 +10:00
dm-bio-prison.c	block: add a bi_error field to struct bio	2015-07-29 08:55:15 -06:00
dm-bio-prison.h	dm bio prison: add dm_cell_promote_or_release()	2015-05-29 14:19:06 -04:00
dm-bio-record.h	dm: Refactor for new bio cloning/splitting	2013-11-23 22:33:55 -08:00
dm-bufio.c	block: add a bi_error field to struct bio	2015-07-29 08:55:15 -06:00
dm-bufio.h	dm snapshot: use dm-bufio prefetch	2014-01-14 23:23:03 -05:00
dm-builtin.c	dm sysfs: fix a module unload race	2014-01-14 23:23:04 -05:00
dm-cache-block-types.h	dm cache: revert "remove remainder of distinct discard block size"	2014-11-10 15:25:30 -05:00
dm-cache-metadata.c	dm cache: add fail io mode and needs_check flag	2015-06-11 17:13:00 -04:00
dm-cache-metadata.h	dm cache: add fail io mode and needs_check flag	2015-06-11 17:13:00 -04:00
dm-cache-policy-cleaner.c	dm cache: fix NULL pointer when switching from cleaner policy	2015-10-09 09:16:29 -04:00
dm-cache-policy-internal.h	dm cache: age and write back cache entries even without active IO	2015-06-11 17:13:01 -04:00
dm-cache-policy-mq.c	dm cache policy smq: move 'dm-cache-default' module alias to SMQ	2015-08-12 11:27:29 -04:00
dm-cache-policy-smq.c	dm cache policy smq: change the mutex to a spinlock	2015-08-12 11:32:19 -04:00
dm-cache-policy.c	dm cache: add policy name to status output	2014-01-16 13:44:11 -05:00
dm-cache-policy.h	dm cache: age and write back cache entries even without active IO	2015-06-11 17:13:01 -04:00
dm-cache-target.c	Merge tag 'dm-4.3-changes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm	2015-09-02 16:35:26 -07:00
dm-crypt.c	dm crypt: constrain crypt device's max_segment_size to PAGE_SIZE	2015-09-14 12:04:24 -04:00
dm-delay.c	dm: do not override error code returned from dm_get_device()	2015-08-12 11:32:21 -04:00
dm-era-target.c	block: kill merge_bvec_fn() completely	2015-08-13 12:31:57 -06:00
dm-exception-store.c	dm snapshot: add new persistent store option to support overflow	2015-10-09 16:57:03 -04:00
dm-exception-store.h	dm snapshot: add new persistent store option to support overflow	2015-10-09 16:57:03 -04:00
dm-flakey.c	Merge tag 'dm-4.3-changes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm	2015-09-02 16:35:26 -07:00
dm-io.c	block: remove bio_get_nr_vecs()	2015-08-13 12:32:04 -06:00
dm-ioctl.c	char: make misc_deregister a void function	2015-08-05 10:35:49 -07:00
dm-kcopyd.c	dm: stop using WQ_NON_REENTRANT	2013-08-23 09:02:13 -04:00
dm-linear.c	Merge tag 'dm-4.3-changes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm	2015-09-02 16:35:26 -07:00
dm-log-userspace-base.c	dm log userspace base: fix compile warning	2015-04-15 12:10:20 -04:00
dm-log-userspace-transfer.c	dm log userspace transfer: match wait_for_completion_timeout return type	2015-04-15 12:10:20 -04:00
dm-log-userspace-transfer.h
dm-log-writes.c	Merge tag 'dm-4.3-changes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm	2015-09-02 16:35:26 -07:00
dm-log.c
dm-mpath.c	dm-mpath, scsi_dh: request scsi_dh modules in scsi_dh, not dm-mpath	2015-08-28 13:14:55 -07:00
dm-mpath.h
dm-path-selector.c
dm-path-selector.h
dm-queue-length.c
dm-raid1.c	Merge tag 'dm-4.3-changes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm	2015-09-02 16:35:26 -07:00
dm-raid.c	dm raid: fix round up of default region size	2015-10-02 12:02:31 -04:00
dm-region-hash.c	block: Abstract out bvec iterator	2013-11-23 22:33:47 -08:00
dm-round-robin.c
dm-service-time.c
dm-snap-persistent.c	dm snapshot persistent: fix missing cleanup in persistent_ctr error path	2015-10-13 12:20:54 -04:00
dm-snap-transient.c	dm snapshot: add new persistent store option to support overflow	2015-10-09 16:57:03 -04:00
dm-snap.c	dm snapshot: add new persistent store option to support overflow	2015-10-09 16:57:03 -04:00
dm-stats.c	dm stats: report precise_timestamps and histogram in @stats_list output	2015-08-18 17:20:03 -04:00
dm-stats.h	dm stats: support precise timestamps	2015-06-17 12:40:40 -04:00
dm-stripe.c	Merge tag 'dm-4.3-changes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm	2015-09-02 16:35:26 -07:00
dm-switch.c	dm switch: efficiently support repetitive patterns	2014-08-01 12:30:37 -04:00
dm-sysfs.c	dm: add 'use_blk_mq' module param and expose in per-device ro sysfs attr	2015-04-15 12:10:17 -04:00
dm-table.c	block: Replace SG_GAPS with new queue limits mask	2015-08-19 14:26:02 -07:00
dm-target.c	dm: allocate requests in target when stacking on blk-mq devices	2015-02-09 13:06:47 -05:00
dm-thin-metadata.c	dm thin metadata: delete btrees when releasing metadata snapshot	2015-08-12 10:42:51 -04:00
dm-thin-metadata.h	dm thin metadata: add dm_thin_remove_range()	2015-06-11 17:13:04 -04:00
dm-thin.c	dm thin: fix missing pool reference count decrement in pool_ctr error path	2015-10-13 12:20:55 -04:00
dm-uevent.c
dm-uevent.h
dm-verity.c	Merge tag 'dm-4.3-changes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm	2015-09-02 16:35:26 -07:00
dm-zero.c	block: add a bi_error field to struct bio	2015-07-29 08:55:15 -06:00
dm.c	dm: fix request-based dm error reporting	2015-10-06 10:08:16 -04:00
dm.h	block: kill merge_bvec_fn() completely	2015-08-13 12:31:57 -06:00
faulty.c	block: add a bi_error field to struct bio	2015-07-29 08:55:15 -06:00
Kconfig	SCSI misc on 20150911	2015-09-11 18:15:18 -07:00
linear.c	block: kill merge_bvec_fn() completely	2015-08-13 12:31:57 -06:00
linear.h
Makefile	dm cache: add stochastic-multi-queue (smq) policy	2015-06-11 17:12:59 -04:00
md-cluster.c	md-cluster: remove inappropriate try_module_get from join()	2015-08-31 19:43:17 +02:00
md-cluster.h	Fix read-balancing during node failure	2015-07-24 13:37:59 +10:00
md.c	md: clear CHANGE_PENDING in readonly array	2015-10-02 17:23:44 +10:00
md.h	block: kill merge_bvec_fn() completely	2015-08-13 12:31:57 -06:00
multipath.c	md: drop null test before destroy functions	2015-10-02 17:23:44 +10:00
multipath.h
raid0.c	md/raid0: apply base queue limits before disk_stack_limits	2015-10-02 17:23:44 +10:00
raid0.h	block: kill merge_bvec_fn() completely	2015-08-13 12:31:57 -06:00
raid1.c	md/raid1: don't clear bitmap bit when bad-block-list write fails.	2015-10-24 16:24:22 +11:00
raid1.h	md/raid1: ensure device failure recorded before write request returns.	2015-08-31 19:43:23 +02:00
raid5.c	md: drop null test before destroy functions	2015-10-02 17:23:44 +10:00
raid5.h	md/raid5: ensure device failure recorded before write request returns.	2015-08-31 19:43:59 +02:00
raid10.c	md/raid10: don't clear bitmap bit when bad-block-list write fails.	2015-10-24 16:24:23 +11:00
raid10.h	md/raid10: ensure device failure recorded before write request returns.	2015-08-31 19:43:45 +02:00