mirrors/qemu

mirror of https://github.com/qemu/qemu.git synced 2024-11-25 11:53:39 +08:00

Author	SHA1	Message	Date
Stefan Hajnoczi	0e71be1932	qed: add .bdrv_co_write_zeroes() support Zero writes are a dedicated interface for writing regions of zeroes into the image file. If clusters are not yet allocated it is possible to use an efficient metadata representation which keeps the image file compact and does not store individual zero bytes. Implementing this for the QED image format is fairly straightforward. The only issue is that when a zero write touches an existing cluster we have to allocate a bounce buffer and perform a regular write. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2012-02-09 16:17:50 +01:00
Stefan Hajnoczi	6e4f59bd0d	qed: replace is_write with flags field Per-request attributes like read/write are currently implemented as bool fields in the QEDAIOCB struct. This becomes unwiedly as the number of attributes grows. For example, the qed_aio_setup() function would have to take multiple bool arguments and at call sites it would be hard to distinguish the meaning of each bool. Instead use a flags field with bitmask constants. This will be used when zero write support is added. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2012-02-09 16:17:50 +01:00
Li Zhi Hui	2b16c9ffb2	qcow: Use bdrv functions to replace file operation Since common file operation functions lack of error detection and use much more I/O syscalls, so change them to bdrv series functions and reduce I/O request. Signed-off-by: Li Zhi Hui <zhihuili@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2012-01-26 14:49:18 +01:00
Li Zhi Hui	84b0ec020f	qcow: Return real error code in qcow_open Signed-off-by: Li Zhi Hui <zhihuili@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2012-01-26 14:49:18 +01:00
Stefan Weil	641543b76b	block/vdi: Zero unused parts when allocating a new block (fix #919242 ) The new block was filled with zero when it was allocated by g_malloc0, but when it was reused later and only partially used, data from the previously allocated block were still present and written to the new block. This caused the problems reported by bug #919242 (https://bugs.launchpad.net/qemu/+bug/919242). Now the unused parts of the new block which are before and after the data are always filled with zero, so it is no longer necessary to zero the whole block with g_malloc0. I also updated the copyright comment. Signed-off-by: Stefan Weil <sw@weilnetz.de> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2012-01-26 14:49:18 +01:00
Marcelo Tosatti	c8c3080f4a	block: add support for partial streaming Add support for streaming data from an intermediate section of the image chain (see patch and documentation for details). Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com> Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2012-01-26 14:49:18 +01:00
Stefan Hajnoczi	5094a6c016	block: rate-limit streaming operations This patch implements rate-limiting for image streaming. If we've exceeded the bandwidth quota for a 100 ms time slice we sleep the coroutine until the next slice begins. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2012-01-26 11:45:26 +01:00
Stefan Hajnoczi	4f1043b4ff	block: add image streaming block job Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2012-01-26 11:45:26 +01:00
Stefan Hajnoczi	031380d877	block: replace unchecked strdup/malloc/calloc with glib Most of the codebase as been converted to use glib memory allocation functions. There are still a few instances of malloc/calloc in the block layer and qemu-io. Replace them, especially since they do not check the strdup/malloc/calloc return value. Reported-by: Dr David Alan Gilbert <davidagilbert@uk.ibm.com> Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2012-01-26 11:39:03 +01:00
Gregory Farnum	bd60324706	rbd: wire up snapshot removal and rollback functionality Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2012-01-26 11:39:03 +01:00
Paolo Bonzini	6b620ca3b0	prepare for future GPLv2+ relicensing All files under GPLv2 will get GPLv2+ changes starting tomorrow. event_notifier.c and exec-obsolete.h were only ever touched by Red Hat employees and can be relicensed now. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Anthony Liguori <aliguori@us.ibm.com>	2012-01-13 10:55:56 -06:00
Stefan Hajnoczi	8d98734651	vvfat: avoid leaking file descriptor in commit_one_file() Reported-by: Dr David Alan Gilbert <davidagilbert@uk.ibm.com> Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2012-01-13 10:36:59 +00:00
Paolo Bonzini	128aa58947	move corking functions to osdep.c Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2011-12-22 11:53:58 +01:00
Paolo Bonzini	7a706633e9	nbd: add support for NBD_CMD_TRIM Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2011-12-22 11:53:57 +01:00
Paolo Bonzini	1486d04a1b	nbd: add support for NBD_CMD_FLUSH Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2011-12-22 11:53:57 +01:00
Paolo Bonzini	2c7989a9b1	nbd: add support for NBD_CMD_FLAG_FUA Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2011-12-22 11:53:57 +01:00
Paolo Bonzini	ecda3447d1	nbd: allow multiple in-flight requests Allow sending up to 16 requests, and drive the replies to the coroutine that did the request. The code is written to be exactly the same as before this patch when MAX_NBD_REQUESTS == 1 (modulo the extra mutex and state). Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2011-12-22 11:53:57 +01:00
Paolo Bonzini	d9b09f13ca	nbd: split requests qemu-nbd has a limit of slightly less than 1M per request. Work around this in the nbd block driver. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2011-12-22 11:53:57 +01:00
Paolo Bonzini	ae255e523c	nbd: switch to asynchronous operation Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2011-12-22 11:53:57 +01:00
Paolo Bonzini	8c5135f90e	sheepdog: move coroutine send/recv function to generic code Outside coroutines, avoid busy waiting on EAGAIN by temporarily making the socket blocking. The API of qemu_recvv/qemu_sendv is slightly different from do_readv/do_writev because they do not handle coroutines. It returns the number of bytes written before encountering an EAGAIN. The specificity of yielding on EAGAIN is entirely in qemu-coroutine.c. Reviewed-by: MORITA Kazutaka <morita.kazutaka@lab.ntt.co.jp> Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2011-12-22 11:53:53 +01:00
Li Zhi Hui	16d2fc002a	block/cow: Return real error code Signed-off-by: Li Zhi Hui <zhihuili@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-15 12:40:33 +01:00
Kevin Wolf	c2c9a46609	qcow2: Allow >4 GB VM state This is a compatible extension to the snapshot header format that allows saving a 64 bit VM state size. Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-15 12:40:33 +01:00
Josh Durgin	b9c532903f	rbd: always set out parameter in qemu_rbd_snap_list The caller expects psn_tab to be NULL when there are no snapshots or an error occurs. This results in calling g_free on an invalid address. Reported-by: Oliver Francke <Oliver@filoo.de> Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-15 12:40:08 +01:00
Li Zhi Hui	28c1202ba6	block/qcow2.c: call qcow2_free_snapshots in the function of qcow2_close Signed-off-by: Li Zhi Hui <zhihuili@linux.vnet.ibm.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-15 12:40:08 +01:00
Paolo Bonzini	91977c2e5f	block: qemu_aio_get does not return NULL Initially done with the following semantic patch: @ rule1 @ expression E; statement S; @@ E = qemu_aio_get (...); ( - if (E == NULL) { ... } \| - if (E) { <... S ...> } ) which however missed occurrences in linux-aio.c and posix-aio-compat.c. Those were done by hand. The change in vdi_aio_setup's caller was also done by hand. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-15 12:40:08 +01:00
Paolo Bonzini	ad54ae80c7	block: bdrv_aio_* do not return NULL Initially done with the following semantic patch: @ rule1 @ expression E; statement S; @@ E = ( bdrv_aio_readv \| bdrv_aio_writev \| bdrv_aio_flush \| bdrv_aio_discard \| bdrv_aio_ioctl ) (...); ( - if (E == NULL) { ... } \| - if (E) { <... S ...> } ) which however missed the occurrence in block/blkverify.c (as it should have done), and left behind some unused variables. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-15 12:40:07 +01:00
Dong Xu Wang	3a93113a00	fix typo: delete redundant semicolon Double semicolons should be single. Signed-off-by: Dong Xu Wang <wdongxu@linux.vnet.ibm.com> Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2011-12-06 09:56:41 +00:00
Anthony Liguori	eb5d5beaeb	Merge remote-tracking branch 'kwolf/for-anthony' into staging	2011-12-05 09:39:25 -06:00
Stefan Hajnoczi	e94d138733	cow: use bdrv_co_is_allocated() Now that bdrv_co_is_allocated() is available we can use it instead of the synchronous bdrv_is_allocated() interface. This is a follow-up that Kevin Wolf <kwolf@redhat.com> pointed out after applying the series that introduces bdrv_co_is_allocated(). It is safe to make cow_read() a coroutine_fn because its only caller is a coroutine_fn. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:51:38 +01:00
Stefan Hajnoczi	e8ee5e4c47	coroutine: add qemu_co_queue_restart_all() It's common to wake up all waiting coroutines. Introduce the qemu_co_queue_restart_all() function to do this instead of looping over qemu_co_queue_next() in every caller. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:51:38 +01:00
Stefan Hajnoczi	81145834d3	cow: convert to .bdrv_co_is_allocated() The cow block driver does not keep internal state for cluster lookups. This means it is safe to perform cluster lookups in coroutine context without risk of race conditions that corrupt internal state. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:51:37 +01:00
Stefan Hajnoczi	e850b35a1f	vdi: convert to .bdrv_co_is_allocated() It is trivial to switch from the synchronous .bdrv_is_allocated() interface to .bdrv_co_is_allocated() since vdi_is_allocated() does not block. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:51:37 +01:00
Stefan Hajnoczi	73f703ca8f	vvfat: convert to .bdrv_co_is_allocated() It is trivial to switch from the synchronous .bdrv_is_allocated() interface to .bdrv_co_is_allocated() since vvfat_is_allocated() does not block. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:51:37 +01:00
Stefan Hajnoczi	f8a2e5e3ca	block: convert qcow2, qcow2, and vmdk to .bdrv_co_is_allocated() The qcow2, qcow, and vmdk block drivers are based on coroutines. They have a coroutine mutex which protects internal state. We can convert the .bdrv_is_allocated() function to .bdrv_co_is_allocated() by holding the mutex around the cluster lookup operation. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:51:37 +01:00
Stefan Hajnoczi	b7d5a5b8ae	qed: convert to .bdrv_co_is_allocated() The bdrv_qed_is_allocated() function is a synchronous wrapper around qed_find_cluster(), which performs the cluster lookup. In order to convert the synchronous function to a coroutine function we yield instead of using qemu_aio_wait(). Note that QED's cache is already safe for parallel requests so no locking is needed. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:51:37 +01:00
Kevin Wolf	e3f652b332	qcow2: Fix error path in qcow2_snapshot_load_tmp If the bdrv_read() of the snapshot's L1 table fails, return the right error code and make sure that the old L1 table is still loaded and we don't break the BlockDriverState completely. Signed-off-by: Kevin Wolf <kwolf@redhat.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2011-12-05 14:51:36 +01:00
Kevin Wolf	9a4767809f	qcow2: Fix order in qcow2_snapshot_delete First the snapshot must be deleted and only then the refcounts can be decreased. Signed-off-by: Kevin Wolf <kwolf@redhat.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2011-12-05 14:51:36 +01:00
Kevin Wolf	43a0cac465	qcow2: Fix order of refcount updates in qcow2_snapshot_goto The refcount updates must be moved so that in the worst case we can get cluster leaks, but refcounts may never be too low. Signed-off-by: Kevin Wolf <kwolf@redhat.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2011-12-05 14:51:36 +01:00
Kevin Wolf	589f284b76	qcow2: Return real error in qcow2_snapshot_goto Besides fixing the return code, this adds some comments that make clear how the code works and that it potentially breaks images if we fail in the wrong place. Actually fixing this is left for the next patch. Signed-off-by: Kevin Wolf <kwolf@redhat.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2011-12-05 14:51:36 +01:00
Kevin Wolf	d1ea98d56d	qcow2: Rework qcow2_snapshot_create error handling Increase refcounts only after allocating a new L1 table has succeeded in order to make leaks less likely. If writing the snapshot table fails, revert in-memory state to be consistent with that on disk. While at it, make it return the real error codes instead of -1. Signed-off-by: Kevin Wolf <kwolf@redhat.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2011-12-05 14:51:36 +01:00
Kevin Wolf	03343166f7	qcow2: Cleanups and memleak fix in qcow2_snapshot_create sn->id_str could be leaked before this. The rest of this patch changes comments, fixes coding style or removes checks that are unnecessary with g_malloc. Signed-off-by: Kevin Wolf <kwolf@redhat.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2011-12-05 14:51:36 +01:00
Kevin Wolf	d69969c404	qcow2: Update snapshot table information at once Failing in the middle wouldn't help with the integrity of the image, so doing everything in a single request seems better. Signed-off-by: Kevin Wolf <kwolf@redhat.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2011-12-05 14:51:36 +01:00
Kevin Wolf	07fd877900	qcow2: Return real error code in qcow2_write_snapshots Doesn't immediately fix anything as the callers don't use the return value, but they will be fixed next. Signed-off-by: Kevin Wolf <kwolf@redhat.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2011-12-05 14:51:35 +01:00
Kevin Wolf	42deb29fed	qcow2: Return real error code in qcow2_read_snapshots Signed-off-by: Kevin Wolf <kwolf@redhat.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2011-12-05 14:51:35 +01:00
Dong Xu Wang	a968168c58	block: Add coroutine_fn marker to coroutine functions Looks better when reviewing these source files. Signed-off-by: Dong Xu Wang <wdongxu@linux.vnet.ibm.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:51:35 +01:00
Li Zhi Hui	3535a9c6be	block: Use bdrv functions to replace file operation in cow.c Since common file operation functions lack of error detection, so change them to bdrv series functions. Signed-off-by: Li Zhi Hui <zhihuili@linux.vnet.ibm.com> Reviewed-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:51:35 +01:00
Zhi Yong Wu	23e9a39e7d	qed: adjust the way to get nb_sectors This patch is only to refactor some lines of codes to get better and more robust codes. As you have seen, in qed_read_table_cb() it's nice to use qiov->size because that function doesn't obviously use a single struct iovec. In other two functions, if qiov use more than one struct iovec, the existing way will get wrong nb_sectors. To make the code more robust, it will be nicer to refactor the existing way as below. Signed-off-by: Zhi Yong Wu <wuzhy@linux.vnet.ibm.com> Acked-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:51:34 +01:00
Stefan Hajnoczi	aef4acb661	qcow2: avoid reentrant bdrv_read() in copy_sectors() A BlockDriverState should not issue requests on itself through the public block layer interface. Nested, or reentrant, requests are problematic because they do I/O throttling and request tracking twice. Features like block layer copy-on-read use request tracking to avoid race conditions between concurrent requests. The reentrant request will have to "wait" for its parent request to complete. But the parent is waiting for the reentrant request to make progress so we have reached deadlock. The solution is for block drivers to avoid the public block layer interfaces for reentrant requests. Instead they should call their own internal functions if they wish to perform reentrant requests. This is also a good opportunity to make copy_sectors() a true coroutine_fn. That means calling bdrv_co_writev() instead of bdrv_write(). Behavior is unchanged but we're being explicit that this executes in coroutine context. Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:49:47 +01:00
Kevin Wolf	1b9f1491f8	qcow2: Unlock during COW Unlocking during COW allows for more parallelism. One change it requires is that buffers are dynamically allocated instead of just using a per-image buffer. While touching the code, drop the synchronous qcow2_read() function and replace it by a bdrv_read() call. Signed-off-by: Kevin Wolf <kwolf@redhat.com>	2011-12-05 14:49:40 +01:00
Dong Xu Wang	9b2260cbd5	fix spelling in block sub directory Cc: Kevin Wolf <kwolf@redhat.com> Signed-off-by: Dong Xu Wang <wdongxu@linux.vnet.ibm.com> Signed-off-by: Stefan Hajnoczi <stefanha@linux.vnet.ibm.com>	2011-12-02 10:50:57 +00:00

1 2 3 4 5 ...

500 Commits