systemd

mirror of https://github.com/systemd/systemd.git synced 2024-11-24 18:53:33 +08:00

Author	SHA1	Message	Date
Lennart Poettering	6c47cd7d3b	execute: make StateDirectory= and friends compatible with DynamicUser=1 and RootDirectory=/RootImage= Let's clean up the interaction of StateDirectory= (and friends) to DynamicUser=1: instead of creating these directories directly below /var/lib, place them in /var/lib/private instead if DynamicUser=1 is set, making that directory 0700 and owned by root:root. This way, if a dynamic UID is later reused, access to the old run's state directory is prohibited for that user. Then, use file system namespacing inside the service to make /var/lib/private a readable tmpfs, hiding all state directories that are not listed in StateDirectory=, and making access to the actual state directory possible. Mount all directories listed in StateDirectory= to the same places inside the service (which means they'll now be mounted into the tmpfs instance). Finally, add a symlink from the state directory name in /var/lib/ to the one in /var/lib/private, so that both the host and the service can access the path under the same location. Here's an example: let's say a service runs with StateDirectory=foo. When DynamicUser=0 is set, it will get the following setup, and no difference between what the unit and what the host sees: /var/lib/foo (created as directory) Now, if DynamicUser=1 is set, we'll instead get this on the host: /var/lib/private (created as directory with mode 0700, root:root) /var/lib/private/foo (created as directory) /var/lib/foo → private/foo (created as symlink) And from inside the unit: /var/lib/private (a tmpfs mount with mode 0755, root:root) /var/lib/private/foo (bind mounted from the host) /var/lib/foo → private/foo (the same symlink as above) This takes inspiration from how container trees are protected below /var/lib/machines: they generally reuse UIDs/GIDs of the host, but because /var/lib/machines itself is set to 0700 host users cannot access files in the container tree even if the UIDs/GIDs are reused. However, for this commit we add one further trick: inside and outside of the unit /var/lib/private is a different thing: outside it is a plain, inaccessible directory, and inside it is a world-readable tmpfs mount with only the whitelisted subdirs below it, bind mounte din. This means, from the outside the dir acts as an access barrier, but from the inside it does not. And the symlink created in /var/lib/foo itself points across the barrier in both cases, so that root and the unit's user always have access to these dirs without knowing the details of this mounting magic. This logic resolves a major shortcoming of DynamicUser=1 units: previously they couldn't safely store persistant data. With this change they can have their own private state, log and data directories, which they can write to, but which are protected from UID recycling. With this change, if RootDirectory= or RootImage= are used it is ensured that the specified state/log/cache directories are always mounted in from the host. This change of semantics I think is much preferable since this means the root directory/image logic can be used easily for read-only resource bundling (as all writable data resides outside of the image). Note that this is a change of behaviour, but given that we haven't released any systemd version with StateDirectory= and friends implemented this should be a safe change to make (in particular as previously it wasn't clear what would actually happen when used in combination). Moreover, by making this change we can later add a "+" modifier to these setings too working similar to the same modifier in ReadOnlyPaths= and friends, making specified paths relative to the container itself.	2017-10-02 17:41:44 +02:00
Lennart Poettering	a227a4be48	namespace: if we can create the destination of bind and PrivateTmp= mounts When putting together the namespace, always create the file or directory we are supposed to bind mount on, the same way we do it for most other stuff, for example mount units or systemd-nspawn's --bind= option. This has the big benefit that we can use namespace bind mounts on dirs in /tmp or /var/tmp even in conjunction with PrivateTmp=.	2017-10-02 17:41:43 +02:00
Lennart Poettering	e908468b5b	namespace: properly handle bind mounts from the host Before this patch we had an ordering problem: if we have no namespacing enabled except for two bind mounts that intend to swap /a and /b via bind mounts, then we'd execute the bind mount binding /b to /a, followed by thebind mount from /a to /b, thus having the effect that /b is now visible in both /a and /b, which was not intended. With this change, as soon as any bind mount is configured we'll put together the service mount namespace in a temporary directory instead of operating directly in the root. This solves the problem in a straightforward fashion: the source of bind mounts will always refer to the host, and thus be unaffected from the bind mounts we already created.	2017-10-02 17:41:43 +02:00
Lennart Poettering	645767d6b5	namespace: create /dev, /proc, /sys when needed We already create /dev implicitly if PrivateTmp=yes is on, if it is missing. Do so too for the other two API VFS, as well as for /dev if PrivateTmp=yes is off but MountAPIVFS=yes is on (i.e. when /dev is bind mounted from the host).	2017-10-02 17:41:43 +02:00
Lennart Poettering	72fd17682d	core: usually our enum's _INVALID and _MAX special values are named after the full type In most cases we followed the rule that the special _INVALID and _MAX values we use in our enums use the full type name as prefix (in contrast to regular values that we often make shorter), do so for ExecDirectoryType as well. No functional changes, just a little bit of renaming to make this code more like the rest.	2017-10-02 17:41:43 +02:00
Lennart Poettering	a1164ae380	core: chown() StateDirectory= and friends recursively when starting a service This is particularly useful when used in conjunction with DynamicUser=1, where the UID might change for every invocation, but is useful in other cases too, for example, when these directories are shared between systems where the UID assignments differ slightly.	2017-10-02 17:41:43 +02:00
Lennart Poettering	64fbdc0f91	nspawn: properly report all kinds of changed UID/GID when patching things for userns We forgot to propagate one chmod().	2017-10-02 17:41:43 +02:00
Lennart Poettering	fd68e0f58c	Merge pull request #6960 from keszybz/hwdb-update Hwdb update and sorting	2017-10-02 17:12:58 +02:00
Jouke Witteveen	df66b93fe2	service: better detect when a Type=notify service cannot become active anymore (#6959 ) No need to wait for a timeout when we know things are not going to work out. When the main process goes away and only notifications from the main process are accepted, then we will not receive any notifications anymore.	2017-10-02 16:35:27 +02:00
Zbigniew Jędrzejewski-Szmek	b139c95bc4	Merge pull request #6941 from andir/use-in_set use IN_SET where possible	2017-10-02 15:08:10 +02:00
Zbigniew Jędrzejewski-Szmek	d29ad81e3b	Minor line wrapping adjustment	2017-10-02 14:52:12 +02:00
Zbigniew Jędrzejewski-Szmek	81473e6f20	Merge pull request #6932 from poettering/notify-pam fix handling of cgroup empty events for Type=notify services	2017-10-02 14:40:10 +02:00
Zbigniew Jędrzejewski-Szmek	53f4c5fa21	hwdb: sort usb classes and vendors No change, but let's keep the sorting in place as a safeguard for the future.	2017-10-02 13:19:23 +02:00
Zbigniew Jędrzejewski-Szmek	0a91fd3bd6	hwdb: sort pci classes and vendors No change, but let's keep the sorting in place as a safeguard for the future.	2017-10-02 13:19:23 +02:00
Zbigniew Jędrzejewski-Szmek	e979e502da	hwdb: sort sdio entries Nothing intersting here.	2017-10-02 13:19:23 +02:00
Zbigniew Jędrzejewski-Szmek	e4804bdae6	hwdb: de-duplicate and sort OUI entries Ignoring duplicate entry: 0001C8 = "THOMAS CONRAD CORP.", "CONRAD CORP." Ignoring duplicate entry: 080030 = "NETWORK RESEARCH CORPORATION", "ROYAL MELBOURNE INST OF TECH" Ignoring duplicate entry: 080030 = "NETWORK RESEARCH CORPORATION", "CERN" → we have two vendor prefixes with duplicate entries. For the first one, there are two entries with what appear to be the same company. In the second case, the same prefix is assigned to three different entities. I arbitrarily chose to prefer the first entry.	2017-10-02 13:19:23 +02:00
Zbigniew Jędrzejewski-Szmek	53d22844f0	ids_parser: allow specifying which parts to run	2017-10-02 13:19:22 +02:00
Zbigniew Jędrzejewski-Szmek	3a58a43085	hwdb: add a grammar-based generator for vendor/model and class tables This is rather slow (1 m 45 s on my laptop), but since it'd be only used once per release, maybe this doesn't matter that much. Output is identical to ids-update.pl with the set of source files committed in the grandparent.	2017-10-02 13:19:22 +02:00
Zbigniew Jędrzejewski-Szmek	c579960977	hwdb: fix invalid encoding in usb.ids 0xB4 is the code point for "accent acute", but when encoded in utf-8, is is a multibyte sequence. A single 0xB4 byte is invalid (seems to be latin1).	2017-10-02 13:19:22 +02:00
Zbigniew Jędrzejewski-Szmek	2a9ba96c67	hwdb: also add the source files to git Without the original files it's hard to see what changed "upstream", and what entries were added and removed. Upstream did not keep the entries sorted, and our processing scripts did not sort the output either, so from just looking at diffs it's hard to say what changed. So let's keep the original data, at least for a few update cycles, so get a better handle on the upstream changes. It's a few hundred kilobytes, so not that big, and text, so it should compresses well.	2017-10-02 13:19:22 +02:00
Zbigniew Jędrzejewski-Szmek	689f489812	hwdb: update http://www.uefi.org/uefi-pnp-export is returning an internal server error, so the previous version was used.	2017-10-02 13:19:21 +02:00
Zbigniew Jędrzejewski-Szmek	91dab98ffc	meson-hwdb-update.sh: add -n param to skip downloads Useful to experiment with the patching part w/o redownloading the files.	2017-10-02 13:19:21 +02:00
Zbigniew Jędrzejewski-Szmek	9e82570e73	meson: rename hwdb update target ninja -c build hwdb-update During the initial meson conversion, custom_target:s and run_target:s behaved the same, and the target name became a top-level command. Now custom_target:s require the subdir to be included, e.g. we have man/man target to build man pages, but run_target:s not. So I think this target got a name that is so generic because of the confusion caused by changing rules. Let's rename it.	2017-10-02 13:19:21 +02:00
Andreas Rammhold	ec2ce0c5d7	tree-wide: use `!IN_SET(..)` for `a != b && a != c && …` The included cocci was used to generate the changes. Thanks to @flo-wer for pointing this case out.	2017-10-02 13:09:56 +02:00
Andreas Rammhold	3742095b27	tree-wide: use IN_SET where possible In addition to the changes from #6933 this handles cases that could be matched with the included cocci file.	2017-10-02 13:09:54 +02:00
Lennart Poettering	5aaeeffb5f	man: document that PAMName= and NotifyAccess=all don't mix well. See: #6045	2017-10-02 12:58:42 +02:00
Lennart Poettering	b13ddbbcf3	service: accept the fact that the three xyz_good() functions return ints Currently, all three of cgroup_good(), main_pid_good(), control_pid_good() all return an "int" (two of them propagate errors). It's a good thing to keep the three functions similar, so let's leave it at that, but then let's clean up the invocation of the three functions so that they always clearly acknowledge that the return value is not a bool, but potentially negative.	2017-10-02 12:58:42 +02:00
Lennart Poettering	019be28676	service: drop _pure_ decorator on static function The compiler should be good enough to figure this out on its own if this is a static function, and it makes control_pid_good() an outlier anyway, and decorators like this tend to bitrot. Hence, to keep things simple and automatic, let's just drop the decorator.	2017-10-02 12:58:42 +02:00
Lennart Poettering	3c751b1bfa	service: a cgroup empty notification isn't reason enough to go down The processes associated with a service are not just the ones in its cgroup, but also the control and main processes, which might possibly live outside of it, for example if they transitioned into their own cgroups because they registered a PAM session of their own. Hence, if we get a cgroup empty notification always check if the main PID is still around before taking action too eagerly. Fixes: #6045	2017-10-02 12:58:42 +02:00
Lennart Poettering	07697d7ec5	service: add explanatory comments to control_pid_good() and cgroup_good() Let's add a similar comment to each as we already have for main_pid_good(), emphasizing that these functions are supposed to be have very similar.	2017-10-02 12:58:42 +02:00
Lennart Poettering	51894d706f	service: fix main_pid_good() comment We don't actually return -1, don't claim that.	2017-10-02 12:58:37 +02:00
Piotr Drąg	965d824031	catalog: update Polish translation (#6947 )	2017-09-30 14:39:29 +02:00
Lennart Poettering	894fcf91ed	Merge pull request #6942 from keszybz/remove-sys Remove sys if necessary	2017-09-29 15:48:23 +02:00
Zbigniew Jędrzejewski-Szmek	0bca795456	sys-script: remove output directory if it exists meson will occasionally call us even though the output directory exists. Let's just nuke and recreate in that case.	2017-09-29 12:28:25 +02:00
Zbigniew Jędrzejewski-Szmek	227ef9bc38	create-sys-script: adapt to separate build dir, modernize, add more checks The script wasn't apparently used since the switch to meson, because it required the sys subdirectory to be present in the same subdirectory where the output script is located. Let's use f-strings to make the whole thing more readable. Add some extra checks.	2017-09-29 12:27:21 +02:00
John Lin	01a65d4180	man: move non-target units together (#6934 )	2017-09-28 19:56:50 +02:00
Zbigniew Jędrzejewski-Szmek	56d50ab1d3	meson: move library version defines to the top (#6939 )	2017-09-28 19:24:16 +02:00
Lennart Poettering	e88fe88877	Merge pull request #6936 from poettering/news-235 News 235	2017-09-28 19:22:55 +02:00
Lennart Poettering	aecc6f6b34	Merge pull request #6933 from yuwata/use_in_set use IN_SET macro	2017-09-28 19:22:09 +02:00
Lennart Poettering	63950427ab	meson: bump release to 235	2017-09-28 11:29:52 +02:00
Lennart Poettering	fa991fb72b	add some more things to TODO	2017-09-28 11:26:11 +02:00
Lennart Poettering	fccf5419e8	prepare NEWS for 235	2017-09-28 11:26:02 +02:00
Yu Watanabe	945c2931bb	libsystemd: use IN_SET macro	2017-09-28 17:37:59 +09:00
Lennart Poettering	cd4826e0e6	Merge pull request #6924 from andir/vrf-dhcpv4 networkd: use VRFs routing table for DHCP routes	2017-09-28 09:46:03 +02:00
Franck Bui	7e760b79ad	udev-rules: all values can contain escaped double quotes now (#6890 ) This is primarly useful to support escaped double quotes in PROGRAM or IMPORT{program} directives. The only possibilty before this patch was to use an external shell script but this seems too cumbersome for trivial logics such as PROGRAM=="/bin/sh -c 'FOO=\"%s{model}\"; echo ${FOO:0:4}'" or any similar shell constructs that needs to deals with patterns including whitespaces. As it's the case for single quote and for directives running a program, words within escaped double quotes will be considered as a single argument. Fixes: #6835	2017-09-28 08:53:46 +02:00
Zbigniew Jędrzejewski-Szmek	9500b9209b	Merge pull request #6928 from poettering/cgroup-empty-race rework cgroup empty notification handling (i.e. a fix for #6608)	2017-09-28 08:48:21 +02:00
Yu Watanabe	74dead3461	networkd: use assert_not_reached()	2017-09-28 15:38:50 +09:00
Yu Watanabe	25a1bffc8f	networkd: use IN_SET macro	2017-09-28 15:37:38 +09:00
Andreas Rammhold	11d38b9043	man/systemd.network: Updated documentation about VRF traffic redirection Starting with kernel version 4.8 the kernel has a single `l3mdev` rule that handles this. This rule will be created when the first VRF device is added.	2017-09-27 20:02:15 +02:00
Andreas Rammhold	917b226024	man/systemd.network: Added documentation about the DHCP.RouteTable behaviour with VRFs	2017-09-27 20:02:15 +02:00

1 2 3 4 5 ...

30198 Commits