linux/Documentation/admin-guide/cgroup-v1
Johannes Weiner ef1fcad854 mm: memcontrol: deprecate charge moving
commit da34a8484d upstream.

Charge moving mode in cgroup1 allows memory to follow tasks as they
migrate between cgroups.  This is, and always has been, a questionable
thing to do - for several reasons.

First, it's expensive.  Pages need to be identified, locked and isolated
from various MM operations, and reassigned, one by one.

Second, it's unreliable.  Once pages are charged to a cgroup, there isn't
always a clear owner task anymore.  Cache isn't moved at all, for example.
Mapped memory is moved - but if trylocking or isolating a page fails,
it's arbitrarily left behind.  Frequent moving between domains may leave a
task's memory scattered all over the place.

Third, it isn't really needed.  Launcher tasks can kick off workload tasks
directly in their target cgroup.  Using dedicated per-workload groups
allows fine-grained policy adjustments - no need to move tasks and their
physical pages between control domains.  The feature was never
forward-ported to cgroup2, and it hasn't been missed.

Despite it being a niche usecase, the maintenance overhead of supporting
it is enormous.  Because pages are moved while they are live and subject
to various MM operations, the synchronization rules are complicated.
There are lock_page_memcg() in MM and FS code, which non-cgroup people
don't understand.  In some cases we've been able to shift code and cgroup
API calls around such that we can rely on native locking as much as
possible.  But that's fragile, and sometimes we need to hold MM locks for
longer than we otherwise would (pte lock e.g.).

Mark the feature deprecated. Hopefully we can remove it soon.

And backport into -stable kernels so that people who develop against
earlier kernels are warned about this deprecation as early as possible.

[akpm@linux-foundation.org: fix memory.rst underlining]
Link: https://lkml.kernel.org/r/Y5COd+qXwk/S+n8N@cmpxchg.org
Signed-off-by: Johannes Weiner <hannes@cmpxchg.org>
Acked-by: Shakeel Butt <shakeelb@google.com>
Acked-by: Hugh Dickins <hughd@google.com>
Acked-by: Michal Hocko <mhocko@suse.com>
Cc: Muchun Song <songmuchun@bytedance.com>
Cc: Roman Gushchin <roman.gushchin@linux.dev>
Cc: <stable@vger.kernel.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
2023-03-10 09:40:09 +01:00
..
blkio-controller.rst docs/cgroup-v1/blkio: update for 5.x kernels 2021-06-16 11:32:03 -06:00
cgroups.rst
cpuacct.rst
cpusets.rst mm/doc: editorial pass on page migration 2020-09-09 11:39:46 -06:00
devices.rst
freezer-subsystem.rst
hugetlb.rst hugetlb_cgroup: add hugetlb_cgroup reservation docs 2020-04-02 09:35:32 -07:00
index.rst cgroup: Miscellaneous cgroup documentation. 2021-04-04 13:34:46 -04:00
memcg_test.rst mm/lru: revise the comments of lru_lock 2020-12-15 14:48:04 -08:00
memory.rst mm: memcontrol: deprecate charge moving 2023-03-10 09:40:09 +01:00
misc.rst cgroup: Miscellaneous cgroup documentation. 2021-04-04 13:34:46 -04:00
net_cls.rst
net_prio.rst
pids.rst
rdma.rst Documentation/admin-guide: cgroup-v1/rdma: drop doubled word 2020-07-05 14:01:48 -06:00