Commit 993c1aad authored by Wen Congyang's avatar Wen Congyang Committed by Linus Torvalds

memory-hotplug: try to offline the memory twice to avoid dependence

memory can't be offlined when CONFIG_MEMCG is selected.  For example:
there is a memory device on node 1.  The address range is [1G, 1.5G).
You will find 4 new directories memory8, memory9, memory10, and memory11
under the directory /sys/devices/system/memory/.

If CONFIG_MEMCG is selected, we will allocate memory to store page
cgroup when we online pages.  When we online memory8, the memory stored
page cgroup is not provided by this memory device.  But when we online
memory9, the memory stored page cgroup may be provided by memory8.  So
we can't offline memory8 now.  We should offline the memory in the
reversed order.

When the memory device is hotremoved, we will auto offline memory
provided by this memory device.  But we don't know which memory is
onlined first, so offlining memory may fail.  In such case, iterate
twice to offline the memory.  1st iterate: offline every non primary
memory block.  2nd iterate: offline primary (i.e.  first added) memory
block.

This idea is suggested by KOSAKI Motohiro.
Signed-off-by: default avatarWen Congyang <wency@cn.fujitsu.com>
Signed-off-by: default avatarTang Chen <tangchen@cn.fujitsu.com>
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
Cc: Jiang Liu <jiang.liu@huawei.com>
Cc: Jianguo Wu <wujianguo@huawei.com>
Cc: Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: Lai Jiangshan <laijs@cn.fujitsu.com>
Cc: Wu Jianguo <wujianguo@huawei.com>
Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
Cc: Ingo Molnar <mingo@elte.hu>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Signed-off-by: default avatarAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: default avatarLinus Torvalds <torvalds@linux-foundation.org>
parent a864b9d0
...@@ -1387,10 +1387,13 @@ int remove_memory(u64 start, u64 size) ...@@ -1387,10 +1387,13 @@ int remove_memory(u64 start, u64 size)
unsigned long start_pfn, end_pfn; unsigned long start_pfn, end_pfn;
unsigned long pfn, section_nr; unsigned long pfn, section_nr;
int ret; int ret;
int return_on_error = 0;
int retry = 0;
start_pfn = PFN_DOWN(start); start_pfn = PFN_DOWN(start);
end_pfn = start_pfn + PFN_DOWN(size); end_pfn = start_pfn + PFN_DOWN(size);
repeat:
for (pfn = start_pfn; pfn < end_pfn; pfn += PAGES_PER_SECTION) { for (pfn = start_pfn; pfn < end_pfn; pfn += PAGES_PER_SECTION) {
section_nr = pfn_to_section_nr(pfn); section_nr = pfn_to_section_nr(pfn);
if (!present_section_nr(section_nr)) if (!present_section_nr(section_nr))
...@@ -1409,14 +1412,23 @@ int remove_memory(u64 start, u64 size) ...@@ -1409,14 +1412,23 @@ int remove_memory(u64 start, u64 size)
ret = offline_memory_block(mem); ret = offline_memory_block(mem);
if (ret) { if (ret) {
kobject_put(&mem->dev.kobj); if (return_on_error) {
return ret; kobject_put(&mem->dev.kobj);
return ret;
} else {
retry = 1;
}
} }
} }
if (mem) if (mem)
kobject_put(&mem->dev.kobj); kobject_put(&mem->dev.kobj);
if (retry) {
return_on_error = 1;
goto repeat;
}
return 0; return 0;
} }
#else #else
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment