[1/3] topology: Represent clusters of CPUs within a die

From: Jonathan Cameron <Jonathan.Cameron@huawei.com>

From: Jonathan Cameron <Jonathan.Cameron@huawei.com>

Both ACPI and DT provide the ability to describe additional layers of
topology between that of individual cores and higher level constructs
such as the level at which the last level cache is shared.
In ACPI this can be represented in PPTT as a Processor Hierarchy
Node Structure [1] that is the parent of the CPU cores and in turn
has a parent Processor Hierarchy Nodes Structure representing
a higher level of topology.

For example Kunpeng 920 has 6 or 8 clusters in each NUMA node, and each
cluster has 4 cpus. All clusters share L3 cache data, but each cluster
has local L3 tag. On the other hand, each clusters will share some
internal system bus.

+-----------------------------------+                          +---------+
|  +------+    +------+            +---------------------------+         |
|  | CPU0 |    | cpu1 |             |    +-----------+         |         |
|  +------+    +------+             |    |           |         |         |
|                                   +----+    L3     |         |         |
|  +------+    +------+   cluster   |    |    tag    |         |         |
|  | CPU2 |    | CPU3 |             |    |           |         |         |
|  +------+    +------+             |    +-----------+         |         |
|                                   |                          |         |
+-----------------------------------+                          |         |
+-----------------------------------+                          |         |
|  +------+    +------+             +--------------------------+         |
|  |      |    |      |             |    +-----------+         |         |
|  +------+    +------+             |    |           |         |         |
|                                   |    |    L3     |         |         |
|  +------+    +------+             +----+    tag    |         |         |
|  |      |    |      |             |    |           |         |         |
|  +------+    +------+             |    +-----------+         |         |
|                                   |                          |         |
+-----------------------------------+                          |   L3    |
                                                               |   data  |
+-----------------------------------+                          |         |
|  +------+    +------+             |    +-----------+         |         |
|  |      |    |      |             |    |           |         |         |
|  +------+    +------+             +----+    L3     |         |         |
|                                   |    |    tag    |         |         |
|  +------+    +------+             |    |           |         |         |
|  |      |    |      |            ++    +-----------+         |         |
|  +------+    +------+            |---------------------------+         |
+-----------------------------------|                          |         |
+-----------------------------------|                          |         |
|  +------+    +------+            +---------------------------+         |
|  |      |    |      |             |    +-----------+         |         |
|  +------+    +------+             |    |           |         |         |
|                                   +----+    L3     |         |         |
|  +------+    +------+             |    |    tag    |         |         |
|  |      |    |      |             |    |           |         |         |
|  +------+    +------+             |    +-----------+         |         |
|                                   |                          |         |
+-----------------------------------+                          |         |
+-----------------------------------+                          |         |
|  +------+    +------+             +--------------------------+         |
|  |      |    |      |             |   +-----------+          |         |
|  +------+    +------+             |   |           |          |         |
|                                   |   |    L3     |          |         |
|  +------+    +------+             +---+    tag    |          |         |
|  |      |    |      |             |   |           |          |         |
|  +------+    +------+             |   +-----------+          |         |
|                                   |                          |         |
+-----------------------------------+                          |         |
+-----------------------------------+                         ++         |
|  +------+    +------+             +--------------------------+         |
|  |      |    |      |             |  +-----------+           |         |
|  +------+    +------+             |  |           |           |         |
|                                   |  |    L3     |           |         |
|  +------+    +------+             +--+    tag    |           |         |
|  |      |    |      |             |  |           |           |         |
|  +------+    +------+             |  +-----------+           |         |
|                                   |                          +---------+
+-----------------------------------+

That means spreading tasks among clusters will bring more bandwidth
while packing tasks within one cluster will lead to smaller cache
synchronization latency. So both kernel and userspace will have
a chance to leverage this topology to deploy tasks accordingly to
achieve either smaller cache latency within one cluster or an even
distribution of load among clusters for higher throughput.

This patch exposes cluster topology to both kernel and userspace.
Libraried like hwloc will know cluster by cluster_cpus and related
sysfs attributes. PoC of HWLOC support at [2].

Note this patch only handle the ACPI case.

Special consideration is needed for SMT processors, where it is
necessary to move 2 levels up the hierarchy from the leaf nodes
(thus skipping the processor core level).

Note that arm64 / ACPI does not provide any means of identifying
a die level in the topology but that may be unrelate to the cluster
level.

[1] ACPI Specification 6.3 - section 5.2.29.1 processor hierarchy node
    structure (Type 0)
[2] https://github.com/hisilicon/hwloc/tree/linux-cluster

Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>

Signed-off-by: Tian Tao <tiantao6@hisilicon.com>

Signed-off-by: Barry Song <song.bao.hua@hisilicon.com>

---
 .../ABI/stable/sysfs-devices-system-cpu       | 15 +++++
 Documentation/admin-guide/cputopology.rst     | 12 ++--
 arch/arm64/kernel/topology.c                  |  2 +
 drivers/acpi/pptt.c                           | 67 +++++++++++++++++++
 drivers/base/arch_topology.c                  | 14 ++++
 drivers/base/topology.c                       | 10 +++
 include/linux/acpi.h                          |  5 ++
 include/linux/arch_topology.h                 |  5 ++
 include/linux/topology.h                      |  6 ++
 9 files changed, 132 insertions(+), 4 deletions(-)

-- 
2.25.1

Message ID	20210820013008.12881-2-21cnbao@gmail.com
State	New
Headers	show Delivered-To: patch@linaro.org Received: by 2002:a02:6f15:0:0:0:0:0 with SMTP id x21csp1074299jab; Thu, 19 Aug 2021 18:31:10 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz+85RyUV2cW224NbHKZQcgGK0+2592CZH7v+4cZDTT0Kdn++275xUs7+mU2aCAw+TqP2Sp X-Received: by 2002:a02:946d:: with SMTP id a100mr15137304jai.118.1629423069966; Thu, 19 Aug 2021 18:31:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1629423069; cv=none; d=google.com; s=arc-20160816; b=k/uSbq0v6ouHton0uI6SuangycOu/dgkrBN9Ip+mIBf4jvjGpEX6r+q6g+hWPDWbb3 nTyaYh2QYM6aGnCjSRpUQ17bP/LiBSj9i6wOjPbLqgjZ3hV4ekSKv6R7Ce4lJAPMTwHS xWGuYrEgSEfYvRsrSga4s6/rAhYkJk/TwK7iDkusCE56VmODKuDs9jVxUCcIa5FT3TGP 4AjYG8XxVKhWAl4Mlrc2o0UPPBCucovZV677K2KMdn3BULsj8yKtJgmn+YwSzalyoc1g 5XEhMjxv12uid54zLwuOt1Zf/FKTV9Xov1WATk0QhF+yPx6W7KCeW8gGcAqLn8XhkAdw dxLw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=7O2k6+cYDuSSHjxTEm0czGO0EPIJVmQZ/1JyW8Uu9YY=; b=KqOjQMvkSZsoXCGGNMHtSoSgkrxhK1b5Vj6qt91Ar5Ntv6US83sIwvBPK7CPMlocT8 IR/1z6kA3GWI4rgmAW19m44VknGMRxcRR0wqNB6RQ2jiYrAZw7tz8MUePS6g042eS2Jj mXsGUA8AvQKNqM8xed4xyFULveXYbZu7op0xuvNI5dmNMcfFsy0WybGBspFkfX85+TTF 5t+zLTAnyo8meqymt7nCivDpdpvtqKZAl2WMLAUigDunkEEDR8DH2T0h8ihvQb2T14oh 4eCGYc8OTcTnqGJX3KawouATDZajSnfa6xe+l/dX3eCGBZhwoGyq0cS+FwIzGKwTkUG0 +2Rw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=mPAL2jB8; spf=pass (google.com: domain of linux-acpi-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-acpi-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: <linux-acpi-owner@vger.kernel.org> Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id 13si5377505ilt.16.2021.08.19.18.31.09; Thu, 19 Aug 2021 18:31:09 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-acpi-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=mPAL2jB8; spf=pass (google.com: domain of linux-acpi-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-acpi-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232191AbhHTBbp (ORCPT <rfc822;patch@linaro.org> + 3 others); Thu, 19 Aug 2021 21:31:45 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55104 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236013AbhHTBbp (ORCPT <rfc822; linux-acpi@vger.kernel.org>); Thu, 19 Aug 2021 21:31:45 -0400 Received: from mail-pj1-x102b.google.com (mail-pj1-x102b.google.com [IPv6:2607:f8b0:4864:20::102b]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3EF50C061756; Thu, 19 Aug 2021 18:31:08 -0700 (PDT) Received: by mail-pj1-x102b.google.com with SMTP id w13-20020a17090aea0db029017897a5f7bcso6136118pjy.5; Thu, 19 Aug 2021 18:31:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references :mime-version:content-transfer-encoding; bh=7O2k6+cYDuSSHjxTEm0czGO0EPIJVmQZ/1JyW8Uu9YY=; b=mPAL2jB8xV1D3a2bqFDAnXEgnAB8Ny+6BQxbQiO9WNQaeYllNA5MzXgaQPUlDERJce EsS5w0k6tyyi4kwxwjvlVDw0u8Eg8NMscfGF4DH8pkaUf3Mt80ZvC8nPEN/ZKKvz/okg 7tkIeVoNWDNKECKZsyjEonbxrIcLMsWBKWaclIwncigkHR5B3KzpZZoDPPul+9FutZRn Fj9Weck7IfZEyvIedF+vMOr6IiKMe/NSQis33Y0HrpdWUF+BqskwWgejlzd6DskLsehi TTfwZ9mpDhIM+5Vz3jCB1QIKTIVEL88BP3pLyg6GQC8mXh3k/3R9/OO2NUW+jEaD2dVN e2Bg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references:mime-version:content-transfer-encoding; bh=7O2k6+cYDuSSHjxTEm0czGO0EPIJVmQZ/1JyW8Uu9YY=; b=kA9m4tKRMUyf1OLOfE5TZA9mHGS0gocF2zm59Op1aFwnQhUoTt1ZKR0nySpsU6MuuO aFAa3ylYua8atfpn3x+thQELxqjJwUai5uoNPy3TNQ/Y1jleAKr1ygWph/N/TDfii/cx OrqrjXIy34emp7cU91pHwYuYtohwTVQCjszU43VbOj0wlqvIFW5zDrit6G96bovilpIg M+p7ea3G+x1FrcYQKwGpmetU33IlCdKOmsEF41u9+C//wCJsKhU3+ZUQgVjf/eLecyG6 rUkYb+Ghv/8ZtfmlKEeqg+mCxYavXr39sCiRN2cY31CLwsO3kISgLlmgwhjHaaJUGhj2 8cyw== X-Gm-Message-State: AOAM5302l5V3rBDu36aDats/ZMTKsHjPATK0VcthhRT9OpX1IjBHfs3y vDWJ7kaFcWPufH+Yt59QfO0= X-Received: by 2002:a17:90b:3805:: with SMTP id mq5mr1857519pjb.207.1629423067702; Thu, 19 Aug 2021 18:31:07 -0700 (PDT) Received: from localhost.localdomain ([2407:7000:8916:5000:d23:7118:c805:b5a5]) by smtp.gmail.com with ESMTPSA id 66sm4877950pfu.67.2021.08.19.18.30.54 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 19 Aug 2021 18:31:07 -0700 (PDT) From: Barry Song <21cnbao@gmail.com> To: bp@alien8.de, catalin.marinas@arm.com, dietmar.eggemann@arm.com, gregkh@linuxfoundation.org, hpa@zytor.com, juri.lelli@redhat.com, bristot@redhat.com, lenb@kernel.org, mgorman@suse.de, mingo@redhat.com, peterz@infradead.org, rjw@rjwysocki.net, sudeep.holla@arm.com, tglx@linutronix.de Cc: aubrey.li@linux.intel.com, bsegall@google.com, guodong.xu@linaro.org, jonathan.cameron@huawei.com, liguozhu@hisilicon.com, linux-acpi@vger.kernel.org, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, mark.rutland@arm.com, msys.mizuma@gmail.com, prime.zeng@hisilicon.com, rostedt@goodmis.org, tim.c.chen@linux.intel.com, valentin.schneider@arm.com, vincent.guittot@linaro.org, will@kernel.org, x86@kernel.org, xuwei5@huawei.com, yangyicong@huawei.com, linuxarm@huawei.com, Jonathan Cameron <Jonathan.Cameron@huawei.com>, Tian Tao <tiantao6@hisilicon.com>, Barry Song <song.bao.hua@hisilicon.com> Subject: [PATCH 1/3] topology: Represent clusters of CPUs within a die Date: Fri, 20 Aug 2021 13:30:06 +1200 Message-Id: <20210820013008.12881-2-21cnbao@gmail.com> X-Mailer: git-send-email 2.25.1 In-Reply-To: <20210820013008.12881-1-21cnbao@gmail.com> References: <20210820013008.12881-1-21cnbao@gmail.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: <linux-acpi.vger.kernel.org> X-Mailing-List: linux-acpi@vger.kernel.org
Series	Represent cluster topology and enable load balance between clusters \| expand [0/3] Represent cluster topology and enable load balance between clusters [1/3] topology: Represent clusters of CPUs within a die [2/3] scheduler: Add cluster scheduler level in core and related Kconfig for ARM64

[1/3] topology: Represent clusters of CPUs within a die

Commit Message

Comments

Patch