diff mbox series

cpufreq: scmi: Skip SCMI devices that aren't used by the CPUs

Message ID 20250411212941.1275572-1-quic_mdtipton@quicinc.com
State New
Headers show
Series cpufreq: scmi: Skip SCMI devices that aren't used by the CPUs | expand

Commit Message

Mike Tipton April 11, 2025, 9:29 p.m. UTC
Currently, all SCMI devices with performance domains attempt to register
a cpufreq driver, even if their performance domains aren't used to
control the CPUs. The cpufreq framework only supports registering a
single driver, so only the first device will succeed. And if that device
isn't used for the CPUs, then cpufreq will scale the wrong domains.

To avoid this, return early from scmi_cpufreq_probe() if the probing
SCMI device isn't referenced by the CPU device phandles.

This keeps the existing assumption that all CPUs are controlled by a
single SCMI device.

Signed-off-by: Mike Tipton <quic_mdtipton@quicinc.com>
---
 drivers/cpufreq/scmi-cpufreq.c | 29 +++++++++++++++++++++++++++++
 1 file changed, 29 insertions(+)

Comments

Sudeep Holla April 14, 2025, 8:23 a.m. UTC | #1
Hi Peng,

On Mon, Apr 14, 2025 at 04:38:32PM +0800, Peng Fan wrote:
> Hi Mike,
> On Fri, Apr 11, 2025 at 02:29:41PM -0700, Mike Tipton wrote:
> >Currently, all SCMI devices with performance domains attempt to register
> >a cpufreq driver,
> 
> The scmi cpufreq device is created based on entry
> { SCMI_PROTOCOL_PERF, "cpufreq" },
> 
> So the scmi-cpufreq driver could only probe the upper single device.
> 
> How could the driver work with all SCMI devices with performance domains?
> 

IIUC, this is on a system with multiple SCMI servers/providers some of
which don't deal with CPU performance domains at all.
Peng Fan April 14, 2025, 8:38 a.m. UTC | #2
Hi Mike,
On Fri, Apr 11, 2025 at 02:29:41PM -0700, Mike Tipton wrote:
>Currently, all SCMI devices with performance domains attempt to register
>a cpufreq driver,

The scmi cpufreq device is created based on entry
{ SCMI_PROTOCOL_PERF, "cpufreq" },

So the scmi-cpufreq driver could only probe the upper single device.

How could the driver work with all SCMI devices with performance domains?

THanks,
Peng

even if their performance domains aren't used to
>control the CPUs. The cpufreq framework only supports registering a
>single driver, so only the first device will succeed. And if that device
>isn't used for the CPUs, then cpufreq will scale the wrong domains.
>
>To avoid this, return early from scmi_cpufreq_probe() if the probing
>SCMI device isn't referenced by the CPU device phandles.
>
>This keeps the existing assumption that all CPUs are controlled by a
>single SCMI device.
>
>Signed-off-by: Mike Tipton <quic_mdtipton@quicinc.com>
>---
> drivers/cpufreq/scmi-cpufreq.c | 29 +++++++++++++++++++++++++++++
> 1 file changed, 29 insertions(+)
>
>diff --git a/drivers/cpufreq/scmi-cpufreq.c b/drivers/cpufreq/scmi-cpufreq.c
>index 944e899eb1be..7981a879974b 100644
>--- a/drivers/cpufreq/scmi-cpufreq.c
>+++ b/drivers/cpufreq/scmi-cpufreq.c
>@@ -393,6 +393,32 @@ static struct cpufreq_driver scmi_cpufreq_driver = {
> 	.set_boost	= cpufreq_boost_set_sw,
> };
> 
>+static bool scmi_dev_used_by_cpus(struct device *scmi_dev)
>+{
>+	struct device_node *scmi_np = scmi_dev->of_node;
>+	struct device_node *np;
>+	struct device *cpu_dev;
>+	int cpu, idx;
>+
>+	for_each_possible_cpu(cpu) {
>+		cpu_dev = get_cpu_device(cpu);
>+		if (!cpu_dev)
>+			continue;
>+
>+		np = cpu_dev->of_node;
>+
>+		if (of_parse_phandle(np, "clocks", 0) == scmi_np)
>+			return true;
>+
>+		idx = of_property_match_string(np, "power-domain-names", "perf");
>+
>+		if (of_parse_phandle(np, "power-domains", idx) == scmi_np)
>+			return true;
>+	}
>+
>+	return false;
>+}
>+
> static int scmi_cpufreq_probe(struct scmi_device *sdev)
> {
> 	int ret;
>@@ -404,6 +430,9 @@ static int scmi_cpufreq_probe(struct scmi_device *sdev)
> 	if (!handle)
> 		return -ENODEV;
> 
>+	if (!scmi_dev_used_by_cpus(dev))
>+		return 0;
>+
> 	scmi_cpufreq_driver.driver_data = sdev;
> 
> 	perf_ops = handle->devm_protocol_get(sdev, SCMI_PROTOCOL_PERF, &ph);
>-- 
>2.34.1
>
Peng Fan April 14, 2025, 10:28 a.m. UTC | #3
Hi Sudeep,
On Mon, Apr 14, 2025 at 09:23:24AM +0100, Sudeep Holla wrote:
>Hi Peng,
>
>On Mon, Apr 14, 2025 at 04:38:32PM +0800, Peng Fan wrote:
>> Hi Mike,
>> On Fri, Apr 11, 2025 at 02:29:41PM -0700, Mike Tipton wrote:
>> >Currently, all SCMI devices with performance domains attempt to register
>> >a cpufreq driver,
>> 
>> The scmi cpufreq device is created based on entry
>> { SCMI_PROTOCOL_PERF, "cpufreq" },
>> 
>> So the scmi-cpufreq driver could only probe the upper single device.
>> 
>> How could the driver work with all SCMI devices with performance domains?
>> 
>
>IIUC, this is on a system with multiple SCMI servers/providers some of
>which don't deal with CPU performance domains at all.

Yeah. This sounds valid case.
CPU perf only needs to be managed by one server, the other server
also has performance domains that only for peripherals.

Thanks,
Peng

>
>-- 
>Regards,
>Sudeep
Mike Tipton April 14, 2025, 3:34 p.m. UTC | #4
On Mon, Apr 14, 2025 at 06:28:14PM +0800, Peng Fan wrote:
> Hi Sudeep,
> On Mon, Apr 14, 2025 at 09:23:24AM +0100, Sudeep Holla wrote:
> >Hi Peng,
> >
> >On Mon, Apr 14, 2025 at 04:38:32PM +0800, Peng Fan wrote:
> >> Hi Mike,
> >> On Fri, Apr 11, 2025 at 02:29:41PM -0700, Mike Tipton wrote:
> >> >Currently, all SCMI devices with performance domains attempt to register
> >> >a cpufreq driver,
> >> 
> >> The scmi cpufreq device is created based on entry
> >> { SCMI_PROTOCOL_PERF, "cpufreq" },
> >> 
> >> So the scmi-cpufreq driver could only probe the upper single device.
> >> 
> >> How could the driver work with all SCMI devices with performance domains?
> >> 
> >
> >IIUC, this is on a system with multiple SCMI servers/providers some of
> >which don't deal with CPU performance domains at all.
> 
> Yeah. This sounds valid case.
> CPU perf only needs to be managed by one server, the other server
> also has performance domains that only for peripherals.

Yeah, this is the case we're trying to fix.

> 
> Thanks,
> Peng
> 
> >
> >-- 
> >Regards,
> >Sudeep
Mike Tipton April 15, 2025, 4:44 p.m. UTC | #5
On Tue, Apr 15, 2025 at 05:06:55PM +0800, Peng Fan wrote:
> On Fri, Apr 11, 2025 at 02:29:41PM -0700, Mike Tipton wrote:
> >Currently, all SCMI devices with performance domains attempt to register
> >a cpufreq driver, even if their performance domains aren't used to
> >control the CPUs. The cpufreq framework only supports registering a
> >single driver, so only the first device will succeed. And if that device
> >isn't used for the CPUs, then cpufreq will scale the wrong domains.
> >
> >To avoid this, return early from scmi_cpufreq_probe() if the probing
> >SCMI device isn't referenced by the CPU device phandles.
> >
> >This keeps the existing assumption that all CPUs are controlled by a
> >single SCMI device.
> >
> >Signed-off-by: Mike Tipton <quic_mdtipton@quicinc.com>
> >---
> > drivers/cpufreq/scmi-cpufreq.c | 29 +++++++++++++++++++++++++++++
> > 1 file changed, 29 insertions(+)
> >
> >diff --git a/drivers/cpufreq/scmi-cpufreq.c b/drivers/cpufreq/scmi-cpufreq.c
> >index 944e899eb1be..7981a879974b 100644
> >--- a/drivers/cpufreq/scmi-cpufreq.c
> >+++ b/drivers/cpufreq/scmi-cpufreq.c
> >@@ -393,6 +393,32 @@ static struct cpufreq_driver scmi_cpufreq_driver = {
> > 	.set_boost	= cpufreq_boost_set_sw,
> > };
> > 
> >+static bool scmi_dev_used_by_cpus(struct device *scmi_dev)
> >+{
> >+	struct device_node *scmi_np = scmi_dev->of_node;
> >+	struct device_node *np;
> >+	struct device *cpu_dev;
> >+	int cpu, idx;
> >+
> >+	for_each_possible_cpu(cpu) {
> >+		cpu_dev = get_cpu_device(cpu);
> >+		if (!cpu_dev)
> >+			continue;
> >+
> >+		np = cpu_dev->of_node;
> >+
> >+		if (of_parse_phandle(np, "clocks", 0) == scmi_np)
> >+			return true;
> >+
> >+		idx = of_property_match_string(np, "power-domain-names", "perf");
> >+
> >+		if (of_parse_phandle(np, "power-domains", idx) == scmi_np)
> >+			return true;
> >+	}
> >+
> >+	return false;
> >+}
> >+
> > static int scmi_cpufreq_probe(struct scmi_device *sdev)
> > {
> > 	int ret;
> >@@ -404,6 +430,9 @@ static int scmi_cpufreq_probe(struct scmi_device *sdev)
> > 	if (!handle)
> > 		return -ENODEV;
> > 
> >+	if (!scmi_dev_used_by_cpus(dev))
> >+		return 0;
> 
> Should 'return -ENOTSUPP' be used here?
> There is no need to mark the probe success.

Returning -ENOTSUPP will add noise in the logs from probe failures, for
example:

    scmi-cpufreq scmi_dev.4: probe with driver scmi-cpufreq failed with error -524

These are "expected" failures, so this would be misleading. However, we
could return -ENODEV instead which doesn't log anything by default. It
uses a dev_dbg() in that case:

    scmi-cpufreq scmi_dev.4: probe with driver scmi-cpufreq rejects match -19

Returning -ENODEV seems more appropriate. I can make that change.
diff mbox series

Patch

diff --git a/drivers/cpufreq/scmi-cpufreq.c b/drivers/cpufreq/scmi-cpufreq.c
index 944e899eb1be..7981a879974b 100644
--- a/drivers/cpufreq/scmi-cpufreq.c
+++ b/drivers/cpufreq/scmi-cpufreq.c
@@ -393,6 +393,32 @@  static struct cpufreq_driver scmi_cpufreq_driver = {
 	.set_boost	= cpufreq_boost_set_sw,
 };
 
+static bool scmi_dev_used_by_cpus(struct device *scmi_dev)
+{
+	struct device_node *scmi_np = scmi_dev->of_node;
+	struct device_node *np;
+	struct device *cpu_dev;
+	int cpu, idx;
+
+	for_each_possible_cpu(cpu) {
+		cpu_dev = get_cpu_device(cpu);
+		if (!cpu_dev)
+			continue;
+
+		np = cpu_dev->of_node;
+
+		if (of_parse_phandle(np, "clocks", 0) == scmi_np)
+			return true;
+
+		idx = of_property_match_string(np, "power-domain-names", "perf");
+
+		if (of_parse_phandle(np, "power-domains", idx) == scmi_np)
+			return true;
+	}
+
+	return false;
+}
+
 static int scmi_cpufreq_probe(struct scmi_device *sdev)
 {
 	int ret;
@@ -404,6 +430,9 @@  static int scmi_cpufreq_probe(struct scmi_device *sdev)
 	if (!handle)
 		return -ENODEV;
 
+	if (!scmi_dev_used_by_cpus(dev))
+		return 0;
+
 	scmi_cpufreq_driver.driver_data = sdev;
 
 	perf_ops = handle->devm_protocol_get(sdev, SCMI_PROTOCOL_PERF, &ph);