mbox series

[0/4] Fixup register offsets to support per core L3 DCVS

Message ID 1627581885-32165-1-git-send-email-sibis@codeaurora.org
Headers show
Series Fixup register offsets to support per core L3 DCVS | expand

Message

Sibi Sankar July 29, 2021, 6:04 p.m. UTC
Qualcomm SoCs (starting with SM8350) support per core voting for L3 cache
frequency. The patch series re-arranges the cpufreq register offsets to
allow access for the L3 interconnect to implement per core control i.e.
the first 0x100 is now accessed by the L3 interconnect driver instead.

L3 interconnect provider node on SC7280 SoC:
epss_l3: interconnect@18590000 {
	compatible = "qcom,sc7280-epss-l3";
        reg = <0 0x18590000 0 0x1000>, <0 0x18591000 0 0x100>,
	      <0 0x18592000 0 0x100>, <0 0x18593000 0 0x100>;
	...
};

CPUFREQ node on SC7280 SoC:
cpufreq_hw: cpufreq@18591000 {
	compatible = "qcom,cpufreq-epss";
	reg = <0 0x18591100 0 0x900>,
	      <0 0x18592100 0 0x900>,
	      <0 0x18593100 0 0x900>;
	...
};

The patch series also prevents binding breakage by using the
SM8250/SM8350 EPSS compatible.

Sibi Sankar (4):
  dt-bindings: cpufreq: cpufreq-qcom-hw: Add compatible for SM8250/8350
  cpufreq: qcom: Re-arrange register offsets to support per core L3 DCVS
  arm64: dts: qcom: sc7280: Fixup the cpufreq node
  arm64: dts: qcom: sm8350: Fixup the cpufreq node

 .../bindings/cpufreq/cpufreq-qcom-hw.txt           |  6 +++++-
 arch/arm64/boot/dts/qcom/sc7280.dtsi               |  6 +++---
 arch/arm64/boot/dts/qcom/sm8350.dtsi               |  9 ++++-----
 drivers/cpufreq/qcom-cpufreq-hw.c                  | 23 ++++++++++++++++++----
 4 files changed, 31 insertions(+), 13 deletions(-)

Comments

Rob Herring Aug. 3, 2021, 7:23 p.m. UTC | #1
On Thu, 29 Jul 2021 23:34:42 +0530, Sibi Sankar wrote:
> Re-arranging the register regions to support per core L3 DCVS would lead
> to bindings breakage when using an older dt with a newer kernel. So,
> document the EPSS compatible for SM8250/SM8350 SoCs and use them in the
> CPUFreq-hw driver to prevent such breakages.
> 
> Signed-off-by: Sibi Sankar <sibis@codeaurora.org>
> ---
>  Documentation/devicetree/bindings/cpufreq/cpufreq-qcom-hw.txt | 6 +++++-
>  1 file changed, 5 insertions(+), 1 deletion(-)
> 

Acked-by: Rob Herring <robh@kernel.org>
Stephen Boyd Aug. 4, 2021, 6:56 p.m. UTC | #2
Quoting Sibi Sankar (2021-07-29 11:04:42)
> Re-arranging the register regions to support per core L3 DCVS would lead

> to bindings breakage when using an older dt with a newer kernel. So,

> document the EPSS compatible for SM8250/SM8350 SoCs and use them in the

> CPUFreq-hw driver to prevent such breakages.

>

> Signed-off-by: Sibi Sankar <sibis@codeaurora.org>

> ---


Reviewed-by: Stephen Boyd <swboyd@chromium.org>
Bjorn Andersson Aug. 4, 2021, 11:11 p.m. UTC | #3
On Thu 29 Jul 13:04 CDT 2021, Sibi Sankar wrote:

> Qualcomm SoCs (starting with SM8350) support per core voting for L3 cache
> frequency. So, re-arrange the cpufreq register offsets to allow access for
> the L3 interconnect to implement per core control. Also prevent binding
> breakage caused by register offset shuffling by using the SM8250/SM8350
> EPSS compatible.
> 
> Fixes: 7dbd121a2c58 ("arm64: dts: qcom: sc7280: Add cpufreq hw node")
> Signed-off-by: Sibi Sankar <sibis@codeaurora.org>
> ---
>  drivers/cpufreq/qcom-cpufreq-hw.c | 23 +++++++++++++++++++----
>  1 file changed, 19 insertions(+), 4 deletions(-)
> 
> diff --git a/drivers/cpufreq/qcom-cpufreq-hw.c b/drivers/cpufreq/qcom-cpufreq-hw.c
> index f86859bf76f1..74ef3b38343b 100644
> --- a/drivers/cpufreq/qcom-cpufreq-hw.c
> +++ b/drivers/cpufreq/qcom-cpufreq-hw.c
> @@ -28,6 +28,7 @@ struct qcom_cpufreq_soc_data {
>  	u32 reg_volt_lut;
>  	u32 reg_perf_state;
>  	u8 lut_row_size;
> +	bool skip_enable;

This should probably be called "skip_enable_check".

>  };
>  
>  struct qcom_cpufreq_data {
> @@ -257,19 +258,31 @@ static const struct qcom_cpufreq_soc_data qcom_soc_data = {
>  	.reg_volt_lut = 0x114,
>  	.reg_perf_state = 0x920,
>  	.lut_row_size = 32,
> +	.skip_enable = false,
>  };
>  
>  static const struct qcom_cpufreq_soc_data epss_soc_data = {
> +	.reg_freq_lut = 0x0,
> +	.reg_volt_lut = 0x100,
> +	.reg_perf_state = 0x220,
> +	.lut_row_size = 4,
> +	.skip_enable = true,

This change is not compatible with existing DTBs.

Regards,
Bjorn

> +};
> +
> +static const struct qcom_cpufreq_soc_data epss_sm8250_soc_data = {
>  	.reg_enable = 0x0,
>  	.reg_freq_lut = 0x100,
>  	.reg_volt_lut = 0x200,
>  	.reg_perf_state = 0x320,
>  	.lut_row_size = 4,
> +	.skip_enable = false,
>  };
>  
>  static const struct of_device_id qcom_cpufreq_hw_match[] = {
>  	{ .compatible = "qcom,cpufreq-hw", .data = &qcom_soc_data },
>  	{ .compatible = "qcom,cpufreq-epss", .data = &epss_soc_data },
> +	{ .compatible = "qcom,sm8250-cpufreq-epss", .data = &epss_sm8250_soc_data },
> +	{ .compatible = "qcom,sm8350-cpufreq-epss", .data = &epss_sm8250_soc_data },
>  	{}
>  };
>  MODULE_DEVICE_TABLE(of, qcom_cpufreq_hw_match);
> @@ -334,10 +347,12 @@ static int qcom_cpufreq_hw_cpu_init(struct cpufreq_policy *policy)
>  	data->res = res;
>  
>  	/* HW should be in enabled state to proceed */
> -	if (!(readl_relaxed(base + data->soc_data->reg_enable) & 0x1)) {
> -		dev_err(dev, "Domain-%d cpufreq hardware not enabled\n", index);
> -		ret = -ENODEV;
> -		goto error;
> +	if (!data->soc_data->skip_enable) {
> +		if (!(readl_relaxed(base + data->soc_data->reg_enable) & 0x1)) {
> +			dev_err(dev, "Domain-%d cpufreq hardware not enabled\n", index);
> +			ret = -ENODEV;
> +			goto error;
> +		}
>  	}
>  
>  	qcom_get_related_cpus(index, policy->cpus);
> -- 
> The Qualcomm Innovation Center, Inc. is a member of the Code Aurora Forum,
> a Linux Foundation Collaborative Project
>
Stephen Boyd Aug. 5, 2021, 6:25 p.m. UTC | #4
Quoting Sibi Sankar (2021-08-05 10:47:20)
> Stephen,

>

> Thanks for taking time to review

> the series.

>

> On 2021-08-05 00:31, Stephen Boyd wrote:

> > Quoting Sibi Sankar (2021-07-29 11:04:43)

> >> Qualcomm SoCs (starting with SM8350) support per core voting for L3

> >> cache

> >> frequency.

> >

> > And the L3 cache frequency voting code can't be put into this cpufreq

> > driver?

>

> Yes, it could have gone either into

> the cpufreq driver or l3 interconnect

> provider driver. Taniya/Odelu preferred

> the latter, because of the need for other

> clients to vote for l3 frequencies in

> the future.


What other clients are those?

> The other option to prevent

> register re-arrangement would involve

> using syscons from the cpufreq node, which

> really wasn't necessary since there

> wasn't any register overlap between the

> two drivers.


Let's not do that.

>

> >

> >> So, re-arrange the cpufreq register offsets to allow access for

> >> the L3 interconnect to implement per core control. Also prevent

> >> binding

> >> breakage caused by register offset shuffling by using the

> >> SM8250/SM8350

> >> EPSS compatible.

> >>

> >> Fixes: 7dbd121a2c58 ("arm64: dts: qcom: sc7280: Add cpufreq hw node")

> >> Signed-off-by: Sibi Sankar <sibis@codeaurora.org>

> >> ---

> >>  drivers/cpufreq/qcom-cpufreq-hw.c | 23 +++++++++++++++++++----

> >>  1 file changed, 19 insertions(+), 4 deletions(-)

> >>

> >> diff --git a/drivers/cpufreq/qcom-cpufreq-hw.c

> >> b/drivers/cpufreq/qcom-cpufreq-hw.c

> >> index f86859bf76f1..74ef3b38343b 100644

> >> --- a/drivers/cpufreq/qcom-cpufreq-hw.c

> >> +++ b/drivers/cpufreq/qcom-cpufreq-hw.c

> >> @@ -28,6 +28,7 @@ struct qcom_cpufreq_soc_data {

> >>         u32 reg_volt_lut;

> >>         u32 reg_perf_state;

> >>         u8 lut_row_size;

> >> +       bool skip_enable;

> >>  };

> >>

> >>  struct qcom_cpufreq_data {

> >> @@ -257,19 +258,31 @@ static const struct qcom_cpufreq_soc_data

> >> qcom_soc_data = {

> >>         .reg_volt_lut = 0x114,

> >>         .reg_perf_state = 0x920,

> >>         .lut_row_size = 32,

> >> +       .skip_enable = false,

> >>  };

> >>

> >>  static const struct qcom_cpufreq_soc_data epss_soc_data = {

> >> +       .reg_freq_lut = 0x0,

> >> +       .reg_volt_lut = 0x100,

> >> +       .reg_perf_state = 0x220,

> >> +       .lut_row_size = 4,

> >> +       .skip_enable = true,

> >> +};

> >> +

> >> +static const struct qcom_cpufreq_soc_data epss_sm8250_soc_data = {

> >>         .reg_enable = 0x0,

> >>         .reg_freq_lut = 0x100,

> >>         .reg_volt_lut = 0x200,

> >>         .reg_perf_state = 0x320,

> >>         .lut_row_size = 4,

> >> +       .skip_enable = false,

> >>  };

> >>

> >>  static const struct of_device_id qcom_cpufreq_hw_match[] = {

> >>         { .compatible = "qcom,cpufreq-hw", .data = &qcom_soc_data },

> >>         { .compatible = "qcom,cpufreq-epss", .data = &epss_soc_data },

> >> +       { .compatible = "qcom,sm8250-cpufreq-epss", .data =

> >> &epss_sm8250_soc_data },

> >> +       { .compatible = "qcom,sm8350-cpufreq-epss", .data =

> >> &epss_sm8250_soc_data },

> >>         {}

> >>  };

> >>  MODULE_DEVICE_TABLE(of, qcom_cpufreq_hw_match);

> >> @@ -334,10 +347,12 @@ static int qcom_cpufreq_hw_cpu_init(struct

> >> cpufreq_policy *policy)

> >>         data->res = res;

> >>

> >>         /* HW should be in enabled state to proceed */

> >

> > It looks odd that we're no longer making sure that the clk domain is

> > enabled when we probe the driver. Why is that OK?

>

> On newer EPSS hw it's no longer

> required to perform the additional

> hw enable check. IIRC we don't do

> that on corresponding downstream

> kernels as well.


It's fairly clear that we no longer perform the additional check. The
question is why that's OK.
Sibi Sankar Aug. 6, 2021, 6:42 a.m. UTC | #5
On 2021-08-05 23:55, Stephen Boyd wrote:
> Quoting Sibi Sankar (2021-08-05 10:47:20)

>> Stephen,

>> 

>> Thanks for taking time to review

>> the series.

>> 

>> On 2021-08-05 00:31, Stephen Boyd wrote:

>> > Quoting Sibi Sankar (2021-07-29 11:04:43)

>> >> Qualcomm SoCs (starting with SM8350) support per core voting for L3

>> >> cache

>> >> frequency.

>> >

>> > And the L3 cache frequency voting code can't be put into this cpufreq

>> > driver?

>> 

>> Yes, it could have gone either into

>> the cpufreq driver or l3 interconnect

>> provider driver. Taniya/Odelu preferred

>> the latter, because of the need for other

>> clients to vote for l3 frequencies in

>> the future.

> 

> What other clients are those?


https://lore.kernel.org/lkml/20190814152116.GB28465@jcrouse1-lnx.qualcomm.com/

GPU was supposed to be one of the
other clients that would vote for
l3.

> 

>> The other option to prevent

>> register re-arrangement would involve

>> using syscons from the cpufreq node, which

>> really wasn't necessary since there

>> wasn't any register overlap between the

>> two drivers.

> 

> Let's not do that.

> 

>> 

>> >

>> >> So, re-arrange the cpufreq register offsets to allow access for

>> >> the L3 interconnect to implement per core control. Also prevent

>> >> binding

>> >> breakage caused by register offset shuffling by using the

>> >> SM8250/SM8350

>> >> EPSS compatible.

>> >>

>> >> Fixes: 7dbd121a2c58 ("arm64: dts: qcom: sc7280: Add cpufreq hw node")

>> >> Signed-off-by: Sibi Sankar <sibis@codeaurora.org>

>> >> ---

>> >>  drivers/cpufreq/qcom-cpufreq-hw.c | 23 +++++++++++++++++++----

>> >>  1 file changed, 19 insertions(+), 4 deletions(-)

>> >>

>> >> diff --git a/drivers/cpufreq/qcom-cpufreq-hw.c

>> >> b/drivers/cpufreq/qcom-cpufreq-hw.c

>> >> index f86859bf76f1..74ef3b38343b 100644

>> >> --- a/drivers/cpufreq/qcom-cpufreq-hw.c

>> >> +++ b/drivers/cpufreq/qcom-cpufreq-hw.c

>> >> @@ -28,6 +28,7 @@ struct qcom_cpufreq_soc_data {

>> >>         u32 reg_volt_lut;

>> >>         u32 reg_perf_state;

>> >>         u8 lut_row_size;

>> >> +       bool skip_enable;

>> >>  };

>> >>

>> >>  struct qcom_cpufreq_data {

>> >> @@ -257,19 +258,31 @@ static const struct qcom_cpufreq_soc_data

>> >> qcom_soc_data = {

>> >>         .reg_volt_lut = 0x114,

>> >>         .reg_perf_state = 0x920,

>> >>         .lut_row_size = 32,

>> >> +       .skip_enable = false,

>> >>  };

>> >>

>> >>  static const struct qcom_cpufreq_soc_data epss_soc_data = {

>> >> +       .reg_freq_lut = 0x0,

>> >> +       .reg_volt_lut = 0x100,

>> >> +       .reg_perf_state = 0x220,

>> >> +       .lut_row_size = 4,

>> >> +       .skip_enable = true,

>> >> +};

>> >> +

>> >> +static const struct qcom_cpufreq_soc_data epss_sm8250_soc_data = {

>> >>         .reg_enable = 0x0,

>> >>         .reg_freq_lut = 0x100,

>> >>         .reg_volt_lut = 0x200,

>> >>         .reg_perf_state = 0x320,

>> >>         .lut_row_size = 4,

>> >> +       .skip_enable = false,

>> >>  };

>> >>

>> >>  static const struct of_device_id qcom_cpufreq_hw_match[] = {

>> >>         { .compatible = "qcom,cpufreq-hw", .data = &qcom_soc_data },

>> >>         { .compatible = "qcom,cpufreq-epss", .data = &epss_soc_data },

>> >> +       { .compatible = "qcom,sm8250-cpufreq-epss", .data =

>> >> &epss_sm8250_soc_data },

>> >> +       { .compatible = "qcom,sm8350-cpufreq-epss", .data =

>> >> &epss_sm8250_soc_data },

>> >>         {}

>> >>  };

>> >>  MODULE_DEVICE_TABLE(of, qcom_cpufreq_hw_match);

>> >> @@ -334,10 +347,12 @@ static int qcom_cpufreq_hw_cpu_init(struct

>> >> cpufreq_policy *policy)

>> >>         data->res = res;

>> >>

>> >>         /* HW should be in enabled state to proceed */

>> >

>> > It looks odd that we're no longer making sure that the clk domain is

>> > enabled when we probe the driver. Why is that OK?

>> 

>> On newer EPSS hw it's no longer

>> required to perform the additional

>> hw enable check. IIRC we don't do

>> that on corresponding downstream

>> kernels as well.

> 

> It's fairly clear that we no longer perform the additional check. The

> question is why that's OK.


Taniya probably would know more
about the history behind the change.
I'll dig up more info regarding ^^
and update the thread.


-- 
Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum,
a Linux Foundation Collaborative Project.