[v3,3/5] coresight: add support for debug module

Message ID	1488520809-31670-4-git-send-email-leo.yan@linaro.org
State	New
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: best guess record for domain of devicetree-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; From: Leo Yan <leo.yan@linaro.org> To: Rob Herring <robh+dt@kernel.org>, Mark Rutland <mark.rutland@arm.com>, Wei Xu <xuwei5@hisilicon.com>, Catalin Marinas <catalin.marinas@arm.com>, Will Deacon <will.deacon@arm.com>, Michael Turquette <mturquette@baylibre.com>, Stephen Boyd <sboyd@codeaurora.org>, Mathieu Poirier <mathieu.poirier@linaro.org>, John Stultz <john.stultz@linaro.org>, Guodong Xu <guodong.xu@linaro.org>, Haojian Zhuang <haojian.zhuang@linaro.org>, Greg Kroah-Hartman <gregkh@linuxfoundation.org>, devicetree@vger.kernel.org, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, linux-clk@vger.kernel.org, mike.leach@linaro.org Cc: Leo Yan <leo.yan@linaro.org> Subject: [PATCH v3 3/5] coresight: add support for debug module Date: Fri, 3 Mar 2017 14:00:07 +0800 Message-Id: <1488520809-31670-4-git-send-email-leo.yan@linaro.org> In-Reply-To: <1488520809-31670-1-git-send-email-leo.yan@linaro.org> References: <1488520809-31670-1-git-send-email-leo.yan@linaro.org> Sender: devicetree-owner@vger.kernel.org Precedence: bulk
Series	[v3,1/5] coresight: bindings for debug module \| expand [v3,1/5] coresight: bindings for debug module [v3,2/5] coresight: refactor with function of_coresight_get_cpu [v3,3/5] coresight: add support for debug module [v3,5/5] arm64: dts: hi6220: register debug module

Leo Yan March 3, 2017, 6 a.m. UTC

Coresight includes debug module and usually the module connects with CPU
debug logic. ARMv8 architecture reference manual (ARM DDI 0487A.k) has
description for related info in "Part H: External Debug".

Chapter H7 "The Sample-based Profiling Extension" introduces several
sampling registers, e.g. we can check program counter value with
combined CPU exception level, secure state, etc. So this is helpful for
analysis CPU lockup scenarios, e.g. if one CPU has run into infinite
loop with IRQ disabled. In this case the CPU cannot switch context and
handle any interrupt (including IPIs), as the result it cannot handle
SMP call for stack dump.

This patch is to enable coresight debug module, so firstly this driver
is to bind apb clock for debug module and this is to ensure the debug
module can be accessed from program or external debugger. And the driver
uses sample-based registers for debug purpose, e.g. when system detects
the CPU lockup and trigger panic, the driver will dump program counter
and combined context registers (EDCIDSR, EDVIDSR); by parsing context
registers so can quickly get to know CPU secure state, exception level,
etc.

Some of the debug module registers are located in CPU power domain, so
in the driver it has checked the power state for CPU before accessing
registers within CPU power domain. For most safe way to use this driver,
it's suggested to disable CPU low power states, this can simply set
"nohlt" in kernel command line.

Signed-off-by: Leo Yan <leo.yan@linaro.org>

---
 drivers/hwtracing/coresight/Kconfig           |  10 +
 drivers/hwtracing/coresight/Makefile          |   1 +
 drivers/hwtracing/coresight/coresight-debug.c | 377 ++++++++++++++++++++++++++
 3 files changed, 388 insertions(+)
 create mode 100644 drivers/hwtracing/coresight/coresight-debug.c

-- 
2.7.4

--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Leo Yan March 9, 2017, 5:59 p.m. UTC | #1

Hi Suziku,

Thanks for reviewing, please see some replying.

On Thu, Mar 09, 2017 at 04:53:05PM +0000, Suzuki K Poulose wrote:
> On 03/03/17 06:00, Leo Yan wrote:

> >Coresight includes debug module and usually the module connects with CPU

> >debug logic. ARMv8 architecture reference manual (ARM DDI 0487A.k) has

> >description for related info in "Part H: External Debug".

> >

> >Chapter H7 "The Sample-based Profiling Extension" introduces several

> >sampling registers, e.g. we can check program counter value with

> >combined CPU exception level, secure state, etc. So this is helpful for

> >analysis CPU lockup scenarios, e.g. if one CPU has run into infinite

> >loop with IRQ disabled. In this case the CPU cannot switch context and

> >handle any interrupt (including IPIs), as the result it cannot handle

> >SMP call for stack dump.

> >

> >This patch is to enable coresight debug module, so firstly this driver

> >is to bind apb clock for debug module and this is to ensure the debug

> >module can be accessed from program or external debugger. And the driver

> >uses sample-based registers for debug purpose, e.g. when system detects

> >the CPU lockup and trigger panic, the driver will dump program counter

> >and combined context registers (EDCIDSR, EDVIDSR); by parsing context

> >registers so can quickly get to know CPU secure state, exception level,

> >etc.

> 

> The problem is, it is not guaranteed that the EDPCSR_Hi, EDCIDSR & EDVIDSR are

> updated as a side effect of a memory mapped access (which is what we do here) to the

> EDPCSR_Lo.

> 

> Section H.7.1.2 : Reads of EDPCSRs (in ARM DDI 0487A.k) :

> 

> "The indirect writes to EDCIDSR, EDVIDSR, and EDPCSRhi might not occur for a memory-mapped access

> to the external debug interface. For more information, see Memory-mapped accesses to the external debug

> interface on page H8-4968."

> 

> So we cannot really rely on the values in EDVIDSR which we use to make further decisions. So I

> am wondering if this is really guranteed to be useful.


So this is caused by Software lock is locked?

Section H8.4.1: 

"Reads and writes have no side-effects. A side-effect is where a
direct read or a direct write of a register creates
an indirect write of the same or another register. When the Software
Lock is locked, the indirect write does
not occur."

> >Some of the debug module registers are located in CPU power domain, so

> >in the driver it has checked the power state for CPU before accessing

> >registers within CPU power domain. For most safe way to use this driver,

> >it's suggested to disable CPU low power states, this can simply set

> >"nohlt" in kernel command line.

> >

> >Signed-off-by: Leo Yan <leo.yan@linaro.org>

> >---

> > drivers/hwtracing/coresight/Kconfig           |  10 +

> > drivers/hwtracing/coresight/Makefile          |   1 +

> > drivers/hwtracing/coresight/coresight-debug.c | 377 ++++++++++++++++++++++++++

> > 3 files changed, 388 insertions(+)

> > create mode 100644 drivers/hwtracing/coresight/coresight-debug.c

> >

> >diff --git a/drivers/hwtracing/coresight/Kconfig b/drivers/hwtracing/coresight/Kconfig

> >index 130cb21..3ed651e 100644

> >--- a/drivers/hwtracing/coresight/Kconfig

> >+++ b/drivers/hwtracing/coresight/Kconfig

> >@@ -89,4 +89,14 @@ config CORESIGHT_STM

> > 	  logging useful software events or data coming from various entities

> > 	  in the system, possibly running different OSs

> >

> >+config CORESIGHT_DEBUG

> 

> To make it more specific, may be CORESIGHT_CPU_DEBUG ?


Will fix.

> >+	bool "CoreSight debug driver"

> 

> "Coresight CPU Debug driver"


Will fix.

> >+	depends on ARM || ARM64

> >+	help

> >+	  This driver provides support for coresight debugging module. This

> >+	  is primarily used to dump sample-based profiling registers for

> >+	  panic. To avoid lockups when accessing debug module registers,

> >+	  it is safer to disable CPU low power states (like "nohlt" on the

> >+	  kernel command line) when using this feature.

> >+

> 

> >+#define EDPCSR_THUMB			BIT(0)

> >+#define EDPCSR_ARM_INST_MASK		GENMASK(31, 2)

> >+#define EDPCSR_THUMB_INST_MASK		GENMASK(31, 1)

> 

> We don't need two different masks. {ED/DBG}PCSR has only bit 0 reserved

> for instruction set indication.


I think we need this two different masks. Please review below extra doc
for PC offset analysis in ARM ARM.

> >+#endif

> >+

> >+/* bits definition for EDPRSR */

> >+#define EDPRSR_DLK			BIT(6)

> >+#define EDPRSR_PU			BIT(0)

> >+

> >+

> >+static void debug_read_regs(struct debug_drvdata *drvdata)

> >+{

> >+	drvdata->edprsr = readl_relaxed(drvdata->base + EDPRSR);

> >+

> >+	if (!debug_access_permitted(drvdata))

> >+		return;

> >+

> >+	if (!drvdata->edpcsr_present)

> >+		return;

> >+

> >+	CS_UNLOCK(drvdata->base);

> >+

> >+	debug_os_unlock(drvdata);

> >+

> >+	drvdata->edpcsr = readl_relaxed(drvdata->base + EDPCSR);

> >+

> >+	/*

> >+	 * As described in ARM DDI 0487A.k, if the processing

> >+	 * element (PE) is in debug state, or sample-based

> >+	 * profiling is prohibited, EDPCSR reads as 0xFFFFFFFF;

> >+	 * EDCIDSR, EDVIDSR and EDPCSR_HI registers also become

> >+	 * UNKNOWN state. So directly bail out for this case.

> >+	 */

> >+	if (drvdata->edpcsr == EDPCSR_PROHIBITED) {

> >+		CS_LOCK(drvdata->base);

> >+		return;

> >+	}

> >+

> >+	/*

> >+	 * A read of the EDPCSR normally has the side-effect of

> >+	 * indirectly writing to EDCIDSR, EDVIDSR and EDPCSR_HI;

> >+	 * at this point it's safe to read value from them.

> >+	 */

> 

> See my comment above about the side effects of memory mapped access.


Yeah.

> >+	drvdata->edcidsr = readl_relaxed(drvdata->base + EDCIDSR);

> >+#ifdef CONFIG_64BIT

> >+	drvdata->edpcsr_hi = readl_relaxed(drvdata->base + EDPCSR_HI);

> >+#endif

> 

> >+

> >+	if (drvdata->edvidsr_present)

> >+		drvdata->edvidsr = readl_relaxed(drvdata->base + EDVIDSR);

> >+

> >+	CS_LOCK(drvdata->base);

> >+}

> >+

> 

> >+#ifndef CONFIG_64BIT

> 

> I guess this doesn't help for an ARMv8 32bit only core (e.g, Cortex-A32). And

> unfortunately, there are conflicting definitions for the values for PCSROffset w.r.t

> ARMv8 and ARMv7.

> 

> DBGDEVID1[3:0] For ARMv7 :

> 

> 0000 - Sample offset applies based on the instruction state.

> 0001 - No offset applies.

> 

> EDDEVID1[3:0] For ARMv8 :

> 0000 - EDPCSR not implemented

> 0010 - EDPCSR implemented without offsets, but do not use in AArch32 state!

> 

> So there is no easy way to make sense of the value, unless you know which version

> of the architecture is in use. Or may be we could co-relate it with the value from

> DEVID.

> 

> i.e, EDPCSR is not implemented do not register this device, see comments on debug_probe().

> ( And we should also include the following test for 32bit code to see if edpcsr is implemented.

> See comments on debug_init_arch_data() )

> 

> 

> That way, we could use the following inference from the PCSROffset value :

> 

> 0000 - Sample offset applies based on the instruction state (indicated by PCSR[0])

> 0001 - No offset applies.

> 0010 - No offset applies, but do not use in AArch32 mode


Just now I went through ARM ARM and ARMv8 ARM, this makes sense to me.
Thanks for good pointing for this.

> >+static bool debug_pc_has_offset(struct debug_drvdata *drvdata)

> >+{

> >+	u32 pcsr_offset;

> >+

> >+	pcsr_offset = drvdata->eddevid1 & EDDEVID1_PCSR_OFFSET_MASK;

> >+

> >+	return (pcsr_offset == EDDEVID1_PCSR_OFFSET_INS_SET);

> >+}

> >+

> >+static unsigned long debug_adjust_pc(struct debug_drvdata *drvdata,

> >+				     unsigned long pc)

> >+{

> >+	unsigned long arm_inst_offset = 0, thumb_inst_offset = 0;

> >+

> >+	if (debug_pc_has_offset(drvdata)) {

> >+		arm_inst_offset = 8;

> >+		thumb_inst_offset = 4;

> >+	}

> >+

> >+	/* Handle thumb instruction */

> >+	if (pc & EDPCSR_THUMB) {

> >+		pc = (pc & EDPCSR_THUMB_INST_MASK) - thumb_inst_offset;

> >+		return pc;

> >+	}

> >+

> >+	/*

> >+	 * Handle arm instruction offset, if the arm instruction

> >+	 * is not 4 byte alignment then it's possible the case

> >+	 * for implementation defined; keep original value for this

> >+	 * case and print info for notice.

> >+	 */

> >+	if (pc & BIT(1))

> >+		pr_emerg("Instruction offset is implementation defined\n");

> 

> I am struggling to find the any mention about this in the ARM ARM. Please could

> you point me to it.


Sure, please see ARM DDI 0406C.b, chapter C11.11.34 "
DBGPCSR, Program Counter Sampling Register":

A profiling tool can use the value of the T bit to calculate the
instruction address as follows:

When an offset is applied to the sampled address
- if T is 0 and DBGPCSR[1] is 0, ((DBGPCSR[31:2] << 2) - 8) is the
address of the sampled ARM instruction
- if T is 0 and DBGPCSR[1] is 1, the instruction address is
IMPLEMENTATION DEFINED
- if T is 1, ((DBGPCSR[31:1] << 1) - 4) is the address of the sampled
Thumb or ThumbEE instruction.

When no offset is applied to the sampled address
-  if T is 0 and DBGPCSR[1] is 0, (DBGPCSR[31:2] << 2) is the address
of the sampled ARM instruction
-  if T is 0 and DBGPCSR[1] is 1, the instruction address is
IMPLEMENTATION DEFINED
- if T is 1, (DBGPCSR[31:1] << 1) is the address of the sampled Thumb
or ThumbEE instruction.

> >+static void debug_init_arch_data(void *info)

> >+{

> >+	struct debug_drvdata *drvdata = info;

> >+	u32 mode;

> >+

> >+	CS_UNLOCK(drvdata->base);

> >+

> >+	debug_os_unlock(drvdata);

> >+

> >+	/* Read device info */

> >+	drvdata->eddevid  = readl_relaxed(drvdata->base + EDDEVID);

> >+	drvdata->eddevid1 = readl_relaxed(drvdata->base + EDDEVID1);

> >+

> >+	/* Parse implementation feature */

> >+	mode = drvdata->eddevid & EDDEVID_PCSAMPLE_MODE;

> >+	if (mode == EDDEVID_IMPL_FULL) {

> >+		drvdata->edpcsr_present  = true;

> >+		drvdata->edvidsr_present = true;

> >+	} else if (mode == EDDEVID_IMPL_EDPCSR_EDCIDSR) {

> >+		drvdata->edpcsr_present  = true;

> >+		drvdata->edvidsr_present = false;

> 

> As discussed above, we need to consult the DEVID1:PCSROffset for AArch32 to decide

> if we have the edpcsr implemented on ARMv8.


Yeah.

> >+	} else {

> >+		drvdata->edpcsr_present  = false;

> >+		drvdata->edvidsr_present = false;

> >+	}

> >+

> >+	CS_LOCK(drvdata->base);

> >+}

> >+

> >+static int debug_probe(struct amba_device *adev, const struct amba_id *id)

> >+{

> >+	void __iomem *base;

> >+	struct device *dev = &adev->dev;

> >+	struct debug_drvdata *drvdata;

> >+	struct resource *res = &adev->res;

> >+	struct device_node *np = adev->dev.of_node;

> >+	char buf[32];

> >+	static int debug_count;

> >+

> >+	drvdata = devm_kzalloc(dev, sizeof(*drvdata), GFP_KERNEL);

> >+	if (!drvdata)

> >+		return -ENOMEM;

> >+

> >+	drvdata->cpu = np ? of_coresight_get_cpu(np) : 0;

> >+	drvdata->dev = &adev->dev;

> >+

> >+	dev_set_drvdata(dev, drvdata);

> >+

> >+	/* Validity for the resource is already checked by the AMBA core */

> >+	base = devm_ioremap_resource(dev, res);

> >+	if (IS_ERR(base))

> >+		return PTR_ERR(base);

> >+

> >+	drvdata->base = base;

> >+

> >+	get_online_cpus();

> >+	per_cpu(debug_drvdata, drvdata->cpu) = drvdata;

> >+

> >+	if (smp_call_function_single(drvdata->cpu,

> >+				debug_init_arch_data, drvdata, 1))

> >+		dev_err(dev, "Debug arch init failed\n");

> 

> If this fails (say the CPU was offline), should we still return success ?

> And may be we should check if the drvdata->edpcsr_present to detect if the CPU

> implements the PC Sampling and return failure here if it doesn't.


Will fix.

> >+

> >+	put_online_cpus();

> >+

> >+	if (!debug_count++)

> >+		atomic_notifier_chain_register(&panic_notifier_list,

> >+					       &debug_notifier);

> >+

> 

> >+	sprintf(buf, (char *)id->data, drvdata->cpu);

> >+	dev_info(dev, "%s initialized\n", buf);

> 

> This could simply be :

> 	dev_info(dev, "Coresight debug-CPU%d initialized\n", drvdata->cpu);

> 

> and get rid of the static string and the buffer, see below.

> 

> >+	return 0;

> >+}

> >+

> >+static struct amba_id debug_ids[] = {

> >+	{       /* Debug for Cortex-A53 */

> >+		.id	= 0x000bbd03,

> >+		.mask	= 0x000fffff,

> 

> ...

> 

> >+		.data   = "Coresight debug-CPU%d",

> 

> I think this is pointless, as the debug area we are interested in is always associated

> with a CPU, we could as well figure out what to print from the drvdata->cpu above.


I prefer to follow your suggestion for upper two comments; but I'd like
check with Mathieu, due I followed up Mathieu's suggestion to write
current code.

Thanks,
Leo Yan
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Leo Yan March 13, 2017, 8:12 a.m. UTC | #2

Hi Suzuki,

On Fri, Mar 10, 2017 at 02:29:53PM +0000, Suzuki K Poulose wrote:

[...]

> >>So we cannot really rely on the values in EDVIDSR which we use to make further decisions. So I

> >>am wondering if this is really guranteed to be useful.

> >

> >So this is caused by Software lock is locked?

> >

> >Section H8.4.1:

> >

> >"Reads and writes have no side-effects. A side-effect is where a

> >direct read or a direct write of a register creates

> >an indirect write of the same or another register. When the Software

> >Lock is locked, the indirect write does

> >not occur."

> 

> Yes, thats correct, further :

> 

> Section H9.2.32: EDPCSR

> 

> "For a read of EDPCSRlo from the memory-mapped interface, if EDLSR.SLK == 1, meaning

> the Software Lock is locked, then the access has no side-effects. That is, EDCIDSR,

> EDVIDSR, and EDPCSRhi are unchanged."

> 

> And since we do a CS_UNLOCK, that should be fine. Please ignore my comment.


Thanks a lot for confirmation.

[...]

> >>>+

> >>>+	put_online_cpus();

> >>>+

> >>>+	if (!debug_count++)

> >>>+		atomic_notifier_chain_register(&panic_notifier_list,

> >>>+					       &debug_notifier);

> >>>+

> >>

> >>>+	sprintf(buf, (char *)id->data, drvdata->cpu);

> >>>+	dev_info(dev, "%s initialized\n", buf);

> >>

> >>This could simply be :

> >>	dev_info(dev, "Coresight debug-CPU%d initialized\n", drvdata->cpu);

> >>

> >>and get rid of the static string and the buffer, see below.

> 

> Also we need pm_runtime_put() here to balance the pm_runtime_get_ from AMBA

> device probe. More on that below.


[...]

> Btw, I don't see any PM calls to make sure the power domain (at least the debug domain)

> is up, which could cause problems with accesses to some of these registers (leave alone the

> ones in CPU power domain), especially the EDPRSR. We could also do pm_runtime_get on the

> CPU's power domain, if the CPU is online, before we access the pcsr.


I will add pm_runtime_get/pm_runtime_put for apb clock.

But for CPU power domain, AFAIK this part is managed by PSCI but is not
controlled by pm_runtime_{put|get} pairs. So at beginning, we suggest
to use "nohlt" to ensure CPU power domain is enabled.

Please let me know if I miss some thing for this?

Thanks,
Leo Yan
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mathieu Poirier March 13, 2017, 4:29 p.m. UTC | #3

On Fri, Mar 10, 2017 at 01:59:15AM +0800, Leo Yan wrote:
> Hi Suziku,

> 

> Thanks for reviewing, please see some replying.

> 

> On Thu, Mar 09, 2017 at 04:53:05PM +0000, Suzuki K Poulose wrote:

> > On 03/03/17 06:00, Leo Yan wrote:

> > >Coresight includes debug module and usually the module connects with CPU

> > >debug logic. ARMv8 architecture reference manual (ARM DDI 0487A.k) has

> > >description for related info in "Part H: External Debug".

> > >

> > >Chapter H7 "The Sample-based Profiling Extension" introduces several

> > >sampling registers, e.g. we can check program counter value with

> > >combined CPU exception level, secure state, etc. So this is helpful for

> > >analysis CPU lockup scenarios, e.g. if one CPU has run into infinite

> > >loop with IRQ disabled. In this case the CPU cannot switch context and

> > >handle any interrupt (including IPIs), as the result it cannot handle

> > >SMP call for stack dump.

> > >

> > >This patch is to enable coresight debug module, so firstly this driver

> > >is to bind apb clock for debug module and this is to ensure the debug

> > >module can be accessed from program or external debugger. And the driver

> > >uses sample-based registers for debug purpose, e.g. when system detects

> > >the CPU lockup and trigger panic, the driver will dump program counter

> > >and combined context registers (EDCIDSR, EDVIDSR); by parsing context

> > >registers so can quickly get to know CPU secure state, exception level,

> > >etc.

> > 

> > The problem is, it is not guaranteed that the EDPCSR_Hi, EDCIDSR & EDVIDSR are

> > updated as a side effect of a memory mapped access (which is what we do here) to the

> > EDPCSR_Lo.

> > 

> > Section H.7.1.2 : Reads of EDPCSRs (in ARM DDI 0487A.k) :

> > 

> > "The indirect writes to EDCIDSR, EDVIDSR, and EDPCSRhi might not occur for a memory-mapped access

> > to the external debug interface. For more information, see Memory-mapped accesses to the external debug

> > interface on page H8-4968."

> > 

> > So we cannot really rely on the values in EDVIDSR which we use to make further decisions. So I

> > am wondering if this is really guranteed to be useful.

> 

> So this is caused by Software lock is locked?

> 

> Section H8.4.1: 

> 

> "Reads and writes have no side-effects. A side-effect is where a

> direct read or a direct write of a register creates

> an indirect write of the same or another register. When the Software

> Lock is locked, the indirect write does

> not occur."

> 

> > >Some of the debug module registers are located in CPU power domain, so

> > >in the driver it has checked the power state for CPU before accessing

> > >registers within CPU power domain. For most safe way to use this driver,

> > >it's suggested to disable CPU low power states, this can simply set

> > >"nohlt" in kernel command line.

> > >

> > >Signed-off-by: Leo Yan <leo.yan@linaro.org>

> > >---

> > > drivers/hwtracing/coresight/Kconfig           |  10 +

> > > drivers/hwtracing/coresight/Makefile          |   1 +

> > > drivers/hwtracing/coresight/coresight-debug.c | 377 ++++++++++++++++++++++++++

> > > 3 files changed, 388 insertions(+)

> > > create mode 100644 drivers/hwtracing/coresight/coresight-debug.c

> > >

> > >diff --git a/drivers/hwtracing/coresight/Kconfig b/drivers/hwtracing/coresight/Kconfig

> > >index 130cb21..3ed651e 100644

> > >--- a/drivers/hwtracing/coresight/Kconfig

> > >+++ b/drivers/hwtracing/coresight/Kconfig

> > >@@ -89,4 +89,14 @@ config CORESIGHT_STM

> > > 	  logging useful software events or data coming from various entities

> > > 	  in the system, possibly running different OSs

> > >

> > >+config CORESIGHT_DEBUG

> > 

> > To make it more specific, may be CORESIGHT_CPU_DEBUG ?

> 

> Will fix.

> 

> > >+	bool "CoreSight debug driver"

> > 

> > "Coresight CPU Debug driver"

> 

> Will fix.

> 

> > >+	depends on ARM || ARM64

> > >+	help

> > >+	  This driver provides support for coresight debugging module. This

> > >+	  is primarily used to dump sample-based profiling registers for

> > >+	  panic. To avoid lockups when accessing debug module registers,

> > >+	  it is safer to disable CPU low power states (like "nohlt" on the

> > >+	  kernel command line) when using this feature.

> > >+

> > 

> > >+#define EDPCSR_THUMB			BIT(0)

> > >+#define EDPCSR_ARM_INST_MASK		GENMASK(31, 2)

> > >+#define EDPCSR_THUMB_INST_MASK		GENMASK(31, 1)

> > 

> > We don't need two different masks. {ED/DBG}PCSR has only bit 0 reserved

> > for instruction set indication.

> 

> I think we need this two different masks. Please review below extra doc

> for PC offset analysis in ARM ARM.

> 

> > >+#endif

> > >+

> > >+/* bits definition for EDPRSR */

> > >+#define EDPRSR_DLK			BIT(6)

> > >+#define EDPRSR_PU			BIT(0)

> > >+

> > >+

> > >+static void debug_read_regs(struct debug_drvdata *drvdata)

> > >+{

> > >+	drvdata->edprsr = readl_relaxed(drvdata->base + EDPRSR);

> > >+

> > >+	if (!debug_access_permitted(drvdata))

> > >+		return;

> > >+

> > >+	if (!drvdata->edpcsr_present)

> > >+		return;

> > >+

> > >+	CS_UNLOCK(drvdata->base);

> > >+

> > >+	debug_os_unlock(drvdata);

> > >+

> > >+	drvdata->edpcsr = readl_relaxed(drvdata->base + EDPCSR);

> > >+

> > >+	/*

> > >+	 * As described in ARM DDI 0487A.k, if the processing

> > >+	 * element (PE) is in debug state, or sample-based

> > >+	 * profiling is prohibited, EDPCSR reads as 0xFFFFFFFF;

> > >+	 * EDCIDSR, EDVIDSR and EDPCSR_HI registers also become

> > >+	 * UNKNOWN state. So directly bail out for this case.

> > >+	 */

> > >+	if (drvdata->edpcsr == EDPCSR_PROHIBITED) {

> > >+		CS_LOCK(drvdata->base);

> > >+		return;

> > >+	}

> > >+

> > >+	/*

> > >+	 * A read of the EDPCSR normally has the side-effect of

> > >+	 * indirectly writing to EDCIDSR, EDVIDSR and EDPCSR_HI;

> > >+	 * at this point it's safe to read value from them.

> > >+	 */

> > 

> > See my comment above about the side effects of memory mapped access.

> 

> Yeah.

> 

> > >+	drvdata->edcidsr = readl_relaxed(drvdata->base + EDCIDSR);

> > >+#ifdef CONFIG_64BIT

> > >+	drvdata->edpcsr_hi = readl_relaxed(drvdata->base + EDPCSR_HI);

> > >+#endif

> > 

> > >+

> > >+	if (drvdata->edvidsr_present)

> > >+		drvdata->edvidsr = readl_relaxed(drvdata->base + EDVIDSR);

> > >+

> > >+	CS_LOCK(drvdata->base);

> > >+}

> > >+

> > 

> > >+#ifndef CONFIG_64BIT

> > 

> > I guess this doesn't help for an ARMv8 32bit only core (e.g, Cortex-A32). And

> > unfortunately, there are conflicting definitions for the values for PCSROffset w.r.t

> > ARMv8 and ARMv7.

> > 

> > DBGDEVID1[3:0] For ARMv7 :

> > 

> > 0000 - Sample offset applies based on the instruction state.

> > 0001 - No offset applies.

> > 

> > EDDEVID1[3:0] For ARMv8 :

> > 0000 - EDPCSR not implemented

> > 0010 - EDPCSR implemented without offsets, but do not use in AArch32 state!

> > 

> > So there is no easy way to make sense of the value, unless you know which version

> > of the architecture is in use. Or may be we could co-relate it with the value from

> > DEVID.

> > 

> > i.e, EDPCSR is not implemented do not register this device, see comments on debug_probe().

> > ( And we should also include the following test for 32bit code to see if edpcsr is implemented.

> > See comments on debug_init_arch_data() )

> > 

> > 

> > That way, we could use the following inference from the PCSROffset value :

> > 

> > 0000 - Sample offset applies based on the instruction state (indicated by PCSR[0])

> > 0001 - No offset applies.

> > 0010 - No offset applies, but do not use in AArch32 mode

> 

> Just now I went through ARM ARM and ARMv8 ARM, this makes sense to me.

> Thanks for good pointing for this.

> 

> > >+static bool debug_pc_has_offset(struct debug_drvdata *drvdata)

> > >+{

> > >+	u32 pcsr_offset;

> > >+

> > >+	pcsr_offset = drvdata->eddevid1 & EDDEVID1_PCSR_OFFSET_MASK;

> > >+

> > >+	return (pcsr_offset == EDDEVID1_PCSR_OFFSET_INS_SET);

> > >+}

> > >+

> > >+static unsigned long debug_adjust_pc(struct debug_drvdata *drvdata,

> > >+				     unsigned long pc)

> > >+{

> > >+	unsigned long arm_inst_offset = 0, thumb_inst_offset = 0;

> > >+

> > >+	if (debug_pc_has_offset(drvdata)) {

> > >+		arm_inst_offset = 8;

> > >+		thumb_inst_offset = 4;

> > >+	}

> > >+

> > >+	/* Handle thumb instruction */

> > >+	if (pc & EDPCSR_THUMB) {

> > >+		pc = (pc & EDPCSR_THUMB_INST_MASK) - thumb_inst_offset;

> > >+		return pc;

> > >+	}

> > >+

> > >+	/*

> > >+	 * Handle arm instruction offset, if the arm instruction

> > >+	 * is not 4 byte alignment then it's possible the case

> > >+	 * for implementation defined; keep original value for this

> > >+	 * case and print info for notice.

> > >+	 */

> > >+	if (pc & BIT(1))

> > >+		pr_emerg("Instruction offset is implementation defined\n");

> > 

> > I am struggling to find the any mention about this in the ARM ARM. Please could

> > you point me to it.

> 

> Sure, please see ARM DDI 0406C.b, chapter C11.11.34 "

> DBGPCSR, Program Counter Sampling Register":

> 

> A profiling tool can use the value of the T bit to calculate the

> instruction address as follows:

> 

> When an offset is applied to the sampled address

> - if T is 0 and DBGPCSR[1] is 0, ((DBGPCSR[31:2] << 2) - 8) is the

> address of the sampled ARM instruction

> - if T is 0 and DBGPCSR[1] is 1, the instruction address is

> IMPLEMENTATION DEFINED

> - if T is 1, ((DBGPCSR[31:1] << 1) - 4) is the address of the sampled

> Thumb or ThumbEE instruction.

> 

> When no offset is applied to the sampled address

> -  if T is 0 and DBGPCSR[1] is 0, (DBGPCSR[31:2] << 2) is the address

> of the sampled ARM instruction

> -  if T is 0 and DBGPCSR[1] is 1, the instruction address is

> IMPLEMENTATION DEFINED

> - if T is 1, (DBGPCSR[31:1] << 1) is the address of the sampled Thumb

> or ThumbEE instruction.

> 

> > >+static void debug_init_arch_data(void *info)

> > >+{

> > >+	struct debug_drvdata *drvdata = info;

> > >+	u32 mode;

> > >+

> > >+	CS_UNLOCK(drvdata->base);

> > >+

> > >+	debug_os_unlock(drvdata);

> > >+

> > >+	/* Read device info */

> > >+	drvdata->eddevid  = readl_relaxed(drvdata->base + EDDEVID);

> > >+	drvdata->eddevid1 = readl_relaxed(drvdata->base + EDDEVID1);

> > >+

> > >+	/* Parse implementation feature */

> > >+	mode = drvdata->eddevid & EDDEVID_PCSAMPLE_MODE;

> > >+	if (mode == EDDEVID_IMPL_FULL) {

> > >+		drvdata->edpcsr_present  = true;

> > >+		drvdata->edvidsr_present = true;

> > >+	} else if (mode == EDDEVID_IMPL_EDPCSR_EDCIDSR) {

> > >+		drvdata->edpcsr_present  = true;

> > >+		drvdata->edvidsr_present = false;

> > 

> > As discussed above, we need to consult the DEVID1:PCSROffset for AArch32 to decide

> > if we have the edpcsr implemented on ARMv8.

> 

> Yeah.

> 

> > >+	} else {

> > >+		drvdata->edpcsr_present  = false;

> > >+		drvdata->edvidsr_present = false;

> > >+	}

> > >+

> > >+	CS_LOCK(drvdata->base);

> > >+}

> > >+

> > >+static int debug_probe(struct amba_device *adev, const struct amba_id *id)

> > >+{

> > >+	void __iomem *base;

> > >+	struct device *dev = &adev->dev;

> > >+	struct debug_drvdata *drvdata;

> > >+	struct resource *res = &adev->res;

> > >+	struct device_node *np = adev->dev.of_node;

> > >+	char buf[32];

> > >+	static int debug_count;

> > >+

> > >+	drvdata = devm_kzalloc(dev, sizeof(*drvdata), GFP_KERNEL);

> > >+	if (!drvdata)

> > >+		return -ENOMEM;

> > >+

> > >+	drvdata->cpu = np ? of_coresight_get_cpu(np) : 0;

> > >+	drvdata->dev = &adev->dev;

> > >+

> > >+	dev_set_drvdata(dev, drvdata);

> > >+

> > >+	/* Validity for the resource is already checked by the AMBA core */

> > >+	base = devm_ioremap_resource(dev, res);

> > >+	if (IS_ERR(base))

> > >+		return PTR_ERR(base);

> > >+

> > >+	drvdata->base = base;

> > >+

> > >+	get_online_cpus();

> > >+	per_cpu(debug_drvdata, drvdata->cpu) = drvdata;

> > >+

> > >+	if (smp_call_function_single(drvdata->cpu,

> > >+				debug_init_arch_data, drvdata, 1))

> > >+		dev_err(dev, "Debug arch init failed\n");

> > 

> > If this fails (say the CPU was offline), should we still return success ?

> > And may be we should check if the drvdata->edpcsr_present to detect if the CPU

> > implements the PC Sampling and return failure here if it doesn't.

> 

> Will fix.

> 

> > >+

> > >+	put_online_cpus();

> > >+

> > >+	if (!debug_count++)

> > >+		atomic_notifier_chain_register(&panic_notifier_list,

> > >+					       &debug_notifier);

> > >+

> > 

> > >+	sprintf(buf, (char *)id->data, drvdata->cpu);

> > >+	dev_info(dev, "%s initialized\n", buf);

> > 

> > This could simply be :

> > 	dev_info(dev, "Coresight debug-CPU%d initialized\n", drvdata->cpu);

> > 

> > and get rid of the static string and the buffer, see below.

> > 

> > >+	return 0;

> > >+}

> > >+

> > >+static struct amba_id debug_ids[] = {

> > >+	{       /* Debug for Cortex-A53 */

> > >+		.id	= 0x000bbd03,

> > >+		.mask	= 0x000fffff,

> > 

> > ...

> > 

> > >+		.data   = "Coresight debug-CPU%d",

> > 

> > I think this is pointless, as the debug area we are interested in is always associated

> > with a CPU, we could as well figure out what to print from the drvdata->cpu above.

> 

> I prefer to follow your suggestion for upper two comments; but I'd like

> check with Mathieu, due I followed up Mathieu's suggestion to write

> current code.


The end result is the same - I'm good either way.

Thanks,
Mathieu

> 

> Thanks,

> Leo Yan

--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mathieu Poirier March 13, 2017, 4:56 p.m. UTC | #4

On Fri, Mar 10, 2017 at 02:29:53PM +0000, Suzuki K Poulose wrote:
> On 09/03/17 17:59, Leo Yan wrote:

> >Hi Suziku,

> >>The problem is, it is not guaranteed that the EDPCSR_Hi, EDCIDSR & EDVIDSR are

> >>updated as a side effect of a memory mapped access (which is what we do here) to the

> >>EDPCSR_Lo.

> >>

> >>Section H.7.1.2 : Reads of EDPCSRs (in ARM DDI 0487A.k) :

> >>

> >>"The indirect writes to EDCIDSR, EDVIDSR, and EDPCSRhi might not occur for a memory-mapped access

> >>to the external debug interface. For more information, see Memory-mapped accesses to the external debug

> >>interface on page H8-4968."

> >>

> >>So we cannot really rely on the values in EDVIDSR which we use to make further decisions. So I

> >>am wondering if this is really guranteed to be useful.

> >

> >So this is caused by Software lock is locked?

> >

> >Section H8.4.1:

> >

> >"Reads and writes have no side-effects. A side-effect is where a

> >direct read or a direct write of a register creates

> >an indirect write of the same or another register. When the Software

> >Lock is locked, the indirect write does

> >not occur."

> 

> Yes, thats correct, further :

> 

> Section H9.2.32: EDPCSR

> 

> "For a read of EDPCSRlo from the memory-mapped interface, if EDLSR.SLK == 1, meaning

> the Software Lock is locked, then the access has no side-effects. That is, EDCIDSR,

> EDVIDSR, and EDPCSRhi are unchanged."

> 

> And since we do a CS_UNLOCK, that should be fine. Please ignore my comment.

> 

> >>>diff --git a/drivers/hwtracing/coresight/Kconfig b/drivers/hwtracing/coresight/Kconfig

> >>>index 130cb21..3ed651e 100644

> >>>--- a/drivers/hwtracing/coresight/Kconfig

> >>>+++ b/drivers/hwtracing/coresight/Kconfig

> >>>@@ -89,4 +89,14 @@ config CORESIGHT_STM

> >>>	  logging useful software events or data coming from various entities

> >>>	  in the system, possibly running different OSs

> >>>

> >>>+config CORESIGHT_DEBUG

> >>

> >>To make it more specific, may be CORESIGHT_CPU_DEBUG ?

> >

> >Will fix.

> >

> >>>+	bool "CoreSight debug driver"

> >>

> >>"Coresight CPU Debug driver"

> >

> >Will fix.

> >

> >>>+	depends on ARM || ARM64

> >>>+	help

> >>>+	  This driver provides support for coresight debugging module. This

> >>>+	  is primarily used to dump sample-based profiling registers for

> >>>+	  panic. To avoid lockups when accessing debug module registers,

> >>>+	  it is safer to disable CPU low power states (like "nohlt" on the

> >>>+	  kernel command line) when using this feature.

> >>>+

> >>

> >>>+#define EDPCSR_THUMB			BIT(0)

> >>>+#define EDPCSR_ARM_INST_MASK		GENMASK(31, 2)

> >>>+#define EDPCSR_THUMB_INST_MASK		GENMASK(31, 1)

> >>

> >>We don't need two different masks. {ED/DBG}PCSR has only bit 0 reserved

> >>for instruction set indication.

> >

> >I think we need this two different masks. Please review below extra doc

> >for PC offset analysis in ARM ARM.

> 

> You're correct. Thanks for the pointer. I got confused, as there was no bit

> dedicated in the DBGPCSR bit assignment figure.

> 

> >>>+	/*

> >>>+	 * Handle arm instruction offset, if the arm instruction

> >>>+	 * is not 4 byte alignment then it's possible the case

> >>>+	 * for implementation defined; keep original value for this

> >>>+	 * case and print info for notice.

> >>>+	 */

> >>>+	if (pc & BIT(1))

> >>>+		pr_emerg("Instruction offset is implementation defined\n");

> >>

> >>I am struggling to find the any mention about this in the ARM ARM. Please could

> >>you point me to it.

> >

> >Sure, please see ARM DDI 0406C.b, chapter C11.11.34 "

> >DBGPCSR, Program Counter Sampling Register":

> >

> >A profiling tool can use the value of the T bit to calculate the

> >instruction address as follows:

> >

> >When an offset is applied to the sampled address

> >- if T is 0 and DBGPCSR[1] is 0, ((DBGPCSR[31:2] << 2) - 8) is the

> >address of the sampled ARM instruction

> >- if T is 0 and DBGPCSR[1] is 1, the instruction address is

> >IMPLEMENTATION DEFINED

> >- if T is 1, ((DBGPCSR[31:1] << 1) - 4) is the address of the sampled

> >Thumb or ThumbEE instruction.

> >

> >When no offset is applied to the sampled address

> >-  if T is 0 and DBGPCSR[1] is 0, (DBGPCSR[31:2] << 2) is the address

> >of the sampled ARM instruction

> >-  if T is 0 and DBGPCSR[1] is 1, the instruction address is

> >IMPLEMENTATION DEFINED

> >- if T is 1, (DBGPCSR[31:1] << 1) is the address of the sampled Thumb

> >or ThumbEE instruction.

> >

> 

> Ok.

> 

> 

> >>>+

> >>>+	put_online_cpus();

> >>>+

> >>>+	if (!debug_count++)

> >>>+		atomic_notifier_chain_register(&panic_notifier_list,

> >>>+					       &debug_notifier);

> >>>+

> >>

> >>>+	sprintf(buf, (char *)id->data, drvdata->cpu);

> >>>+	dev_info(dev, "%s initialized\n", buf);

> >>

> >>This could simply be :

> >>	dev_info(dev, "Coresight debug-CPU%d initialized\n", drvdata->cpu);

> >>

> >>and get rid of the static string and the buffer, see below.

> 

> Also we need pm_runtime_put() here to balance the pm_runtime_get_ from AMBA

> device probe.


Good point.

> More on that below.

> 

> >>

> >>>+	return 0;

> >>>+}

> >>>+

> >>>+static struct amba_id debug_ids[] = {

> >>>+	{       /* Debug for Cortex-A53 */

> >>>+		.id	= 0x000bbd03,

> >>>+		.mask	= 0x000fffff,

> >>

> >>...

> >>

> >>>+		.data   = "Coresight debug-CPU%d",

> >>

> >>I think this is pointless, as the debug area we are interested in is always associated

> >>with a CPU, we could as well figure out what to print from the drvdata->cpu above.

> >

> >I prefer to follow your suggestion for upper two comments; but I'd like

> >check with Mathieu, due I followed up Mathieu's suggestion to write

> >current code.

> 

> Btw, I don't see any PM calls to make sure the power domain (at least the debug domain)

> is up, which could cause problems with accesses to some of these registers (leave alone the

> ones in CPU power domain), especially the EDPRSR. We could also do pm_runtime_get on the

> CPU's power domain, if the CPU is online, before we access the pcsr.


I thought about PM runtime operations a little while back but wondered if it is
really a good thing to have them around.  When this code is called the system
has crashed and as such making PM runtimes call isn't a good idea.

One thing we could do is _not_ call pm_runtime_put() at the end of the probe()
operation.  That way we wouldn't have to mess around with PM runtime operations
on an unstable system.  This, of course, is costly in terms of power consumption
but the system is under test/debug anyway.

Thoughts?

> 

> Suzuki

> 

> 

> 

> 

> 

--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mathieu Poirier March 15, 2017, 8:41 p.m. UTC | #5

On 15 March 2017 at 10:44, Suzuki K Poulose <Suzuki.Poulose@arm.com> wrote:
> On 13/03/17 16:56, Mathieu Poirier wrote:

>>

>> On Fri, Mar 10, 2017 at 02:29:53PM +0000, Suzuki K Poulose wrote:

>>>>>>

>>>>>> +

>>>>>> +       put_online_cpus();

>>>>>> +

>>>>>> +       if (!debug_count++)

>>>>>> +               atomic_notifier_chain_register(&panic_notifier_list,

>>>>>> +                                              &debug_notifier);

>>>>>> +

>>>>>

>>>>>

>>>>>> +       sprintf(buf, (char *)id->data, drvdata->cpu);

>>>>>> +       dev_info(dev, "%s initialized\n", buf);

>>>>>

>>>>>

>>>>> This could simply be :

>>>>>         dev_info(dev, "Coresight debug-CPU%d initialized\n",

>>>>> drvdata->cpu);

>>>>>

>>>>> and get rid of the static string and the buffer, see below.

>>>

>>>

>>> Also we need pm_runtime_put() here to balance the pm_runtime_get_ from

>>> AMBA

>>> device probe.

>>

>>

>> Good point.

>>

>>> More on that below.

>>>

>>>>>

>>>>>> +       return 0;

>>>>>> +}

>>>>>> +

>>>>>> +static struct amba_id debug_ids[] = {

>>>>>> +       {       /* Debug for Cortex-A53 */

>>>>>> +               .id     = 0x000bbd03,

>>>>>> +               .mask   = 0x000fffff,

>>>>>

>>>>>

>>>>> ...

>>>>>

>>>>>> +               .data   = "Coresight debug-CPU%d",

>>>>>

>>>>>

>>>>> I think this is pointless, as the debug area we are interested in is

>>>>> always associated

>>>>> with a CPU, we could as well figure out what to print from the

>>>>> drvdata->cpu above.

>>>>

>>>>

>>>> I prefer to follow your suggestion for upper two comments; but I'd like

>>>> check with Mathieu, due I followed up Mathieu's suggestion to write

>>>> current code.

>>>

>>>

>>> Btw, I don't see any PM calls to make sure the power domain (at least the

>>> debug domain)

>>> is up, which could cause problems with accesses to some of these

>>> registers (leave alone the

>>> ones in CPU power domain), especially the EDPRSR. We could also do

>>> pm_runtime_get on the

>>> CPU's power domain, if the CPU is online, before we access the pcsr.

>>

>>

>> I thought about PM runtime operations a little while back but wondered if

>> it is

>> really a good thing to have them around.  When this code is called the

>> system

>> has crashed and as such making PM runtimes call isn't a good idea.

>

>

> You are right. It is not safe to make such calls when we have crashed.

> The other side effect is, if we don't have the debug power domain up,

> we could possibly hang the system and prevent other registered notifiers

> from running, which doesn't sound good either.

>

>>

>> One thing we could do is _not_ call pm_runtime_put() at the end of the

>> probe()

>> operation.  That way we wouldn't have to mess around with PM runtime

>> operations

>> on an unstable system.  This, of course, is costly in terms of power

>> consumption

>> but the system is under test/debug anyway.

>

>

> May be control the behavior via kernel command line ? Something like

> coresight_debug={on or 1} or

> even use the "nohlt" ?


We need to deal with the debug and CPU power domains.

For the former I suggest we do what coresight does and use the
"power-domains" binding[1].  For the CPU power domain we can re-use
the "nohlt" flag.  In the probe function if the "nohlt" cmd line flag
is not set the code bails out.  If it is set pm_runtime_put() is _not_
called and the driver can be used without worries of hanging the
system when the panic handler is invoked.

Am I forgetting something?

[1]. http://lxr.free-electrons.com/source/arch/arm64/boot/dts/arm/juno-base.dtsi#L137

>

> Suzuki

--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Leo Yan March 17, 2017, 10:13 a.m. UTC | #6

On Wed, Mar 15, 2017 at 02:41:59PM -0600, Mathieu Poirier wrote:
> On 15 March 2017 at 10:44, Suzuki K Poulose <Suzuki.Poulose@arm.com> wrote:

> > On 13/03/17 16:56, Mathieu Poirier wrote:

> >> On Fri, Mar 10, 2017 at 02:29:53PM +0000, Suzuki K Poulose wrote:


[...]

> >>> Btw, I don't see any PM calls to make sure the power domain (at least the

> >>> debug domain)

> >>> is up, which could cause problems with accesses to some of these

> >>> registers (leave alone the

> >>> ones in CPU power domain), especially the EDPRSR. We could also do

> >>> pm_runtime_get on the

> >>> CPU's power domain, if the CPU is online, before we access the pcsr.

> >>

> >>

> >> I thought about PM runtime operations a little while back but wondered if

> >> it is

> >> really a good thing to have them around.  When this code is called the

> >> system

> >> has crashed and as such making PM runtimes call isn't a good idea.

> >

> >

> > You are right. It is not safe to make such calls when we have crashed.

> > The other side effect is, if we don't have the debug power domain up,

> > we could possibly hang the system and prevent other registered notifiers

> > from running, which doesn't sound good either.

> >

> >>

> >> One thing we could do is _not_ call pm_runtime_put() at the end of the

> >> probe()

> >> operation.  That way we wouldn't have to mess around with PM runtime

> >> operations

> >> on an unstable system.  This, of course, is costly in terms of power

> >> consumption

> >> but the system is under test/debug anyway.

> >

> >

> > May be control the behavior via kernel command line ? Something like

> > coresight_debug={on or 1} or

> > even use the "nohlt" ?

> 

> We need to deal with the debug and CPU power domains.

> 

> For the former I suggest we do what coresight does and use the

> "power-domains" binding[1].  For the CPU power domain we can re-use

> the "nohlt" flag.  In the probe function if the "nohlt" cmd line flag

> is not set the code bails out.  If it is set pm_runtime_put() is _not_

> called and the driver can be used without worries of hanging the

> system when the panic handler is invoked.

> 

> Am I forgetting something?


I tested this drvier on Hikey and DB410c. For Hikey we must pass
"nohlt" to disable low power states, otherwise the kernel will hang
during initialization. But for DB410c, this driver even can work well
without "nohlt" and I checked the CPUIdle has been really enabled with
below sysfs entries:

root@linaro-developer:~# cat /sys/devices/system/cpu/cpu*/cpuidle/state*/usage
5992
6988
4225
4547
2790
23696
4202
3899

And from my previous experience, I'm quite sure some SoCs can access
the debug module registers with CPUIdle enabled, and it will read
back 0xFFFF_FFFF_FFFF_FFFF when the CPU stays in low power state.
So I prefer we could keep current method to suggest to use "nohlt" in
Kconfig's help description but it's not mandotory to check this in
the code. How about you think for this?

Thanks,
Leo Yan
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mathieu Poirier March 17, 2017, 3:50 p.m. UTC | #7

On Fri, Mar 17, 2017 at 06:13:28PM +0800, Leo Yan wrote:
> On Wed, Mar 15, 2017 at 02:41:59PM -0600, Mathieu Poirier wrote:

> > On 15 March 2017 at 10:44, Suzuki K Poulose <Suzuki.Poulose@arm.com> wrote:

> > > On 13/03/17 16:56, Mathieu Poirier wrote:

> > >> On Fri, Mar 10, 2017 at 02:29:53PM +0000, Suzuki K Poulose wrote:

> 

> [...]

> 

> > >>> Btw, I don't see any PM calls to make sure the power domain (at least the

> > >>> debug domain)

> > >>> is up, which could cause problems with accesses to some of these

> > >>> registers (leave alone the

> > >>> ones in CPU power domain), especially the EDPRSR. We could also do

> > >>> pm_runtime_get on the

> > >>> CPU's power domain, if the CPU is online, before we access the pcsr.

> > >>

> > >>

> > >> I thought about PM runtime operations a little while back but wondered if

> > >> it is

> > >> really a good thing to have them around.  When this code is called the

> > >> system

> > >> has crashed and as such making PM runtimes call isn't a good idea.

> > >

> > >

> > > You are right. It is not safe to make such calls when we have crashed.

> > > The other side effect is, if we don't have the debug power domain up,

> > > we could possibly hang the system and prevent other registered notifiers

> > > from running, which doesn't sound good either.

> > >

> > >>

> > >> One thing we could do is _not_ call pm_runtime_put() at the end of the

> > >> probe()

> > >> operation.  That way we wouldn't have to mess around with PM runtime

> > >> operations

> > >> on an unstable system.  This, of course, is costly in terms of power

> > >> consumption

> > >> but the system is under test/debug anyway.

> > >

> > >

> > > May be control the behavior via kernel command line ? Something like

> > > coresight_debug={on or 1} or

> > > even use the "nohlt" ?

> > 

> > We need to deal with the debug and CPU power domains.

> > 

> > For the former I suggest we do what coresight does and use the

> > "power-domains" binding[1].  For the CPU power domain we can re-use

> > the "nohlt" flag.  In the probe function if the "nohlt" cmd line flag

> > is not set the code bails out.  If it is set pm_runtime_put() is _not_

> > called and the driver can be used without worries of hanging the

> > system when the panic handler is invoked.

> > 

> > Am I forgetting something?

> 

> I tested this drvier on Hikey and DB410c. For Hikey we must pass

> "nohlt" to disable low power states, otherwise the kernel will hang

> during initialization. But for DB410c, this driver even can work well

> without "nohlt" and I checked the CPUIdle has been really enabled with

> below sysfs entries:

> 

> root@linaro-developer:~# cat /sys/devices/system/cpu/cpu*/cpuidle/state*/usage

> 5992

> 6988

> 4225

> 4547

> 2790

> 23696

> 4202

> 3899

> 

> And from my previous experience, I'm quite sure some SoCs can access

> the debug module registers with CPUIdle enabled, and it will read

> back 0xFFFF_FFFF_FFFF_FFFF when the CPU stays in low power state.

> So I prefer we could keep current method to suggest to use "nohlt" in

> Kconfig's help description but it's not mandotory to check this in

> the code. How about you think for this?


If we don't check for "nohlt" some platform may freeze, others may work.  If we
mandate that "nohlt" be present on the kernel cmd line it works in all cases.
As such mandating that "nohlt" be present is a better way to go.

Mathieu

> 

> Thanks,

> Leo Yan

--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Leo Yan March 17, 2017, 4:28 p.m. UTC | #8

On Fri, Mar 17, 2017 at 09:50:07AM -0600, Mathieu Poirier wrote:

[...]

> If we don't check for "nohlt" some platform may freeze, others may work.  If we

> mandate that "nohlt" be present on the kernel cmd line it works in all cases.

> As such mandating that "nohlt" be present is a better way to go.


Sure, so I will add below checking code in the probe function, please
let me know if you have alter better way to implement this:

+       if (IS_ENABLED(CONFIG_CPU_IDLE) &&
+                       !strstr(boot_command_line, "nohlt")) {
+               dev_err(dev, "May not be accessible in CPU power domain.\n");
+               return -EPERM;
+       }

Thanks,
Leo Yan
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Leo Yan March 21, 2017, 11:47 a.m. UTC | #9

On Tue, Mar 21, 2017 at 10:16:45AM +0000, Suzuki K Poulose wrote:

[...]

> >>In my opinion booting with "nohlt" on the cmd line is sufficient to

> >>determine if we should use the driver or not.  That way we also avoid

> >>declaring yet another sysfs flag, something I really want to avoid.

> >

> >Agree.

> >

> >I did spend some time to implement coresight core framework to support

> >debug module, you could see it on:http://termbin.com/k2fj; this also

> >gives me more sense which is better choice. If declaring another sysfs

> >flag to support debug module in coresight framework, this lets the

> >codes and interfaces more complex. E.g. for best fit into coresight

> >framework, finally we can get 8 sysfs nodes for 8 CPUs in system; so

> >that means we need enable every CPU one by one.

> 

> Having a node for each debug area indeed doesn't look good. We could

> as will stick a single node under /sys/kernel/debug/ which would enable/disable

> the debug component.

> 

> I am OK with it being tied to nohlt. In that case we will have to add

> a Kconfig dependency on GENERIC_IDLE_POLL_SETUP (though it is selected

> by default on ARM/ARM64). Parsing the boot command line for nohlt doesn't

> look like a good idea. We may have to figure out a way to do that.


I remembered that we can use QoS constraint for CPUIdle:
        pm_qos_add_request(pm_qos_req, PM_QOS_CPU_DMA_LATENCY, 0);

Using this way we can disable all low power states and don't depend on
nohlt anymore.

> Also, please could you add support for building this as a module ? Since it

> doesn't depend on the coresight bus anyway, it should be pretty straight

> forward.


Yeah. Will support module building.

Thanks,
Leo Yan
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mathieu Poirier March 21, 2017, 3:15 p.m. UTC | #10

On Tue, Mar 21, 2017 at 07:47:11PM +0800, Leo Yan wrote:
> On Tue, Mar 21, 2017 at 10:16:45AM +0000, Suzuki K Poulose wrote:

> 

> [...]

> 

> > >>In my opinion booting with "nohlt" on the cmd line is sufficient to

> > >>determine if we should use the driver or not.  That way we also avoid

> > >>declaring yet another sysfs flag, something I really want to avoid.

> > >

> > >Agree.

> > >

> > >I did spend some time to implement coresight core framework to support

> > >debug module, you could see it on:http://termbin.com/k2fj; this also

> > >gives me more sense which is better choice. If declaring another sysfs

> > >flag to support debug module in coresight framework, this lets the

> > >codes and interfaces more complex. E.g. for best fit into coresight

> > >framework, finally we can get 8 sysfs nodes for 8 CPUs in system; so

> > >that means we need enable every CPU one by one.

> > 

> > Having a node for each debug area indeed doesn't look good. We could

> > as will stick a single node under /sys/kernel/debug/ which would enable/disable

> > the debug component.

> > 

> > I am OK with it being tied to nohlt. In that case we will have to add

> > a Kconfig dependency on GENERIC_IDLE_POLL_SETUP (though it is selected

> > by default on ARM/ARM64). Parsing the boot command line for nohlt doesn't

> > look like a good idea. We may have to figure out a way to do that.

> 

> I remembered that we can use QoS constraint for CPUIdle:

>         pm_qos_add_request(pm_qos_req, PM_QOS_CPU_DMA_LATENCY, 0);

> 

> Using this way we can disable all low power states and don't depend on

> nohlt anymore.


The idea is to use "nohlt" as a trigger option.  If it is set then we use this
driver - if it isn't then it is kept under wrap.

> 

> > Also, please could you add support for building this as a module ? Since it

> > doesn't depend on the coresight bus anyway, it should be pretty straight

> > forward.

> 

> Yeah. Will support module building.

> 

> Thanks,

> Leo Yan

--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Sudeep Holla March 22, 2017, 2:07 p.m. UTC | #11

On 22/03/17 12:54, Mike Leach wrote:
> 

> 

> On 21 March 2017 at 15:39, Sudeep Holla <sudeep.holla@arm.com 

> <mailto:sudeep.holla@arm.com>> wrote:

> 

[...]

> I disagree with this approach. One of the main usefulness of such

> self hosted debug feature is to debug issues around features like

> cpuidle. Adding constraints like "cpuidle needs to be disabled" is

> not good IMO. There are ways to make it work with cpuidle enabled.

> Please explore them. In particular refer H9.2.39 EDPRCR, External

> Debug Power/Reset Control Register.

> 

> So, "nohlt" option is not an option. I prefer some sysfs option like 

> Suzuki suggested to enable this feature on demand if power saving in 

> normal usecase is the concern. Using "nohlt" just disables idle and 

> doesn't ensure the debug power domain is ON. Using the flag directly

> in this driver to enable debug power domain also sounds misuse of

> that flag for me.

> 

> I think the key issue to remember here is that experience with

> external debug shows that CPU Idle means different things to

> different SoC designs / power management schemes. (and we are using

> external debug in a self hosted way here).

> 


Yes agreed on the point that meaning of "cpuidle" differs on each SoC.

> Some designs will power down an entire cluster if all CPUs on the 

> cluster are powered down - including the parts of the debug

> registers that should remain powered in the debug power domain.


Interesting, at-least ETMv4 or some other coresight specification
clearly classify the power domains and the register access. The actual
power domain itself may vary depending on implementation.

> The bits in EDPRCR are not respected in these cases - these designs

> do not really support debug over power down in the way that the

> CoreSight / Debug designers anticipated. This means that even

> checking EDPRSR has the potential to cause a bus hang if the target

> register is unpowered. (and if the debug power domain is unpowered

> then the PC data is also lost).

> 


Agreed, but can we start supporting the sane designs in sane way first.
We can always add compatible and handle deviations. I agree we may need
to support such deviations but starting with that seems setting a bad
example.

> In these cases, accessing to the debug registers while they are not 

> powered is a recipe for disaster - so preventing CPUIdle and the

> subsequent cluster power down  allows investigation on this class of

> system - and allowing the CPUs of interest be interrogated without

> hanging the crash log process.

> 


Agreed. But my point is that many issues are around cpuidle and some
usecase and just eliminating that use-case sounds bad. For me,
core-sight was most useful to debug issues around cpu power management
and lockups where we can't stop cores but examine these registers.
There are other alternatives for other use-cases IMO.

> 

> On systems that do behave correctly with respect to debug power 

> domains, then disabling CPUIdle is unnecessary - these can be

> controlled by EDPRCR - perhaps; per the specification it is

> "implementation defined" if writing bits to this register have an

> effect on the system anyway even if the debug domain is correctly

> powered.

> 


We can always do that unconditionally. If implementations don't honor
those bits, it's different. If they hang on accessing something which is
on debug power domain and not on core power domain, then you have much
bigger issue to solve. How can you even trust and make any other
register accesses that are in debug power domain then ?

> While it is true to say that disabling CPUIdle does not guarantee

> that the debug power domain is on, it does in a certain class of

> designs prevent it being powered off (Juno historically - not sure if

> that is still the case.).

> 


Again it's completely platform specific. All you need to care is that
the debug power domain is on or not. Disabling CPUIdle to achieve that
is simply wrong and may work only on few platforms.

> However, I do agree that the use of the driver should not be

> triggered _only_ on the existence of /nohlt on the command line - 

> there is a class of designs where this will not be required.

> 


Thanks

> When enabing the driver as a kernel config the user needs to

> decide:- 1) do I need this to debug the issue I am seeing 2) does the

> power management on my system require I use /nohlt as well.


Please don't *misuse* nohlt to disable idle. There are other ways to
do the same either from the user-space or from the driver.

> 

> I think that the use of /nohlt as an option, and the reasons why it 

> might be needed should be part of the configuration help in this

> case.

> 

> There is also a case for considering if there should be an option to 

> configure it to be enabled or disabled at boot time. It is easy to 

> imagine cases I want to have this running from the start as a crash 

> happens early - and cases I can enable it on demand later.

> 


Also consider with cpuidle enabled ;). I can help testing if needed.

-- 
Regards,
Sudeep
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mike Leach March 22, 2017, 3:45 p.m. UTC | #12

On 22 March 2017 at 14:07, Sudeep Holla <sudeep.holla@arm.com> wrote:
>

>

> On 22/03/17 12:54, Mike Leach wrote:

>>

>>

>> On 21 March 2017 at 15:39, Sudeep Holla <sudeep.holla@arm.com

>> <mailto:sudeep.holla@arm.com>> wrote:

>>

> [...]

>

>> I disagree with this approach. One of the main usefulness of such

>> self hosted debug feature is to debug issues around features like

>> cpuidle. Adding constraints like "cpuidle needs to be disabled" is

>> not good IMO. There are ways to make it work with cpuidle enabled.

>> Please explore them. In particular refer H9.2.39 EDPRCR, External

>> Debug Power/Reset Control Register.

>>

>> So, "nohlt" option is not an option. I prefer some sysfs option like

>> Suzuki suggested to enable this feature on demand if power saving in

>> normal usecase is the concern. Using "nohlt" just disables idle and

>> doesn't ensure the debug power domain is ON. Using the flag directly

>> in this driver to enable debug power domain also sounds misuse of

>> that flag for me.

>>

>> I think the key issue to remember here is that experience with

>> external debug shows that CPU Idle means different things to

>> different SoC designs / power management schemes. (and we are using

>> external debug in a self hosted way here).

>>

>

> Yes agreed on the point that meaning of "cpuidle" differs on each SoC.

>

>> Some designs will power down an entire cluster if all CPUs on the

>> cluster are powered down - including the parts of the debug

>> registers that should remain powered in the debug power domain.

>

> Interesting, at-least ETMv4 or some other coresight specification

> clearly classify the power domains and the register access. The actual

> power domain itself may vary depending on implementation.

Yes - the ETMv4 spec defines what should be in the core / debug power
domains, but there is no architectural requirement for these to be
separate.
Most of power management is "implementation defined", & hw designers
seem to have different criteria than sw engineers wanting to debug
stuff.

>

>> The bits in EDPRCR are not respected in these cases - these designs

>> do not really support debug over power down in the way that the

>> CoreSight / Debug designers anticipated. This means that even

>> checking EDPRSR has the potential to cause a bus hang if the target

>> register is unpowered. (and if the debug power domain is unpowered

>> then the PC data is also lost).

>>

>

> Agreed, but can we start supporting the sane designs in sane way first.

> We can always add compatible and handle deviations. I agree we may need

> to support such deviations but starting with that seems setting a bad

> example.

>

From a pragmatic point of view, we have to support the designs that we
have and are currently using.
Sadly this might include some that do not behave in an ideal way.

I'm not saying disabling CPUIdle is right for all cases, or perhaps
many, but it has in the past been useful in specific instances - not
just for external debug, but to use CoreSight trace etc, were powering
down the a CPU / cluster takes out ETM accesses and breaks stuff.

The key is that to use this driver, the user has to be aware of the PM
implications on their specific system - the kernel may not take care
of it all for them - as SCP type power controllers are often external
and may have unique firmware and capabilites.

Historically CPUIdle disable has been used as a blunt instrument to
handle power management problems in real debug use cases - but it is
one that has been successful. Where there are better methods then I am
all for using these.

>> In these cases, accessing to the debug registers while they are not

>> powered is a recipe for disaster - so preventing CPUIdle and the

>> subsequent cluster power down allows investigation on this class of

>> system - and allowing the CPUs of interest be interrogated without

>> hanging the crash log process.

>>

>

> Agreed. But my point is that many issues are around cpuidle and some

> usecase and just eliminating that use-case sounds bad. For me,

> core-sight was most useful to debug issues around cpu power management

> and lockups where we can't stop cores but examine these registers.

> There are other alternatives for other use-cases IMO.

For your case, removing what you are interested in debugging is
evidently counter productive, so other techniques need to be used to
ensure the CS regs remain alive.
But equally there could be use cases where this might be just fine or
even the only way.

>

>>

>> On systems that do behave correctly with respect to debug power

>> domains, then disabling CPUIdle is unnecessary - these can be

>> controlled by EDPRCR - perhaps; per the specification it is

>> "implementation defined" if writing bits to this register have an

>> effect on the system anyway even if the debug domain is correctly

>> powered.

>>

>

> We can always do that unconditionally. If implementations don't honor

> those bits, it's different. If they hang on accessing something which is

> on debug power domain and not on core power domain, then you have much

> bigger issue to solve. How can you even trust and make any other

> register accesses that are in debug power domain then ?

>

It is difficult and highly platform dependent. For external debug we
might have per-platform rules built into the debugger on what can and
cannot be done and when, plus on occasion some power management
scripts. Those platforms that get closest to the "standard" CoreSight
power management are easiest to debug.

>> While it is true to say that disabling CPUIdle does not guarantee

>> that the debug power domain is on, it does in a certain class of

>> designs prevent it being powered off (Juno historically - not sure if

>> that is still the case.).

>>

>

> Again it's completely platform specific. All you need to care is that

> the debug power domain is on or not. Disabling CPUIdle to achieve that

> is simply wrong and may work only on few platforms.

>

>> However, I do agree that the use of the driver should not be

>> triggered _only_ on the existence of /nohlt on the command line -

>> there is a class of designs where this will not be required.

>>

>

> Thanks

>

>> When enabing the driver as a kernel config the user needs to

>> decide:- 1) do I need this to debug the issue I am seeing 2) does the

>> power management on my system require I use /nohlt as well.

>

> Please don't *misuse* nohlt to disable idle. There are other ways to

> do the same either from the user-space or from the driver.

I'm not advocating /nohlt above anything else - it's just what is
being discussed here. Furthermore, no one debug technique is ever
going to be appropriate in all circumstances - debug and trace are
always a compromise.

I initially raised the issue of clusters powering down, and
possibility that no CPUIdle might prevent this, to ensure that
awareness is built in to driver / config / help text /documentation
that these are real issues seen in the external debug world.

The key point is that the caveat in using this driver is that the
power management has to be considered on a platform specific basis
before it is configured; and appropriate actions may be needed for it
to work correctly. Without this then the driver could cause more
issues than it debugs. A user selecting this _must_ be told about
these issues

>

>>

>> I think that the use of /nohlt as an option, and the reasons why it

>> might be needed should be part of the configuration help in this

>> case.

>>

>> There is also a case for considering if there should be an option to

>> configure it to be enabled or disabled at boot time. It is easy to

>> imagine cases I want to have this running from the start as a crash

>> happens early - and cases I can enable it on demand later.

>>

>

> Also consider with cpuidle enabled ;). I can help testing if needed.

>

> --

> Regards,

> Sudeep

Regards

Mike

-- 
Mike Leach
Principal Engineer, ARM Ltd.
Blackburn Design Centre. UK
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Sudeep Holla March 22, 2017, 4:17 p.m. UTC | #13

On 22/03/17 15:45, Mike Leach wrote:
> On 22 March 2017 at 14:07, Sudeep Holla <sudeep.holla@arm.com> wrote:

>>

>>

>> On 22/03/17 12:54, Mike Leach wrote:

>>>

>>>

>>> On 21 March 2017 at 15:39, Sudeep Holla <sudeep.holla@arm.com

>>> <mailto:sudeep.holla@arm.com>> wrote:

>>>

>> [...]

>>

>>> I disagree with this approach. One of the main usefulness of such

>>> self hosted debug feature is to debug issues around features like

>>> cpuidle. Adding constraints like "cpuidle needs to be disabled" is

>>> not good IMO. There are ways to make it work with cpuidle enabled.

>>> Please explore them. In particular refer H9.2.39 EDPRCR, External

>>> Debug Power/Reset Control Register.

>>>

>>> So, "nohlt" option is not an option. I prefer some sysfs option like

>>> Suzuki suggested to enable this feature on demand if power saving in

>>> normal usecase is the concern. Using "nohlt" just disables idle and

>>> doesn't ensure the debug power domain is ON. Using the flag directly

>>> in this driver to enable debug power domain also sounds misuse of

>>> that flag for me.

>>>

>>> I think the key issue to remember here is that experience with

>>> external debug shows that CPU Idle means different things to

>>> different SoC designs / power management schemes. (and we are using

>>> external debug in a self hosted way here).

>>>

>>

>> Yes agreed on the point that meaning of "cpuidle" differs on each SoC.

>>

>>> Some designs will power down an entire cluster if all CPUs on the

>>> cluster are powered down - including the parts of the debug

>>> registers that should remain powered in the debug power domain.

>>

>> Interesting, at-least ETMv4 or some other coresight specification

>> clearly classify the power domains and the register access. The actual

>> power domain itself may vary depending on implementation.

> 

> Yes - the ETMv4 spec defines what should be in the core / debug power

> domains, but there is no architectural requirement for these to be

> separate.

> Most of power management is "implementation defined", & hw designers

> seem to have different criteria than sw engineers wanting to debug

> stuff.

> 


Yes I agree, no argument there.

>>> The bits in EDPRCR are not respected in these cases - these designs

>>> do not really support debug over power down in the way that the

>>> CoreSight / Debug designers anticipated. This means that even

>>> checking EDPRSR has the potential to cause a bus hang if the target

>>> register is unpowered. (and if the debug power domain is unpowered

>>> then the PC data is also lost).

>>>

>>

>> Agreed, but can we start supporting the sane designs in sane way first.

>> We can always add compatible and handle deviations. I agree we may need

>> to support such deviations but starting with that seems setting a bad

>> example.

>>

> 

> From a pragmatic point of view, we have to support the designs that we

> have and are currently using.

> Sadly this might include some that do not behave in an ideal way.

> 


We will have to support them. But I don't want that to be defacto
standard. They should be treated as deviations. Though specification
says the behavior is IMPDEF, it does provide standard interface(EDPRCR)
to use and we should have that in the driver IMO.

> I'm not saying disabling CPUIdle is right for all cases, or perhaps

> many, but it has in the past been useful in specific instances - not

> just for external debug, 


Yes I know but that's either issue with the firmware or the debugger
and we can work them around in user-space too instead of baking the
solution to kernel.

> but to use CoreSight trace etc, were powering

> down the a CPU / cluster takes out ETM accesses and breaks stuff.

> 


FYI, I added support in ETMv4 to emulate power-down during active trace
session and so far I have not seen any report on things breaking because
of that register access(may be no one has tested it on other platforms
yet :()

> The key is that to use this driver, the user has to be aware of the PM

> implications on their specific system - the kernel may not take care

> of it all for them - as SCP type power controllers are often external

> and may have unique firmware and capabilites.

> 


Sure

> Historically CPUIdle disable has been used as a blunt instrument to

> handle power management problems in real debug use cases - but it is

> one that has been successful. Where there are better methods then I am

> all for using these.

> 


Cool, thanks. Lets try out that and see if it helps first before bluntly
advertising CPUIdle disabling and that too in misleading ways like "nohlt"

>>> In these cases, accessing to the debug registers while they are not

>>> powered is a recipe for disaster - so preventing CPUIdle and the

>>> subsequent cluster power down allows investigation on this class of

>>> system - and allowing the CPUs of interest be interrogated without

>>> hanging the crash log process.

>>>

>>

>> Agreed. But my point is that many issues are around cpuidle and some

>> usecase and just eliminating that use-case sounds bad. For me,

>> core-sight was most useful to debug issues around cpu power management

>> and lockups where we can't stop cores but examine these registers.

>> There are other alternatives for other use-cases IMO.

> 

> For your case, removing what you are interested in debugging is

> evidently counter productive, so other techniques need to be used to

> ensure the CS regs remain alive.

> But equally there could be use cases where this might be just fine or

> even the only way.

>


Again not arguing on that. Just saying we can try out if that solves the
issue. As I said there are other ways to disable idle and we can
advertise those instead of such misuse. Also those alternative methods
are runtime and can be used when you need them.

>>>

>>> On systems that do behave correctly with respect to debug power

>>> domains, then disabling CPUIdle is unnecessary - these can be

>>> controlled by EDPRCR - perhaps; per the specification it is

>>> "implementation defined" if writing bits to this register have an

>>> effect on the system anyway even if the debug domain is correctly

>>> powered.

>>>

>>

>> We can always do that unconditionally. If implementations don't honor

>> those bits, it's different. If they hang on accessing something which is

>> on debug power domain and not on core power domain, then you have much

>> bigger issue to solve. How can you even trust and make any other

>> register accesses that are in debug power domain then ?

>>

> 

> It is difficult and highly platform dependent. For external debug we

> might have per-platform rules built into the debugger on what can and

> cannot be done and when, plus on occasion some power management

> scripts. Those platforms that get closest to the "standard" CoreSight

> power management are easiest to debug.

> 


Absolutely, so just mandating might just solve issue *accidentally*
not *intentionally*. So I don't want it to be advertised in that way
and that becomes defacto.

[...]
> 

> I'm not advocating /nohlt above anything else - it's just what is

> being discussed here. Furthermore, no one debug technique is ever

> going to be appropriate in all circumstances - debug and trace are

> always a compromise.

> 


True.

> I initially raised the issue of clusters powering down, and

> possibility that no CPUIdle might prevent this, to ensure that

> awareness is built in to driver / config / help text /documentation

> that these are real issues seen in the external debug world.

> 


Point taken. So we could just specify that all necessary power domains
need to be on for proper functionality for this feature and that it's
highly platform specific instead of mixing cpu/cluster idle details here.

> The key point is that the caveat in using this driver is that the

> power management has to be considered on a platform specific basis

> before it is configured; and appropriate actions may be needed for it

> to work correctly. Without this then the driver could cause more

> issues than it debugs. A user selecting this _must_ be told about

> these issues

> 


Exactly and the help text could be something on the similar lines as you
mention/summarize above.

-- 
Regards,
Sudeep
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Suzuki K Poulose March 22, 2017, 5:09 p.m. UTC | #14

On 22/03/17 16:17, Sudeep Holla wrote:
>

>

> On 22/03/17 15:45, Mike Leach wrote:

>> On 22 March 2017 at 14:07, Sudeep Holla <sudeep.holla@arm.com> wrote:

>>>

>>>

>>> On 22/03/17 12:54, Mike Leach wrote:

>>>>

>>>>

>>>> On 21 March 2017 at 15:39, Sudeep Holla <sudeep.holla@arm.com

>>>> <mailto:sudeep.holla@arm.com>> wrote:

>>>>

>>> [...]

>>>

>>>> I disagree with this approach. One of the main usefulness of such

>>>> self hosted debug feature is to debug issues around features like

>>>> cpuidle. Adding constraints like "cpuidle needs to be disabled" is

>>>> not good IMO. There are ways to make it work with cpuidle enabled.

>>>> Please explore them. In particular refer H9.2.39 EDPRCR, External

>>>> Debug Power/Reset Control Register.

>>>>

>>>> So, "nohlt" option is not an option. I prefer some sysfs option like

>>>> Suzuki suggested to enable this feature on demand if power saving in

>>>> normal usecase is the concern. Using "nohlt" just disables idle and

>>>> doesn't ensure the debug power domain is ON. Using the flag directly

>>>> in this driver to enable debug power domain also sounds misuse of

>>>> that flag for me.

>>>>

>>>> I think the key issue to remember here is that experience with

>>>> external debug shows that CPU Idle means different things to

>>>> different SoC designs / power management schemes. (and we are using

>>>> external debug in a self hosted way here).

>>>>

>>>

>>> Yes agreed on the point that meaning of "cpuidle" differs on each SoC.

>>>

>>>> Some designs will power down an entire cluster if all CPUs on the

>>>> cluster are powered down - including the parts of the debug

>>>> registers that should remain powered in the debug power domain.

>>>

>>> Interesting, at-least ETMv4 or some other coresight specification

>>> clearly classify the power domains and the register access. The actual

>>> power domain itself may vary depending on implementation.

>>

>> Yes - the ETMv4 spec defines what should be in the core / debug power

>> domains, but there is no architectural requirement for these to be

>> separate.

>> Most of power management is "implementation defined", & hw designers

>> seem to have different criteria than sw engineers wanting to debug

>> stuff.

>>

>

> Yes I agree, no argument there.

>

>>>> The bits in EDPRCR are not respected in these cases - these designs

>>>> do not really support debug over power down in the way that the

>>>> CoreSight / Debug designers anticipated. This means that even

>>>> checking EDPRSR has the potential to cause a bus hang if the target

>>>> register is unpowered. (and if the debug power domain is unpowered

>>>> then the PC data is also lost).

>>>>

>>>

>>> Agreed, but can we start supporting the sane designs in sane way first.

>>> We can always add compatible and handle deviations. I agree we may need

>>> to support such deviations but starting with that seems setting a bad

>>> example.

>>>

>>

>> From a pragmatic point of view, we have to support the designs that we

>> have and are currently using.

>> Sadly this might include some that do not behave in an ideal way.

>>

>

> We will have to support them. But I don't want that to be defacto

> standard. They should be treated as deviations. Though specification

> says the behavior is IMPDEF, it does provide standard interface(EDPRCR)

> to use and we should have that in the driver IMO.

>

>> I'm not saying disabling CPUIdle is right for all cases, or perhaps

>> many, but it has in the past been useful in specific instances - not

>> just for external debug,

>

> Yes I know but that's either issue with the firmware or the debugger

> and we can work them around in user-space too instead of baking the

> solution to kernel.

>

>> but to use CoreSight trace etc, were powering

>> down the a CPU / cluster takes out ETM accesses and breaks stuff.

>>

>

> FYI, I added support in ETMv4 to emulate power-down during active trace

> session and so far I have not seen any report on things breaking because

> of that register access(may be no one has tested it on other platforms

> yet :()

>

>> The key is that to use this driver, the user has to be aware of the PM

>> implications on their specific system - the kernel may not take care

>> of it all for them - as SCP type power controllers are often external

>> and may have unique firmware and capabilites.

>>

>

> Sure

>

>> Historically CPUIdle disable has been used as a blunt instrument to

>> handle power management problems in real debug use cases - but it is

>> one that has been successful. Where there are better methods then I am

>> all for using these.

>>

>

> Cool, thanks. Lets try out that and see if it helps first before bluntly

> advertising CPUIdle disabling and that too in misleading ways like "nohlt"

>

>>>> In these cases, accessing to the debug registers while they are not

>>>> powered is a recipe for disaster - so preventing CPUIdle and the

>>>> subsequent cluster power down allows investigation on this class of

>>>> system - and allowing the CPUs of interest be interrogated without

>>>> hanging the crash log process.

>>>>

>>>

>>> Agreed. But my point is that many issues are around cpuidle and some

>>> usecase and just eliminating that use-case sounds bad. For me,

>>> core-sight was most useful to debug issues around cpu power management

>>> and lockups where we can't stop cores but examine these registers.

>>> There are other alternatives for other use-cases IMO.

>>

>> For your case, removing what you are interested in debugging is

>> evidently counter productive, so other techniques need to be used to

>> ensure the CS regs remain alive.

>> But equally there could be use cases where this might be just fine or

>> even the only way.

>>

>

> Again not arguing on that. Just saying we can try out if that solves the

> issue. As I said there are other ways to disable idle and we can

> advertise those instead of such misuse. Also those alternative methods

> are runtime and can be used when you need them.

>

>>>>

>>>> On systems that do behave correctly with respect to debug power

>>>> domains, then disabling CPUIdle is unnecessary - these can be

>>>> controlled by EDPRCR - perhaps; per the specification it is

>>>> "implementation defined" if writing bits to this register have an

>>>> effect on the system anyway even if the debug domain is correctly

>>>> powered.

>>>>

>>>

>>> We can always do that unconditionally. If implementations don't honor

>>> those bits, it's different. If they hang on accessing something which is

>>> on debug power domain and not on core power domain, then you have much

>>> bigger issue to solve. How can you even trust and make any other

>>> register accesses that are in debug power domain then ?

>>>

>>

>> It is difficult and highly platform dependent. For external debug we

>> might have per-platform rules built into the debugger on what can and

>> cannot be done and when, plus on occasion some power management

>> scripts. Those platforms that get closest to the "standard" CoreSight

>> power management are easiest to debug.

>>

>

> Absolutely, so just mandating might just solve issue *accidentally*

> not *intentionally*. So I don't want it to be advertised in that way

> and that becomes defacto.

>

> [...]

>>

>> I'm not advocating /nohlt above anything else - it's just what is

>> being discussed here. Furthermore, no one debug technique is ever

>> going to be appropriate in all circumstances - debug and trace are

>> always a compromise.

>>

>

> True.

>

>> I initially raised the issue of clusters powering down, and

>> possibility that no CPUIdle might prevent this, to ensure that

>> awareness is built in to driver / config / help text /documentation

>> that these are real issues seen in the external debug world.

>>

>

> Point taken. So we could just specify that all necessary power domains

> need to be on for proper functionality for this feature and that it's

> highly platform specific instead of mixing cpu/cluster idle details here.

>

>> The key point is that the caveat in using this driver is that the

>> power management has to be considered on a platform specific basis

>> before it is configured; and appropriate actions may be needed for it

>> to work correctly. Without this then the driver could cause more

>> issues than it debugs. A user selecting this _must_ be told about

>> these issues

>>


So given all the possible caveats, I think we :

1) Shouldn't enable the driver by default at runtime even if it is built-in.
2) Should provide mechanisms to turn it on at boot (via kernel commandline)
    or anytime later (via sysfs), which kind of puts the responsibility back on
    the user : "You know what you are doing".
3) Shouldn't turn the driver on based on "nohlt" which the user could use it for
    some other purposes, without explicit intention of turning this driver on).
4) Should document the fact that, on some platforms, the user may have to disable
    CPUidle explicitly to get the driver working. But let us not make it the default.
    The user with a not so ideal platform could add "nohlt" and get it working.

Suzuki
    

--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Sudeep Holla March 22, 2017, 5:25 p.m. UTC | #15

On 22/03/17 17:09, Suzuki K Poulose wrote:
> On 22/03/17 16:17, Sudeep Holla wrote:


[...]

>> 

>> Point taken. So we could just specify that all necessary power

>> domains need to be on for proper functionality for this feature and

>> that it's highly platform specific instead of mixing cpu/cluster

>> idle details here.

>> 

>>> The key point is that the caveat in using this driver is that

>>> the power management has to be considered on a platform specific

>>> basis before it is configured; and appropriate actions may be

>>> needed for it to work correctly. Without this then the driver

>>> could cause more issues than it debugs. A user selecting this

>>> _must_ be told about these issues

>>> 

> 

> So given all the possible caveats, I think we :

> 

> 1) Shouldn't enable the driver by default at runtime even if it is 

> built-in. 

> 2) Should provide mechanisms to turn it on at boot (via

> kernel commandline) or anytime later (via sysfs), which kind of puts

> the responsibility back on the user : "You know what you are doing". 

> 3) Shouldn't turn the driver on based on "nohlt" which the user

> could use it for some other purposes, without explicit intention of

> turning this driver on).

> 4) Should document the fact that, on some

> platforms, the user may have to disable CPUidle explicitly to get the

> driver working. But let us not make it the default. The user with a

> not so ideal platform could add "nohlt" and get it working.

> 


Agreed on all points and well summarized.
I would like to highlight (3) and (4) as it needs to be well understood.

"nohlt" has a *different* meaning already, so using that in this
driver for something else is simple wrong as it affects the system in
unintended ways. And yes if user (mis)uses it to get things working,
it's fine but shouldn't be recommended way.

-- 
Regards,
Sudeep
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Mike Leach March 23, 2017, 12:27 p.m. UTC | #16

Also agree with Suzuki's excellent summary.

A couple of notes on using EDPRCR....

The logic for this assuming a correctly implemented system should be
something like.....

if (prsr() == powered_down) { // if we are powered down request power up.
    sw_unlock()   // ensure that the sw lock register on the device is unlocked
    write(prcr(bit3=1))
    wait_with_timeout(prsr()==powered_up)   // must wait to see if the
system does power up - otherwise bad things happen!
    if(timed_out)
       fail("power up request for CPU N failed.")  // nothing to do
after this point though if a CPU can't be powered is should not be
included in the list to be checked on crash.
}
ensure_os_lock_unlocked()  // os lock has to be unlocked for bit 0 of
prcr to be writeable.
sw_unlock()   // ensure that the sw lock register on the device is unlocked
write(prcr(bit3=1 | bit0=1)) // the core is powered, set the
nopowerdown request bit so we don't lose it & emulate power down


I think that some of the above logic is already in the driver, so
needs to be adapted for the PRCR handling.


On the "bad" systems, the initial prsr check is likely to fail (crash
/ buslock) if the debug logic is not correctly powered. The user will
then have to use one or more of the PM mitigations previously
discussed that are appropriate for that specific platform and retry

Regards

Mike


On 23 March 2017 at 05:43, Leo Yan <leo.yan@linaro.org> wrote:
> On Wed, Mar 22, 2017 at 05:25:50PM +0000, Sudeep Holla wrote:

>>

>>

>> On 22/03/17 17:09, Suzuki K Poulose wrote:

>> > On 22/03/17 16:17, Sudeep Holla wrote:

>>

>> [...]

>>

>> >>

>> >> Point taken. So we could just specify that all necessary power

>> >> domains need to be on for proper functionality for this feature and

>> >> that it's highly platform specific instead of mixing cpu/cluster

>> >> idle details here.

>> >>

>> >>> The key point is that the caveat in using this driver is that

>> >>> the power management has to be considered on a platform specific

>> >>> basis before it is configured; and appropriate actions may be

>> >>> needed for it to work correctly. Without this then the driver

>> >>> could cause more issues than it debugs. A user selecting this

>> >>> _must_ be told about these issues

>> >>>

>> >

>> > So given all the possible caveats, I think we :

>> >

>> > 1) Shouldn't enable the driver by default at runtime even if it is

>> > built-in.

>> > 2) Should provide mechanisms to turn it on at boot (via

>> > kernel commandline) or anytime later (via sysfs), which kind of puts

>> > the responsibility back on the user : "You know what you are doing".

>> > 3) Shouldn't turn the driver on based on "nohlt" which the user

>> > could use it for some other purposes, without explicit intention of

>> > turning this driver on).

>> > 4) Should document the fact that, on some

>> > platforms, the user may have to disable CPUidle explicitly to get the

>> > driver working. But let us not make it the default. The user with a

>> > not so ideal platform could add "nohlt" and get it working.

>> >

>>

>> Agreed on all points and well summarized.

>> I would like to highlight (3) and (4) as it needs to be well understood.

>>

>> "nohlt" has a *different* meaning already, so using that in this

>> driver for something else is simple wrong as it affects the system in

>> unintended ways. And yes if user (mis)uses it to get things working,

>> it's fine but shouldn't be recommended way.

>

> Understand this point.

>

> I will try to use general way to constraint CPUIdle like other

> drivers.

>

> Thanks all for these good suggestions :)

>

> Thanks,

> Leo Yan




-- 
Mike Leach
Principal Engineer, ARM Ltd.
Blackburn Design Centre. UK
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[v3,3/5] coresight: add support for debug module

Commit Message

Comments

Patch