mbox series

[v10,00/10] coresight: enable debug module

Message ID 1495167957-14923-1-git-send-email-leo.yan@linaro.org
Headers show
Series coresight: enable debug module | expand

Message

Leo Yan May 19, 2017, 4:25 a.m. UTC
ARMv8 architecture reference manual (ARM DDI 0487A.k) Chapter H7 "The
Sample-based Profiling Extension" has description for sampling
registers, we can utilize these registers to check program counter
value with combined CPU exception level, secure state, etc. So this is
helpful for CPU lockup bugs, e.g. if one CPU has run into infinite loop
with IRQ disabled; the 'hang' CPU cannot switch context and handle any
interrupt, so it cannot handle SMP call for stack dump, etc.

This patch series is to enable coresight debug module with sample-based
registers and register call back notifier for PCSR register dumping
when panic happens, so we can see below dumping info for panic; and
this patch series has considered the conditions for access permission
for debug registers self, so this can avoid access debug registers when
CPU power domain is off; the driver also try to figure out the CPU is
in secure or non-secure state.

Patch 0001 is to document the dt binding; patch 0002 adds one detailed
document to describe the Coresight debug module implementation, the
clock and power domain impaction on the driver, some examples for usage.

Patch 0003 is to document boot parameters used in kernel command line.

Patch 0004 is to add file entries for MAINTAINERS.

Patch 0005 is used to fix the func of_get_coresight_platform_data()
doesn't properly drop the reference to the CPU node pointer; and
patch 0006 is refactor to add new function of_coresight_get_cpu().

Patch 0007 is the driver for CPU debug module.

Patch 0008 in this series are to enable debug unit on 96boards Hikey,
Patch 0009 is to enable debug on 96boards DB410c. Have verified on both
two boards.
Patch 0010 is to add Coresight CPU debug nodes for Juno r0, r1 & r2.

We can enable debugging with two methods, adding parameters into kernel
command line for build-in module:
  coresight_cpu_debug.enable=1

Or we can wait the system has booted up to use debugfs nodes to enable
debugging:
  # echo 1 > /sys/kernel/debug/coresight_cpu_debug/enable

As result we can get below log after input command:
echo c > /proc/sysrq-trigger:

ARM external debug module:
coresight-cpu-debug 850000.debug: CPU[0]:
coresight-cpu-debug 850000.debug:  EDPRSR:  00000001 (Power:On DLK:Unlock)
coresight-cpu-debug 850000.debug:  EDPCSR:  [<ffff00000808e9bc>] handle_IPI+0x174/0x1d8
coresight-cpu-debug 850000.debug:  EDCIDSR: 00000000
coresight-cpu-debug 850000.debug:  EDVIDSR: 90000000 (State:Non-secure Mode:EL1/0 Width:64bits VMID:0)
coresight-cpu-debug 852000.debug: CPU[1]:
coresight-cpu-debug 852000.debug:  EDPRSR:  00000001 (Power:On DLK:Unlock)
coresight-cpu-debug 852000.debug:  EDPCSR:  [<ffff0000087fab34>] debug_notifier_call+0x23c/0x358
coresight-cpu-debug 852000.debug:  EDCIDSR: 00000000
coresight-cpu-debug 852000.debug:  EDVIDSR: 90000000 (State:Non-secure Mode:EL1/0 Width:64bits VMID:0)


[...]

Changes from v9:
* Used dev_xyz() to replace pr_xyz() for print log.
* Added DT binding patch for Juno shared by Suzuki.

Changes from v8:
* According to Mathieu suggestions to split the doc into two patches,
  one is for kernel parameter and another is for driver documentation.
* Add file entries to MAINTAINERS.
* According to Mathieu suggestions, refined functions
  debug_enable_func()/debug_disable_func().

Changes from v7:
* Fix operator priority bug.
* Minor sequence adjustment for function debug_func_exit().

Changes from v6:
* According to Suzuki and Mathieu suggestions, refined debug module
  driver to install panic notifier when insmod module; refined function
  debug_force_cpu_powered_up() for CPU power state checking; some minor
  fixing for output log, adding comments for memory barrier, code
  alignment.

Changes from v5:
* According to Suzuki and Mathieu suggestions, refined debug module
  driver to drop unused structure members, refactored initialization
  code to distinguish hardware implementation features, refactored
  flow for forcing CPU powered up, supported pm_runtime operations.
* Added one new doc file: Documentation/trace/coresight-cpu-debug.txt,
  which is used to describe detailed info for implementation, clock
  and power domain impaction on debug module, and exmaples for common
  usage.
* Removed "idle constraints" from debug driver.

Changes from v4:
* This version is mainly credit to ARM colleagues many contribution
  ideas for better quality (Thanks a lot Suzuki, Mike and Sudeep!).
* According to Suzuki suggestion, refined debug module driver to avoid
  memory leak for drvdata struct, handle PCSAMPLE_MODE=1, use flag
  drvdata.pc_has_offset to indicate if PCSR has offset, minor fixes.
* According to Mathieu suggestion, refined dt binding description.
* Changed driver to support module mode;
* According to Mike suggestion and very appreciate the pseudo code,
  added support to force CPU powered up with register EDPRCR;
* According to discussions, added command line and debugfs nodes to
  support enabling debugging for boot time, or later can dynamically
  enable/disable debugging by debugfs.
* According to Rob Herring suggestion, one minor fixes in DT binding.
* According to Stephen Boyd suggestion, add const quality to structure
  device_node. And used use of_cpu_device_node_get() to replace
  of_get_cpu_node() in patch 0003.

Changes from v3:
* Added Suzuki K Poulose's patch to fix issue for the func
  of_get_coresight_platform_data() doesn't properly drop the reference
  to the CPU node pointer.
* According to Suzuki suggestion, added code to handl the corner case
  for ARMv8 CPU with aarch32 mode.
* According to Suzuki suggestion, changed compatible string to
  "arm,coresight-cpu-debug".
* According to Mathieu suggestion, added "power-domains" as optional
  properties.

Changes from v2:
* According to Mathieu Poirier suggestion, applied some minor fixes.
* Added two extra patches for enabling debug module on Hikey.

Changes from v1:
* According to Mike Leach suggestion, removed the binding for debug
  module clocks which have been directly provided by CPU clocks.
* According to Mathieu Poirier suggestion, added function
  of_coresight_get_cpu() and some minor refactors for debug module
  driver.

Changes from RFC:
* According to Mike Leach suggestion, added check for EDPRSR to avoid
  lockup; added supporting EDVIDSR and EDCIDSR registers.
* According to Mark Rutland and Mathieu Poirier suggestion, rewrote
  the documentation for DT binding.
* According to Mark and Mathieu suggestion, refined debug driver.

Leo Yan (8):
  coresight: bindings for CPU debug module
  doc: Add documentation for Coresight CPU debug
  doc: Add coresight_cpu_debug.enable to kernel-parameters.txt
  MAINTAINERS: update file entries for Coresight subsystem
  coresight: refactor with function of_coresight_get_cpu
  coresight: add support for CPU debug module
  arm64: dts: hi6220: register debug module
  arm64: dts: qcom: msm8916: Add debug unit

Suzuki K Poulose (2):
  coresight: of_get_coresight_platform_data: Add missing of_node_put
  arm64: dts: juno: Add Coresight CPU debug nodes

 Documentation/admin-guide/kernel-parameters.txt    |   7 +
 .../bindings/arm/coresight-cpu-debug.txt           |  49 ++
 Documentation/trace/coresight-cpu-debug.txt        | 174 ++++++
 MAINTAINERS                                        |   2 +
 arch/arm64/boot/dts/arm/juno-base.dtsi             |  54 ++
 arch/arm64/boot/dts/arm/juno-r1.dts                |  24 +
 arch/arm64/boot/dts/arm/juno-r2.dts                |  24 +
 arch/arm64/boot/dts/arm/juno.dts                   |  24 +
 arch/arm64/boot/dts/hisilicon/hi6220.dtsi          |  64 ++
 arch/arm64/boot/dts/qcom/msm8916.dtsi              |  32 +
 drivers/hwtracing/coresight/Kconfig                |  14 +
 drivers/hwtracing/coresight/Makefile               |   1 +
 drivers/hwtracing/coresight/coresight-cpu-debug.c  | 696 +++++++++++++++++++++
 drivers/hwtracing/coresight/of_coresight.c         |  40 +-
 include/linux/coresight.h                          |   3 +
 15 files changed, 1196 insertions(+), 12 deletions(-)
 create mode 100644 Documentation/devicetree/bindings/arm/coresight-cpu-debug.txt
 create mode 100644 Documentation/trace/coresight-cpu-debug.txt
 create mode 100644 drivers/hwtracing/coresight/coresight-cpu-debug.c

-- 
2.7.4

--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Comments

Sudeep Holla May 19, 2017, 5:44 p.m. UTC | #1
Hi Suzuki, Leo,

On 19/05/17 05:25, Leo Yan wrote:
> From: Suzuki K Poulose <suzuki.poulose@arm.com>

> 

> Add Coresight CPU debug nodes for Juno r0, r1 & r2. The CPU

> debug areas are mapped at the same address for all revisions,

> like the ETM, even though the CPUs have changed from r1 to r2.

> 

> Cc: Sudeep Holla <sudeep.holla@arm.com>

> Cc: Leo Yan <leo.yan@linaro.org>

> Cc: Mathieu Poirier <mathieu.porier@linaro.org>

> Cc: Liviu Dudau <liviu.dudau@arm.com>

> Signed-off-by: Suzuki K Poulose <suzuki.poulose@arm.com>


ARM-SoC team expect DTS to be routed via platform tree.
So I have queued this for v4.13[1]

-- 
Regards,
Sudeep

[1] https://git.kernel.org/sudeep.holla/linux/h/for-next/juno-updates
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Suzuki K Poulose May 19, 2017, 5:53 p.m. UTC | #2
On 19/05/17 18:44, Sudeep Holla wrote:
> Hi Suzuki, Leo,

>

> On 19/05/17 05:25, Leo Yan wrote:

>> From: Suzuki K Poulose <suzuki.poulose@arm.com>

>>

>> Add Coresight CPU debug nodes for Juno r0, r1 & r2. The CPU

>> debug areas are mapped at the same address for all revisions,

>> like the ETM, even though the CPUs have changed from r1 to r2.

>>

>> Cc: Sudeep Holla <sudeep.holla@arm.com>

>> Cc: Leo Yan <leo.yan@linaro.org>

>> Cc: Mathieu Poirier <mathieu.porier@linaro.org>

>> Cc: Liviu Dudau <liviu.dudau@arm.com>

>> Signed-off-by: Suzuki K Poulose <suzuki.poulose@arm.com>

>

> ARM-SoC team expect DTS to be routed via platform tree.

> So I have queued this for v4.13[1]


Sudeep

Thanks for picking that up.

Suzuki

--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Liviu Dudau May 22, 2017, 10:16 a.m. UTC | #3
On Fri, May 19, 2017 at 12:25:49PM +0800, Leo Yan wrote:
> Add detailed documentation for Coresight CPU debug driver, which

> contains the info for driver implementation, Mike Leach excellent

> summary for "clock and power domain". At the end some examples on how

> to enable the debugging functionality are provided.


Hi Leo,

Below are my minor suggestions to improve readability of the documentation.

Thanks,
Liviu

> 

> Suggested-by: Mike Leach <mike.leach@linaro.org>

> Reviewed-by: Mathieu Poirier <mathieu.poirier@linaro.org>

> Signed-off-by: Leo Yan <leo.yan@linaro.org>

> ---

>  Documentation/trace/coresight-cpu-debug.txt | 174 ++++++++++++++++++++++++++++

>  1 file changed, 174 insertions(+)

>  create mode 100644 Documentation/trace/coresight-cpu-debug.txt

> 

> diff --git a/Documentation/trace/coresight-cpu-debug.txt b/Documentation/trace/coresight-cpu-debug.txt

> new file mode 100644

> index 0000000..f0c3f0f

> --- /dev/null

> +++ b/Documentation/trace/coresight-cpu-debug.txt

> @@ -0,0 +1,174 @@

> +		Coresight CPU Debug Module

> +		==========================

> +

> +   Author:   Leo Yan <leo.yan@linaro.org>

> +   Date:     April 5th, 2017

> +

> +Introduction

> +------------

> +

> +Coresight CPU debug module is defined in ARMv8-a architecture reference manual

> +(ARM DDI 0487A.k) Chapter 'Part H: External debug', the CPU can integrate

> +debug module and it is mainly used for two modes: self-hosted debug and

> +external debug. Usually the external debug mode is well known as the external

> +debugger connects with SoC from JTAG port; on the other hand the program can

> +explore debugging method which rely on self-hosted debug mode, this document

> +is to focus on this part.

> +

> +The debug module provides sample-based profiling extension, which can be used

> +to sample CPU program counter, secure state and exception level, etc; usually

> +every CPU has one dedicated debug module to be connected. Based on self-hosted

> +debug mechanism, Linux kernel can access these related registers from mmio

> +region when the kernel panic happens. The callback notifier for kernel panic

> +will dump related registers for every CPU; finally this is good for assistant

> +analysis for panic.

> +

> +

> +Implementation

> +--------------

> +

> +- During driver registration, use EDDEVID and EDDEVID1 two device ID


During driver registration, it uses EDDEVID and EDDEVID1 - two device ID

> +  registers to decide if sample-based profiling is implemented or not. On some

> +  platforms this hardware feature is fully or partialy implemented; and if

> +  this feature is not supported then registration will fail.

> +

> +- When write this doc, the debug driver mainly relies on three sampling

> +  registers. The kernel panic callback notifier gathers info from EDPCSR

> +  EDVIDSR and EDCIDSR; from EDPCSR we can get program counter, EDVIDSR has


At the time this documentation was writen, the debug driver mainly relies on
information gathered by the kernel panic callback notifier from three sampling
registers: EDPCSR, EDVIDSR and EDCIDSR: ....

> +  information for secure state, exception level, bit width, etc; EDCIDSR is

> +  context ID value which contains the sampled value of CONTEXTIDR_EL1.

> +

> +- The driver supports CPU running mode with either AArch64 or AArch32. The


... supports a CPU running in either AArch64 or AArch32 mode. The

> +  registers naming convention is a bit different between them, AArch64 uses

> +  'ED' for register prefix (ARM DDI 0487A.k, chapter H9.1) and AArch32 uses

> +  'DBG' as prefix (ARM DDI 0487A.k, chapter G5.1). The driver is unified to

> +  use AArch64 naming convention.

> +

> +- ARMv8-a (ARM DDI 0487A.k) and ARMv7-a (ARM DDI 0406C.b) have different

> +  register bits definition. So the driver consolidates two difference:

> +

> +  If PCSROffset=0b0000, on ARMv8-a the feature of EDPCSR is not implemented;

> +  but ARMv7-a defines "PCSR samples are offset by a value that depends on the

> +  instruction set state". For ARMv7-a, the driver checks furthermore if CPU

> +  runs with ARM or thumb instruction set and calibrate PCSR value, the

> +  detailed description for offset is in ARMv7-a ARM (ARM DDI 0406C.b) chapter

> +  C11.11.34 "DBGPCSR, Program Counter Sampling Register".

> +

> +  If PCSROffset=0b0010, ARMv8-a defines "EDPCSR implemented, and samples have

> +  no offset applied and do not sample the instruction set state in AArch32

> +  state". So on ARMv8 if EDDEVID1.PCSROffset is 0b0010 and the CPU operates

> +  in AArch32 state, EDPCSR is not sampled; when the CPU operates in AArch64

> +  state EDPCSR is sampled and no offset are applied.

> +

> +

> +Clock and power domain

> +----------------------

> +

> +Before accessing debug registers, we should ensure the clock and power domain

> +have been enabled properly. In ARMv8-a ARM (ARM DDI 0487A.k) chapter 'H9.1

> +Debug registers', the debug registers are spread into two domains: the debug

> +domain and the CPU domain.

> +

> +                                +---------------+

> +                                |               |

> +                                |               |

> +                     +----------+--+            |

> +          dbg_clk -->|          |**|            |<-- cpu_clk

> +                     |    Debug |**|   CPU      |

> +           dbg_pd -->|          |**|            |<-- cpu_pd

> +                     +----------+--+            |

> +                                |               |

> +                                |               |

> +                                +---------------+

> +

> +For debug domain, the user uses DT binding "clocks" and "power-domains" to

> +specify the corresponding clock source and power supply for the debug logic.

> +The driver calls the pm_runtime_{put|get} operations as needed to handle the

> +debug power domain.

> +

> +For CPU domain, the different SoC designs have different power management

> +schemes and finally this heavily impacts external debug module. So we can

> +divide into below cases:

> +

> +- On systems with a sane power controller which can behave correctly with

> +  respect to CPU power domain, the CPU power domain can be controlled by

> +  register EDPRCR in driver. The driver firstly writes bit EDPRCR.COREPURQ

> +  to power up the CPU, and then writes bit EDPRCR.CORENPDRQ for emulation

> +  of CPU power down. As result, this can ensure the CPU power domain is

> +  powered on properly during the period when access debug related registers;

> +

> +- Some designs will power down an entire cluster if all CPUs on the cluster

> +  are powered down - including the parts of the debug registers that should

> +  remain powered in the debug power domain. The bits in EDPRCR are not

> +  respected in these cases, so these designs do not support debug over

> +  power down in the way that the CoreSight / Debug designers anticipated.

> +  This means that even checking EDPRSR has the potential to cause a bus hang

> +  if the target register is unpowered.

> +

> +  In this case, accessing to the debug registers while they are not powered

> +  is a recipe for disaster; so we need preventing CPU low power states at boot

> +  time or when user enable module at the run time. Please see chapter

> +  "How to use the module" for detailed usage info for this.

> +

> +

> +Device Tree Bindings

> +--------------------

> +

> +See Documentation/devicetree/bindings/arm/coresight-cpu-debug.txt for details.

> +

> +

> +How to use the module

> +---------------------

> +

> +If you want to enable debugging functionality at boot time, you can add

> +"coresight_cpu_debug.enable=1" to the kernel command line parameter.

> +

> +The driver also can work as module, so can enable the debugging when insmod

> +module:

> +# insmod coresight_cpu_debug.ko debug=1

> +

> +When boot time or insmod module you have not enabled the debugging, the driver

> +uses the debugfs file system to provide a knob to dynamically enable or disable

> +debugging:

> +

> +To enable it, write a '1' into /sys/kernel/debug/coresight_cpu_debug/enable:

> +# echo 1 > /sys/kernel/debug/coresight_cpu_debug/enable

> +

> +To disable it, write a '0' into /sys/kernel/debug/coresight_cpu_debug/enable:

> +# echo 0 > /sys/kernel/debug/coresight_cpu_debug/enable

> +

> +As explained in chapter "Clock and power domain", if you are working on one

> +platform which has idle states to power off debug logic and the power

> +controller cannot work well for the request from EDPRCR, then you should

> +firstly constraint CPU idle states before enable CPU debugging feature; so can

> +ensure the accessing to debug logic.

> +

> +If you want to limit idle states at boot time, you can use "nohlt" or

> +"cpuidle.off=1" in the kernel command line.

> +

> +At the runtime you can disable idle states with below methods:

> +

> +Set latency request to /dev/cpu_dma_latency to disable all CPUs specific idle

> +states (if latency = 0uS then disable all idle states):

> +# echo "what_ever_latency_you_need_in_uS" > /dev/cpu_dma_latency

> +

> +Disable specific CPU's specific idle state:

> +# echo 1 > /sys/devices/system/cpu/cpu$cpu/cpuidle/state$state/disable

> +

> +

> +Output format

> +-------------

> +

> +Here is an example of the debugging output format:

> +

> +ARM external debug module:

> +coresight-cpu-debug 850000.debug: CPU[0]:

> +coresight-cpu-debug 850000.debug:  EDPRSR:  00000001 (Power:On DLK:Unlock)

> +coresight-cpu-debug 850000.debug:  EDPCSR:  [<ffff00000808e9bc>] handle_IPI+0x174/0x1d8

> +coresight-cpu-debug 850000.debug:  EDCIDSR: 00000000

> +coresight-cpu-debug 850000.debug:  EDVIDSR: 90000000 (State:Non-secure Mode:EL1/0 Width:64bits VMID:0)

> +coresight-cpu-debug 852000.debug: CPU[1]:

> +coresight-cpu-debug 852000.debug:  EDPRSR:  00000001 (Power:On DLK:Unlock)

> +coresight-cpu-debug 852000.debug:  EDPCSR:  [<ffff0000087fab34>] debug_notifier_call+0x23c/0x358

> +coresight-cpu-debug 852000.debug:  EDCIDSR: 00000000

> +coresight-cpu-debug 852000.debug:  EDVIDSR: 90000000 (State:Non-secure Mode:EL1/0 Width:64bits VMID:0)

> -- 

> 2.7.4

> 


-- 
====================
| I would like to |
| fix the world,  |
| but they're not |
| giving me the   |
 \ source code!  /
  ---------------
    ¯\_(ツ)_/¯
--
To unsubscribe from this list: send the line "unsubscribe devicetree" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html