diff mbox series

PCI/ACPI: do not reference a pci device after it has been released

Message ID 20220428142854.1065953-1-gregkh@linuxfoundation.org
State New
Headers show
Series PCI/ACPI: do not reference a pci device after it has been released | expand

Commit Message

Greg Kroah-Hartman April 28, 2022, 2:28 p.m. UTC
In acpi_get_pci_dev(), the debugging message for when a PCI bridge is
not found uses a pointer to a pci device whose reference has just been
dropped.  The chance that this really is a device that is now been
removed from the system is almost impossible to happen, but to be safe,
let's print out the debugging message based on the acpi root device
which we do have a valid reference to at the moment.

Cc: Bjorn Helgaas <bhelgaas@google.com>
Cc: "Rafael J. Wysocki" <rafael@kernel.org>
Cc: Len Brown <lenb@kernel.org>
Cc: linux-pci@vger.kernel.org
Cc: linux-acpi@vger.kernel.org
Reported-by: whitehat002 <hackyzh002@gmail.com>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
---
 drivers/acpi/pci_root.c | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

Comments

Bjorn Helgaas April 28, 2022, 3:58 p.m. UTC | #1
On Thu, Apr 28, 2022 at 04:28:53PM +0200, Greg Kroah-Hartman wrote:
> In acpi_get_pci_dev(), the debugging message for when a PCI bridge is
> not found uses a pointer to a pci device whose reference has just been
> dropped.  The chance that this really is a device that is now been
> removed from the system is almost impossible to happen, but to be safe,
> let's print out the debugging message based on the acpi root device
> which we do have a valid reference to at the moment.

This code was added by 497fb54f578e ("ACPI / PCI: Fix NULL pointer
dereference in acpi_get_pci_dev() (rev. 2)").  Not sure if it's worth
a Fixes: tag.

acpi_get_pci_dev() is used by only five callers, three of which are
video/backlight related.  I'm always skeptical of one-off interfaces
like this, but I don't know enough to propose any refactoring or other
alternatives.

I'll leave this for Rafael, but if I were applying I would silently
touch up the subject to match convention:

  PCI/ACPI: Do not reference PCI device after it has been released

> Cc: Bjorn Helgaas <bhelgaas@google.com>
> Cc: "Rafael J. Wysocki" <rafael@kernel.org>
> Cc: Len Brown <lenb@kernel.org>
> Cc: linux-pci@vger.kernel.org
> Cc: linux-acpi@vger.kernel.org
> Reported-by: whitehat002 <hackyzh002@gmail.com>
> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
> ---
>  drivers/acpi/pci_root.c | 3 ++-
>  1 file changed, 2 insertions(+), 1 deletion(-)
> 
> diff --git a/drivers/acpi/pci_root.c b/drivers/acpi/pci_root.c
> index 6f9e75d14808..ecda378dbc09 100644
> --- a/drivers/acpi/pci_root.c
> +++ b/drivers/acpi/pci_root.c
> @@ -303,7 +303,8 @@ struct pci_dev *acpi_get_pci_dev(acpi_handle handle)
>  		 * case pdev->subordinate will be NULL for the parent.
>  		 */
>  		if (!pbus) {
> -			dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> +			dev_dbg(&root->device->dev,
> +				"dev %d, function %d is not a PCI-to-PCI bridge\n", dev, fn);

This should use "%02x.%d" to be consistent with the dev_set_name() in
pci_setup_device().

>  			pdev = NULL;
>  			break;
>  		}
> -- 
> 2.36.0
>
Rafael J. Wysocki April 28, 2022, 8:30 p.m. UTC | #2
On Thu, Apr 28, 2022 at 10:15 PM Rafael J. Wysocki <rafael@kernel.org> wrote:
>
> On Thu, Apr 28, 2022 at 6:22 PM Greg Kroah-Hartman
> <gregkh@linuxfoundation.org> wrote:
> >
> > On Thu, Apr 28, 2022 at 10:58:58AM -0500, Bjorn Helgaas wrote:
> > > On Thu, Apr 28, 2022 at 04:28:53PM +0200, Greg Kroah-Hartman wrote:
> > > > In acpi_get_pci_dev(), the debugging message for when a PCI bridge is
> > > > not found uses a pointer to a pci device whose reference has just been
> > > > dropped.  The chance that this really is a device that is now been
> > > > removed from the system is almost impossible to happen, but to be safe,
> > > > let's print out the debugging message based on the acpi root device
> > > > which we do have a valid reference to at the moment.
> > >
> > > This code was added by 497fb54f578e ("ACPI / PCI: Fix NULL pointer
> > > dereference in acpi_get_pci_dev() (rev. 2)").  Not sure if it's worth
> > > a Fixes: tag.
> >
> > Can't hurt, I'll add it for the v2 based on this review.
> >
> > >
> > > acpi_get_pci_dev() is used by only five callers, three of which are
> > > video/backlight related.  I'm always skeptical of one-off interfaces
> > > like this, but I don't know enough to propose any refactoring or other
> > > alternatives.
> > >
> > > I'll leave this for Rafael, but if I were applying I would silently
> > > touch up the subject to match convention:
> > >
> > >   PCI/ACPI: Do not reference PCI device after it has been released
> >
> > Much simpler, thanks.
> >
> > >
> > > > Cc: Bjorn Helgaas <bhelgaas@google.com>
> > > > Cc: "Rafael J. Wysocki" <rafael@kernel.org>
> > > > Cc: Len Brown <lenb@kernel.org>
> > > > Cc: linux-pci@vger.kernel.org
> > > > Cc: linux-acpi@vger.kernel.org
> > > > Reported-by: whitehat002 <hackyzh002@gmail.com>
> > > > Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
> > > > ---
> > > >  drivers/acpi/pci_root.c | 3 ++-
> > > >  1 file changed, 2 insertions(+), 1 deletion(-)
> > > >
> > > > diff --git a/drivers/acpi/pci_root.c b/drivers/acpi/pci_root.c
> > > > index 6f9e75d14808..ecda378dbc09 100644
> > > > --- a/drivers/acpi/pci_root.c
> > > > +++ b/drivers/acpi/pci_root.c
> > > > @@ -303,7 +303,8 @@ struct pci_dev *acpi_get_pci_dev(acpi_handle handle)
> > > >              * case pdev->subordinate will be NULL for the parent.
> > > >              */
> > > >             if (!pbus) {
> > > > -                   dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > +                   dev_dbg(&root->device->dev,
> > > > +                           "dev %d, function %d is not a PCI-to-PCI bridge\n", dev, fn);
> > >
> > > This should use "%02x.%d" to be consistent with the dev_set_name() in
> > > pci_setup_device().
> >
> > Ah, missed that, will change it and send out a new version tomorrow.
>
> I would make the change below (modulo the gmail-induced wthite space
> breakage), though.

That said ->

> ---
>  drivers/acpi/pci_root.c |    5 +++--
>  1 file changed, 3 insertions(+), 2 deletions(-)
>
> Index: linux-pm/drivers/acpi/pci_root.c
> ===================================================================
> --- linux-pm.orig/drivers/acpi/pci_root.c
> +++ linux-pm/drivers/acpi/pci_root.c
> @@ -295,8 +295,6 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
>              break;
>
>          pbus = pdev->subordinate;
> -        pci_dev_put(pdev);
> -
>          /*
>           * This function may be called for a non-PCI device that has a
>           * PCI parent (eg. a disk under a PCI SATA controller).  In that
> @@ -304,9 +302,12 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
>           */
>          if (!pbus) {
>              dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> +            pci_dev_put(pdev);
>              pdev = NULL;
>              break;
>          }
> +
> +        pci_dev_put(pdev);

-> we are going to use pbus after this and it is pdev->subordinate
which cannot survive without pdev AFAICS.

Are we not concerned about this case?

>      }
>  out:
>      list_for_each_entry_safe(node, tmp, &device_list, node)
Greg Kroah-Hartman June 27, 2022, 3:07 p.m. UTC | #3
On Thu, Apr 28, 2022 at 10:30:38PM +0200, Rafael J. Wysocki wrote:
> On Thu, Apr 28, 2022 at 10:15 PM Rafael J. Wysocki <rafael@kernel.org> wrote:
> >
> > On Thu, Apr 28, 2022 at 6:22 PM Greg Kroah-Hartman
> > <gregkh@linuxfoundation.org> wrote:
> > >
> > > On Thu, Apr 28, 2022 at 10:58:58AM -0500, Bjorn Helgaas wrote:
> > > > On Thu, Apr 28, 2022 at 04:28:53PM +0200, Greg Kroah-Hartman wrote:
> > > > > In acpi_get_pci_dev(), the debugging message for when a PCI bridge is
> > > > > not found uses a pointer to a pci device whose reference has just been
> > > > > dropped.  The chance that this really is a device that is now been
> > > > > removed from the system is almost impossible to happen, but to be safe,
> > > > > let's print out the debugging message based on the acpi root device
> > > > > which we do have a valid reference to at the moment.
> > > >
> > > > This code was added by 497fb54f578e ("ACPI / PCI: Fix NULL pointer
> > > > dereference in acpi_get_pci_dev() (rev. 2)").  Not sure if it's worth
> > > > a Fixes: tag.
> > >
> > > Can't hurt, I'll add it for the v2 based on this review.
> > >
> > > >
> > > > acpi_get_pci_dev() is used by only five callers, three of which are
> > > > video/backlight related.  I'm always skeptical of one-off interfaces
> > > > like this, but I don't know enough to propose any refactoring or other
> > > > alternatives.
> > > >
> > > > I'll leave this for Rafael, but if I were applying I would silently
> > > > touch up the subject to match convention:
> > > >
> > > >   PCI/ACPI: Do not reference PCI device after it has been released
> > >
> > > Much simpler, thanks.
> > >
> > > >
> > > > > Cc: Bjorn Helgaas <bhelgaas@google.com>
> > > > > Cc: "Rafael J. Wysocki" <rafael@kernel.org>
> > > > > Cc: Len Brown <lenb@kernel.org>
> > > > > Cc: linux-pci@vger.kernel.org
> > > > > Cc: linux-acpi@vger.kernel.org
> > > > > Reported-by: whitehat002 <hackyzh002@gmail.com>
> > > > > Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
> > > > > ---
> > > > >  drivers/acpi/pci_root.c | 3 ++-
> > > > >  1 file changed, 2 insertions(+), 1 deletion(-)
> > > > >
> > > > > diff --git a/drivers/acpi/pci_root.c b/drivers/acpi/pci_root.c
> > > > > index 6f9e75d14808..ecda378dbc09 100644
> > > > > --- a/drivers/acpi/pci_root.c
> > > > > +++ b/drivers/acpi/pci_root.c
> > > > > @@ -303,7 +303,8 @@ struct pci_dev *acpi_get_pci_dev(acpi_handle handle)
> > > > >              * case pdev->subordinate will be NULL for the parent.
> > > > >              */
> > > > >             if (!pbus) {
> > > > > -                   dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > > +                   dev_dbg(&root->device->dev,
> > > > > +                           "dev %d, function %d is not a PCI-to-PCI bridge\n", dev, fn);
> > > >
> > > > This should use "%02x.%d" to be consistent with the dev_set_name() in
> > > > pci_setup_device().
> > >
> > > Ah, missed that, will change it and send out a new version tomorrow.
> >
> > I would make the change below (modulo the gmail-induced wthite space
> > breakage), though.
> 
> That said ->
> 
> > ---
> >  drivers/acpi/pci_root.c |    5 +++--
> >  1 file changed, 3 insertions(+), 2 deletions(-)
> >
> > Index: linux-pm/drivers/acpi/pci_root.c
> > ===================================================================
> > --- linux-pm.orig/drivers/acpi/pci_root.c
> > +++ linux-pm/drivers/acpi/pci_root.c
> > @@ -295,8 +295,6 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> >              break;
> >
> >          pbus = pdev->subordinate;
> > -        pci_dev_put(pdev);
> > -
> >          /*
> >           * This function may be called for a non-PCI device that has a
> >           * PCI parent (eg. a disk under a PCI SATA controller).  In that
> > @@ -304,9 +302,12 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> >           */
> >          if (!pbus) {
> >              dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > +            pci_dev_put(pdev);
> >              pdev = NULL;
> >              break;
> >          }
> > +
> > +        pci_dev_put(pdev);
> 
> -> we are going to use pbus after this and it is pdev->subordinate
> which cannot survive without pdev AFAICS.
> 
> Are we not concerned about this case?

Good point.

whitehat002, any ideas?  You found this issue but it really looks like
it is not anything that can ever be hit, so how far do you want to go to
unwind it?

thanks,

greg k-h
Rafael J. Wysocki June 27, 2022, 4:37 p.m. UTC | #4
On Mon, Jun 27, 2022 at 5:07 PM Greg Kroah-Hartman
<gregkh@linuxfoundation.org> wrote:
>
> On Thu, Apr 28, 2022 at 10:30:38PM +0200, Rafael J. Wysocki wrote:
> > On Thu, Apr 28, 2022 at 10:15 PM Rafael J. Wysocki <rafael@kernel.org> wrote:
> > >
> > > On Thu, Apr 28, 2022 at 6:22 PM Greg Kroah-Hartman
> > > <gregkh@linuxfoundation.org> wrote:
> > > >
> > > > On Thu, Apr 28, 2022 at 10:58:58AM -0500, Bjorn Helgaas wrote:
> > > > > On Thu, Apr 28, 2022 at 04:28:53PM +0200, Greg Kroah-Hartman wrote:
> > > > > > In acpi_get_pci_dev(), the debugging message for when a PCI bridge is
> > > > > > not found uses a pointer to a pci device whose reference has just been
> > > > > > dropped.  The chance that this really is a device that is now been
> > > > > > removed from the system is almost impossible to happen, but to be safe,
> > > > > > let's print out the debugging message based on the acpi root device
> > > > > > which we do have a valid reference to at the moment.
> > > > >
> > > > > This code was added by 497fb54f578e ("ACPI / PCI: Fix NULL pointer
> > > > > dereference in acpi_get_pci_dev() (rev. 2)").  Not sure if it's worth
> > > > > a Fixes: tag.
> > > >
> > > > Can't hurt, I'll add it for the v2 based on this review.
> > > >
> > > > >
> > > > > acpi_get_pci_dev() is used by only five callers, three of which are
> > > > > video/backlight related.  I'm always skeptical of one-off interfaces
> > > > > like this, but I don't know enough to propose any refactoring or other
> > > > > alternatives.
> > > > >
> > > > > I'll leave this for Rafael, but if I were applying I would silently
> > > > > touch up the subject to match convention:
> > > > >
> > > > >   PCI/ACPI: Do not reference PCI device after it has been released
> > > >
> > > > Much simpler, thanks.
> > > >
> > > > >
> > > > > > Cc: Bjorn Helgaas <bhelgaas@google.com>
> > > > > > Cc: "Rafael J. Wysocki" <rafael@kernel.org>
> > > > > > Cc: Len Brown <lenb@kernel.org>
> > > > > > Cc: linux-pci@vger.kernel.org
> > > > > > Cc: linux-acpi@vger.kernel.org
> > > > > > Reported-by: whitehat002 <hackyzh002@gmail.com>
> > > > > > Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
> > > > > > ---
> > > > > >  drivers/acpi/pci_root.c | 3 ++-
> > > > > >  1 file changed, 2 insertions(+), 1 deletion(-)
> > > > > >
> > > > > > diff --git a/drivers/acpi/pci_root.c b/drivers/acpi/pci_root.c
> > > > > > index 6f9e75d14808..ecda378dbc09 100644
> > > > > > --- a/drivers/acpi/pci_root.c
> > > > > > +++ b/drivers/acpi/pci_root.c
> > > > > > @@ -303,7 +303,8 @@ struct pci_dev *acpi_get_pci_dev(acpi_handle handle)
> > > > > >              * case pdev->subordinate will be NULL for the parent.
> > > > > >              */
> > > > > >             if (!pbus) {
> > > > > > -                   dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > > > +                   dev_dbg(&root->device->dev,
> > > > > > +                           "dev %d, function %d is not a PCI-to-PCI bridge\n", dev, fn);
> > > > >
> > > > > This should use "%02x.%d" to be consistent with the dev_set_name() in
> > > > > pci_setup_device().
> > > >
> > > > Ah, missed that, will change it and send out a new version tomorrow.
> > >
> > > I would make the change below (modulo the gmail-induced wthite space
> > > breakage), though.
> >
> > That said ->
> >
> > > ---
> > >  drivers/acpi/pci_root.c |    5 +++--
> > >  1 file changed, 3 insertions(+), 2 deletions(-)
> > >
> > > Index: linux-pm/drivers/acpi/pci_root.c
> > > ===================================================================
> > > --- linux-pm.orig/drivers/acpi/pci_root.c
> > > +++ linux-pm/drivers/acpi/pci_root.c
> > > @@ -295,8 +295,6 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > >              break;
> > >
> > >          pbus = pdev->subordinate;
> > > -        pci_dev_put(pdev);
> > > -
> > >          /*
> > >           * This function may be called for a non-PCI device that has a
> > >           * PCI parent (eg. a disk under a PCI SATA controller).  In that
> > > @@ -304,9 +302,12 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > >           */
> > >          if (!pbus) {
> > >              dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > +            pci_dev_put(pdev);
> > >              pdev = NULL;
> > >              break;
> > >          }
> > > +
> > > +        pci_dev_put(pdev);
> >
> > -> we are going to use pbus after this and it is pdev->subordinate
> > which cannot survive without pdev AFAICS.
> >
> > Are we not concerned about this case?
>
> Good point.
>
> whitehat002, any ideas?  You found this issue but it really looks like
> it is not anything that can ever be hit, so how far do you want to go to
> unwind it?

I have an idea, sorry for the delay here.

I should be ready to post something tomorrow.

Cheers!
Greg Kroah-Hartman Sept. 9, 2022, 7:42 a.m. UTC | #5
On Mon, Jun 27, 2022 at 06:37:06PM +0200, Rafael J. Wysocki wrote:
> On Mon, Jun 27, 2022 at 5:07 PM Greg Kroah-Hartman
> <gregkh@linuxfoundation.org> wrote:
> >
> > On Thu, Apr 28, 2022 at 10:30:38PM +0200, Rafael J. Wysocki wrote:
> > > On Thu, Apr 28, 2022 at 10:15 PM Rafael J. Wysocki <rafael@kernel.org> wrote:
> > > >
> > > > On Thu, Apr 28, 2022 at 6:22 PM Greg Kroah-Hartman
> > > > <gregkh@linuxfoundation.org> wrote:
> > > > >
> > > > > On Thu, Apr 28, 2022 at 10:58:58AM -0500, Bjorn Helgaas wrote:
> > > > > > On Thu, Apr 28, 2022 at 04:28:53PM +0200, Greg Kroah-Hartman wrote:
> > > > > > > In acpi_get_pci_dev(), the debugging message for when a PCI bridge is
> > > > > > > not found uses a pointer to a pci device whose reference has just been
> > > > > > > dropped.  The chance that this really is a device that is now been
> > > > > > > removed from the system is almost impossible to happen, but to be safe,
> > > > > > > let's print out the debugging message based on the acpi root device
> > > > > > > which we do have a valid reference to at the moment.
> > > > > >
> > > > > > This code was added by 497fb54f578e ("ACPI / PCI: Fix NULL pointer
> > > > > > dereference in acpi_get_pci_dev() (rev. 2)").  Not sure if it's worth
> > > > > > a Fixes: tag.
> > > > >
> > > > > Can't hurt, I'll add it for the v2 based on this review.
> > > > >
> > > > > >
> > > > > > acpi_get_pci_dev() is used by only five callers, three of which are
> > > > > > video/backlight related.  I'm always skeptical of one-off interfaces
> > > > > > like this, but I don't know enough to propose any refactoring or other
> > > > > > alternatives.
> > > > > >
> > > > > > I'll leave this for Rafael, but if I were applying I would silently
> > > > > > touch up the subject to match convention:
> > > > > >
> > > > > >   PCI/ACPI: Do not reference PCI device after it has been released
> > > > >
> > > > > Much simpler, thanks.
> > > > >
> > > > > >
> > > > > > > Cc: Bjorn Helgaas <bhelgaas@google.com>
> > > > > > > Cc: "Rafael J. Wysocki" <rafael@kernel.org>
> > > > > > > Cc: Len Brown <lenb@kernel.org>
> > > > > > > Cc: linux-pci@vger.kernel.org
> > > > > > > Cc: linux-acpi@vger.kernel.org
> > > > > > > Reported-by: whitehat002 <hackyzh002@gmail.com>
> > > > > > > Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
> > > > > > > ---
> > > > > > >  drivers/acpi/pci_root.c | 3 ++-
> > > > > > >  1 file changed, 2 insertions(+), 1 deletion(-)
> > > > > > >
> > > > > > > diff --git a/drivers/acpi/pci_root.c b/drivers/acpi/pci_root.c
> > > > > > > index 6f9e75d14808..ecda378dbc09 100644
> > > > > > > --- a/drivers/acpi/pci_root.c
> > > > > > > +++ b/drivers/acpi/pci_root.c
> > > > > > > @@ -303,7 +303,8 @@ struct pci_dev *acpi_get_pci_dev(acpi_handle handle)
> > > > > > >              * case pdev->subordinate will be NULL for the parent.
> > > > > > >              */
> > > > > > >             if (!pbus) {
> > > > > > > -                   dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > > > > +                   dev_dbg(&root->device->dev,
> > > > > > > +                           "dev %d, function %d is not a PCI-to-PCI bridge\n", dev, fn);
> > > > > >
> > > > > > This should use "%02x.%d" to be consistent with the dev_set_name() in
> > > > > > pci_setup_device().
> > > > >
> > > > > Ah, missed that, will change it and send out a new version tomorrow.
> > > >
> > > > I would make the change below (modulo the gmail-induced wthite space
> > > > breakage), though.
> > >
> > > That said ->
> > >
> > > > ---
> > > >  drivers/acpi/pci_root.c |    5 +++--
> > > >  1 file changed, 3 insertions(+), 2 deletions(-)
> > > >
> > > > Index: linux-pm/drivers/acpi/pci_root.c
> > > > ===================================================================
> > > > --- linux-pm.orig/drivers/acpi/pci_root.c
> > > > +++ linux-pm/drivers/acpi/pci_root.c
> > > > @@ -295,8 +295,6 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > > >              break;
> > > >
> > > >          pbus = pdev->subordinate;
> > > > -        pci_dev_put(pdev);
> > > > -
> > > >          /*
> > > >           * This function may be called for a non-PCI device that has a
> > > >           * PCI parent (eg. a disk under a PCI SATA controller).  In that
> > > > @@ -304,9 +302,12 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > > >           */
> > > >          if (!pbus) {
> > > >              dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > +            pci_dev_put(pdev);
> > > >              pdev = NULL;
> > > >              break;
> > > >          }
> > > > +
> > > > +        pci_dev_put(pdev);
> > >
> > > -> we are going to use pbus after this and it is pdev->subordinate
> > > which cannot survive without pdev AFAICS.
> > >
> > > Are we not concerned about this case?
> >
> > Good point.
> >
> > whitehat002, any ideas?  You found this issue but it really looks like
> > it is not anything that can ever be hit, so how far do you want to go to
> > unwind it?
> 
> I have an idea, sorry for the delay here.
> 
> I should be ready to post something tomorrow.

Was this ever posted?

thanks,

greg k-h
Rafael J. Wysocki Sept. 9, 2022, 9:18 p.m. UTC | #6
On Friday, September 9, 2022 9:42:53 AM CEST Greg Kroah-Hartman wrote:
> On Mon, Jun 27, 2022 at 06:37:06PM +0200, Rafael J. Wysocki wrote:
> > On Mon, Jun 27, 2022 at 5:07 PM Greg Kroah-Hartman
> > <gregkh@linuxfoundation.org> wrote:
> > >
> > > On Thu, Apr 28, 2022 at 10:30:38PM +0200, Rafael J. Wysocki wrote:
> > > > On Thu, Apr 28, 2022 at 10:15 PM Rafael J. Wysocki <rafael@kernel.org> wrote:
> > > > >
> > > > > On Thu, Apr 28, 2022 at 6:22 PM Greg Kroah-Hartman
> > > > > <gregkh@linuxfoundation.org> wrote:
> > > > > >
> > > > > > On Thu, Apr 28, 2022 at 10:58:58AM -0500, Bjorn Helgaas wrote:
> > > > > > > On Thu, Apr 28, 2022 at 04:28:53PM +0200, Greg Kroah-Hartman wrote:
> > > > > > > > In acpi_get_pci_dev(), the debugging message for when a PCI bridge is
> > > > > > > > not found uses a pointer to a pci device whose reference has just been
> > > > > > > > dropped.  The chance that this really is a device that is now been
> > > > > > > > removed from the system is almost impossible to happen, but to be safe,
> > > > > > > > let's print out the debugging message based on the acpi root device
> > > > > > > > which we do have a valid reference to at the moment.
> > > > > > >
> > > > > > > This code was added by 497fb54f578e ("ACPI / PCI: Fix NULL pointer
> > > > > > > dereference in acpi_get_pci_dev() (rev. 2)").  Not sure if it's worth
> > > > > > > a Fixes: tag.
> > > > > >
> > > > > > Can't hurt, I'll add it for the v2 based on this review.
> > > > > >
> > > > > > >
> > > > > > > acpi_get_pci_dev() is used by only five callers, three of which are
> > > > > > > video/backlight related.  I'm always skeptical of one-off interfaces
> > > > > > > like this, but I don't know enough to propose any refactoring or other
> > > > > > > alternatives.
> > > > > > >
> > > > > > > I'll leave this for Rafael, but if I were applying I would silently
> > > > > > > touch up the subject to match convention:
> > > > > > >
> > > > > > >   PCI/ACPI: Do not reference PCI device after it has been released
> > > > > >
> > > > > > Much simpler, thanks.
> > > > > >
> > > > > > >
> > > > > > > > Cc: Bjorn Helgaas <bhelgaas@google.com>
> > > > > > > > Cc: "Rafael J. Wysocki" <rafael@kernel.org>
> > > > > > > > Cc: Len Brown <lenb@kernel.org>
> > > > > > > > Cc: linux-pci@vger.kernel.org
> > > > > > > > Cc: linux-acpi@vger.kernel.org
> > > > > > > > Reported-by: whitehat002 <hackyzh002@gmail.com>
> > > > > > > > Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
> > > > > > > > ---
> > > > > > > >  drivers/acpi/pci_root.c | 3 ++-
> > > > > > > >  1 file changed, 2 insertions(+), 1 deletion(-)
> > > > > > > >
> > > > > > > > diff --git a/drivers/acpi/pci_root.c b/drivers/acpi/pci_root.c
> > > > > > > > index 6f9e75d14808..ecda378dbc09 100644
> > > > > > > > --- a/drivers/acpi/pci_root.c
> > > > > > > > +++ b/drivers/acpi/pci_root.c
> > > > > > > > @@ -303,7 +303,8 @@ struct pci_dev *acpi_get_pci_dev(acpi_handle handle)
> > > > > > > >              * case pdev->subordinate will be NULL for the parent.
> > > > > > > >              */
> > > > > > > >             if (!pbus) {
> > > > > > > > -                   dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > > > > > +                   dev_dbg(&root->device->dev,
> > > > > > > > +                           "dev %d, function %d is not a PCI-to-PCI bridge\n", dev, fn);
> > > > > > >
> > > > > > > This should use "%02x.%d" to be consistent with the dev_set_name() in
> > > > > > > pci_setup_device().
> > > > > >
> > > > > > Ah, missed that, will change it and send out a new version tomorrow.
> > > > >
> > > > > I would make the change below (modulo the gmail-induced wthite space
> > > > > breakage), though.
> > > >
> > > > That said ->
> > > >
> > > > > ---
> > > > >  drivers/acpi/pci_root.c |    5 +++--
> > > > >  1 file changed, 3 insertions(+), 2 deletions(-)
> > > > >
> > > > > Index: linux-pm/drivers/acpi/pci_root.c
> > > > > ===================================================================
> > > > > --- linux-pm.orig/drivers/acpi/pci_root.c
> > > > > +++ linux-pm/drivers/acpi/pci_root.c
> > > > > @@ -295,8 +295,6 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > > > >              break;
> > > > >
> > > > >          pbus = pdev->subordinate;
> > > > > -        pci_dev_put(pdev);
> > > > > -
> > > > >          /*
> > > > >           * This function may be called for a non-PCI device that has a
> > > > >           * PCI parent (eg. a disk under a PCI SATA controller).  In that
> > > > > @@ -304,9 +302,12 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > > > >           */
> > > > >          if (!pbus) {
> > > > >              dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > > +            pci_dev_put(pdev);
> > > > >              pdev = NULL;
> > > > >              break;
> > > > >          }
> > > > > +
> > > > > +        pci_dev_put(pdev);
> > > >
> > > > -> we are going to use pbus after this and it is pdev->subordinate
> > > > which cannot survive without pdev AFAICS.
> > > >
> > > > Are we not concerned about this case?
> > >
> > > Good point.
> > >
> > > whitehat002, any ideas?  You found this issue but it really looks like
> > > it is not anything that can ever be hit, so how far do you want to go to
> > > unwind it?
> > 
> > I have an idea, sorry for the delay here.
> > 
> > I should be ready to post something tomorrow.
> 
> Was this ever posted?

No, it wasn't.  Sorry for the glacial pace here.

So the idea is based on the observation that the PCI device returned by the current
code in acpi_get_pci_dev() needs to be registered, so if it corresponds to an ACPI
device object, the struct acpi_device representing it must be registered too and,
moreover, it should be the ACPI companion of that PCI device.  Thus it should be
sufficient to look for it in the ACPI device object's list of physical nodes
corresponding to it.  Hence, the patch below.

I actually can't test it right now (or even compile it for that matter), but
I'll put it in order tomorrow.

---
 drivers/acpi/pci_root.c |   82 +++++++++---------------------------------------
 1 file changed, 16 insertions(+), 66 deletions(-)

Index: linux-pm/drivers/acpi/pci_root.c
===================================================================
--- linux-pm.orig/drivers/acpi/pci_root.c
+++ linux-pm/drivers/acpi/pci_root.c
@@ -312,76 +312,26 @@ struct acpi_handle_node {
  */
 struct pci_dev *acpi_get_pci_dev(acpi_handle handle)
 {
-	int dev, fn;
-	unsigned long long adr;
-	acpi_status status;
-	acpi_handle phandle;
-	struct pci_bus *pbus;
-	struct pci_dev *pdev = NULL;
-	struct acpi_handle_node *node, *tmp;
-	struct acpi_pci_root *root;
-	LIST_HEAD(device_list);
-
-	/*
-	 * Walk up the ACPI CA namespace until we reach a PCI root bridge.
-	 */
-	phandle = handle;
-	while (!acpi_is_root_bridge(phandle)) {
-		node = kzalloc(sizeof(struct acpi_handle_node), GFP_KERNEL);
-		if (!node)
-			goto out;
-
-		INIT_LIST_HEAD(&node->node);
-		node->handle = phandle;
-		list_add(&node->node, &device_list);
-
-		status = acpi_get_parent(phandle, &phandle);
-		if (ACPI_FAILURE(status))
-			goto out;
-	}
-
-	root = acpi_pci_find_root(phandle);
-	if (!root)
-		goto out;
-
-	pbus = root->bus;
-
-	/*
-	 * Now, walk back down the PCI device tree until we return to our
-	 * original handle. Assumes that everything between the PCI root
-	 * bridge and the device we're looking for must be a P2P bridge.
-	 */
-	list_for_each_entry(node, &device_list, node) {
-		acpi_handle hnd = node->handle;
-		status = acpi_evaluate_integer(hnd, "_ADR", NULL, &adr);
-		if (ACPI_FAILURE(status))
-			goto out;
-		dev = (adr >> 16) & 0xffff;
-		fn  = adr & 0xffff;
-
-		pdev = pci_get_slot(pbus, PCI_DEVFN(dev, fn));
-		if (!pdev || hnd == handle)
-			break;
-
-		pbus = pdev->subordinate;
-		pci_dev_put(pdev);
-
-		/*
-		 * This function may be called for a non-PCI device that has a
-		 * PCI parent (eg. a disk under a PCI SATA controller).  In that
-		 * case pdev->subordinate will be NULL for the parent.
-		 */
-		if (!pbus) {
-			dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
-			pdev = NULL;
+	struct acpi_device *adev = acpi_fetch_acpi_dev(handle);
+	struct acpi_device_physical_node *pn;
+	struct device *pci_dev = NULL;
+
+	if (!adev)
+		return NULL;
+
+	mutex_lock(&adev->physical_node_lock);
+
+	list_for_each_entry(pn, &acpi_dev->physical_node_list, node) {
+		if (dev_is_pci(pn->dev)) {
+			pci_dev = to_pci_dev(pn->dev);
 			break;
 		}
+			
 	}
-out:
-	list_for_each_entry_safe(node, tmp, &device_list, node)
-		kfree(node);
 
-	return pdev;
+	mutex_unlock(&adev->physical_node_lock);
+
+	return pci_dev;
 }
 EXPORT_SYMBOL_GPL(acpi_get_pci_dev);
Greg Kroah-Hartman Sept. 10, 2022, 5:42 a.m. UTC | #7
On Fri, Sep 09, 2022 at 11:18:46PM +0200, Rafael J. Wysocki wrote:
> On Friday, September 9, 2022 9:42:53 AM CEST Greg Kroah-Hartman wrote:
> > On Mon, Jun 27, 2022 at 06:37:06PM +0200, Rafael J. Wysocki wrote:
> > > On Mon, Jun 27, 2022 at 5:07 PM Greg Kroah-Hartman
> > > <gregkh@linuxfoundation.org> wrote:
> > > >
> > > > On Thu, Apr 28, 2022 at 10:30:38PM +0200, Rafael J. Wysocki wrote:
> > > > > On Thu, Apr 28, 2022 at 10:15 PM Rafael J. Wysocki <rafael@kernel.org> wrote:
> > > > > >
> > > > > > On Thu, Apr 28, 2022 at 6:22 PM Greg Kroah-Hartman
> > > > > > <gregkh@linuxfoundation.org> wrote:
> > > > > > >
> > > > > > > On Thu, Apr 28, 2022 at 10:58:58AM -0500, Bjorn Helgaas wrote:
> > > > > > > > On Thu, Apr 28, 2022 at 04:28:53PM +0200, Greg Kroah-Hartman wrote:
> > > > > > > > > In acpi_get_pci_dev(), the debugging message for when a PCI bridge is
> > > > > > > > > not found uses a pointer to a pci device whose reference has just been
> > > > > > > > > dropped.  The chance that this really is a device that is now been
> > > > > > > > > removed from the system is almost impossible to happen, but to be safe,
> > > > > > > > > let's print out the debugging message based on the acpi root device
> > > > > > > > > which we do have a valid reference to at the moment.
> > > > > > > >
> > > > > > > > This code was added by 497fb54f578e ("ACPI / PCI: Fix NULL pointer
> > > > > > > > dereference in acpi_get_pci_dev() (rev. 2)").  Not sure if it's worth
> > > > > > > > a Fixes: tag.
> > > > > > >
> > > > > > > Can't hurt, I'll add it for the v2 based on this review.
> > > > > > >
> > > > > > > >
> > > > > > > > acpi_get_pci_dev() is used by only five callers, three of which are
> > > > > > > > video/backlight related.  I'm always skeptical of one-off interfaces
> > > > > > > > like this, but I don't know enough to propose any refactoring or other
> > > > > > > > alternatives.
> > > > > > > >
> > > > > > > > I'll leave this for Rafael, but if I were applying I would silently
> > > > > > > > touch up the subject to match convention:
> > > > > > > >
> > > > > > > >   PCI/ACPI: Do not reference PCI device after it has been released
> > > > > > >
> > > > > > > Much simpler, thanks.
> > > > > > >
> > > > > > > >
> > > > > > > > > Cc: Bjorn Helgaas <bhelgaas@google.com>
> > > > > > > > > Cc: "Rafael J. Wysocki" <rafael@kernel.org>
> > > > > > > > > Cc: Len Brown <lenb@kernel.org>
> > > > > > > > > Cc: linux-pci@vger.kernel.org
> > > > > > > > > Cc: linux-acpi@vger.kernel.org
> > > > > > > > > Reported-by: whitehat002 <hackyzh002@gmail.com>
> > > > > > > > > Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
> > > > > > > > > ---
> > > > > > > > >  drivers/acpi/pci_root.c | 3 ++-
> > > > > > > > >  1 file changed, 2 insertions(+), 1 deletion(-)
> > > > > > > > >
> > > > > > > > > diff --git a/drivers/acpi/pci_root.c b/drivers/acpi/pci_root.c
> > > > > > > > > index 6f9e75d14808..ecda378dbc09 100644
> > > > > > > > > --- a/drivers/acpi/pci_root.c
> > > > > > > > > +++ b/drivers/acpi/pci_root.c
> > > > > > > > > @@ -303,7 +303,8 @@ struct pci_dev *acpi_get_pci_dev(acpi_handle handle)
> > > > > > > > >              * case pdev->subordinate will be NULL for the parent.
> > > > > > > > >              */
> > > > > > > > >             if (!pbus) {
> > > > > > > > > -                   dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > > > > > > +                   dev_dbg(&root->device->dev,
> > > > > > > > > +                           "dev %d, function %d is not a PCI-to-PCI bridge\n", dev, fn);
> > > > > > > >
> > > > > > > > This should use "%02x.%d" to be consistent with the dev_set_name() in
> > > > > > > > pci_setup_device().
> > > > > > >
> > > > > > > Ah, missed that, will change it and send out a new version tomorrow.
> > > > > >
> > > > > > I would make the change below (modulo the gmail-induced wthite space
> > > > > > breakage), though.
> > > > >
> > > > > That said ->
> > > > >
> > > > > > ---
> > > > > >  drivers/acpi/pci_root.c |    5 +++--
> > > > > >  1 file changed, 3 insertions(+), 2 deletions(-)
> > > > > >
> > > > > > Index: linux-pm/drivers/acpi/pci_root.c
> > > > > > ===================================================================
> > > > > > --- linux-pm.orig/drivers/acpi/pci_root.c
> > > > > > +++ linux-pm/drivers/acpi/pci_root.c
> > > > > > @@ -295,8 +295,6 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > > > > >              break;
> > > > > >
> > > > > >          pbus = pdev->subordinate;
> > > > > > -        pci_dev_put(pdev);
> > > > > > -
> > > > > >          /*
> > > > > >           * This function may be called for a non-PCI device that has a
> > > > > >           * PCI parent (eg. a disk under a PCI SATA controller).  In that
> > > > > > @@ -304,9 +302,12 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > > > > >           */
> > > > > >          if (!pbus) {
> > > > > >              dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > > > +            pci_dev_put(pdev);
> > > > > >              pdev = NULL;
> > > > > >              break;
> > > > > >          }
> > > > > > +
> > > > > > +        pci_dev_put(pdev);
> > > > >
> > > > > -> we are going to use pbus after this and it is pdev->subordinate
> > > > > which cannot survive without pdev AFAICS.
> > > > >
> > > > > Are we not concerned about this case?
> > > >
> > > > Good point.
> > > >
> > > > whitehat002, any ideas?  You found this issue but it really looks like
> > > > it is not anything that can ever be hit, so how far do you want to go to
> > > > unwind it?
> > > 
> > > I have an idea, sorry for the delay here.
> > > 
> > > I should be ready to post something tomorrow.
> > 
> > Was this ever posted?
> 
> No, it wasn't.  Sorry for the glacial pace here.
> 
> So the idea is based on the observation that the PCI device returned by the current
> code in acpi_get_pci_dev() needs to be registered, so if it corresponds to an ACPI
> device object, the struct acpi_device representing it must be registered too and,
> moreover, it should be the ACPI companion of that PCI device.  Thus it should be
> sufficient to look for it in the ACPI device object's list of physical nodes
> corresponding to it.  Hence, the patch below.
> 
> I actually can't test it right now (or even compile it for that matter), but
> I'll put it in order tomorrow.

The idea looks sane to me, let me know if testing works or not, thanks!

greg k-h
Rafael J. Wysocki Sept. 10, 2022, 1:33 p.m. UTC | #8
On Saturday, September 10, 2022 7:42:03 AM CEST Greg Kroah-Hartman wrote:
> On Fri, Sep 09, 2022 at 11:18:46PM +0200, Rafael J. Wysocki wrote:
> > On Friday, September 9, 2022 9:42:53 AM CEST Greg Kroah-Hartman wrote:
> > > On Mon, Jun 27, 2022 at 06:37:06PM +0200, Rafael J. Wysocki wrote:
> > > > On Mon, Jun 27, 2022 at 5:07 PM Greg Kroah-Hartman
> > > > <gregkh@linuxfoundation.org> wrote:
> > > > >
> > > > > On Thu, Apr 28, 2022 at 10:30:38PM +0200, Rafael J. Wysocki wrote:
> > > > > > On Thu, Apr 28, 2022 at 10:15 PM Rafael J. Wysocki <rafael@kernel.org> wrote:
> > > > > > >
> > > > > > > On Thu, Apr 28, 2022 at 6:22 PM Greg Kroah-Hartman
> > > > > > > <gregkh@linuxfoundation.org> wrote:
> > > > > > > >
> > > > > > > > On Thu, Apr 28, 2022 at 10:58:58AM -0500, Bjorn Helgaas wrote:
> > > > > > > > > On Thu, Apr 28, 2022 at 04:28:53PM +0200, Greg Kroah-Hartman wrote:
> > > > > > > > > > In acpi_get_pci_dev(), the debugging message for when a PCI bridge is
> > > > > > > > > > not found uses a pointer to a pci device whose reference has just been
> > > > > > > > > > dropped.  The chance that this really is a device that is now been
> > > > > > > > > > removed from the system is almost impossible to happen, but to be safe,
> > > > > > > > > > let's print out the debugging message based on the acpi root device
> > > > > > > > > > which we do have a valid reference to at the moment.
> > > > > > > > >
> > > > > > > > > This code was added by 497fb54f578e ("ACPI / PCI: Fix NULL pointer
> > > > > > > > > dereference in acpi_get_pci_dev() (rev. 2)").  Not sure if it's worth
> > > > > > > > > a Fixes: tag.
> > > > > > > >
> > > > > > > > Can't hurt, I'll add it for the v2 based on this review.
> > > > > > > >
> > > > > > > > >
> > > > > > > > > acpi_get_pci_dev() is used by only five callers, three of which are
> > > > > > > > > video/backlight related.  I'm always skeptical of one-off interfaces
> > > > > > > > > like this, but I don't know enough to propose any refactoring or other
> > > > > > > > > alternatives.
> > > > > > > > >
> > > > > > > > > I'll leave this for Rafael, but if I were applying I would silently
> > > > > > > > > touch up the subject to match convention:
> > > > > > > > >
> > > > > > > > >   PCI/ACPI: Do not reference PCI device after it has been released
> > > > > > > >
> > > > > > > > Much simpler, thanks.
> > > > > > > >
> > > > > > > > >
> > > > > > > > > > Cc: Bjorn Helgaas <bhelgaas@google.com>
> > > > > > > > > > Cc: "Rafael J. Wysocki" <rafael@kernel.org>
> > > > > > > > > > Cc: Len Brown <lenb@kernel.org>
> > > > > > > > > > Cc: linux-pci@vger.kernel.org
> > > > > > > > > > Cc: linux-acpi@vger.kernel.org
> > > > > > > > > > Reported-by: whitehat002 <hackyzh002@gmail.com>
> > > > > > > > > > Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
> > > > > > > > > > ---
> > > > > > > > > >  drivers/acpi/pci_root.c | 3 ++-
> > > > > > > > > >  1 file changed, 2 insertions(+), 1 deletion(-)
> > > > > > > > > >
> > > > > > > > > > diff --git a/drivers/acpi/pci_root.c b/drivers/acpi/pci_root.c
> > > > > > > > > > index 6f9e75d14808..ecda378dbc09 100644
> > > > > > > > > > --- a/drivers/acpi/pci_root.c
> > > > > > > > > > +++ b/drivers/acpi/pci_root.c
> > > > > > > > > > @@ -303,7 +303,8 @@ struct pci_dev *acpi_get_pci_dev(acpi_handle handle)
> > > > > > > > > >              * case pdev->subordinate will be NULL for the parent.
> > > > > > > > > >              */
> > > > > > > > > >             if (!pbus) {
> > > > > > > > > > -                   dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > > > > > > > +                   dev_dbg(&root->device->dev,
> > > > > > > > > > +                           "dev %d, function %d is not a PCI-to-PCI bridge\n", dev, fn);
> > > > > > > > >
> > > > > > > > > This should use "%02x.%d" to be consistent with the dev_set_name() in
> > > > > > > > > pci_setup_device().
> > > > > > > >
> > > > > > > > Ah, missed that, will change it and send out a new version tomorrow.
> > > > > > >
> > > > > > > I would make the change below (modulo the gmail-induced wthite space
> > > > > > > breakage), though.
> > > > > >
> > > > > > That said ->
> > > > > >
> > > > > > > ---
> > > > > > >  drivers/acpi/pci_root.c |    5 +++--
> > > > > > >  1 file changed, 3 insertions(+), 2 deletions(-)
> > > > > > >
> > > > > > > Index: linux-pm/drivers/acpi/pci_root.c
> > > > > > > ===================================================================
> > > > > > > --- linux-pm.orig/drivers/acpi/pci_root.c
> > > > > > > +++ linux-pm/drivers/acpi/pci_root.c
> > > > > > > @@ -295,8 +295,6 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > > > > > >              break;
> > > > > > >
> > > > > > >          pbus = pdev->subordinate;
> > > > > > > -        pci_dev_put(pdev);
> > > > > > > -
> > > > > > >          /*
> > > > > > >           * This function may be called for a non-PCI device that has a
> > > > > > >           * PCI parent (eg. a disk under a PCI SATA controller).  In that
> > > > > > > @@ -304,9 +302,12 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > > > > > >           */
> > > > > > >          if (!pbus) {
> > > > > > >              dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > > > > +            pci_dev_put(pdev);
> > > > > > >              pdev = NULL;
> > > > > > >              break;
> > > > > > >          }
> > > > > > > +
> > > > > > > +        pci_dev_put(pdev);
> > > > > >
> > > > > > -> we are going to use pbus after this and it is pdev->subordinate
> > > > > > which cannot survive without pdev AFAICS.
> > > > > >
> > > > > > Are we not concerned about this case?
> > > > >
> > > > > Good point.
> > > > >
> > > > > whitehat002, any ideas?  You found this issue but it really looks like
> > > > > it is not anything that can ever be hit, so how far do you want to go to
> > > > > unwind it?
> > > > 
> > > > I have an idea, sorry for the delay here.
> > > > 
> > > > I should be ready to post something tomorrow.
> > > 
> > > Was this ever posted?
> > 
> > No, it wasn't.  Sorry for the glacial pace here.
> > 
> > So the idea is based on the observation that the PCI device returned by the current
> > code in acpi_get_pci_dev() needs to be registered, so if it corresponds to an ACPI
> > device object, the struct acpi_device representing it must be registered too and,
> > moreover, it should be the ACPI companion of that PCI device.  Thus it should be
> > sufficient to look for it in the ACPI device object's list of physical nodes
> > corresponding to it.  Hence, the patch below.
> > 
> > I actually can't test it right now (or even compile it for that matter), but
> > I'll put it in order tomorrow.
> 
> The idea looks sane to me, let me know if testing works or not, thanks!

The patch sent previously had a few build issues, so I've just officially
posted a version of it that builds:

https://patchwork.kernel.org/project/linux-acpi/patch/2661914.mvXUDI8C0e@kreacher/

To test it, I've applied the appended extra debug patch and checked that the output
from it is the same before and after the change above.  It is for me.

---
 drivers/acpi/scan.c |   15 +++++++++++++++
 1 file changed, 15 insertions(+)

Index: linux-pm/drivers/acpi/scan.c
===================================================================
--- linux-pm.orig/drivers/acpi/scan.c
+++ linux-pm/drivers/acpi/scan.c
@@ -2537,6 +2537,19 @@ static void __init acpi_get_spcr_uart_ad
 
 static bool acpi_scan_initialized;
 
+#include <linux/pci.h>
+
+static int pci_dev_check(struct device *dev, void *not_used)
+{
+	struct acpi_device *adev = to_acpi_device(dev);
+	struct pci_dev *pci_dev = acpi_get_pci_dev(adev->handle);
+
+	if (pci_dev)
+		dev_info(&adev->dev, "%s: %s\n", __func__, dev_name(&pci_dev->dev));
+
+	return 0;
+}
+
 void __init acpi_scan_init(void)
 {
 	acpi_status status;
@@ -2603,6 +2616,8 @@ void __init acpi_scan_init(void)
 
 	acpi_scan_initialized = true;
 
+	acpi_bus_for_each_dev(pci_dev_check, NULL);
+
 unlock:
 	mutex_unlock(&acpi_scan_lock);
 }
Greg Kroah-Hartman Sept. 10, 2022, 2:06 p.m. UTC | #9
On Sat, Sep 10, 2022 at 03:33:15PM +0200, Rafael J. Wysocki wrote:
> On Saturday, September 10, 2022 7:42:03 AM CEST Greg Kroah-Hartman wrote:
> > On Fri, Sep 09, 2022 at 11:18:46PM +0200, Rafael J. Wysocki wrote:
> > > On Friday, September 9, 2022 9:42:53 AM CEST Greg Kroah-Hartman wrote:
> > > > On Mon, Jun 27, 2022 at 06:37:06PM +0200, Rafael J. Wysocki wrote:
> > > > > On Mon, Jun 27, 2022 at 5:07 PM Greg Kroah-Hartman
> > > > > <gregkh@linuxfoundation.org> wrote:
> > > > > >
> > > > > > On Thu, Apr 28, 2022 at 10:30:38PM +0200, Rafael J. Wysocki wrote:
> > > > > > > On Thu, Apr 28, 2022 at 10:15 PM Rafael J. Wysocki <rafael@kernel.org> wrote:
> > > > > > > >
> > > > > > > > On Thu, Apr 28, 2022 at 6:22 PM Greg Kroah-Hartman
> > > > > > > > <gregkh@linuxfoundation.org> wrote:
> > > > > > > > >
> > > > > > > > > On Thu, Apr 28, 2022 at 10:58:58AM -0500, Bjorn Helgaas wrote:
> > > > > > > > > > On Thu, Apr 28, 2022 at 04:28:53PM +0200, Greg Kroah-Hartman wrote:
> > > > > > > > > > > In acpi_get_pci_dev(), the debugging message for when a PCI bridge is
> > > > > > > > > > > not found uses a pointer to a pci device whose reference has just been
> > > > > > > > > > > dropped.  The chance that this really is a device that is now been
> > > > > > > > > > > removed from the system is almost impossible to happen, but to be safe,
> > > > > > > > > > > let's print out the debugging message based on the acpi root device
> > > > > > > > > > > which we do have a valid reference to at the moment.
> > > > > > > > > >
> > > > > > > > > > This code was added by 497fb54f578e ("ACPI / PCI: Fix NULL pointer
> > > > > > > > > > dereference in acpi_get_pci_dev() (rev. 2)").  Not sure if it's worth
> > > > > > > > > > a Fixes: tag.
> > > > > > > > >
> > > > > > > > > Can't hurt, I'll add it for the v2 based on this review.
> > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > acpi_get_pci_dev() is used by only five callers, three of which are
> > > > > > > > > > video/backlight related.  I'm always skeptical of one-off interfaces
> > > > > > > > > > like this, but I don't know enough to propose any refactoring or other
> > > > > > > > > > alternatives.
> > > > > > > > > >
> > > > > > > > > > I'll leave this for Rafael, but if I were applying I would silently
> > > > > > > > > > touch up the subject to match convention:
> > > > > > > > > >
> > > > > > > > > >   PCI/ACPI: Do not reference PCI device after it has been released
> > > > > > > > >
> > > > > > > > > Much simpler, thanks.
> > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > > Cc: Bjorn Helgaas <bhelgaas@google.com>
> > > > > > > > > > > Cc: "Rafael J. Wysocki" <rafael@kernel.org>
> > > > > > > > > > > Cc: Len Brown <lenb@kernel.org>
> > > > > > > > > > > Cc: linux-pci@vger.kernel.org
> > > > > > > > > > > Cc: linux-acpi@vger.kernel.org
> > > > > > > > > > > Reported-by: whitehat002 <hackyzh002@gmail.com>
> > > > > > > > > > > Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
> > > > > > > > > > > ---
> > > > > > > > > > >  drivers/acpi/pci_root.c | 3 ++-
> > > > > > > > > > >  1 file changed, 2 insertions(+), 1 deletion(-)
> > > > > > > > > > >
> > > > > > > > > > > diff --git a/drivers/acpi/pci_root.c b/drivers/acpi/pci_root.c
> > > > > > > > > > > index 6f9e75d14808..ecda378dbc09 100644
> > > > > > > > > > > --- a/drivers/acpi/pci_root.c
> > > > > > > > > > > +++ b/drivers/acpi/pci_root.c
> > > > > > > > > > > @@ -303,7 +303,8 @@ struct pci_dev *acpi_get_pci_dev(acpi_handle handle)
> > > > > > > > > > >              * case pdev->subordinate will be NULL for the parent.
> > > > > > > > > > >              */
> > > > > > > > > > >             if (!pbus) {
> > > > > > > > > > > -                   dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > > > > > > > > +                   dev_dbg(&root->device->dev,
> > > > > > > > > > > +                           "dev %d, function %d is not a PCI-to-PCI bridge\n", dev, fn);
> > > > > > > > > >
> > > > > > > > > > This should use "%02x.%d" to be consistent with the dev_set_name() in
> > > > > > > > > > pci_setup_device().
> > > > > > > > >
> > > > > > > > > Ah, missed that, will change it and send out a new version tomorrow.
> > > > > > > >
> > > > > > > > I would make the change below (modulo the gmail-induced wthite space
> > > > > > > > breakage), though.
> > > > > > >
> > > > > > > That said ->
> > > > > > >
> > > > > > > > ---
> > > > > > > >  drivers/acpi/pci_root.c |    5 +++--
> > > > > > > >  1 file changed, 3 insertions(+), 2 deletions(-)
> > > > > > > >
> > > > > > > > Index: linux-pm/drivers/acpi/pci_root.c
> > > > > > > > ===================================================================
> > > > > > > > --- linux-pm.orig/drivers/acpi/pci_root.c
> > > > > > > > +++ linux-pm/drivers/acpi/pci_root.c
> > > > > > > > @@ -295,8 +295,6 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > > > > > > >              break;
> > > > > > > >
> > > > > > > >          pbus = pdev->subordinate;
> > > > > > > > -        pci_dev_put(pdev);
> > > > > > > > -
> > > > > > > >          /*
> > > > > > > >           * This function may be called for a non-PCI device that has a
> > > > > > > >           * PCI parent (eg. a disk under a PCI SATA controller).  In that
> > > > > > > > @@ -304,9 +302,12 @@ struct pci_dev *acpi_get_pci_dev(acpi_ha
> > > > > > > >           */
> > > > > > > >          if (!pbus) {
> > > > > > > >              dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
> > > > > > > > +            pci_dev_put(pdev);
> > > > > > > >              pdev = NULL;
> > > > > > > >              break;
> > > > > > > >          }
> > > > > > > > +
> > > > > > > > +        pci_dev_put(pdev);
> > > > > > >
> > > > > > > -> we are going to use pbus after this and it is pdev->subordinate
> > > > > > > which cannot survive without pdev AFAICS.
> > > > > > >
> > > > > > > Are we not concerned about this case?
> > > > > >
> > > > > > Good point.
> > > > > >
> > > > > > whitehat002, any ideas?  You found this issue but it really looks like
> > > > > > it is not anything that can ever be hit, so how far do you want to go to
> > > > > > unwind it?
> > > > > 
> > > > > I have an idea, sorry for the delay here.
> > > > > 
> > > > > I should be ready to post something tomorrow.
> > > > 
> > > > Was this ever posted?
> > > 
> > > No, it wasn't.  Sorry for the glacial pace here.
> > > 
> > > So the idea is based on the observation that the PCI device returned by the current
> > > code in acpi_get_pci_dev() needs to be registered, so if it corresponds to an ACPI
> > > device object, the struct acpi_device representing it must be registered too and,
> > > moreover, it should be the ACPI companion of that PCI device.  Thus it should be
> > > sufficient to look for it in the ACPI device object's list of physical nodes
> > > corresponding to it.  Hence, the patch below.
> > > 
> > > I actually can't test it right now (or even compile it for that matter), but
> > > I'll put it in order tomorrow.
> > 
> > The idea looks sane to me, let me know if testing works or not, thanks!
> 
> The patch sent previously had a few build issues, so I've just officially
> posted a version of it that builds:
> 
> https://patchwork.kernel.org/project/linux-acpi/patch/2661914.mvXUDI8C0e@kreacher/
> 
> To test it, I've applied the appended extra debug patch and checked that the output
> from it is the same before and after the change above.  It is for me.

Looks good, thanks for doing this!

greg k-h
diff mbox series

Patch

diff --git a/drivers/acpi/pci_root.c b/drivers/acpi/pci_root.c
index 6f9e75d14808..ecda378dbc09 100644
--- a/drivers/acpi/pci_root.c
+++ b/drivers/acpi/pci_root.c
@@ -303,7 +303,8 @@  struct pci_dev *acpi_get_pci_dev(acpi_handle handle)
 		 * case pdev->subordinate will be NULL for the parent.
 		 */
 		if (!pbus) {
-			dev_dbg(&pdev->dev, "Not a PCI-to-PCI bridge\n");
+			dev_dbg(&root->device->dev,
+				"dev %d, function %d is not a PCI-to-PCI bridge\n", dev, fn);
 			pdev = NULL;
 			break;
 		}