[5.6,061/254] block, bfq: fix use-after-free in bfq_idle_slice_timer_body

From: Zhiqiang Liu <liuzhiqiang26@huawei.com>

From: Zhiqiang Liu <liuzhiqiang26@huawei.com>

[ Upstream commit 2f95fa5c955d0a9987ffdc3a095e2f4e62c5f2a9 ]

In bfq_idle_slice_timer func, bfqq = bfqd->in_service_queue is
not in bfqd-lock critical section. The bfqq, which is not
equal to NULL in bfq_idle_slice_timer, may be freed after passing
to bfq_idle_slice_timer_body. So we will access the freed memory.

In addition, considering the bfqq may be in race, we should
firstly check whether bfqq is in service before doing something
on it in bfq_idle_slice_timer_body func. If the bfqq in race is
not in service, it means the bfqq has been expired through
__bfq_bfqq_expire func, and wait_request flags has been cleared in
__bfq_bfqd_reset_in_service func. So we do not need to re-clear the
wait_request of bfqq which is not in service.

KASAN log is given as follows:
[13058.354613] ==================================================================
[13058.354640] BUG: KASAN: use-after-free in bfq_idle_slice_timer+0xac/0x290
[13058.354644] Read of size 8 at addr ffffa02cf3e63f78 by task fork13/19767
[13058.354646]
[13058.354655] CPU: 96 PID: 19767 Comm: fork13
[13058.354661] Call trace:
[13058.354667]  dump_backtrace+0x0/0x310
[13058.354672]  show_stack+0x28/0x38
[13058.354681]  dump_stack+0xd8/0x108
[13058.354687]  print_address_description+0x68/0x2d0
[13058.354690]  kasan_report+0x124/0x2e0
[13058.354697]  __asan_load8+0x88/0xb0
[13058.354702]  bfq_idle_slice_timer+0xac/0x290
[13058.354707]  __hrtimer_run_queues+0x298/0x8b8
[13058.354710]  hrtimer_interrupt+0x1b8/0x678
[13058.354716]  arch_timer_handler_phys+0x4c/0x78
[13058.354722]  handle_percpu_devid_irq+0xf0/0x558
[13058.354731]  generic_handle_irq+0x50/0x70
[13058.354735]  __handle_domain_irq+0x94/0x110
[13058.354739]  gic_handle_irq+0x8c/0x1b0
[13058.354742]  el1_irq+0xb8/0x140
[13058.354748]  do_wp_page+0x260/0xe28
[13058.354752]  __handle_mm_fault+0x8ec/0x9b0
[13058.354756]  handle_mm_fault+0x280/0x460
[13058.354762]  do_page_fault+0x3ec/0x890
[13058.354765]  do_mem_abort+0xc0/0x1b0
[13058.354768]  el0_da+0x24/0x28
[13058.354770]
[13058.354773] Allocated by task 19731:
[13058.354780]  kasan_kmalloc+0xe0/0x190
[13058.354784]  kasan_slab_alloc+0x14/0x20
[13058.354788]  kmem_cache_alloc_node+0x130/0x440
[13058.354793]  bfq_get_queue+0x138/0x858
[13058.354797]  bfq_get_bfqq_handle_split+0xd4/0x328
[13058.354801]  bfq_init_rq+0x1f4/0x1180
[13058.354806]  bfq_insert_requests+0x264/0x1c98
[13058.354811]  blk_mq_sched_insert_requests+0x1c4/0x488
[13058.354818]  blk_mq_flush_plug_list+0x2d4/0x6e0
[13058.354826]  blk_flush_plug_list+0x230/0x548
[13058.354830]  blk_finish_plug+0x60/0x80
[13058.354838]  read_pages+0xec/0x2c0
[13058.354842]  __do_page_cache_readahead+0x374/0x438
[13058.354846]  ondemand_readahead+0x24c/0x6b0
[13058.354851]  page_cache_sync_readahead+0x17c/0x2f8
[13058.354858]  generic_file_buffered_read+0x588/0xc58
[13058.354862]  generic_file_read_iter+0x1b4/0x278
[13058.354965]  ext4_file_read_iter+0xa8/0x1d8 [ext4]
[13058.354972]  __vfs_read+0x238/0x320
[13058.354976]  vfs_read+0xbc/0x1c0
[13058.354980]  ksys_read+0xdc/0x1b8
[13058.354984]  __arm64_sys_read+0x50/0x60
[13058.354990]  el0_svc_common+0xb4/0x1d8
[13058.354994]  el0_svc_handler+0x50/0xa8
[13058.354998]  el0_svc+0x8/0xc
[13058.354999]
[13058.355001] Freed by task 19731:
[13058.355007]  __kasan_slab_free+0x120/0x228
[13058.355010]  kasan_slab_free+0x10/0x18
[13058.355014]  kmem_cache_free+0x288/0x3f0
[13058.355018]  bfq_put_queue+0x134/0x208
[13058.355022]  bfq_exit_icq_bfqq+0x164/0x348
[13058.355026]  bfq_exit_icq+0x28/0x40
[13058.355030]  ioc_exit_icq+0xa0/0x150
[13058.355035]  put_io_context_active+0x250/0x438
[13058.355038]  exit_io_context+0xd0/0x138
[13058.355045]  do_exit+0x734/0xc58
[13058.355050]  do_group_exit+0x78/0x220
[13058.355054]  __wake_up_parent+0x0/0x50
[13058.355058]  el0_svc_common+0xb4/0x1d8
[13058.355062]  el0_svc_handler+0x50/0xa8
[13058.355066]  el0_svc+0x8/0xc
[13058.355067]
[13058.355071] The buggy address belongs to the object at ffffa02cf3e63e70#012 which belongs to the cache bfq_queue of size 464
[13058.355075] The buggy address is located 264 bytes inside of#012 464-byte region [ffffa02cf3e63e70, ffffa02cf3e64040)
[13058.355077] The buggy address belongs to the page:
[13058.355083] page:ffff7e80b3cf9800 count:1 mapcount:0 mapping:ffff802db5c90780 index:0xffffa02cf3e606f0 compound_mapcount: 0
[13058.366175] flags: 0x2ffffe0000008100(slab|head)
[13058.370781] raw: 2ffffe0000008100 ffff7e80b53b1408 ffffa02d730c1c90 ffff802db5c90780
[13058.370787] raw: ffffa02cf3e606f0 0000000000370023 00000001ffffffff 0000000000000000
[13058.370789] page dumped because: kasan: bad access detected
[13058.370791]
[13058.370792] Memory state around the buggy address:
[13058.370797]  ffffa02cf3e63e00: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fb fb
[13058.370801]  ffffa02cf3e63e80: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
[13058.370805] >ffffa02cf3e63f00: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
[13058.370808]                                                                 ^
[13058.370811]  ffffa02cf3e63f80: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
[13058.370815]  ffffa02cf3e64000: fb fb fb fb fb fb fb fb fc fc fc fc fc fc fc fc
[13058.370817] ==================================================================
[13058.370820] Disabling lock debugging due to kernel taint

Here, we directly pass the bfqd to bfq_idle_slice_timer_body func.
--
V2->V3: rewrite the comment as suggested by Paolo Valente
V1->V2: add one comment, and add Fixes and Reported-by tag.

Fixes: aee69d78d ("block, bfq: introduce the BFQ-v0 I/O scheduler as an extra scheduler")
Acked-by: Paolo Valente <paolo.valente@linaro.org>
Reported-by: Wang Wang <wangwang2@huawei.com>
Signed-off-by: Zhiqiang Liu <liuzhiqiang26@huawei.com>
Signed-off-by: Feilong Lin <linfeilong@huawei.com>
Signed-off-by: Jens Axboe <axboe@kernel.dk>
Signed-off-by: Sasha Levin <sashal@kernel.org>
---
 block/bfq-iosched.c | 16 ++++++++++++----
 1 file changed, 12 insertions(+), 4 deletions(-)

Message ID	20200416131333.568693190@linuxfoundation.org
State	Superseded
Headers	show Return-Path: <SRS0=D6dW=6A=vger.kernel.org=stable-owner@kernel.org> From: Greg Kroah-Hartman <gregkh@linuxfoundation.org> To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>, stable@vger.kernel.org, Paolo Valente <paolo.valente@linaro.org>, Wang Wang <wangwang2@huawei.com>, Zhiqiang Liu <liuzhiqiang26@huawei.com>, Feilong Lin <linfeilong@huawei.com>, Jens Axboe <axboe@kernel.dk>, Sasha Levin <sashal@kernel.org> Subject: [PATCH 5.6 061/254] block, bfq: fix use-after-free in bfq_idle_slice_timer_body Date: Thu, 16 Apr 2020 15:22:30 +0200 Message-Id: <20200416131333.568693190@linuxfoundation.org> In-Reply-To: <20200416131325.804095985@linuxfoundation.org> References: <20200416131325.804095985@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: stable-owner@vger.kernel.org Precedence: bulk
Series	None \| expand [5.6,002/254] cpufreq: imx6q: Fixes unwanted cpu overclocking on i.MX6ULL [5.6,004/254] usb: ucsi: ccg: disable runtime pm during fw flashing [5.6,007/254] media: hantro: fix extra MV/MC sync space calculation [5.6,008/254] media: staging: rkisp1: use consistent bus_info string for media_dev [5.6,009/254] media: staging: rkisp1: isp: do not set invalid mbus code for pad [5.6,011/254] firmware: arm_sdei: fix double-lock on hibernate with shared events [5.6,013/254] usb: phy: tegra: Include proper GPIO consumer header to fix compile testing [5.6,015/254] sched/vtime: Prevent unstable evaluation of WARN(vtime->state) [5.6,021/254] null_blk: fix spurious IO errors after failed past-wp access [5.6,023/254] media: imx: imx7-media-csi: Fix video field handling [5.6,025/254] ACPI: EC: Do not clear boot_ec_is_ecdt in acpi_ec_add() [5.6,027/254] x86: Dont let pgprot_modify() change the page encryption bit [5.6,028/254] dma-mapping: Fix dma_pgprot() for unencrypted coherent pages [5.6,031/254] spi: spi-fsl-dspi: Avoid NULL pointer in dspi_slave_abort for non-DMA mode [5.6,032/254] irqchip/versatile-fpga: Handle chained IRQs properly [5.6,033/254] time/sched_clock: Expire timer in hardirq context [5.6,035/254] media: allegro: fix type of gop_length in channel_create message [5.6,038/254] selftests/x86/ptrace_syscall_32: Fix no-vDSO segfault [5.6,041/254] media: i2c: video-i2c: fix build errors due to imply hwmon [5.6,042/254] libata: Remove extra scsi_host_put() in ata_scsi_add_hosts() [5.6,043/254] pstore/platform: fix potential mem leak if pstore_init_fs failed [5.6,045/254] gfs2: Dont demote a glock until its revokes are written [5.6,047/254] x86/boot: Use unsigned comparison for addresses [5.6,048/254] efi/x86: Ignore the memory attributes table on i386 [5.6,049/254] genirq/irqdomain: Check pointer in irq_domain_alloc_irqs_hierarchy() [5.6,055/254] irqchip/gic-v4: Provide irq_retrigger to avoid circular locking dependency [5.6,057/254] firmware: fix a double abort case with fw_load_sysfs_fallback [5.6,058/254] spi: spi-fsl-dspi: Replace interruptible wait queue with a simple completion [5.6,061/254] block, bfq: fix use-after-free in bfq_idle_slice_timer_body [5.6,062/254] btrfs: qgroup: ensure qgroup_rescan_running is only set when the worker is at least... [5.6,064/254] btrfs: restart relocate_tree_blocks properly [5.6,065/254] btrfs: track reloc roots based on their commit root bytenr [5.6,067/254] ASoC: dapm: connect virtual mux with default value [5.6,068/254] ASoC: dpcm: allow start or stop during pause for backend [5.6,071/254] usb: gadget: composite: Inform controller driver of self-powered [5.6,073/254] ALSA: hda: Add driver blacklist [5.6,075/254] ALSA: ice1724: Fix invalid access for enumerated ctl items [5.6,076/254] ALSA: pcm: oss: Fix regression by buffer overflow fix [5.6,077/254] ALSA: hda/realtek: Enable mute LED on an HP system [5.6,083/254] ALSA: hda/realtek - Add quirk for MSI GL63 [5.6,084/254] media: venus: cache vb payload to be used by clock scaling [5.6,085/254] media: venus: firmware: Ignore secure call error on first resume [5.6,086/254] media: hantro: Read be32 words starting at every fourth byte [5.6,090/254] ACPI: EC: Avoid printing confusing messages in acpi_ec_setup() [5.6,092/254] ACPICA: Allow acpi_any_gpe_status_set() to skip one GPE [5.6,093/254] ACPI: PM: s2idle: Refine active GPEs check [5.6,096/254] nvmet-tcp: fix maxh2cdata icresp parameter [5.6,097/254] nvme-fc: Revert "add module to ops template to allow module references" [5.6,100/254] PCI/ASPM: Clear the correct bits when enabling L1 substates [5.6,102/254] PCI: qcom: Fix the fixup of PCI_VENDOR_ID_QCOM [5.6,104/254] erofs: correct the remaining shrink objects [5.6,107/254] tpm: tpm1_bios_measurements_next should increase position index [5.6,108/254] tpm: tpm2_bios_measurements_next should increase position index [5.6,109/254] KEYS: reaching the keys quotas correctly [5.6,111/254] rcu: Make rcu_barrier() account for offline no-CBs CPUs [5.6,112/254] cpu/hotplug: Ignore pm_wakeup_pending() for disable_nonboot_cpus() [5.6,116/254] io_uring: remove bogus RLIMIT_NOFILE check in file registration [5.6,119/254] MIPS/tlbex: Fix LDDIR usage in setup_pw() for Loongson-3 [5.6,121/254] PM / Domains: Allow no domain-idle-states DT property in genpd when parsing [5.6,123/254] ath9k: Handle txpower changes even when TPC is disabled [5.6,124/254] signal: Extend exec_id to 64bits [5.6,125/254] x86/tsc_msr: Use named struct initializers [5.6,127/254] x86/tsc_msr: Make MSR derived TSC frequency more accurate [5.6,131/254] KVM: nVMX: Properly handle userspace interrupt window request [5.6,132/254] KVM: s390: vsie: Fix region 1 ASCE sanity shadow address checks [5.6,134/254] KVM: x86: Allocate new rmap and large page tracking when moving memslot [5.6,135/254] KVM: VMX: Always VMCLEAR in-use VMCSes during crash with kexec support [5.6,136/254] KVM: x86: Gracefully handle __vmalloc() failure during VM allocation [5.6,139/254] smb3: fix performance regression with setting mtime [5.6,141/254] CIFS: check new file size when extending file by fallocate [5.6,142/254] mtd: spinand: Stop using spinand->oobbuf for buffering bad block markers [5.6,144/254] mtd: rawnand: cadence: fix the calculation of the avaialble OOB size [5.6,145/254] mtd: rawnand: cadence: change bad block marker size [5.6,147/254] drm/i915/gen12: Disable preemption timeout [5.6,149/254] btrfs: fix btrfs_calc_reclaim_metadata_size calculation [5.6,150/254] Btrfs: fix crash during unmount due to race with delayed inode workers [5.6,154/254] btrfs: fix missing file extent item for hole after ranged fsync [5.6,155/254] btrfs: unset reloc control if we fail to recover [5.6,157/254] btrfs: use nofs allocations for running delayed items [5.6,159/254] remoteproc: qcom_q6v5_mss: Reload the mba region on coredump [5.6,161/254] time/namespace: Fix time_for_children symlink [5.6,162/254] time/namespace: Add max_time_namespaces ucount [5.6,165/254] io_uring: honor original task RLIMIT_FSIZE [5.6,167/254] tools: gpio: Fix out-of-tree build regression [5.6,168/254] net: qualcomm: rmnet: Allow configuration updates to existing devices [5.6,171/254] arm64: dts: allwinner: h5: Fix PMU compatible [5.6,173/254] dm writecache: add cond_resched to avoid CPU hangs [5.6,175/254] dm verity fec: fix memory leak in verity_fec_dtr [5.6,178/254] dm clone: Add overflow check for number of regions [5.6,180/254] dm clone metadata: Fix return type of dm_clone_nr_of_hydrated_regions() [5.6,184/254] crypto: caam - update xts sector size for large input length [5.6,186/254] crypto: ccree - only try to map auth tag if needed [5.6,188/254] scsi: zfcp: fix missing erp_lock in port recovery trigger for point-to-point [5.6,189/254] scsi: ufs: fix Auto-Hibern8 error detection [5.6,191/254] scsi: lpfc: Fix broken Credit Recovery after driver load [5.6,192/254] ARM: dts: exynos: Fix polarity of the LCD SPI bus on UniversalC210 board [5.6,193/254] arm64: dts: ti: k3-am65: Add clocks to dwc3 nodes [5.6,195/254] selftests: vm: drop dependencies on page flags from mlock2 tests [5.6,197/254] selftests/powerpc: Add tlbie_test in .gitignore [5.6,199/254] vfio: platform: Switch to platform_get_irq_optional() [5.6,203/254] drm: Remove PageReserved manipulation from drm_pci_alloc [5.6,205/254] drm/amd/powerplay: implement the is_dpm_running() [5.6,207/254] drm/amd/display: Check for null fclk voltage when parsing clock table [5.6,208/254] drm/prime: fix extracting of the DMA addresses from a scatterlist [5.6,211/254] drm/vboxvideo: Add missing remove_conflicting_pci_framebuffers call, v2 [5.6,212/254] nfsd: fsnotify on rmdir under nfsd/clients/ [5.6,214/254] NFS: Fix a page leak in nfs_destroy_unlinked_subrequests() [5.6,215/254] NFS: finish_automount() requires us to hold 2 refs to the mount record [5.6,216/254] NFS: Fix a few constant_table array definitions [5.6,218/254] drm/i915/gt: Treat idling as a RPS downclock event [5.6,220/254] fs/filesystems.c: downgrade user-reachable WARN_ONCE() to pr_warn_once() [5.6,225/254] ftrace/kprobe: Show the maxactive number on kprobe_events [5.6,226/254] clk: ingenic/jz4770: Exit with error if CGU init failed [5.6,228/254] kmod: make request_module() return an error when autoloading is disabled [5.6,230/254] hfsplus: fix crash and filesystem corruption when deleting files [5.6,231/254] libata: Return correct status in sata_pmp_eh_recover_pm() when ATA_DFLAG_DETACH is set [5.6,232/254] ipmi: fix hung processes in __get_guid() [5.6,233/254] xen/blkfront: fix memory allocation flags in blkfront_setup_indirect() [5.6,235/254] scsi: sr: Fix sr_block_release() [5.6,238/254] powerpc/fsl_booke: Avoid creating duplicate tlb1 entry [5.6,239/254] powerpc/hash64/devmap: Use H_PAGE_THP_HUGE when setting up huge devmap PTE entries [5.6,242/254] powerpc/xive: Fix xmon support on the PowerNV platform [5.6,244/254] powerpc/64: Prevent stack protection in early boot [5.6,246/254] Revert "drm/dp_mst: Remove VCPI while disabling topology mgr" [5.6,249/254] drm/i915/ggtt: do not set bits 1-11 in gen12 ptes [5.6,250/254] drm/i915/gt: Fill all the unused space in the GGTT [5.6,252/254] perf/core: Fix event cgroup tracking

[5.6,061/254] block, bfq: fix use-after-free in bfq_idle_slice_timer_body

Commit Message

Patch