[5.4,141/309] crypto: pcrypt - Avoid deadlock by using per-instance padata queues

Message ID	20200210122419.890687777@linuxfoundation.org
State	New
Headers	show Return-Path: <SRS0=fEgN=36=vger.kernel.org=stable-owner@kernel.org> From: Greg Kroah-Hartman <gregkh@linuxfoundation.org> To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>, stable@vger.kernel.org, syzbot+56c7151cad94eec37c521f0e47d2eee53f9361c4@syzkaller.appspotmail.com, Herbert Xu <herbert@gondor.apana.org.au>, Eric Biggers <ebiggers@kernel.org> Subject: [PATCH 5.4 141/309] crypto: pcrypt - Avoid deadlock by using per-instance padata queues Date: Mon, 10 Feb 2020 04:31:37 -0800 Message-Id: <20200210122419.890687777@linuxfoundation.org> In-Reply-To: <20200210122406.106356946@linuxfoundation.org> References: <20200210122406.106356946@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: stable-owner@vger.kernel.org Precedence: bulk
Series	None \| expand [5.4,004/309] gtp: use __GFP_NOWARN to avoid memalloc warning [5.4,005/309] l2tp: Allow duplicate session creation with UDP [5.4,006/309] net: hsr: fix possible NULL deref in hsr_handle_frame() [5.4,007/309] net_sched: fix an OOB access in cls_tcindex [5.4,009/309] bnxt_en: Fix TC queue mapping. [5.4,012/309] rxrpc: Fix missing active use pinning of rxrpc_local object [5.4,013/309] rxrpc: Fix NULL pointer deref due to call->conn being cleared on disconnect [5.4,016/309] tcp: clear tp->data_segs{in\|out} in tcp_disconnect() [5.4,019/309] MAINTAINERS: correct entries for ISDN/mISDN section [5.4,021/309] bnxt_en: Fix logic that disables Bus Master during firmware reset. [5.4,022/309] media: uvcvideo: Avoid cyclic entity chains due to malformed USB descriptors [5.4,024/309] netfilter: ipset: fix suspicious RCU usage in find_set_and_id [5.4,026/309] tracing/kprobes: Have uname use __get_str() in print_fmt [5.4,028/309] rcu: Use _ONCE() to protect lockless ->expmask accesses [5.4,030/309] srcu: Apply _ONCE() to ->srcu_last_gp_end [5.4,031/309] rcu: Use READ_ONCE() for ->expmask in rcu_read_unlock_special() [5.4,033/309] nvmet: Fix controller use after free [5.4,035/309] Bluetooth: btusb: Disable runtime suspend on Realtek devices [5.4,039/309] usb: typec: tcpci: mask event interrupts when remove driver [5.4,042/309] usb: gadget: legacy: set max_speed to super-speed [5.4,043/309] usb: gadget: f_ncm: Use atomic_t to track in-flight request [5.4,045/309] ALSA: usb-audio: Fix endianess in descriptor validation [5.4,048/309] memcg: fix a crash in wb_workfn when a device disappears [5.4,049/309] mm/sparse.c: reset sections mem_map when fully deactivated [5.4,050/309] mmc: sdhci-pci: Make function amd_sdhci_reset static [5.4,052/309] mm/memory_hotplug: fix remove_memory() lockdep splat [5.4,057/309] media: v4l2-rect.h: fix v4l2_rect_map_inside() top/left adjustments [5.4,058/309] lib/test_kasan.c: fix memory leak in kmalloc_oob_krealloc_more() [5.4,059/309] irqdomain: Fix a memory leak in irq_domain_push_irq() [5.4,060/309] x86/cpu: Update cached HLE state on write to TSX_CTRL_CPUID_CLEAR [5.4,063/309] ALSA: hda: Add Clevo W65_67SB the power_save blacklist [5.4,064/309] ALSA: hda: Add JasperLake PCI ID and codec vid [5.4,067/309] KVM: arm/arm64: Correct CPSR on exception entry [5.4,071/309] MIPS: fix indentation of the RELOCS message [5.4,072/309] MIPS: boot: fix typo in vmlinux.lzma.its target [5.4,074/309] powerpc/mmu_gather: enable RCU_TABLE_FREE even for !SMP case [5.4,077/309] powerpc/pseries: Advance pfn if section is not present in lmb_is_removable() [5.4,079/309] powerpc/32s: Fix CPU wake-up from sleep mode [5.4,080/309] tracing: Fix now invalid var_ref_vals assumption in trace action [5.4,081/309] PCI: tegra: Fix return value check of pm_runtime_get_sync() [5.4,082/309] PCI: keystone: Fix outbound region mapping [5.4,084/309] PCI: keystone: Fix error handling when "num-viewport" DT property is not populated [5.4,086/309] ACPI: video: Do not export a non working backlight interface on MSI MS-7721 boards [5.4,087/309] ACPI / battery: Deal with design or full capacity being reported as -1 [5.4,093/309] ubifs: Fix wrong memory allocation [5.4,095/309] ubifs: Fix deadlock in concurrent bulk-read and writepage [5.4,097/309] ASoC: SOF: core: free trace on errors [5.4,100/309] nvmem: core: fix memory abort in cleanup path [5.4,102/309] crypto: ccree - fix backlog memory leak [5.4,105/309] crypto: ccree - fix FDE descriptor sequence [5.4,107/309] padata: Remove broken queue flushing [5.4,110/309] erofs: fix out-of-bound read for shifted uncompressed block [5.4,111/309] scsi: megaraid_sas: Do not initiate OCR if controller is not in ready state [5.4,114/309] power: supply: axp20x_ac_power: Fix reporting online status [5.4,116/309] ovl: fix wrong WARN_ON() in ovl_cache_update_ino() [5.4,118/309] f2fs: choose hardlimit when softlimit is larger than hardlimit in f2fs_statfs_proje... [5.4,120/309] f2fs: code cleanup for f2fs_statfs_project() [5.4,121/309] f2fs: fix dcache lookup of !casefolded directories [5.4,123/309] PM: core: Fix handling of devices deleted during system-wide resume [5.4,125/309] of: Add OF_DMA_DEFAULT_COHERENT & select it on powerpc [5.4,127/309] dm zoned: support zone sizes smaller than 128MiB [5.4,128/309] dm space map common: fix to ensure new block isnt already in use [5.4,129/309] dm writecache: fix incorrect flush sequence when doing SSD mode commit [5.4,130/309] dm crypt: fix GFP flags passed to skcipher_request_alloc() [5.4,134/309] scsi: qla2xxx: Fix stuck login session using prli_pend_timer [5.4,135/309] ASoC: SOF: Introduce state machine for FW boot [5.4,139/309] ftrace: Add comment to why rcu_dereference_sched() is open coded [5.4,141/309] crypto: pcrypt - Avoid deadlock by using per-instance padata queues [5.4,143/309] btrfs: Handle another split brain scenario with metadata uuid feature [5.4,145/309] selftests/bpf: Fix perf_buffer test on systems w/ offline CPUs [5.4,148/309] tc-testing: fix eBPF tests failure on linux fresh clones [5.4,149/309] samples/bpf: Dont try to remove users homedir on clean [5.4,150/309] samples/bpf: Xdp_redirect_cpu fix missing tracepoint attach [5.4,154/309] selftests: bpf: Ignore FIN packets for reuseport tests [5.4,155/309] crypto: api - fix unexpectedly getting generic implementation [5.4,156/309] crypto: hisilicon - Use the offset fields in sqe to avoid need to split scatterlists [5.4,157/309] crypto: ccp - set max RSA modulus size for v3 platform devices as well [5.4,159/309] crypto: pcrypt - Do not clear MAY_SLEEP flag in original request [5.4,162/309] crypto: picoxcell - adjust the position of tasklet_init and fix missed tasklet_kill [5.4,164/309] scsi: qla2xxx: Fix unbound NVME response length [5.4,165/309] NFS: Fix memory leaks and corruption in readdir [5.4,168/309] jbd2_seq_info_next should increase position index [5.4,169/309] ext4: fix deadlock allocating crypto bounce page from mempool [5.4,170/309] ext4: fix race conditions in ->d_compare() and ->d_hash() [5.4,172/309] Btrfs: make deduplication with range including the last block work [5.4,174/309] btrfs: set trans->drity in btrfs_commit_transaction [5.4,178/309] btrfs: Correctly handle empty trees in find_first_clear_extent_bit [5.4,181/309] mwifiex: fix unbalanced locking in mwifiex_process_country_ie() [5.4,183/309] gfs2: fix gfs2_find_jhead that returns uninitialized jhead with seq 0 [5.4,184/309] gfs2: move setting current->backing_dev_info [5.4,186/309] drm: atmel-hlcdc: use double rate for pixel clock only if supported [5.4,188/309] drm: atmel-hlcdc: prefer a lower pixel-clock than requested [5.4,190/309] media: iguanair: fix endpoint sanity check [5.4,191/309] media: rc: ensure lirc is initialized before registering input device [5.4,193/309] xen/balloon: Support xend-based toolstack take two [5.4,195/309] bcache: add readahead cache policy options via sysfs interface [5.4,196/309] eventfd: track eventfd_signal() recursion depth [5.4,197/309] aio: prevent potential eventfd recursion on poll [5.4,201/309] KVM: x86: Protect DR-based index computations from Spectre-v1/L1TF attacks [5.4,203/309] KVM: x86: Protect kvm_hv_msr_[get\|set]_crash_data() from Spectre-v1/L1TF attacks [5.4,204/309] KVM: x86: Protect ioapic_write_indirect() from Spectre-v1/L1TF attacks [5.4,207/309] KVM: x86: Protect MSR-based index computations from Spectre-v1/L1TF attacks in x86.c [5.4,209/309] KVM: x86: Protect MSR-based index computations in fixed_msr_to_seg_unit() from Spec... [5.4,210/309] KVM: x86: Fix potential put_fpu() w/o load_fpu() on MPX platform [5.4,211/309] KVM: PPC: Book3S HV: Uninit vCPU if vcore creation fails [5.4,214/309] x86/kvm: Be careful not to clear KVM_VCPU_FLUSH_TLB bit [5.4,217/309] x86/kvm: Cache gfn to pfn translation [5.4,218/309] x86/KVM: Clean up hosts steal time structure [5.4,222/309] KVM: x86: Handle TIF_NEED_FPU_LOAD in kvm_{load,put}_guest_fpu() [5.4,225/309] KVM: s390: do not clobber registers during guest reset/store status [5.4,227/309] mm/page_alloc.c: fix uninitialized memmaps on a partially populated last section [5.4,230/309] clk: tegra: Mark fuse clock as critical [5.4,231/309] drm/amd/dm/mst: Ignore payload update failures [5.4,235/309] broken ping to ipv6 linklocal addresses on debian buster [5.4,236/309] percpu: Separate decrypted varaibles anytime encryption can be enabled [5.4,238/309] scsi: qla2xxx: Fix the endianness of the qla82xx_get_fw_size() return type [5.4,241/309] scsi: ufs: Recheck bkops level if bkops is disabled [5.4,242/309] mtd: spi-nor: Split mt25qu512a (n25q512a) entry into two [5.4,243/309] phy: qualcomm: Adjust indentation in read_poll_timeout [5.4,244/309] ext2: Adjust indentation in ext2_fill_super [5.4,245/309] powerpc/44x: Adjust indentation in ibm4xx_denali_fixup_memsize [5.4,250/309] net: tulip: Adjust indentation in {dmfe, uli526x}_init_module [5.4,252/309] IB/core: Fix ODP get user pages flow [5.4,255/309] nfsd: Return the correct number of bytes written to the file [5.4,256/309] virtio-balloon: Fix memory leak when unloading while hinting is in progress [5.4,259/309] ubi: Fix an error pointer dereference in error handling code [5.4,260/309] ubifs: Fix memory leak from c->sup_node [5.4,261/309] regulator: core: Add regulator_is_equal() helper [5.4,264/309] devlink: report 0 after hitting end in region read [5.4,265/309] dpaa_eth: support all modes with rate adapting PHYs [5.4,269/309] net: mvneta: move rx_dropped and rx_errors in per-cpu stats [5.4,271/309] net: stmmac: fix a possible endless loop [5.4,272/309] net: systemport: Avoid RBUF stuck in Wake-on-LAN mode [5.4,273/309] net/mlx5: IPsec, Fix esp modify function attribute [5.4,275/309] net: macb: Remove unnecessary alignment check for TSO [5.4,278/309] taprio: Fix still allowing changing the flags during runtime [5.4,279/309] taprio: Add missing policy validation for flags [5.4,280/309] taprio: Use taprio_reset_tc() to reset Traffic Classes configuration [5.4,283/309] qed: Fix timestamping issue for L2 unicast ptp packets. [5.4,287/309] ASoC: Intel: skl_hda_dsp_common: Fix global-out-of-bounds bug [5.4,289/309] mfd: rn5t618: Mark ADC control register volatile [5.4,290/309] mfd: bd70528: Fix hour register mask [5.4,292/309] btrfs: use bool argument in free_root_pointers() [5.4,293/309] btrfs: free block groups after freeing fs trees [5.4,294/309] drm/dp_mst: Remove VCPI while disabling topology mgr [5.4,298/309] KVM: x86: fix overlap between SPTE_MMIO_MASK and generation [5.4,299/309] KVM: nVMX: vmread should not set rflags to specify success in case of #PF [5.4,300/309] KVM: Use vcpu-specific gva->hva translation when querying host page size [5.4,302/309] cifs: fail i/o on soft mounts if sessionsetup errors out [5.4,304/309] x86/apic/msi: Plug non-maskable MSI affinity race [5.4,306/309] perf/core: Fix mlock accounting in perf_mmap() [5.4,308/309] regulator fix for "regulator: core: Add regulator_is_equal() helper"

--- a/crypto/pcrypt.c +++ b/crypto/pcrypt.c @@ -24,6 +24,8 @@ static struct kset *pcrypt_kse struct pcrypt_instance_ctx { struct crypto_aead_spawn spawn; + struct padata_shell *psenc; + struct padata_shell *psdec; atomic_t tfm_count; }; @@ -32,6 +34,12 @@ struct pcrypt_aead_ctx { unsigned int cb_cpu; }; +static inline struct pcrypt_instance_ctx *pcrypt_tfm_ictx( + struct crypto_aead *tfm) +{ + return aead_instance_ctx(aead_alg_instance(tfm)); +} + static int pcrypt_aead_setkey(struct crypto_aead *parent, const u8 *key, unsigned int keylen) { @@ -90,6 +98,9 @@ static int pcrypt_aead_encrypt(struct ae struct crypto_aead *aead = crypto_aead_reqtfm(req); struct pcrypt_aead_ctx *ctx = crypto_aead_ctx(aead); u32 flags = aead_request_flags(req); + struct pcrypt_instance_ctx *ictx; + + ictx = pcrypt_tfm_ictx(aead); memset(padata, 0, sizeof(struct padata_priv)); @@ -103,7 +114,7 @@ static int pcrypt_aead_encrypt(struct ae req->cryptlen, req->iv); aead_request_set_ad(creq, req->assoclen); - err = padata_do_parallel(pencrypt, padata, &ctx->cb_cpu); + err = padata_do_parallel(ictx->psenc, padata, &ctx->cb_cpu); if (!err) return -EINPROGRESS; @@ -132,6 +143,9 @@ static int pcrypt_aead_decrypt(struct ae struct crypto_aead *aead = crypto_aead_reqtfm(req); struct pcrypt_aead_ctx *ctx = crypto_aead_ctx(aead); u32 flags = aead_request_flags(req); + struct pcrypt_instance_ctx *ictx; + + ictx = pcrypt_tfm_ictx(aead); memset(padata, 0, sizeof(struct padata_priv)); @@ -145,7 +159,7 @@ static int pcrypt_aead_decrypt(struct ae req->cryptlen, req->iv); aead_request_set_ad(creq, req->assoclen); - err = padata_do_parallel(pdecrypt, padata, &ctx->cb_cpu); + err = padata_do_parallel(ictx->psdec, padata, &ctx->cb_cpu); if (!err) return -EINPROGRESS; @@ -192,6 +206,8 @@ static void pcrypt_free(struct aead_inst struct pcrypt_instance_ctx *ctx = aead_instance_ctx(inst); crypto_drop_aead(&ctx->spawn); + padata_free_shell(ctx->psdec); + padata_free_shell(ctx->psenc); kfree(inst); } @@ -233,12 +249,22 @@ static int pcrypt_create_aead(struct cry if (!inst) return -ENOMEM; + err = -ENOMEM; + ctx = aead_instance_ctx(inst); + ctx->psenc = padata_alloc_shell(pencrypt); + if (!ctx->psenc) + goto out_free_inst; + + ctx->psdec = padata_alloc_shell(pdecrypt); + if (!ctx->psdec) + goto out_free_psenc; + crypto_set_aead_spawn(&ctx->spawn, aead_crypto_instance(inst)); err = crypto_grab_aead(&ctx->spawn, name, 0, 0); if (err) - goto out_free_inst; + goto out_free_psdec; alg = crypto_spawn_aead_alg(&ctx->spawn); err = pcrypt_init_instance(aead_crypto_instance(inst), &alg->base); @@ -271,6 +297,10 @@ out: out_drop_aead: crypto_drop_aead(&ctx->spawn); +out_free_psdec: + padata_free_shell(ctx->psdec); +out_free_psenc: + padata_free_shell(ctx->psenc); out_free_inst: kfree(inst); goto out; --- a/include/linux/padata.h +++ b/include/linux/padata.h @@ -9,6 +9,7 @@ #ifndef PADATA_H #define PADATA_H +#include <linux/compiler_types.h> #include <linux/workqueue.h> #include <linux/spinlock.h> #include <linux/list.h> @@ -98,7 +99,7 @@ struct padata_cpumask { * struct parallel_data - Internal control structure, covers everything * that depends on the cpumask in use. * - * @pinst: padata instance. + * @sh: padata_shell object. * @pqueue: percpu padata queues used for parallelization. * @squeue: percpu padata queues used for serialuzation. * @reorder_objects: Number of objects waiting in the reorder queues. @@ -111,7 +112,7 @@ struct padata_cpumask { * @lock: Reorder lock. */ struct parallel_data { - struct padata_instance *pinst; + struct padata_shell *ps; struct padata_parallel_queue __percpu *pqueue; struct padata_serial_queue __percpu *squeue; atomic_t reorder_objects; @@ -125,13 +126,32 @@ struct parallel_data { }; /** + * struct padata_shell - Wrapper around struct parallel_data, its + * purpose is to allow the underlying control structure to be replaced + * on the fly using RCU. + * + * @pinst: padat instance. + * @pd: Actual parallel_data structure which may be substituted on the fly. + * @opd: Pointer to old pd to be freed by padata_replace. + * @list: List entry in padata_instance list. + */ +struct padata_shell { + struct padata_instance *pinst; + struct parallel_data __rcu *pd; + struct parallel_data *opd; + struct list_head list; +}; + +/** * struct padata_instance - The overall control structure. * * @cpu_notifier: cpu hotplug notifier. * @parallel_wq: The workqueue used for parallel work. * @serial_wq: The workqueue used for serial work. - * @pd: The internal control structure. + * @pslist: List of padata_shell objects attached to this instance. * @cpumask: User supplied cpumasks for parallel and serial works. + * @rcpumask: Actual cpumasks based on user cpumask and cpu_online_mask. + * @omask: Temporary storage used to compute the notification mask. * @cpumask_change_notifier: Notifiers chain for user-defined notify * callbacks that will be called when either @pcpu or @cbcpu * or both cpumasks change. @@ -143,8 +163,10 @@ struct padata_instance { struct hlist_node node; struct workqueue_struct *parallel_wq; struct workqueue_struct *serial_wq; - struct parallel_data *pd; + struct list_head pslist; struct padata_cpumask cpumask; + struct padata_cpumask rcpumask; + cpumask_var_t omask; struct blocking_notifier_head cpumask_change_notifier; struct kobject kobj; struct mutex lock; @@ -156,7 +178,9 @@ struct padata_instance { extern struct padata_instance *padata_alloc_possible(const char *name); extern void padata_free(struct padata_instance *pinst); -extern int padata_do_parallel(struct padata_instance *pinst, +extern struct padata_shell *padata_alloc_shell(struct padata_instance *pinst); +extern void padata_free_shell(struct padata_shell *ps); +extern int padata_do_parallel(struct padata_shell *ps, struct padata_priv *padata, int *cb_cpu); extern void padata_do_serial(struct padata_priv *padata); extern int padata_set_cpumask(struct padata_instance *pinst, int cpumask_type, --- a/kernel/padata.c +++ b/kernel/padata.c @@ -89,7 +89,7 @@ static void padata_parallel_worker(struc /** * padata_do_parallel - padata parallelization function * - * @pinst: padata instance + * @ps: padatashell * @padata: object to be parallelized * @cb_cpu: pointer to the CPU that the serialization callback function should * run on. If it's not in the serial cpumask of @pinst @@ -100,16 +100,17 @@ static void padata_parallel_worker(struc * Note: Every object which is parallelized by padata_do_parallel * must be seen by padata_do_serial. */ -int padata_do_parallel(struct padata_instance *pinst, +int padata_do_parallel(struct padata_shell *ps, struct padata_priv *padata, int *cb_cpu) { + struct padata_instance *pinst = ps->pinst; int i, cpu, cpu_index, target_cpu, err; struct padata_parallel_queue *queue; struct parallel_data *pd; rcu_read_lock_bh(); - pd = rcu_dereference_bh(pinst->pd); + pd = rcu_dereference_bh(ps->pd); err = -EINVAL; if (!(pinst->flags & PADATA_INIT) || pinst->flags & PADATA_INVALID) @@ -212,10 +213,10 @@ static struct padata_priv *padata_find_n static void padata_reorder(struct parallel_data *pd) { + struct padata_instance *pinst = pd->ps->pinst; int cb_cpu; struct padata_priv *padata; struct padata_serial_queue *squeue; - struct padata_instance *pinst = pd->pinst; struct padata_parallel_queue *next_queue; /* @@ -349,36 +350,39 @@ void padata_do_serial(struct padata_priv } EXPORT_SYMBOL(padata_do_serial); -static int padata_setup_cpumasks(struct parallel_data *pd, - const struct cpumask *pcpumask, - const struct cpumask *cbcpumask) +static int padata_setup_cpumasks(struct padata_instance *pinst) { struct workqueue_attrs *attrs; + int err; + + attrs = alloc_workqueue_attrs(); + if (!attrs) + return -ENOMEM; + + /* Restrict parallel_wq workers to pd->cpumask.pcpu. */ + cpumask_copy(attrs->cpumask, pinst->cpumask.pcpu); + err = apply_workqueue_attrs(pinst->parallel_wq, attrs); + free_workqueue_attrs(attrs); + + return err; +} + +static int pd_setup_cpumasks(struct parallel_data *pd, + const struct cpumask *pcpumask, + const struct cpumask *cbcpumask) +{ int err = -ENOMEM; if (!alloc_cpumask_var(&pd->cpumask.pcpu, GFP_KERNEL)) goto out; - cpumask_and(pd->cpumask.pcpu, pcpumask, cpu_online_mask); - if (!alloc_cpumask_var(&pd->cpumask.cbcpu, GFP_KERNEL)) goto free_pcpu_mask; - cpumask_and(pd->cpumask.cbcpu, cbcpumask, cpu_online_mask); - - attrs = alloc_workqueue_attrs(); - if (!attrs) - goto free_cbcpu_mask; - /* Restrict parallel_wq workers to pd->cpumask.pcpu. */ - cpumask_copy(attrs->cpumask, pd->cpumask.pcpu); - err = apply_workqueue_attrs(pd->pinst->parallel_wq, attrs); - free_workqueue_attrs(attrs); - if (err < 0) - goto free_cbcpu_mask; + cpumask_copy(pd->cpumask.pcpu, pcpumask); + cpumask_copy(pd->cpumask.cbcpu, cbcpumask); return 0; -free_cbcpu_mask: - free_cpumask_var(pd->cpumask.cbcpu); free_pcpu_mask: free_cpumask_var(pd->cpumask.pcpu); out: @@ -422,12 +426,16 @@ static void padata_init_pqueues(struct p } /* Allocate and initialize the internal cpumask dependend resources. */ -static struct parallel_data *padata_alloc_pd(struct padata_instance *pinst, - const struct cpumask *pcpumask, - const struct cpumask *cbcpumask) +static struct parallel_data *padata_alloc_pd(struct padata_shell *ps) { + struct padata_instance *pinst = ps->pinst; + const struct cpumask *cbcpumask; + const struct cpumask *pcpumask; struct parallel_data *pd; + cbcpumask = pinst->rcpumask.cbcpu; + pcpumask = pinst->rcpumask.pcpu; + pd = kzalloc(sizeof(struct parallel_data), GFP_KERNEL); if (!pd) goto err; @@ -440,8 +448,8 @@ static struct parallel_data *padata_allo if (!pd->squeue) goto err_free_pqueue; - pd->pinst = pinst; - if (padata_setup_cpumasks(pd, pcpumask, cbcpumask) < 0) + pd->ps = ps; + if (pd_setup_cpumasks(pd, pcpumask, cbcpumask)) goto err_free_squeue; padata_init_pqueues(pd); @@ -490,32 +498,64 @@ static void __padata_stop(struct padata_ } /* Replace the internal control structure with a new one. */ -static void padata_replace(struct padata_instance *pinst, - struct parallel_data *pd_new) +static int padata_replace_one(struct padata_shell *ps) { - struct parallel_data *pd_old = pinst->pd; - int notification_mask = 0; + struct parallel_data *pd_new; - pinst->flags |= PADATA_RESET; + pd_new = padata_alloc_pd(ps); + if (!pd_new) + return -ENOMEM; - rcu_assign_pointer(pinst->pd, pd_new); + ps->opd = rcu_dereference_protected(ps->pd, 1); + rcu_assign_pointer(ps->pd, pd_new); - synchronize_rcu(); + return 0; +} + +static int padata_replace(struct padata_instance *pinst, int cpu) +{ + int notification_mask = 0; + struct padata_shell *ps; + int err; + + pinst->flags |= PADATA_RESET; - if (!cpumask_equal(pd_old->cpumask.pcpu, pd_new->cpumask.pcpu)) + cpumask_copy(pinst->omask, pinst->rcpumask.pcpu); + cpumask_and(pinst->rcpumask.pcpu, pinst->cpumask.pcpu, + cpu_online_mask); + if (cpu >= 0) + cpumask_clear_cpu(cpu, pinst->rcpumask.pcpu); + if (!cpumask_equal(pinst->omask, pinst->rcpumask.pcpu)) notification_mask |= PADATA_CPU_PARALLEL; - if (!cpumask_equal(pd_old->cpumask.cbcpu, pd_new->cpumask.cbcpu)) + + cpumask_copy(pinst->omask, pinst->rcpumask.cbcpu); + cpumask_and(pinst->rcpumask.cbcpu, pinst->cpumask.cbcpu, + cpu_online_mask); + if (cpu >= 0) + cpumask_clear_cpu(cpu, pinst->rcpumask.cbcpu); + if (!cpumask_equal(pinst->omask, pinst->rcpumask.cbcpu)) notification_mask |= PADATA_CPU_SERIAL; - if (atomic_dec_and_test(&pd_old->refcnt)) - padata_free_pd(pd_old); + list_for_each_entry(ps, &pinst->pslist, list) { + err = padata_replace_one(ps); + if (err) + break; + } + + synchronize_rcu(); + + list_for_each_entry_continue_reverse(ps, &pinst->pslist, list) + if (atomic_dec_and_test(&ps->opd->refcnt)) + padata_free_pd(ps->opd); if (notification_mask) blocking_notifier_call_chain(&pinst->cpumask_change_notifier, notification_mask, - &pd_new->cpumask); + &pinst->cpumask); pinst->flags &= ~PADATA_RESET; + + return err; } /** @@ -568,7 +608,7 @@ static int __padata_set_cpumasks(struct cpumask_var_t cbcpumask) { int valid; - struct parallel_data *pd; + int err; valid = padata_validate_cpumask(pinst, pcpumask); if (!valid) { @@ -581,19 +621,15 @@ static int __padata_set_cpumasks(struct __padata_stop(pinst); out_replace: - pd = padata_alloc_pd(pinst, pcpumask, cbcpumask); - if (!pd) - return -ENOMEM; - cpumask_copy(pinst->cpumask.pcpu, pcpumask); cpumask_copy(pinst->cpumask.cbcpu, cbcpumask); - padata_replace(pinst, pd); + err = padata_setup_cpumasks(pinst) ?: padata_replace(pinst, -1); if (valid) __padata_start(pinst); - return 0; + return err; } /** @@ -676,46 +712,32 @@ EXPORT_SYMBOL(padata_stop); static int __padata_add_cpu(struct padata_instance *pinst, int cpu) { - struct parallel_data *pd; + int err = 0; if (cpumask_test_cpu(cpu, cpu_online_mask)) { - pd = padata_alloc_pd(pinst, pinst->cpumask.pcpu, - pinst->cpumask.cbcpu); - if (!pd) - return -ENOMEM; - - padata_replace(pinst, pd); + err = padata_replace(pinst, -1); if (padata_validate_cpumask(pinst, pinst->cpumask.pcpu) && padata_validate_cpumask(pinst, pinst->cpumask.cbcpu)) __padata_start(pinst); } - return 0; + return err; } static int __padata_remove_cpu(struct padata_instance *pinst, int cpu) { - struct parallel_data *pd = NULL; + int err = 0; if (cpumask_test_cpu(cpu, cpu_online_mask)) { - if (!padata_validate_cpumask(pinst, pinst->cpumask.pcpu) || !padata_validate_cpumask(pinst, pinst->cpumask.cbcpu)) __padata_stop(pinst); - pd = padata_alloc_pd(pinst, pinst->cpumask.pcpu, - pinst->cpumask.cbcpu); - if (!pd) - return -ENOMEM; - - padata_replace(pinst, pd); - - cpumask_clear_cpu(cpu, pd->cpumask.cbcpu); - cpumask_clear_cpu(cpu, pd->cpumask.pcpu); + err = padata_replace(pinst, cpu); } - return 0; + return err; } /** @@ -798,8 +820,12 @@ static void __padata_free(struct padata_ cpuhp_state_remove_instance_nocalls(hp_online, &pinst->node); #endif + WARN_ON(!list_empty(&pinst->pslist)); + padata_stop(pinst); - padata_free_pd(pinst->pd); + free_cpumask_var(pinst->omask); + free_cpumask_var(pinst->rcpumask.cbcpu); + free_cpumask_var(pinst->rcpumask.pcpu); free_cpumask_var(pinst->cpumask.pcpu); free_cpumask_var(pinst->cpumask.cbcpu); destroy_workqueue(pinst->serial_wq); @@ -946,7 +972,6 @@ static struct padata_instance *padata_al const struct cpumask *cbcpumask) { struct padata_instance *pinst; - struct parallel_data *pd = NULL; pinst = kzalloc(sizeof(struct padata_instance), GFP_KERNEL); if (!pinst) @@ -974,14 +999,22 @@ static struct padata_instance *padata_al !padata_validate_cpumask(pinst, cbcpumask)) goto err_free_masks; - pd = padata_alloc_pd(pinst, pcpumask, cbcpumask); - if (!pd) + if (!alloc_cpumask_var(&pinst->rcpumask.pcpu, GFP_KERNEL)) goto err_free_masks; + if (!alloc_cpumask_var(&pinst->rcpumask.cbcpu, GFP_KERNEL)) + goto err_free_rcpumask_pcpu; + if (!alloc_cpumask_var(&pinst->omask, GFP_KERNEL)) + goto err_free_rcpumask_cbcpu; - rcu_assign_pointer(pinst->pd, pd); + INIT_LIST_HEAD(&pinst->pslist); cpumask_copy(pinst->cpumask.pcpu, pcpumask); cpumask_copy(pinst->cpumask.cbcpu, cbcpumask); + cpumask_and(pinst->rcpumask.pcpu, pcpumask, cpu_online_mask); + cpumask_and(pinst->rcpumask.cbcpu, cbcpumask, cpu_online_mask); + + if (padata_setup_cpumasks(pinst)) + goto err_free_omask; pinst->flags = 0; @@ -997,6 +1030,12 @@ static struct padata_instance *padata_al return pinst; +err_free_omask: + free_cpumask_var(pinst->omask); +err_free_rcpumask_cbcpu: + free_cpumask_var(pinst->rcpumask.cbcpu); +err_free_rcpumask_pcpu: + free_cpumask_var(pinst->rcpumask.pcpu); err_free_masks: free_cpumask_var(pinst->cpumask.pcpu); free_cpumask_var(pinst->cpumask.cbcpu); @@ -1035,6 +1074,61 @@ void padata_free(struct padata_instance } EXPORT_SYMBOL(padata_free); +/** + * padata_alloc_shell - Allocate and initialize padata shell. + * + * @pinst: Parent padata_instance object. + */ +struct padata_shell *padata_alloc_shell(struct padata_instance *pinst) +{ + struct parallel_data *pd; + struct padata_shell *ps; + + ps = kzalloc(sizeof(*ps), GFP_KERNEL); + if (!ps) + goto out; + + ps->pinst = pinst; + + get_online_cpus(); + pd = padata_alloc_pd(ps); + put_online_cpus(); + + if (!pd) + goto out_free_ps; + + mutex_lock(&pinst->lock); + RCU_INIT_POINTER(ps->pd, pd); + list_add(&ps->list, &pinst->pslist); + mutex_unlock(&pinst->lock); + + return ps; + +out_free_ps: + kfree(ps); +out: + return NULL; +} +EXPORT_SYMBOL(padata_alloc_shell); + +/** + * padata_free_shell - free a padata shell + * + * @ps: padata shell to free + */ +void padata_free_shell(struct padata_shell *ps) +{ + struct padata_instance *pinst = ps->pinst; + + mutex_lock(&pinst->lock); + list_del(&ps->list); + padata_free_pd(rcu_dereference_protected(ps->pd, 1)); + mutex_unlock(&pinst->lock); + + kfree(ps); +} +EXPORT_SYMBOL(padata_free_shell); + #ifdef CONFIG_HOTPLUG_CPU static __init int padata_driver_init(void)

[5.4,141/309] crypto: pcrypt - Avoid deadlock by using per-instance padata queues

Commit Message

Patch