From patchwork Wed Apr 28 19:36:48 2021 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Rob Clark X-Patchwork-Id: 428640 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-15.7 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM, HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER,INCLUDES_PATCH, MAILING_LIST_MULTI, SPF_HELO_NONE, SPF_PASS, USER_AGENT_GIT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id DEECEC433ED for ; Wed, 28 Apr 2021 19:33:35 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id B359061186 for ; Wed, 28 Apr 2021 19:33:35 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S243499AbhD1TeT (ORCPT ); Wed, 28 Apr 2021 15:34:19 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:45058 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S243990AbhD1TeT (ORCPT ); Wed, 28 Apr 2021 15:34:19 -0400 Received: from mail-pj1-x1031.google.com (mail-pj1-x1031.google.com [IPv6:2607:f8b0:4864:20::1031]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 990DBC061573; Wed, 28 Apr 2021 12:33:32 -0700 (PDT) Received: by mail-pj1-x1031.google.com with SMTP id t2-20020a17090a0242b0290155433387beso3652060pje.1; Wed, 28 Apr 2021 12:33:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references :mime-version:content-transfer-encoding; bh=tHyKA/c627E+DwGyeUfIZ1QItPIfz0VDyJzjzlVcfiM=; b=pID33Lw/tKPo6F7e2OUh0qUiAHOKaPkeMJJBqyKac+TpqtuROmy6A+2Cn4i7xmNuHo 11gZvwbHI9mcQqgMqQiVt/gzsTmk77n+grXj0DZvvVIiaN2pTNohh8uJKj4VF19YtgWK SavFQPTQXlF2MEOzCoKZZ4Kb2ep2jIVnQczHSstnYgQUu84jcFbnVK/TsC9A1SzVkh8W 0zV2uDyKv0WZOzWVvln0/fAf3ZnK1L3EQe17k8l1C+6zObJDrdKLZWqykSO/qn2E96Al pELUG+S3eUIsIN8WHWl+cbPNV2D+N2ld/f1wGwIWUPbxj35BQwFzNJatW8pEFhavki49 AXjg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references:mime-version:content-transfer-encoding; bh=tHyKA/c627E+DwGyeUfIZ1QItPIfz0VDyJzjzlVcfiM=; b=UftnSU9xrjYwiFnOpHeMclnDBhjms6ZlQer7lUOzegNvRpZ4IziujaeV4hGkm8axcC NwVWy2xMkruap51mgw7OIMcvXhNoie0a928qV1+t87GAEGjoQ3WP4IGO0+57KozWWvpS mjt0VD2dsWAfCKraX6efmx2gX65tyNOZPmQCD5wFl5to1VQ8eOCNQeJwosi9TcfZ01ci Ego6+dBt+kdbJ/v3gqHICV7EIIfm74J3YgsJIjuvLE+QIqGY4TdpjppXfOg5EL9EujiX uVJFTKM5AoKlEVVPCDLe2/wcNjnFLpyhPM+UXT5d2Lm+dWLT181ZbXlkiFfqRyLaHFPl UChg== X-Gm-Message-State: AOAM5302GrtO0UrN5dV86bQmCo6UL6D6cIf1ny3XZ5CzfSaaSOg3Vk0K m9tTym4mLc4iQJD3MartHnA= X-Google-Smtp-Source: ABdhPJxaQuJh3H+yrpkJC7lrVgzdxfKzMBDfOSlK7PNpjK0FjYtIdAYiv9DBO7Mzm9706DDfVRLATg== X-Received: by 2002:a17:903:248e:b029:ec:b399:7d75 with SMTP id p14-20020a170903248eb02900ecb3997d75mr30868529plw.35.1619638412162; Wed, 28 Apr 2021 12:33:32 -0700 (PDT) Received: from localhost (c-73-25-156-94.hsd1.or.comcast.net. [73.25.156.94]) by smtp.gmail.com with ESMTPSA id r3sm414934pfl.159.2021.04.28.12.33.31 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 28 Apr 2021 12:33:31 -0700 (PDT) From: Rob Clark To: dri-devel@lists.freedesktop.org Cc: Jordan Crouse , Rob Clark , Rob Clark , Sean Paul , David Airlie , Daniel Vetter , AngeloGioacchino Del Regno , Konrad Dybcio , "Kristian H. Kristensen" , Marijn Suijten , Jonathan Marek , Akhil P Oommen , Eric Anholt , Sai Prakash Ranjan , Sharat Masetty , Douglas Anderson , Dave Airlie , linux-arm-msm@vger.kernel.org (open list:DRM DRIVER FOR MSM ADRENO GPU), freedreno@lists.freedesktop.org (open list:DRM DRIVER FOR MSM ADRENO GPU), linux-kernel@vger.kernel.org (open list) Subject: [PATCH 1/2] drm/msm: Handle ringbuffer overflow Date: Wed, 28 Apr 2021 12:36:48 -0700 Message-Id: <20210428193654.1498482-2-robdclark@gmail.com> X-Mailer: git-send-email 2.30.2 In-Reply-To: <20210428193654.1498482-1-robdclark@gmail.com> References: <20210428193654.1498482-1-robdclark@gmail.com> MIME-Version: 1.0 Precedence: bulk List-ID: X-Mailing-List: linux-arm-msm@vger.kernel.org From: Rob Clark Currently if userspace manages to fill up the ring faster than the GPU can consume we (a) spin for up to 1sec, and then (b) overwrite the ringbuffer contents from previous submits that the GPU is still busy executing. Which predictably goes rather badly. Instead, just skip flushing (updating WPTR) and reset ring->next back to where it was before we tried writing the submit into the ringbuffer, and return an error to userspace (which can then try again). Signed-off-by: Rob Clark --- drivers/gpu/drm/msm/adreno/a5xx_gpu.c | 3 +++ drivers/gpu/drm/msm/adreno/a6xx_gpu.c | 3 +++ drivers/gpu/drm/msm/adreno/adreno_gpu.c | 24 +++++++++++++++++- drivers/gpu/drm/msm/msm_gem_submit.c | 7 +++++- drivers/gpu/drm/msm/msm_gpu.c | 33 +++++++++++++++++++++++-- drivers/gpu/drm/msm/msm_gpu.h | 2 +- drivers/gpu/drm/msm/msm_ringbuffer.h | 5 ++++ 7 files changed, 72 insertions(+), 5 deletions(-) diff --git a/drivers/gpu/drm/msm/adreno/a5xx_gpu.c b/drivers/gpu/drm/msm/adreno/a5xx_gpu.c index ce13d49e615b..0c8faad3b328 100644 --- a/drivers/gpu/drm/msm/adreno/a5xx_gpu.c +++ b/drivers/gpu/drm/msm/adreno/a5xx_gpu.c @@ -36,6 +36,9 @@ void a5xx_flush(struct msm_gpu *gpu, struct msm_ringbuffer *ring, OUT_RING(ring, upper_32_bits(shadowptr(a5xx_gpu, ring))); } + if (unlikely(ring->overflow)) + return; + spin_lock_irqsave(&ring->preempt_lock, flags); /* Copy the shadow to the actual register */ diff --git a/drivers/gpu/drm/msm/adreno/a6xx_gpu.c b/drivers/gpu/drm/msm/adreno/a6xx_gpu.c index d553f62f4eeb..4a4728a774c0 100644 --- a/drivers/gpu/drm/msm/adreno/a6xx_gpu.c +++ b/drivers/gpu/drm/msm/adreno/a6xx_gpu.c @@ -68,6 +68,9 @@ static void a6xx_flush(struct msm_gpu *gpu, struct msm_ringbuffer *ring) OUT_RING(ring, upper_32_bits(shadowptr(a6xx_gpu, ring))); } + if (unlikely(ring->overflow)) + return; + spin_lock_irqsave(&ring->preempt_lock, flags); /* Copy the shadow to the actual register */ diff --git a/drivers/gpu/drm/msm/adreno/adreno_gpu.c b/drivers/gpu/drm/msm/adreno/adreno_gpu.c index 0f184c3dd9d9..a658777e07b1 100644 --- a/drivers/gpu/drm/msm/adreno/adreno_gpu.c +++ b/drivers/gpu/drm/msm/adreno/adreno_gpu.c @@ -467,6 +467,9 @@ void adreno_flush(struct msm_gpu *gpu, struct msm_ringbuffer *ring, u32 reg) { uint32_t wptr; + if (unlikely(ring->overflow)) + return; + /* Copy the shadow to the actual register */ ring->cur = ring->next; @@ -788,12 +791,31 @@ static uint32_t ring_freewords(struct msm_ringbuffer *ring) return (rptr + (size - 1) - wptr) % size; } +static bool space_avail(struct msm_ringbuffer *ring, uint32_t ndwords) +{ + if (ring_freewords(ring) >= ndwords) + return true; + + /* We don't have a good way to know in general when the RPTR has + * advanced.. newer things that use CP_WHERE_AM_I to update the + * shadow rptr could possibly insert a packet to generate an irq. + * But that doesn't cover older GPUs. But if the ringbuffer is + * full, it could take a while before it is empty again, so just + * insert a blind sleep to avoid a busy loop. + */ + msleep(1); + + return false; +} + void adreno_wait_ring(struct msm_ringbuffer *ring, uint32_t ndwords) { - if (spin_until(ring_freewords(ring) >= ndwords)) + if (spin_until(space_avail(ring, ndwords))) { DRM_DEV_ERROR(ring->gpu->dev->dev, "timeout waiting for space in ringbuffer %d\n", ring->id); + ring->overflow = true; + } } /* Get legacy powerlevels from qcom,gpu-pwrlevels and populate the opp table */ diff --git a/drivers/gpu/drm/msm/msm_gem_submit.c b/drivers/gpu/drm/msm/msm_gem_submit.c index 5480852bdeda..4bc669460fda 100644 --- a/drivers/gpu/drm/msm/msm_gem_submit.c +++ b/drivers/gpu/drm/msm/msm_gem_submit.c @@ -683,6 +683,9 @@ int msm_ioctl_gem_submit(struct drm_device *dev, void *data, submitid = atomic_inc_return(&ident) - 1; ring = gpu->rb[queue->prio]; + + GEM_WARN_ON(ring->overflow); + trace_msm_gpu_submit(pid_nr(pid), ring->id, submitid, args->nr_bos, args->nr_cmds); @@ -829,7 +832,9 @@ int msm_ioctl_gem_submit(struct drm_device *dev, void *data, } } - msm_gpu_submit(gpu, submit); + ret = msm_gpu_submit(gpu, submit); + if (ret) + goto out; args->fence = submit->fence->seqno; diff --git a/drivers/gpu/drm/msm/msm_gpu.c b/drivers/gpu/drm/msm/msm_gpu.c index ab7c167b0623..7655ad9108c8 100644 --- a/drivers/gpu/drm/msm/msm_gpu.c +++ b/drivers/gpu/drm/msm/msm_gpu.c @@ -787,7 +787,7 @@ void msm_gpu_retire(struct msm_gpu *gpu) } /* add bo's to gpu's ring, and kick gpu: */ -void msm_gpu_submit(struct msm_gpu *gpu, struct msm_gem_submit *submit) +int msm_gpu_submit(struct msm_gpu *gpu, struct msm_gem_submit *submit) { struct drm_device *dev = gpu->dev; struct msm_drm_private *priv = dev->dev_private; @@ -834,9 +834,38 @@ void msm_gpu_submit(struct msm_gpu *gpu, struct msm_gem_submit *submit) spin_unlock(&ring->submit_lock); gpu->funcs->submit(gpu, submit); - priv->lastctx = submit->queue->ctx; hangcheck_timer_reset(gpu); + + if (unlikely(ring->overflow)) { + /* + * Reset the ptr back to before the submit, so the GPU + * doesn't see a partial submit: + */ + ring->next = ring->cur; + + /* + * Clear the overflow flag, hopefully the next submit on + * the ring actually fits + */ + ring->overflow = false; + + /* + * One might be tempted to remove the submit from the + * submits list, and drop it's reference (and drop the + * active reference for all the bos). But we can't + * really signal the fence attached to obj->resv without + * disturbing other fences on the timeline. So instead + * just leave it and let it retire normally when a + * later submit completes. + */ + + return -ENOSPC; + } + + priv->lastctx = submit->queue->ctx; + + return 0; } /* diff --git a/drivers/gpu/drm/msm/msm_gpu.h b/drivers/gpu/drm/msm/msm_gpu.h index d7cd02cd2109..2dd2ef1f8328 100644 --- a/drivers/gpu/drm/msm/msm_gpu.h +++ b/drivers/gpu/drm/msm/msm_gpu.h @@ -302,7 +302,7 @@ int msm_gpu_perfcntr_sample(struct msm_gpu *gpu, uint32_t *activetime, uint32_t *totaltime, uint32_t ncntrs, uint32_t *cntrs); void msm_gpu_retire(struct msm_gpu *gpu); -void msm_gpu_submit(struct msm_gpu *gpu, struct msm_gem_submit *submit); +int msm_gpu_submit(struct msm_gpu *gpu, struct msm_gem_submit *submit); int msm_gpu_init(struct drm_device *drm, struct platform_device *pdev, struct msm_gpu *gpu, const struct msm_gpu_funcs *funcs, diff --git a/drivers/gpu/drm/msm/msm_ringbuffer.h b/drivers/gpu/drm/msm/msm_ringbuffer.h index fe55d4a1aa16..d8ad9818c389 100644 --- a/drivers/gpu/drm/msm/msm_ringbuffer.h +++ b/drivers/gpu/drm/msm/msm_ringbuffer.h @@ -40,6 +40,8 @@ struct msm_ringbuffer { struct drm_gem_object *bo; uint32_t *start, *end, *cur, *next; + bool overflow; + /* * List of in-flight submits on this ring. Protected by submit_lock. */ @@ -69,6 +71,9 @@ void msm_ringbuffer_destroy(struct msm_ringbuffer *ring); static inline void OUT_RING(struct msm_ringbuffer *ring, uint32_t data) { + if (ring->overflow) + return; + /* * ring->next points to the current command being written - it won't be * committed as ring->cur until the flush