[v3,18/28] tcg: Tidy tcg_n_regions

Message ID	20210502231844.1977630-19-richard.henderson@linaro.org
State	Superseded
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Subject: [PATCH v3 18/28] tcg: Tidy tcg_n_regions Date: Sun, 2 May 2021 16:18:34 -0700 Message-Id: <20210502231844.1977630-19-richard.henderson@linaro.org> In-Reply-To: <20210502231844.1977630-1-richard.henderson@linaro.org> References: <20210502231844.1977630-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::1032; envelope-from=richard.henderson@linaro.org; helo=mail-pj1-x1032.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>
Series	tcg: Clean up code_gen_buffer allocation \| expand [v3,00/28] tcg: Clean up code_gen_buffer allocation [v3,01/28] meson: Split out tcg/meson.build [v3,02/28] meson: Split out fpu/meson.build [v3,03/28] tcg: Re-order tcg_region_init vs tcg_prologue_init [v3,04/28] tcg: Remove error return from tcg_region_initial_alloc__locked [v3,05/28] tcg: Split out tcg_region_initial_alloc [v3,06/28] tcg: Split out tcg_region_prologue_set [v3,07/28] tcg: Split out region.c [v3,08/28] accel/tcg: Inline cpu_gen_init [v3,09/28] accel/tcg: Move alloc_code_gen_buffer to tcg/region.c [v3,10/28] accel/tcg: Rename tcg_init to tcg_init_machine [v3,11/28] tcg: Create tcg_init [v3,12/28] accel/tcg: Merge tcg_exec_init into tcg_init_machine [v3,13/28] accel/tcg: Pass down max_cpus to tcg_init [v3,14/28] tcg: Introduce tcg_max_ctxs [v3,15/28] tcg: Move MAX_CODE_GEN_BUFFER_SIZE to tcg-target.h [v3,16/28] tcg: Replace region.end with region.total_size [v3,17/28] tcg: Rename region.start to region.after_prologue [v3,18/28] tcg: Tidy tcg_n_regions [v3,19/28] tcg: Tidy split_cross_256mb [v3,20/28] tcg: Move in_code_gen_buffer and tests to region.c [v3,21/28] tcg: Allocate code_gen_buffer into struct tcg_region_state [v3,22/28] tcg: Return the map protection from alloc_code_gen_buffer [v3,23/28] tcg: Sink qemu_madvise call to common code [v3,24/28] util/osdep: Add qemu_mprotect_rw [v3,25/28] tcg: Round the tb_size default from qemu_get_host_physmem [v3,26/28] tcg: Merge buffer protection and guard page protection [v3,27/28] tcg: When allocating for !splitwx, begin with PROT_NONE [v3,28/28] tcg: Move tcg_init_ctx and tcg_ctx from accel/tcg/

Message ID

20210502231844.1977630-19-richard.henderson@linaro.org

State

Superseded

Headers

Received-SPF: pass (google.com: domain of
	qemu-devel-bounces+patch=linaro.org@nongnu.org designates
	209.51.188.17 as permitted sender) client-ip=209.51.188.17; 
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Subject: [PATCH v3 18/28] tcg: Tidy tcg_n_regions
Date: Sun,  2 May 2021 16:18:34 -0700
Message-Id: <20210502231844.1977630-19-richard.henderson@linaro.org>
In-Reply-To: <20210502231844.1977630-1-richard.henderson@linaro.org>
References: <20210502231844.1977630-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2607:f8b0:4864:20::1032;
	envelope-from=richard.henderson@linaro.org;
	helo=mail-pj1-x1032.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
	DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
	RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
	SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.23
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>

Series

tcg: Clean up code_gen_buffer allocation | expand

Commit Message

Richard Henderson May 2, 2021, 11:18 p.m. UTC

Compute the value using straight division and bounds,
rather than a loop.  Pass in tb_size rather than reading
from tcg_init_ctx.code_gen_buffer_size,

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>

---
 tcg/region.c | 29 ++++++++++++-----------------
 1 file changed, 12 insertions(+), 17 deletions(-)

-- 
2.25.1

Comments

Alex Bennée June 8, 2021, 4:06 p.m. UTC | #1

Richard Henderson <richard.henderson@linaro.org> writes:

> Compute the value using straight division and bounds,

> rather than a loop.  Pass in tb_size rather than reading

> from tcg_init_ctx.code_gen_buffer_size,

>

> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>

> ---

>  tcg/region.c | 29 ++++++++++++-----------------

>  1 file changed, 12 insertions(+), 17 deletions(-)

>

> diff --git a/tcg/region.c b/tcg/region.c

> index bd81b35359..b44246e1aa 100644

> --- a/tcg/region.c

> +++ b/tcg/region.c

> @@ -363,38 +363,33 @@ void tcg_region_reset_all(void)

>      tcg_region_tree_reset_all();

>  }

>  

> -static size_t tcg_n_regions(unsigned max_cpus)

> +static size_t tcg_n_regions(size_t tb_size, unsigned max_cpus)

>  {

>  #ifdef CONFIG_USER_ONLY

>      return 1;

>  #else

> +    size_t n_regions;

> +

>      /*

>       * It is likely that some vCPUs will translate more code than others,

>       * so we first try to set more regions than max_cpus, with those regions

>       * being of reasonable size. If that's not possible we make do by evenly

>       * dividing the code_gen_buffer among the vCPUs.

>       */

> -    size_t i;

> -

>      /* Use a single region if all we have is one vCPU thread */

>      if (max_cpus == 1 || !qemu_tcg_mttcg_enabled()) {

>          return 1;

>      }

>  

> -    /* Try to have more regions than max_cpus, with each region being >= 2 MB */

> -    for (i = 8; i > 0; i--) {

> -        size_t regions_per_thread = i;

> -        size_t region_size;

> -

> -        region_size = tcg_init_ctx.code_gen_buffer_size;

> -        region_size /= max_cpus * regions_per_thread;

> -

> -        if (region_size >= 2 * 1024u * 1024) {

> -            return max_cpus * regions_per_thread;

> -        }

> +    /*

> +     * Try to have more regions than max_cpus, with each region being >= 2 MB.

> +     * If we can't, then just allocate one region per vCPU thread.

> +     */

> +    n_regions = tb_size / (2 * MiB);

> +    if (n_regions <= max_cpus) {

> +        return max_cpus;

>      }

> -    /* If we can't, then just allocate one region per vCPU thread */

> -    return max_cpus;

> +    return MIN(n_regions, max_cpus * 8);

>  #endif

>  }


This is so much easier to follow now ;-)

Reviewed-by: Alex Bennée <alex.bennee@linaro.org>


-- 
Alex Bennée

Luis Fernando Fujita Pires June 9, 2021, 2:58 p.m. UTC | #2

From: Richard Henderson <richard.henderson@linaro.org>

> Compute the value using straight division and bounds, rather than a loop.  Pass

> in tb_size rather than reading from tcg_init_ctx.code_gen_buffer_size,

> 

> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>

> ---

>  tcg/region.c | 29 ++++++++++++-----------------

>  1 file changed, 12 insertions(+), 17 deletions(-)


Reviewed-by: Luis Pires <luis.pires@eldorado.org.br>


--
Luis Pires
Instituto de Pesquisas ELDORADO
Aviso Legal - Disclaimer <https://www.eldorado.org.br/disclaimer.html>

diff --git a/tcg/region.c b/tcg/region.c
index bd81b35359..b44246e1aa 100644
--- a/tcg/region.c
+++ b/tcg/region.c
@@ -363,38 +363,33 @@  void tcg_region_reset_all(void)
     tcg_region_tree_reset_all();
 }
 
-static size_t tcg_n_regions(unsigned max_cpus)
+static size_t tcg_n_regions(size_t tb_size, unsigned max_cpus)
 {
 #ifdef CONFIG_USER_ONLY
     return 1;
 #else
+    size_t n_regions;
+
     /*
      * It is likely that some vCPUs will translate more code than others,
      * so we first try to set more regions than max_cpus, with those regions
      * being of reasonable size. If that's not possible we make do by evenly
      * dividing the code_gen_buffer among the vCPUs.
      */
-    size_t i;
-
     /* Use a single region if all we have is one vCPU thread */
     if (max_cpus == 1 || !qemu_tcg_mttcg_enabled()) {
         return 1;
     }
 
-    /* Try to have more regions than max_cpus, with each region being >= 2 MB */
-    for (i = 8; i > 0; i--) {
-        size_t regions_per_thread = i;
-        size_t region_size;
-
-        region_size = tcg_init_ctx.code_gen_buffer_size;
-        region_size /= max_cpus * regions_per_thread;
-
-        if (region_size >= 2 * 1024u * 1024) {
-            return max_cpus * regions_per_thread;
-        }
+    /*
+     * Try to have more regions than max_cpus, with each region being >= 2 MB.
+     * If we can't, then just allocate one region per vCPU thread.
+     */
+    n_regions = tb_size / (2 * MiB);
+    if (n_regions <= max_cpus) {
+        return max_cpus;
     }
-    /* If we can't, then just allocate one region per vCPU thread */
-    return max_cpus;
+    return MIN(n_regions, max_cpus * 8);
 #endif
 }
 
@@ -828,7 +823,7 @@  void tcg_region_init(size_t tb_size, int splitwx, unsigned max_cpus)
     buf = tcg_init_ctx.code_gen_buffer;
     total_size = tcg_init_ctx.code_gen_buffer_size;
     page_size = qemu_real_host_page_size;
-    n_regions = tcg_n_regions(max_cpus);
+    n_regions = tcg_n_regions(total_size, max_cpus);
 
     /* The first region will be 'aligned - buf' bytes larger than the others */
     aligned = QEMU_ALIGN_PTR_UP(buf, page_size);

[v3,18/28] tcg: Tidy tcg_n_regions

Commit Message

Comments

Patch