diff mbox series

sched: fix sched_feat for !SCHED_DEBUG builds

Message ID 20171107160658.4839-1-patrick.bellasi@arm.com
State Superseded
Headers show
Series sched: fix sched_feat for !SCHED_DEBUG builds | expand

Commit Message

Patrick Bellasi Nov. 7, 2017, 4:06 p.m. UTC
When the kernel is compiled with !SCHED_DEBUG support, we expect that
all SCHED_FEAT are turned into compile time constants being propagated
to support compiler optimizations.
Specifically, we expect that code blocks like this:

   if (sched_feat(FEATURE_NAME) [&& <other_conditions>]) {
	/* FEATURE CODE */
   }

are turned into dead-code in case FEATURE_NAME defaults to FALSE, and thus
being removed by the compiler from the finale image.

For this mechanism to properly work it's required for the compiler to
have full access, from each translation unit, to whatever is the value
defined by the sched_feat macro. This macro is defined as:

   #define sched_feat(x) (sysctl_sched_features & (1UL << __SCHED_FEAT_##x))

and thus, the compiler can optimize that code only if the value of
sysctl_sched_features is visible within each translation unit.

Since:

   029632fbb sched: Make separate sched*.c translation units

the scheduler code has been split into separate translation units
however the definition of sysctl_sched_features is part of
kernel/sched/core.c while, for all the other scheduler modules, it is
visible only via kernel/sched/sched.h as an:

   extern const_debug unsigned int sysctl_sched_features

Unfortunately, an extern reference does not allow the compiler to apply
constants propagation. Thus, on !SCHED_DEBUG kernel we still end up
with code to load a memory reference and (eventually) doing an unconditional
jump of a chunk of code.

This mechanism is unavoidable when sched_features can be turned on and off at
run-time. However, this is not the case for "production" kernels compiled with
!SCHED_DEBUG. In this case, sysctl_sched_features is just a constant value
which cannot be changed at run-time and thus memory loads and jumps can be
avoided altogether.

This patch fixes the case of !SCHED_DEBUG kernel by declaring a local version
of the sysctl_sched_features constant for each translation unit. This will
ultimately allow the compiler to perform constants propagation and dead-code
pruning.

Tests have been done, with !SCHED_DEBUG on a v4.14-rc8 with and without
the patch, by running 30 iterations of:

   perf bench sched messaging --pipe --thread --group 4 --loop 50000

on a 40 cores Intel(R) Xeon(R) CPU E5-2690 v2 @ 3.00GHz using the
powersave governor to rule out variations due to frequency scaling.

Statistics on the reported completion time:

                 count     mean       std     min       99%     max
v4.14-rc8         30.0  15.7831  0.176032  15.442  16.01226  16.014
v4.14-rc8+patch   30.0  15.5033  0.189681  15.232  15.93938  15.962

show a 1.8% speedup on average completion time and 0.5% speedup in the
99 percentile.

Signed-off-by: Patrick Bellasi <patrick.bellasi@arm.com>

Signed-off-by: Chris Redpath <chris.redpath@arm.com>

Reviewed-by: Dietmar Eggemenn <dietmar.eggemann@arm.com>

Reviewed-by: Brendan Jackman <brendan.jackman@arm.com>

Cc: Ingo Molnar <mingo@redhat.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: linux-kernel@vger.kernel.org
---
 kernel/sched/core.c  |  9 ++++++---
 kernel/sched/sched.h | 25 ++++++++++++++++++++++---
 2 files changed, 28 insertions(+), 6 deletions(-)

-- 
2.14.1

Comments

Peter Zijlstra Nov. 7, 2017, 7:05 p.m. UTC | #1
On Tue, Nov 07, 2017 at 04:06:58PM +0000, Patrick Bellasi wrote:
> For this mechanism to properly work it's required for the compiler to

> have full access, from each translation unit, to whatever is the value

> defined by the sched_feat macro. 


> Statistics on the reported completion time:

> 

>                  count     mean       std     min       99%     max

> v4.14-rc8         30.0  15.7831  0.176032  15.442  16.01226  16.014

> v4.14-rc8+patch   30.0  15.5033  0.189681  15.232  15.93938  15.962

> 

> show a 1.8% speedup on average completion time and 0.5% speedup in the

> 99 percentile.

> 


Ack
Ingo Molnar Nov. 8, 2017, 10:03 a.m. UTC | #2
* tip-bot for Patrick Bellasi <tipbot@zytor.com> wrote:

> Commit-ID:  692ee9a79c14c9f707eeb03754a26b9427c0e005

> Gitweb:     https://git.kernel.org/tip/692ee9a79c14c9f707eeb03754a26b9427c0e005

> Author:     Patrick Bellasi <patrick.bellasi@arm.com>

> AuthorDate: Tue, 7 Nov 2017 16:06:58 +0000

> Committer:  Ingo Molnar <mingo@kernel.org>

> CommitDate: Wed, 8 Nov 2017 10:17:29 +0100

> 

> sched/core: Optimize sched_feat() for !SCHED_DEBUG builds


Hm, I noticed this too late, but lots of architecture defconfig builds now produce 
a ton of warnings:

 In file included from /home/mingo/tip/kernel/sched/idle.c:19:0:
 /home/mingo/tip/kernel/sched/sched.h:1275:33: warning: 'sysctl_sched_features' defined but not used [-Wunused-variable]
  static const_debug unsigned int sysctl_sched_features =
                                  ^

I'm dropping this patch for now.

Thanks,

	Ingo
Patrick Bellasi Nov. 8, 2017, 11:02 a.m. UTC | #3
On 08-Nov 11:03, Ingo Molnar wrote:
> 

> * tip-bot for Patrick Bellasi <tipbot@zytor.com> wrote:

> 

> > Commit-ID:  692ee9a79c14c9f707eeb03754a26b9427c0e005

> > Gitweb:     https://git.kernel.org/tip/692ee9a79c14c9f707eeb03754a26b9427c0e005

> > Author:     Patrick Bellasi <patrick.bellasi@arm.com>

> > AuthorDate: Tue, 7 Nov 2017 16:06:58 +0000

> > Committer:  Ingo Molnar <mingo@kernel.org>

> > CommitDate: Wed, 8 Nov 2017 10:17:29 +0100

> > 

> > sched/core: Optimize sched_feat() for !SCHED_DEBUG builds

> 

> Hm, I noticed this too late, but lots of architecture defconfig builds now produce 

> a ton of warnings:

> 

>  In file included from /home/mingo/tip/kernel/sched/idle.c:19:0:

>  /home/mingo/tip/kernel/sched/sched.h:1275:33: warning: 'sysctl_sched_features' defined but not used [-Wunused-variable]

>   static const_debug unsigned int sysctl_sched_features =

>                                   ^

> 

> I'm dropping this patch for now.


Ok Ingo, thanks.
Actually, my fault  I did not checked archs other then arm64 and x86_64.
I'll look into this and respin a clean new version.

> Thanks,

> 

> 	Ingo


Cheers Patrick

-- 
#include <best/regards.h>

Patrick Bellasi
Patrick Bellasi Nov. 8, 2017, 11:24 a.m. UTC | #4
On 08-Nov 11:02, Patrick Bellasi wrote:
> On 08-Nov 11:03, Ingo Molnar wrote:

> > 

> > * tip-bot for Patrick Bellasi <tipbot@zytor.com> wrote:

> > 

> > > Commit-ID:  692ee9a79c14c9f707eeb03754a26b9427c0e005

> > > Gitweb:     https://git.kernel.org/tip/692ee9a79c14c9f707eeb03754a26b9427c0e005

> > > Author:     Patrick Bellasi <patrick.bellasi@arm.com>

> > > AuthorDate: Tue, 7 Nov 2017 16:06:58 +0000

> > > Committer:  Ingo Molnar <mingo@kernel.org>

> > > CommitDate: Wed, 8 Nov 2017 10:17:29 +0100

> > > 

> > > sched/core: Optimize sched_feat() for !SCHED_DEBUG builds

> > 

> > Hm, I noticed this too late, but lots of architecture defconfig builds now produce 

> > a ton of warnings:

> > 

> >  In file included from /home/mingo/tip/kernel/sched/idle.c:19:0:

> >  /home/mingo/tip/kernel/sched/sched.h:1275:33: warning: 'sysctl_sched_features' defined but not used [-Wunused-variable]

> >   static const_debug unsigned int sysctl_sched_features =

> >                                   ^


Perhaps just using a "__maybe_unused" attribute should fix the problem.

Going to compile test with some more archs, can you tell me a
defconfig failing for you?

> > I'm dropping this patch for now.

> 

> Ok Ingo, thanks.

> Actually, my fault  I did not checked archs other then arm64 and x86_64.

> I'll look into this and respin a clean new version.

> 

> > Thanks,

> > 

> > 	Ingo

> 

> Cheers Patrick

> 

> -- 

> #include <best/regards.h>

> 

> Patrick Bellasi


-- 
#include <best/regards.h>

Patrick Bellasi
Ingo Molnar Nov. 8, 2017, 1:05 p.m. UTC | #5
* Patrick Bellasi <patrick.bellasi@arm.com> wrote:

> On 08-Nov 11:02, Patrick Bellasi wrote:

> > On 08-Nov 11:03, Ingo Molnar wrote:

> > > 

> > > * tip-bot for Patrick Bellasi <tipbot@zytor.com> wrote:

> > > 

> > > > Commit-ID:  692ee9a79c14c9f707eeb03754a26b9427c0e005

> > > > Gitweb:     https://git.kernel.org/tip/692ee9a79c14c9f707eeb03754a26b9427c0e005

> > > > Author:     Patrick Bellasi <patrick.bellasi@arm.com>

> > > > AuthorDate: Tue, 7 Nov 2017 16:06:58 +0000

> > > > Committer:  Ingo Molnar <mingo@kernel.org>

> > > > CommitDate: Wed, 8 Nov 2017 10:17:29 +0100

> > > > 

> > > > sched/core: Optimize sched_feat() for !SCHED_DEBUG builds

> > > 

> > > Hm, I noticed this too late, but lots of architecture defconfig builds now produce 

> > > a ton of warnings:

> > > 

> > >  In file included from /home/mingo/tip/kernel/sched/idle.c:19:0:

> > >  /home/mingo/tip/kernel/sched/sched.h:1275:33: warning: 'sysctl_sched_features' defined but not used [-Wunused-variable]

> > >   static const_debug unsigned int sysctl_sched_features =

> > >                                   ^

> 

> Perhaps just using a "__maybe_unused" attribute should fix the problem.


Yeah, that would be OK in this case.

> Going to compile test with some more archs, can you tell me a

> defconfig failing for you?


Alpha was one of them - don't have the logs of the other warnings anymore.

Thanks,

	Ingo
diff mbox series

Patch

diff --git a/kernel/sched/core.c b/kernel/sched/core.c
index d17c5da523a0..0edb08716555 100644
--- a/kernel/sched/core.c
+++ b/kernel/sched/core.c
@@ -42,18 +42,21 @@ 
 
 DEFINE_PER_CPU_SHARED_ALIGNED(struct rq, runqueues);
 
+#if defined(CONFIG_SCHED_DEBUG) && defined(HAVE_JUMP_LABEL)
 /*
  * Debugging: various feature bits
+ *
+ * If SCHED_DEBUG is disabled, each compilation unit has its own copy of
+ * sysctl_sched_features, defined in sched.h, to allow constants propagation
+ * at compile time and compiler optimization based on features default.
  */
-
 #define SCHED_FEAT(name, enabled)	\
 	(1UL << __SCHED_FEAT_##name) * enabled |
-
 const_debug unsigned int sysctl_sched_features =
 #include "features.h"
 	0;
-
 #undef SCHED_FEAT
+#endif
 
 /*
  * Number of tasks to iterate in a single balance run.
diff --git a/kernel/sched/sched.h b/kernel/sched/sched.h
index 3b448ba82225..e1666b3e2fb2 100644
--- a/kernel/sched/sched.h
+++ b/kernel/sched/sched.h
@@ -1219,8 +1219,6 @@  static inline void __set_task_cpu(struct task_struct *p, unsigned int cpu)
 # define const_debug const
 #endif
 
-extern const_debug unsigned int sysctl_sched_features;
-
 #define SCHED_FEAT(name, enabled)	\
 	__SCHED_FEAT_##name ,
 
@@ -1232,6 +1230,13 @@  enum {
 #undef SCHED_FEAT
 
 #if defined(CONFIG_SCHED_DEBUG) && defined(HAVE_JUMP_LABEL)
+
+/*
+ * To support run-time toggling of sched features, all the translation units
+ * (but core.c) reference the sysctl_sched_features defined in core.c.
+ */
+extern const_debug unsigned int sysctl_sched_features;
+
 #define SCHED_FEAT(name, enabled)					\
 static __always_inline bool static_branch_##name(struct static_key *key) \
 {									\
@@ -1239,13 +1244,27 @@  static __always_inline bool static_branch_##name(struct static_key *key) \
 }
 
 #include "features.h"
-
 #undef SCHED_FEAT
 
 extern struct static_key sched_feat_keys[__SCHED_FEAT_NR];
 #define sched_feat(x) (static_branch_##x(&sched_feat_keys[__SCHED_FEAT_##x]))
+
 #else /* !(SCHED_DEBUG && HAVE_JUMP_LABEL) */
+
+/*
+ * Each translation unit has its own copy of sysctl_sched_features to allow
+ * constants propagation at compile time and compiler optimization based on
+ * features default.
+ */
+#define SCHED_FEAT(name, enabled)	\
+	(1UL << __SCHED_FEAT_##name) * enabled |
+static const_debug unsigned int sysctl_sched_features =
+#include "features.h"
+	0;
+#undef SCHED_FEAT
+
 #define sched_feat(x) (sysctl_sched_features & (1UL << __SCHED_FEAT_##x))
+
 #endif /* SCHED_DEBUG && HAVE_JUMP_LABEL */
 
 extern struct static_key_false sched_numa_balancing;