Re: [RFC PATCH v2 0/7] Tunable sched_mc_power_savings=n

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Peter Zijlstra
Date: Monday, September 8, 2008 - 11:18 pm

On Mon, 2008-09-08 at 18:20 -0700, Suresh Siddha wrote:

Sure, and those comments only tell me what it does, not why it does it.
Also, 1/6-th is still a significant amount.


But then why not add a domain level that represents the packages and
power schedule on that? That way you dont have to change the domain
structure depending on the load balance goals.

Also, that justification is just wrong - AMD has similar constructs in
its cpus, and god knows what other architectures do, so hiding this in
arch/x86 is wrong too.


Yes I understand, but we really need to do something about it - and
everybody with interests in it should be looking at ways to reduce the
nightmare, not add to it.


You and me both then :-)

I've been looking at the history of that function - it started out quite
readable - but has, over the years, grown into a monstrosity.

One of the things I'm currently looking at is getting rid of the
small_imbalance stuff - that made sense when we balanced based on
nr_running, but now that we're weight based and willing to over-balance
a little I'm not sure that it still does.

Also, it looks to me that Christoph Lameters fix
0a2966b48fb784e437520e400ddc94874ddbd4e8 introduced a bug. By skipping
the cpu in the group accumulation the group average computation below
needs to be adjusted, because it uses the group power stuff, which
assumes all cpus contributed.

Then there is this whole sched_group stuff, which I intent to have a
hard look at, afaict its unneeded and we can iterate over the
sub-domains just as well.

Finally, we should move all this stuff into sched_fair and get rid of
that iterator interface and fix up all nr_running etc.. usages to refer
to cfs.nr_running and similar.

Then there is the idea Andi proposed, splitting up the performance and
power balancer into two separate functions, something that is worth
looking into imho.

--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
[RFC PATCH v2 0/7] Tunable sched_mc_power_savings=n, Vaidyanathan Srinivasan, (Mon Sep 8, 6:14 am)
[RFC PATCH v2 1/7] sched: arch_reinit_sched_domains() must ..., Vaidyanathan Srinivasan, (Mon Sep 8, 6:16 am)
[RFC PATCH v2 2/7] sched: Fix __load_balance_iterator() fo ..., Vaidyanathan Srinivasan, (Mon Sep 8, 6:17 am)
[RFC PATCH v2 3/7] sched: Framework for sched_mc/smt_power ..., Vaidyanathan Srinivasan, (Mon Sep 8, 6:18 am)
[RFC PATCH v2 4/7] sched: favour lower logical cpu number ..., Vaidyanathan Srinivasan, (Mon Sep 8, 6:20 am)
[RFC PATCH v2 5/7] sched: nominate preferred wakeup cpu, Vaidyanathan Srinivasan, (Mon Sep 8, 6:21 am)
[RFC PATCH v2 6/7] sched: bias task wakeups to preferred s ..., Vaidyanathan Srinivasan, (Mon Sep 8, 6:22 am)
[RFC PATCH v2 7/7] sched: activate active load balancing i ..., Vaidyanathan Srinivasan, (Mon Sep 8, 6:23 am)
Re: [RFC PATCH v2 0/7] Tunable sched_mc_power_savings=n, Peter Zijlstra, (Mon Sep 8, 6:25 am)
Re: [RFC PATCH v2 5/7] sched: nominate preferred wakeup cpu, Vaidyanathan Srinivasan, (Mon Sep 8, 6:43 am)
Re: [RFC PATCH v2 0/7] Tunable sched_mc_power_savings=n, Vaidyanathan Srinivasan, (Mon Sep 8, 6:48 am)
Re: [RFC PATCH v2 0/7] Tunable sched_mc_power_savings=n, Peter Zijlstra, (Mon Sep 8, 6:56 am)
Re: [RFC PATCH v2 0/7] Tunable sched_mc_power_savings=n, Suresh Siddha, (Mon Sep 8, 6:20 pm)
Re: [RFC PATCH v2 0/7] Tunable sched_mc_power_savings=n, Peter Zijlstra, (Mon Sep 8, 11:18 pm)
Re: [RFC PATCH v2 0/7] Tunable sched_mc_power_savings=n, Peter Zijlstra, (Mon Sep 8, 11:54 pm)
Re: [RFC PATCH v2 0/7] Tunable sched_mc_power_savings=n, Peter Zijlstra, (Tue Sep 9, 1:25 am)
Re: [RFC PATCH v2 0/7] Tunable sched_mc_power_savings=n, Vaidyanathan Srinivasan, (Wed Sep 10, 6:45 am)