linux-kernel mailing list

FromSubjectsort iconDate
Trent Piepho
[PATCH] sysfs: Make dir and name args to sysfs_notify() const
Because they can be, and because code like this produces a warning if they're not: struct device_attribute dev_attr; sysfs_notify(&kobj, NULL, dev_attr.attr.name); Signed-off-by: Trent Piepho <tpiepho@freescale.com> CC: NeilBrown <neilb@suse.de> --- fs/sysfs/file.c | 2 +- include/linux/sysfs.h | 5 +++-- 2 files changed, 4 insertions(+), 3 deletions(-) diff --git a/fs/sysfs/file.c b/fs/sysfs/file.c index c9e4e50..cabfd0f 100644 --- a/fs/sysfs/file.c +++ ...
Sep 25, 4:45 pm 2008
Milton Miller
Re: [PATCH HACK] powerpc: quick hack to get a functional ...
No, not an oversight. The point is, don't mask/unmask between ack/eoi while handling the interrupt. For many irq controllers, the eoi must be done from the same cpu, hence the mask and eoi before actually handling the interrupt in the general case. Its a feature of xics that we don't have to play that game, but can do the "PowerPC External Interrupt Architecture" is defined in appendix A of "Power.org™ Standard for Power Architecture™ Platform Requirements (Workstation, Server)", ...
Sep 25, 4:40 pm 2008
Greg KH
[PATCH] Input: add mmio xi driver
From: Greg Kroah-Hartman <gregkh@suse.de> This patch adds the Mimio Xi interactive whiteboard driver to the tree. It was originally written by mwilder@cs.nmsu.edu, but cleaned up and forward ported by me to the latest kernel version. Cc: Phil Hannent <phil@hannent.co.uk> Cc: <mwilder@cs.nmsu.edu> Signed-off-by: Greg Kroah-Hartman <gregkh@suse.de> --- drivers/input/misc/Kconfig | 11 drivers/input/misc/Makefile | 1 drivers/input/misc/mimio.c | 913 ...
Sep 25, 4:22 pm 2008
Andres Freund
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
--Boundary-01=_HFB3I9mohiun9Ep Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Hi, On Friday 26 September 2008, you wrote in "Re: [Bug #11382] e1000e: 2.6.27-= rc1=20 I attached the current ubuntu 2.6.27 .config for 64bit x86. As another datapoint, my system wasnt effected although having loaded e1000= e=20 both with the ubuntu (without PAT) and the vanilla kernel (PAT enabled). I did not run the -32bit version on ...
Sep 25, 3:31 pm 2008
Roland Dreier
[PATCH] IPoIB: Fix crash when path record fails after pa ...
From: Roland Dreier <rolandd@cisco.com> Commit ee1e2c82 ("IPoIB: Refresh paths instead of flushing them on SM change events") changed how paths are flushed on an SM event. This change introduces a problem if the path record query triggered by fails, causing path->ah to become NULL. A later successful path query will then trigger WARN_ON() in path_rec_completion(), and crash because path->ah has already been freed, so the ipoib_put_ah() inside the lock in path_rec_completion() may actually ...
Sep 25, 3:28 pm 2008
Michael Davidson
[PATCH] x86: TSC resync
This patch is a slightly cleaned up version of one which has been in use at for some time now at Google. It uses an HPET based time source to resynchronize the TSC on systems where it would otherwise be unsynchronized - eg early AMD Opteron based systems where the TSC rate drifts when going in and out of the C1E halt state. While the approach is quite crude it has been effective for systems where user space code relies on the TSC advancing at a constant rate and being reasonably well ...
Sep 25, 3:02 pm 2008
stephane eranian
perfmon3 interface overview
Hello, A few months ago, I started posting on this list a highly simplified version of the perfmon2 version which was providing only per-thread counting on X86 processors. The feedback I got was generally positive but people raised two more issues: - too many system calls for a single subsystem (12 calls) - how to ensure syscalls could be extended without necessarily adding new ones. Since then, I have been working hard on addressing those two issues, in other words ...
Sep 25, 2:48 pm 2008
james toy
20080923-mmotm arch/x86/kvm/built-in.o: In function `kvm ...
Noticed this break with virtualization enabled on the latest mmotm. CC net/wireless/util.o CC [M] net/sunrpc/auth_gss/gss_krb5_unseal.o CC [M] drivers/usb/storage/freecom.o CC [M] net/sunrpc/auth_gss/gss_krb5_seqnum.o CC [M] drivers/usb/storage/dpcm.o CC net/wireless/reg.o CC [M] net/sunrpc/auth_gss/gss_krb5_wrap.o CC [M] drivers/usb/storage/isd200.o CC [M] net/sunrpc/auth_gss/gss_krb5_crypto.o CC net/wireless/nl80211.o CC [M] ...
Sep 25, 2:29 pm 2008
Singaravelan Nallasellan
Acceptable Interrupt Latency in Linux kernel
What is the tolerable interrupt latency for the Linux interrupt handler? I need to write a driver which will require to invoke a function from other driver to read the hardware register. That function will send a message to the firmware in the some other hardware. The firmware will go into a bus to read the register and return the value. This firmware generates an interrupt of the completion of the register read. So the whole process of register read will take time to know the interrupt ...
Sep 25, 2:12 pm 2008
BERTRAND Joel
2.6.26.x hangs on amd64/smp
Hello, System : debian/testing, tested kernels 2.6.26, 2.6.26.3, 2.6.26.5. Hardware : core2duo, 4 GB, raid1 software, CFQ scheduler. I have written a program that work on cartographic data. This program is started as a daemon and does some fork() (and pthread_create()). I have seen that it requires 6 GB to work, each process takes 1,5 GB. The same program works fine under FreeBSD or Solaris (on of course the same hardware). When it starts, I can see disk activity (swap), and after 2 ...
Sep 25, 2:02 pm 2008
Ma, Chinang
RE: 2.6.27-rc5 OLTP performance regression
We also tested sched_shares_ratelimit = 100 ms and reduce the OLTP regression to 0.2%. (very small return in performance with 10x the value). Can we count on this sched_shares_ratelimit tunable be always accessible for workload that does not depend on group fairness? Chinang -----Original Message----- From: Ma, Chinang Sent: Wednesday, September 10, 2008 5:08 PM To: Ingo Molnar Cc: Peter Zijlstra; Srivatsa Vaddagiri; Mike Galbraith; Gregory Haskins; Steven Rostedt; Nick Piggin; Siddha, Suresh ...
Sep 25, 2:04 pm 2008
Sitsofe Wheeler
Re: Turning off camera also kills card reader on EeePC 900
And of course, having tried Ingo Molnar's linux-tip (whereas before I was using 2.6.27) I have now found a modern kernel that doesn't exhibit the problem... --
Sep 25, 1:49 pm 2008
Yinghai Lu
[PATCH] x86: print out irq nr for msi/ht v3
v2: fix hpet compiling error v3: Bjorn want to use dev_printk instead Signed-off-by: Yinghai Lu <yhlu.kernel@gmail.com> --- arch/x86/kernel/hpet.c | 3 +++ arch/x86/kernel/io_apic.c | 5 +++++ 2 files changed, 8 insertions(+) Index: linux-2.6/arch/x86/kernel/io_apic.c =================================================================== --- linux-2.6.orig/arch/x86/kernel/io_apic.c +++ linux-2.6/arch/x86/kernel/io_apic.c @@ -3348,6 +3348,8 @@ static int setup_msi_irq(struct ...
Sep 25, 11:53 am 2008
Steven Rostedt
[RFC PATCH 2/2 v3] ftrace: make work with new ring buffer
Note: This patch is a proof of concept, and breaks a lot of functionality of ftrace. This patch simply makes ftrace work with the developmental ring buffer. Signed-off-by: Steven Rostedt <srostedt@redhat.com> --- kernel/trace/trace.c | 776 ++++++++------------------------------ kernel/trace/trace.h | 22 - kernel/trace/trace_functions.c | 2 kernel/trace/trace_irqsoff.c | 6 kernel/trace/trace_mmiotrace.c | 10 ...
Sep 25, 11:51 am 2008
Steven Rostedt
[RFC PATCH 1/2 v3] Unified trace buffer
This is probably very buggy. I ran it as a back end for ftrace but only tested the irqsoff and ftrace tracers. The selftests are busted with it. But this is an attempt to get a unified buffering system that was talked about at the LPC meeting. Now that it boots and runs (albeit, a bit buggy), I decided to post it. This is some idea that I had to handle this. I tried to make it as simple as possible. I'm not going to explain all the stuff I'm doing here, since this code is under a lot of ...
Sep 25, 11:51 am 2008
Steven Rostedt
[RFC PATCH 0/2 v3] Unified trace buffer
[ NOTE function comments have not been updated. Comments within the code has.] This version I change the event header to what Peter Zijlstra requested. The buffer alignment is now 4 from 8. The minimum event is 8 bytes. The event header is now: struct event_header { u32 type:2, len:3, time_delta:27; u32 array[]; }; The length of the record is determined as: if (data size > 28 bytes) lenght = event->array[0] + sizeof(event_header); else length = event->len << 4 + ...
Sep 25, 11:51 am 2008
Lin Tan
[PATCH git latest] drivers/scsi: fixing wrong comment be ...
(I would wish to be personally CC'ed the answers/comments posted to the list in response to my posting.) --- Removing the wrong comment. The lock is needed before calling new_tape_buffer(), at least in some cases. So the comment above new_tape_buffer() is inconsistent with the code and may mislead developers. I simply removed the wrong comment, as I am not sure if the lock is required in all situations. If so, we should add "Caller must hold os_scsi_tapes_lock". Signed-off-by: Lin Tan ...
Sep 25, 10:49 am 2008
Anton Vorontsov
[RFC PATCH 0/2] QE Pin Multiplexing API
Here is the sketch, only for RFC. It is tested to work, but there could be silly thinkos anyway. I'll review the changes once again later... Anyhow... David, is this approach looks ok? Or I'm hopeless..? ;-) -- Anton Vorontsov email: cbouatmailru@gmail.com irc://irc.freenode.net/bd2 --
Sep 25, 11:36 am 2008
Anton Vorontsov
[PATCH 1/2] OF: new helper: of_parse_phandles_with_args()
Factored out of of_get_gpio(). Will be used by the QE pin multiplexing functions (they need to parse the gpios = <> too). Signed-off-by: Anton Vorontsov <avorontsov@ru.mvista.com> --- drivers/of/base.c | 111 ++++++++++++++++++++++++++++++++++++++++++++++++++++ drivers/of/gpio.c | 77 +++++++---------------------------- include/linux/of.h | 3 + 3 files changed, 130 insertions(+), 61 deletions(-) diff --git a/drivers/of/base.c b/drivers/of/base.c index ad8ac1a..51e0fce 100644 --- ...
Sep 25, 11:37 am 2008
Anton Vorontsov
[PATCH 2/2] powerpc/QE: implement QE Pin Multiplexing API
Today the API is still based on a fact that QE gpio controllers are registered. If they aren't, the API won't work (gracefully though). There is one caveat though: if anybody occupied the node->data before us, or overwrote it, then bad things will happen. Luckily this is all in the platform code that we fully control, so this should never happen. I could implement more checks (for example we could create a list of successfully registered QE controllers, and compare the node->data in the ...
Sep 25, 11:37 am 2008
H. Peter Anvin
Re: [RFC PATCH v2 -tip 0/4] x86: signal handler improvement
This is a gcc failure, and should be reported to the gcc people. -hpa --
Sep 25, 11:14 am 2008
Hiroshi Shimamoto
Re: [RFC PATCH v2 -tip 0/4] x86: signal handler improvement
OK, I'll test with some gcc versions. It's necessary to check outputs of the latest gcc, I think. thanks, Hiroshi Shimamoto --
Sep 25, 12:38 pm 2008
H. Peter Anvin
Re: [RFC PATCH v2 -tip 0/4] x86: signal handler improvement
Yes, it should be able to process this in a register, instead of storing to the stack and then merging later. It's possible it's trying to do that to hide latency, but that's clearly a lose in this case. I'd hate to obfuscate the code, and I'd *really* hate to obfuscate the code to work around gcc brokenness, and then not even bothering to tell the gcc folks. -hpa --
Sep 25, 11:33 am 2008
Hiroshi Shimamoto
Re: [RFC PATCH v2 -tip 0/4] x86: signal handler improvement
Does this gcc failure mean unnecessary storing into stack? thanks, Hiroshi Shimamoto --
Sep 25, 11:29 am 2008
Hiroshi Shimamoto
[RFC PATCH v2 -tip 1/4] x86: uaccess: rename __put_user_ ...
From: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> Use same rule to name of __put_user and __get_user. Signed-off-by: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> --- include/asm-x86/uaccess.h | 6 +++--- 1 files changed, 3 insertions(+), 3 deletions(-) diff --git a/include/asm-x86/uaccess.h b/include/asm-x86/uaccess.h index 414b116..c098dfe 100644 --- a/include/asm-x86/uaccess.h +++ b/include/asm-x86/uaccess.h @@ -186,7 +186,7 @@ extern int __get_user_bad(void); #ifdef ...
Sep 25, 11:18 am 2008
Hiroshi Shimamoto
[RFC PATCH v2 -tip 4/4] x86: signal: use __{put|get}_user_cerr
From: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> Use __{put|get}_user_cerr for cumulative error handling in x86 signal code. This makes stack usage and code size small. The line err |= __put_user(x, ptr); is comiled to like this; a0: 89 fa mov %edi,%edx a2: 8b 41 20 mov 0x20(%ecx),%eax a5: 89 46 08 mov %eax,0x8(%esi) a8: 89 55 b0 mov %edx,-0x50(%ebp) and the line __put_user_cerr(x, ptr, &err); is comiled to ...
Sep 25, 11:18 am 2008
Hiroshi Shimamoto
[RFC PATCH v2 -tip 3/4] x86: uaccess: introduce __{put|g ...
From: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> Introduce __{put|get}_user_cerr for cumulative error handling. The following 2 lines are same. __{put|get}_user_cerr(x, ptr, &err); err |= __{put|get}_user(x, ptr); Introduce __{put|get}_user_size_cerr for internal use from __{put|get}_user_cerr. Signed-off-by: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> --- include/asm-x86/uaccess.h | 74 +++++++++++++++++++++++++++++++++++++++++++++ 1 files changed, 74 insertions(+), 0 ...
Sep 25, 11:18 am 2008
Hiroshi Shimamoto
[RFC PATCH v2 -tip 2/4] x86: uaccess: introduce __{put|g ...
From: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> Introduce __{put|get}_user_asm_eop which receives eop for error opecode. Define __{put|get}_user_asm as __{put|get}_user_asm_eop with "mov". Signed-off-by: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> --- include/asm-x86/uaccess.h | 27 +++++++++++++++++++++------ 1 files changed, 21 insertions(+), 6 deletions(-) diff --git a/include/asm-x86/uaccess.h b/include/asm-x86/uaccess.h index c098dfe..84b0600 100644 --- ...
Sep 25, 11:18 am 2008
Hiroshi Shimamoto
[RFC PATCH v2 -tip 0/4] x86: signal handler improvement
This patch series is experimental. This is a second version of the series. This update is for further testing. Changes v1->v2 - passing the error variable as a reference on __{put|get}_user_cerr. ======== I noticed there are inefficient codes in x86 signals. For example, disassembled 32-bit setup_sigcontext(); 0000007c <setup_sigcontext>: 7c: 55 push %ebp 7d: 89 e5 mov %esp,%ebp 7f: 57 push %edi 80: 31 ff ...
Sep 25, 11:08 am 2008
Tilman Baumann
Re: SMACK netfilter smacklabel socket match
Wow, now it strikes me that i was running around blind all the time. SECMARK is a target not a match. I always thought i would implementing much the same thing. I guess there would be in fact currently not way to set a MARK or CONNMAK based on a SECMARK. Most of the *MARK targets have a --restore-mark option to restore a mark into the packet mark. But since the SECMARK is not numeric/bitmask there is nothing to restore. They however can do the same in regard do CONNSECMARK and ...
Sep 25, 1:32 pm 2008
Paul Moore
Re: SMACK netfilter smacklabel socket match
Most Smack development has taken place on the LSM mailing list. At the very least I would recommend you CC the LSM list when sending Smack patches. I've taken the liberty of CC'ing both the LSM list and Casey It appears the simplest option would be to provide the necessary SECMARK support in Smack. SECMARK has provisions for supporting different types of LSMs and adding Smack support should be relatively trivial. In fact, it is possible for SECMARK to be made entirely LSM agnostic ...
Sep 25, 11:26 am 2008
Paul Moore
Re: SMACK netfilter smacklabel socket match
With SELinux the packet's CIPSO label (called the packet's peer label) is different from the SECMARK label. Assuming you take a similar approach in Smack, you should be able to implement SECMARK without Yes, in the absence of the sending socket to obtain the packet's peer label you need to examine the packet itself and any labeling Well, if you are accepting or dropping packets you are applying some form of access control. I thought that was the point of your patch? Hmmm, the ...
Sep 25, 12:57 pm 2008
Tilman Baumann
Re: SMACK netfilter smacklabel socket match
Sounds like a good idea. When i looked at the SECMARK code i could not get my head around the SELinux specific stuff, so i discarded the idea as to complex. For this to be complete i guess the CIPSO labels for SMACK would need to be taken into account. Far more than my quick and dirty approach, and probably more than i'm the right person to do it. Il try to understand the inner workings of the SECMARK stuff tough. I have not investigated further into that, but if there is some ...
Sep 25, 12:26 pm 2008
Tilman Baumann
SMACK netfilter smacklabel socket match
Hi all, i made some SMACK related patches. I hope this list is the right place to post them. The intention behind this patch is that i needed a way to (firewall) match for packets originating from specific processes. The existing owner match did not work well enough, especially since the cmd-owner part is removed. Then i thought about a way to tag processes and somehow match this tag in the firewall. I recalled that SELinux can do this (SECMARK) but SELinux would have been way to ...
Sep 25, 10:25 am 2008
David Brownell
[patch 2.6.27-rc7] omap drivers switch to standard GPIO calls
From: David Brownell <dbrownell@users.sourceforge.net> This updates most of the OMAP drivers which are in mainline to switch to using the cross-platform GPIO calls instead of the older OMAP-specific ones. This is all fairly brainless/obvious stuff. Probably the most interesting bit is to observe that the omap-keypad code seems to now have a portable core that could work with non-OMAP matrix keypads. (That would improve with hardware IRQ debouncing enabled, of course...) Signed-off-by: ...
Sep 25, 10:14 am 2008
Tilman Baumann
SMACK startproc patch
Sorry if i pollute the wrong list with my stuff. But SMACK does not seem to have it's own list and now lives in the kernel. Though this is userspace related... The smack howto mentions a not yet implemented smack option for start-stop-daemon. We mainly use startproc. So i made a patch which adds this functionality to startproc. It adds the option [-S LABEL] to startproc, which brings the called process up with /proc/self/attr/current = LABEL. I figured setting the security context ...
Sep 25, 9:27 am 2008
Lukas Hejtmanek
CONFIG_SECURITY_ROOTPLUG [was: Re: 2.6.27-rc7 no init fo ...
Hello, I finally found my problems with no init found. It was caused by the CONFIG_SECURITY_ROOTPLUG option. I wonder why this option in 2.6.26.3 kernel allows the kernel boot up, while with 2.6.27-rc7, the kernel is unable to run even init. -- Lukáš Hejtmánek --
Sep 25, 9:54 am 2008
Martin Schwidefsky
[patch 4/6] kmsg: convert xpram messages to kmsg api.
From: Martin Schwidefsky <schwidefsky@de.ibm.com> Signed-off-by: Martin Schwidefsky <schwidefsky@de.ibm.com> --- Documentation/kmsg/s390/xpram | 61 ++++++++++++++++++++++++++++++++++++++++++ drivers/s390/block/xpram.c | 41 +++++++++++++--------------- 2 files changed, 80 insertions(+), 22 deletions(-) Index: kmsg-2.6/Documentation/kmsg/s390/xpram =================================================================== --- /dev/null +++ kmsg-2.6/Documentation/kmsg/s390/xpram @@ -0,0 ...
Sep 25, 9:28 am 2008
Martin Schwidefsky
[patch 2/6] kmsg: tagged device messages.
From: Martin Schwidefsky <schwidefsky@de.ibm.com> From: Michael Holzheu <holzheu@de.ibm.com> Add CONFIG_MSG_IDS support to the dev_xxx printk family. The message tag for a device printk consists of the driver name and the 24 bit hash over the message text. The hash is included in the printed line if the KMSG_COMPONENT macro is defined and CONFIG_MSG_IDS=y. For source files that do not define KMSG_COMPONENT or CONFIG_MSG_IDS=n the dev_xxx printks use the old-style format. To make it possible ...
Sep 25, 9:28 am 2008
Martin Schwidefsky
[patch 1/6] kmsg: tagged kernel messages.
From: Martin Schwidefsky <schwidefsky@de.ibm.com> From: Michael Holzheu <holzheu@de.ibm.com> Introduce a new family of printk macros which prefixes each kmsg message with a component name and allows to tag the message with a 24 bit hash of the message text. The kmsg component name is defined per source file with the KMSG_COMPONENT macro. In order to use the kmsg_xxx macros KMSG_COMPONENT has to be defined. If the message hash will be printed to the console / syslog at all depends on ...
Sep 25, 9:28 am 2008
Martin Schwidefsky
[patch 6/6] kmsg: convert lcs printk messages to kmsg api.
From Klaus-D. Wacker <kdwacker@de.ibm.com> Signed-off-by: Klaus-D. Wacker <kdwacker@de.ibm.com> Signed-off-by: Martin Schwidefsky <schwidefsky@de.ibm.com> --- Documentation/kmsg/s390/lcs | 170 ++++++++++++++++++++++++++++++++++++++++++++ drivers/s390/net/lcs.c | 78 +++++++++++--------- 2 files changed, 216 insertions(+), 32 deletions(-) Index: kmsg-2.6/Documentation/kmsg/s390/lcs =================================================================== --- /dev/null +++ ...
Sep 25, 9:28 am 2008
Martin Schwidefsky
[patch 0/6] [RFC] kmsg macros, take x+3.
Greetings, after a short chat with Greg on the kernel summit here is now the hopefully final version of the kernel message catalog patches. As version x+2 this uses automatically generated message hashes. The new thing is that there are no more special kmsg_dev_xxx macros, this version uses the standard dev_xxx macros. If a KMSG_COMPONENT is defined then dev_xxx will include the message hash after the driver name in the format: <driver-name>.<message hash>: <device name>: <message> The ...
Sep 25, 9:28 am 2008
Martin Schwidefsky
[patch 3/6] kmsg: Kernel message catalog script.
From: Martin Schwidefsky <schwidefsky@de.ibm.com> From: Michael Holzheu <holzheu@de.ibm.com> Add a script and the calls to the make process that allows to check the kmsg_xxx and dev_xxx printk message and to format man pages from the message descriptions. The kmsg message description is a comment with the following format: /*? * Tag: <component>.<hash> * Text: "<kmsg message text>" * Severity: <severity> * Parameter: * @1: <description of the first message parameter> * @2: ...
Sep 25, 9:28 am 2008
Martin Schwidefsky
[patch 5/6] kmsg: convert vmcp to kmsg api.
From: Christian Borntraeger <borntraeger@de.ibm.com> Signed-off-by: Christian Borntraeger <borntraeger@de.ibm.com> Signed-off-by: Martin Schwidefsky <schwidefsky@de.ibm.com> --- drivers/s390/char/vmcp.c | 23 ++++++++++++++++++++--- 1 file changed, 20 insertions(+), 3 deletions(-) Index: kmsg-2.6/drivers/s390/char/vmcp.c =================================================================== --- kmsg-2.6.orig/drivers/s390/char/vmcp.c +++ kmsg-2.6/drivers/s390/char/vmcp.c @@ -11,12 +11,15 ...
Sep 25, 9:28 am 2008
Steven Rostedt
Re: [RFC PATCH 1/2 v2] Unified trace buffer
One answer is that your counter wrap problem is extended. That is, you have 27 bits of time between each event to not worry about wraps. But if you go against the page itself, the last event on that page is more likely to suffer. -- Steve --
Sep 25, 10:48 am 2008
Linus Torvalds
Re: [RFC PATCH 1/2 v2] Unified trace buffer
Overwrite things on page at a time. Don't you already do that? (I didn't check that closely, I just assumed you would do the _much_ simpler "move the head to the next page" thing rather than trying to mix head and tail Use the page start date for the first event in a page. But within pages, make everything depend on previous event. Linus --
Sep 25, 10:25 am 2008
Mathieu Desnoyers
Re: [RFC PATCH 1/2 v2] Unified trace buffer
You can do the exact same thing and manage to keep the absolute time. You just have to adapt the reader like this : (this would be for per-event cycle count in the 32 LSBs, slight bitmask adaptation needed for 27 bits only). keep a 64 bits TSC value in a per-buffer variable. The previous value is always re-used for the next read. let's call it : tf->buffer.tsc (u64) read_event() would look like : u32 timetamp = read_event_timetamp(); if(timestamp < (0xFFFFFFFFULL & ...
Sep 25, 11:25 am 2008
Linus Torvalds
Re: [RFC PATCH 1/2 v2] Unified trace buffer
Your timestamp handling seems odd. You do it per-event, but I think it should happen for all events, ie just do *ts += event->time_delta; _outside_ the case statement, and then in RB_TYPE_TIME_EXTENT you'd do either - relative: *ts += event->data << TS_SHIFT; - absolute timestamp events: *ts = (event->data << TS_SHIFT) + event->time_delta; but the bigger issue is that I think the timestamp should be relative to the _previous_ event, not relative to the page start. ...
Sep 25, 10:02 am 2008
Steven Rostedt
Re: [RFC PATCH 1/2 v2] Unified trace buffer
hehe, this code is in so much flux, that I gave up on keeping the The problem with this is overwrite mode, which is the only mode ftace currently offers. What happens when your writer starts overwriting the ring buffer and there is no reader? What happens is that the start value is gone. You do not have a way to use all the deltas to catch up to the remaining events. -- Steve --
Sep 25, 10:16 am 2008
Steven Rostedt
[RFC PATCH 1/2 v2] Unified trace buffer
This is probably very buggy. I ran it as a back end for ftrace but only tested the irqsoff and ftrace tracers. The selftests are busted with it. But this is an attempt to get a unified buffering system that was talked about at the LPC meeting. Now that it boots and runs (albeit, a bit buggy), I decided to post it. This is some idea that I had to handle this. I tried to make it as simple as possible. I'm not going to explain all the stuff I'm doing here, since this code is under a lot of ...
Sep 25, 8:58 am 2008
Mathieu Desnoyers
Re: [RFC PATCH 1/2 v2] Unified trace buffer
* Linus Torvalds (torvalds@linux-foundation.org) wrote: How about keeping the timestamps absolute ? (but just keep 27 LSBs) It would help resynchronizing the timestamps if an event is lost and would not accumulate error over and over. It would event help detecting bugs in the tracer by checking if timestamps go backward. Also, it would remove inter-dependency between consecutive events; we would not have to know "for sure" what the previous timestamp was when we write the current event. ...
Sep 25, 10:35 am 2008
Steven Rostedt
Re: [RFC PATCH 1/2 v2] Unified trace buffer
Yeah, I decided to do the blast the page instead of just blast the entry. Hmm, I'm confused. I do this already. I put the page start time on each page already. The events on the page are based on the page start time itself. The iterator, or writer, keeps track of the last event. When it reaches a new page, it reads the new time stamp of the page and starts incrementing against that. -- Steve --
Sep 25, 10:46 am 2008
Steven Rostedt
[RFC PATCH 0/2 v2] Unified trace buffer (take two)
Again: this is a proof of concept, just spitting out code for comments. Here's my second attempt. Changes since version 1: - Ripped away all the debugfs and event registration from ring buffers. - Removed the mergesort from the ringbuffer and pushed that up to the tracer. - Changed the event header to what Linus suggested (we can discuss this and try other suggestions for v3, namely Peter Zijlstras ideas). struct { u32 time_delta:27, type:5; u32 data; u64 ...
Sep 25, 8:58 am 2008
Steven Rostedt
[RFC PATCH 2/2 v2] ftrace: make work with new ring buffer
Note: This patch is a proof of concept, and breaks a lot of functionality of ftrace. This patch simply makes ftrace work with the developmental ring buffer. Signed-off-by: Steven Rostedt <srostedt@redhat.com> --- kernel/trace/trace.c | 776 ++++++++------------------------------ kernel/trace/trace.h | 22 - kernel/trace/trace_functions.c | 2 kernel/trace/trace_irqsoff.c | 6 kernel/trace/trace_mmiotrace.c | 10 ...
Sep 25, 8:58 am 2008
Kiyoshi Ueda
[RFC PATCH 0/2] Export lld busy state for request stacki ...
Hi Andrew, Jens, James, The following patches are the revised version to use lld's private data instead of backing_dev_info.state for exporting lld busy state. Please review and give me your opinion. If all of you have no problem, I'll re-send the patches separately to each maintainer, Jens and James. The patches are created on the following commit of for-2.6.28 in linux-2.6-block, but the scsi part can be also applied ...
Sep 25, 8:53 am 2008
Kiyoshi Ueda
[RFC PATCH 2/2] scsi: export busy state via q->lld_busy_fn()
This patch implements q->lld_busy_fn() for scsi mid layer to export its busy state. Please note that it checks the cached information (sdev->lld_busy) for efficiency, instead of checking the actual state of sdev/starget/shost everytime. So the care must be taken not to leave sdev->lld_busy = 1 for the following cases: - when there is no request in the sdev queue - when scsi can't dispatch I/Os anymore and needs to kill I/Os Otherwise, request stacking drivers may not submit any ...
Sep 25, 8:55 am 2008
Kiyoshi Ueda
[RFC PATCH 1/2] block: add lld busy state exporting interface
This patch adds an new interface, blk_lld_busy(), to check lld's busy state from the block layer. blk_lld_busy() calls down into low-level drivers for the checking if the drivers set q->lld_busy_fn() using blk_queue_lld_busy(). This resolves a performance problem on request stacking devices below. Some drivers like scsi mid layer stop dispatching request when they detect busy state on its low-level device like host/target/device. It allows other requests to stay in the I/O scheduler's ...
Sep 25, 8:54 am 2008
Stephen Clark
2.6.26.5 hangs on boot
Hello, Vanilla kernel 2.6.26.5 hangs on boot right after displaying isapnp: Scanning For PnP cards ... isapnp: No Plug & Play device found I have to power off my laptop to get out of this state. I never had this problem with previous kernels. Redhat Bugzilla has other people with this problem. I had it too so I compiled a vanilla kernel to make sure it wasn't vendor specific. https://bugzilla.redhat.com/show_bug.cgi?id=462156 If I append clocksource=hpet then the kernel will boot - but ...
Sep 25, 8:25 am 2008
Stephen Clark
Re: 2.6.26.5 hangs on boot
This seems to be fixed in 2.6.27-rc7 Steve -- "They that give up essential liberty to obtain temporary safety, deserve neither liberty nor safety." (Ben Franklin) "The course of history shows that as a government grows, liberty decreases." (Thomas Jefferson) --
Sep 25, 10:43 am 2008
David Howells
[PATCH] AFS: Convert to new write aops
From: Nick Piggin <npiggin@suse.de> Convert AFS to the new write address space operations. This cannot assume that writes will fully complete, so this conversion goes the easy way and always brings the page uptodate before the write. Signed-off-by: Nick Piggin <npiggin@suse.de> Signed-off-by: David Howells <dhowells@redhat.com> --- fs/afs/file.c | 4 +- fs/afs/internal.h | 8 ++- fs/afs/write.c | 131 +++++++++++++++++------------------------------------ 3 files changed, ...
Sep 25, 8:01 am 2008
Eric Sandeen
Re: ext3 read only handling
Is the block device recognized as readonly? ext3_load_journal() does things like: really_read_only = bdev_read_only(sb->s_bdev); ... printk(KERN_INFO "EXT3-fs: INFO: recovery " "required on readonly filesystem.\n"); if (really_read_only) { printk(KERN_ERR "EXT3-fs: write access " "unavailable, cannot proceed.\n"); ...
Sep 25, 8:15 am 2008
Pavel Machek
ext3 read only handling
Inserting read-only ext3 SD card into a machine and trying to mount ext3 volume on it is very unnice :-(. Is there easy way to "fix" -oro so that it does not write to the media? (Machine died after I played with that SD card a bit more, but I can't easily reproduce that...) Pavel Sep 25 11:51:38 amd kernel: mmcblk0: mmc0:0002 SD1GB 975872KiB (ro) Sep 25 11:51:38 amd kernel: mmcblk0: p1 Sep 25 11:51:47 amd kernel: JBD: Clearing recovery information on journal Sep 25 11:51:47 ...
Sep 25, 8:03 am 2008
Cyrill Gorcunov
Re: Kernel PANIC with 2.6.27-rc6
[Tej - Fri, Sep 26, 2008 at 12:04:30AM +0530] ... Btw Tej, could you please apply this patch and check if it help. http://lkml.org/lkml/2008/9/22/250 Aristeu Rozanski fixed one race window which take place on unpredicted nmi's. - Cyrill - --
Sep 25, 12:39 pm 2008
Cyrill Gorcunov
Re: Kernel PANIC with 2.6.27-rc6
[Tej - Fri, Sep 26, 2008 at 12:04:30AM +0530] ... | > | > If assume that git bisect is right - it's hardly possible | | i have bisected it twice n jason mentioned that it looks like some | kind of nmi race condition. | | but i will bisect it again in rc7 and let you know. | | > that proc_nmi_enabled() related changes introduced this | > error (ie this commit) - kgdb seems to be not | > touching /proc/sys/kernel/nmi while doing self-tests. | > | > - Cyrill - | > | There was a ...
Sep 25, 12:27 pm 2008
Tej
Re: Kernel PANIC with 2.6.27-rc6
i have bisected it twice n jason mentioned that it looks like some kind of nmi race condition. --
Sep 25, 11:34 am 2008
Cyrill Gorcunov
Re: Kernel PANIC with 2.6.27-rc6
[Cyrill Gorcunov - Thu, Sep 25, 2008 at 06:56:35PM +0400] | [Tej - Thu, Sep 25, 2008 at 08:05:23PM +0530] | | On 9/11/08, Jason Wessel <jason.wessel@windriver.com> wrote: | | > Tej wrote: | | >> observed a panic on 2.6.27-rc6 caused by enabling | | >> "CONFIG_KGDB_TESTS_ON_BOOT" option | | >> | | >> panic logged using serial console and .config are attached. | | >> | | >> Linux version 2.6.27-rc6 (root@luser-desktop) (gcc version 4.2.3 | | > (Ubuntu 4.2.3-8 | | > [clip] | | >> kgdb: ...
Sep 25, 8:47 am 2008
Tej
Re: Kernel PANIC with 2.6.27-rc6
i have obsvered this bug from 2.6.27-rc1 to rc7 ok so i did git bisection which pointed:- commit 3ed3f06295e69700fa808396f7b350bff2b69de0 Author: Cyrill Gorcunov <gorcunov@gmail.com> Date: Wed Jun 4 01:00:47 2008 +0400 x86: nmi - consolidate nmi_watchdog_default for 32bit mode 64bit mode bootstrap code does set nmi_watchdog to NMI_NONE by default and doing the same on 32bit mode is safe too. Such an action saves us from several #ifdef. git -bisection ...
Sep 25, 7:35 am 2008
Cyrill Gorcunov
Re: Kernel PANIC with 2.6.27-rc6
[Tej - Thu, Sep 25, 2008 at 08:05:23PM +0530] | On 9/11/08, Jason Wessel <jason.wessel@windriver.com> wrote: | > Tej wrote: | >> observed a panic on 2.6.27-rc6 caused by enabling | >> "CONFIG_KGDB_TESTS_ON_BOOT" option | >> | >> panic logged using serial console and .config are attached. | >> | >> Linux version 2.6.27-rc6 (root@luser-desktop) (gcc version 4.2.3 | > (Ubuntu 4.2.3-8 | > [clip] | >> kgdb: Registered I/O driver kgdbts. | >> kgdbts:RUN plant and detach test | >> kgdbts:RUN sw ...
Sep 25, 7:56 am 2008
Todor Gyumyushev
bug report
Sep 25 16:21:09 macmini kernel: ------------[ cut here ]------------ Sep 25 16:21:09 macmini kernel: WARNING: at drivers/net/wireless/ath5k/hw.c:3795 ath5k_hw_setup_4word_tx_desc+0x1a3/0x2 50 [ath5k]() Sep 25 16:21:09 macmini kernel: Modules linked in: ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack ip_ tables x_tables vboxdrv i915 drm tun kvm acpi_cpufreq hci_usb rfcomm l2cap pktcdvd nfsd lockd nfs_acl auth_rpcgss sunrp c exportfs aes_generic fan ac battery microcode ...
Sep 25, 7:08 am 2008
James Morris
Re: [PATCH] CRED: Use correct task_struct members in cre ...
Applied to: git://git.kernel.org/pub/scm/linux/kernel/git/jmorris/security-testing-2.6#next-creds-subsys -- James Morris <jmorris@namei.org> --
Sep 25, 2:56 pm 2008
David Howells
[PATCH] CRED: Use correct task_struct members in cred.h ...
From: Marc Dionne <marc.c.dionne@gmail.com> Adjust comments in linux/cred.h - use the correct names for the struct cred pointers in the task_struct structure. Signed-off-by: Marc Dionne <marc.c.dionne@gmail.com> Signed-off-by: David Howells <dhowells@redhat.com> --- include/linux/cred.h | 12 ++++++------ 1 files changed, 6 insertions(+), 6 deletions(-) diff --git a/include/linux/cred.h b/include/linux/cred.h index 1f8d8d0..26c1ab1 100644 --- a/include/linux/cred.h +++ ...
Sep 25, 7:03 am 2008
John Gumb
Re: D945GCLF intel atom board & r8169 ethernet broken i ...
My fault linux-2.6.27-rc7 fixes this. I should have tested this. Thanks to all who responded with fixes. John -----Original Message----- From: John Gumb Sent: 24 September 2008 12:26 To: linux-kernel@vger.kernel.org Cc: 'netdev@vger.kernel.org' Subject: D945GCLF intel atom board & r8169 ethernet broken in linux-2.6.27-rc5?? Hi I have an Intel Atom D945GCLF based system. It has a realtek RTL8101E PCI network interface. With linux-2.6.27-rc4 - everything seems fine. I think ...
Sep 25, 6:58 am 2008
Andres Freund
bad DMAR interaction with iwlagn and SATA
--Boundary-01=_z342IWcS2hda0oi Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Hi,=09 in some accident caused by wanting to create the .config/compile the kernel= =20 for my new laptop (thinkpad t500) before the desperately needed sleeping I= =20 activated DMAR... I don't know if this is relevant, but I though i better report it. This was on fb478da5ba69ecf40729ae8ab37ca406b1e5be48 - sometime after ...
Sep 25, 6:11 am 2008
Brice Goglin
[RESEND][PATCH] mm: make do_move_pages() complexity linear
Page migration is currently very slow because its overhead is quadratic with the number of pages. This is caused by each single page migration doing a linear lookup in the page array in new_page_node(). Since pages are stored in the array order in the pagelist and do_move_pages process this list in order, new_page_node() can cache the last "pm" pointer to the page array. This way, the next iteration will find the next page in usually 1 while loop (or 0 if called multiple times on the same page, ...
Sep 25, 6:00 am 2008
Jack Steiner
[PATCH] - UV fix for size of hub mappings
Fix the size of the mappings of UV hub registers. Size must be a function of the maximum node number within the SSI. Signed-off-by: Jack Steiner <steiner@sgi.com> --- arch/x86/kernel/genx2apic_uv_x.c | 13 +++++++------ 1 file changed, 7 insertions(+), 6 deletions(-) Index: linux/arch/x86/kernel/genx2apic_uv_x.c =================================================================== --- linux.orig/arch/x86/kernel/genx2apic_uv_x.c 2008-08-17 01:21:43.000000000 -0500 +++ ...
Sep 25, 5:52 am 2008
Chirag Jog
[BUG][PPC64] BUG in 2.6.26.5-rt9 causing Hang
Hi Gregory, We see the following BUG followed by a hang on the latest kernel 2.6.26.5-rt9 on a Power6 blade (PPC64) It is easily recreated by running the async_handler or sbrk_mutex (realtime tests from ltp) tests. login: cpu 0x2: Vector: 700 (Program Check) at [c0000000e8e875d0] pc: c00000000005110c: .pick_next_pushable_task+0x54/0x9c lr: c000000000059f50: .push_rt_task+0x44/0x2b4 sp: c0000000e8e87850 msr: 8000000000021032 current = 0xc0000000ea5bb2e0 paca = ...
Sep 25, 5:32 am 2008
Avi Kivity
[PATCH 01/39] KVM: VMX: Clean up magic number 0x66 in in ...
From: Sheng Yang <sheng.yang@intel.com> Signed-off-by: Sheng Yang <sheng.yang@intel.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/vmx.c | 3 ++- 1 files changed, 2 insertions(+), 1 deletions(-) diff --git a/arch/x86/kvm/vmx.c b/arch/x86/kvm/vmx.c index ddb49e3..229e2d0 100644 --- a/arch/x86/kvm/vmx.c +++ b/arch/x86/kvm/vmx.c @@ -1732,7 +1732,8 @@ static int init_rmode_tss(struct kvm *kvm) if (r < 0) goto out; data = TSS_BASE_SIZE + ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 32/39] KVM: MMU: Account for npt/ept/realmode pag ...
From: Avi Kivity <avi@qumranet.com> Now that two-dimensional paging is becoming common, account for tdp page faults. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/mmu.c | 1 + 1 files changed, 1 insertions(+), 0 deletions(-) diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c index a1ca4ff..a24da8f 100644 --- a/arch/x86/kvm/mmu.c +++ b/arch/x86/kvm/mmu.c @@ -1283,6 +1283,7 @@ static int direct_map_entry(struct kvm_shadow_walk *_walk, mmu_set_spte(vcpu, sptep, ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 34/39] KVM: MMU: Flush tlbs after clearing write ...
From: Avi Kivity <avi@qumranet.com> Otherwise, the cpu may allow writes to the tracked pages, and we lose some display bits or fail to migrate correctly. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/mmu.c | 1 + 1 files changed, 1 insertions(+), 0 deletions(-) diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c index 5052acd..853a288 100644 --- a/arch/x86/kvm/mmu.c +++ b/arch/x86/kvm/mmu.c @@ -2111,6 +2111,7 @@ void kvm_mmu_slot_remove_write_access(struct kvm *kvm, ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 30/39] KVM: Don't call get_user_pages(.force = 1)
From: Avi Kivity <avi@qumranet.com> This is esoteric and only needed to break COW on MAP_SHARED mappings. Since KVM no longer does these sorts of mappings, breaking COW on them is no longer necessary. Signed-off-by: Avi Kivity <avi@qumranet.com> --- virt/kvm/kvm_main.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/virt/kvm/kvm_main.c b/virt/kvm/kvm_main.c index 0309571..de3b029 100644 --- a/virt/kvm/kvm_main.c +++ b/virt/kvm/kvm_main.c @@ -734,7 +734,7 @@ ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 33/39] KVM: MMU: Add locking around kvm_mmu_slot_ ...
From: Avi Kivity <avi@qumranet.com> It was generally safe due to slots_lock being held for write, but it wasn't very nice. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/mmu.c | 2 ++ 1 files changed, 2 insertions(+), 0 deletions(-) diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c index a24da8f..5052acd 100644 --- a/arch/x86/kvm/mmu.c +++ b/arch/x86/kvm/mmu.c @@ -2097,6 +2097,7 @@ void kvm_mmu_slot_remove_write_access(struct kvm *kvm, int slot) { struct ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 37/39] KVM: add MC5_MISC msr read support
From: Joerg Roedel <joerg.roedel@amd.com> Currently KVM implements MC0-MC4_MISC read support. When booting Linux this results in KVM warnings in the kernel log when the guest tries to read MC5_MISC. Fix this warnings with this patch. Signed-off-by: Joerg Roedel <joerg.roedel@amd.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/x86.c | 1 + 1 files changed, 1 insertions(+), 0 deletions(-) diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c index 675d010..e3b8966 ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 08/39] KVM: Device assignment: Check for privileg ...
From: Amit Shah <amit.shah@qumranet.com> Even though we don't share irqs at the moment, we should ensure regular user processes don't try to allocate system resources. We check for capability to access IO devices (CAP_SYS_RAWIO) before we request_irq on behalf of the guest. Noticed by Avi. Signed-off-by: Amit Shah <amit.shah@qumranet.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/x86.c | 5 +++++ 1 files changed, 5 insertions(+), 0 deletions(-) diff --git ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 22/39] KVM: MMU: Move SHADOW_PT_INDEX to mmu.c
From: Avi Kivity <avi@qumranet.com> It is not specific to the paging mode, so can be made global (and reusable). Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/mmu.c | 2 ++ arch/x86/kvm/paging_tmpl.h | 3 --- 2 files changed, 2 insertions(+), 3 deletions(-) diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c index 171bcea..51d4cd7 100644 --- a/arch/x86/kvm/mmu.c +++ b/arch/x86/kvm/mmu.c @@ -135,6 +135,8 @@ module_param(dbg, bool, 0644); #define ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 35/39] KVM: MMU: Fix setting the accessed bit on ...
From: Avi Kivity <avi@qumranet.com> The accessed bit was accidentally turned on in a random flag word, rather than, the spte itself, which was lucky, since it used the non-EPT compatible PT_ACCESSED_MASK. Fix by turning the bit on in the spte and changing it to use the portable accessed mask. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/mmu.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c index ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 07/39] KVM: Handle spurious acks for PIT interrupts
From: Avi Kivity <avi@qumranet.com> Spurious acks can be generated, for example if the PIC is being reset. Handle those acks gracefully rather than flooding the log with warnings. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/i8254.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/arch/x86/kvm/i8254.c b/arch/x86/kvm/i8254.c index 7d04dd3..c842060 100644 --- a/arch/x86/kvm/i8254.c +++ b/arch/x86/kvm/i8254.c @@ -228,7 +228,7 @@ void ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 23/39] KVM: MMU: Unify direct map 4K and large pa ...
From: Avi Kivity <avi@qumranet.com> The two paths are equivalent except for one argument, which is already available. Merge the two codepaths. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/mmu.c | 11 +++-------- 1 files changed, 3 insertions(+), 8 deletions(-) diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c index 51d4cd7..3ee856f 100644 --- a/arch/x86/kvm/mmu.c +++ b/arch/x86/kvm/mmu.c @@ -1240,15 +1240,10 @@ static int __direct_map(struct kvm_vcpu *vcpu, gpa_t ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 28/39] KVM: MMU: Convert the paging mode shadow w ...
From: Avi Kivity <avi@qumranet.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/paging_tmpl.h | 158 ++++++++++++++++++++++++-------------------- 1 files changed, 86 insertions(+), 72 deletions(-) diff --git a/arch/x86/kvm/paging_tmpl.h b/arch/x86/kvm/paging_tmpl.h index ebb26a0..b7064e1 100644 --- a/arch/x86/kvm/paging_tmpl.h +++ b/arch/x86/kvm/paging_tmpl.h @@ -25,6 +25,7 @@ #if PTTYPE == 64 #define pt_element_t u64 #define guest_walker ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 39/39] KVM: s390: change help text of guest Kconfig
From: Christian Borntraeger <borntraeger@de.ibm.com> The current help text for CONFIG_S390_GUEST is not very helpful. Lets add more text. Signed-off-by: Christian Borntraeger <borntraeger@de.ibm.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/s390/Kconfig | 7 +++++-- 1 files changed, 5 insertions(+), 2 deletions(-) diff --git a/arch/s390/Kconfig b/arch/s390/Kconfig index 8d41908..c9bfed9 100644 --- a/arch/s390/Kconfig +++ b/arch/s390/Kconfig @@ -564,13 +564,16 @@ config ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 26/39] KVM: MMU: Add generic shadow walker
From: Avi Kivity <avi@qumranet.com> We currently walk the shadow page tables in two places: direct map (for real mode and two dimensional paging) and paging mode shadow. Since we anticipate requiring a third walk (for invlpg), it makes sense to have a generic facility for shadow walk. This patch adds such a shadow walker, walks the page tables and calls a method for every spte encountered. The method can examine the spte, modify it, or even instantiate it. The walk can be aborted by ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 11/39] KVM: VMX: Add invalid guest state handler
From: Mohammed Gamal <m.gamal005@gmail.com> This adds the invalid guest state handler function which invokes the x86 emulator until getting the guest to a VMX-friendly state. [avi: leave atomic context if scheduling] [guillaume: return to atomic context correctly] Signed-off-by: Laurent Vivier <laurent.vivier@bull.net> Signed-off-by: Guillaume Thouvenin <guillaume.thouvenin@ext.bull.net> Signed-off-by: Mohammed Gamal <m.gamal005@gmail.com> Signed-off-by: Avi Kivity ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 14/39] KVM: Use kvm_set_irq to inject interrupts
From: Amit Shah <amit.shah@qumranet.com> ... instead of using the pic and ioapic variants Signed-off-by: Amit Shah <amit.shah@qumranet.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/i8254.c | 6 ++---- arch/x86/kvm/x86.c | 8 +------- 2 files changed, 3 insertions(+), 11 deletions(-) diff --git a/arch/x86/kvm/i8254.c b/arch/x86/kvm/i8254.c index c842060..fdaa0f0 100644 --- a/arch/x86/kvm/i8254.c +++ b/arch/x86/kvm/i8254.c @@ -596,10 +596,8 @@ void ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 10/39] KVM: VMX: Add module parameter and emulati ...
From: Mohammed Gamal <m.gamal005@gmail.com> The patch adds the module parameter required to enable emulating invalid guest state, as well as the emulation_required flag used to drive emulation whenever needed. Signed-off-by: Mohammed Gamal <m.gamal005@gmail.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/vmx.c | 4 ++++ 1 files changed, 4 insertions(+), 0 deletions(-) diff --git a/arch/x86/kvm/vmx.c b/arch/x86/kvm/vmx.c index e889b76..7c5f611 100644 --- ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 20/39] KVM: x86 emulator: remove duplicate SrcImm
From: roel kluin <roel.kluin@gmail.com> Signed-off-by: Roel Kluin <roel.kluin@gmail.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/x86_emulate.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/arch/x86/kvm/x86_emulate.c b/arch/x86/kvm/x86_emulate.c index d5da7f1..5d6c144 100644 --- a/arch/x86/kvm/x86_emulate.c +++ b/arch/x86/kvm/x86_emulate.c @@ -269,7 +269,7 @@ static u16 group_table[] = { ByteOp | DstMem | SrcNone | ModRM, ByteOp | DstMem ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 06/39] KVM: fix i8259 reset irq acking
From: Marcelo Tosatti <mtosatti@redhat.com> The irq ack during pic reset has three problems: - Ignores slave/master PIC, using gsi 0-8 for both. - Generates an ACK even if the APIC is in control. - Depends upon IMR being clear, which is broken if the irq was masked at the time it was generated. The last one causes the BIOS to hang after the first reboot of Windows installation, since PIT interrupts stop. [avi: fix check whether pic interrupts are seen by cpu] Signed-off-by: Marcelo ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 24/39] KVM: ia64: Enable virtio driver for ia64 i ...
From: Xiantao Zhang <xiantao.zhang@intel.com> kvm/ia64 uses the virtio drivers to optimize its I/O subsytem. Signed-off-by: Xiantao Zhang <xiantao.zhang@intel.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/ia64/kvm/Kconfig | 2 ++ 1 files changed, 2 insertions(+), 0 deletions(-) diff --git a/arch/ia64/kvm/Kconfig b/arch/ia64/kvm/Kconfig index 7914e48..8e99fed 100644 --- a/arch/ia64/kvm/Kconfig +++ b/arch/ia64/kvm/Kconfig @@ -46,4 +46,6 @@ config KVM_INTEL config ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 02/39] KVM: remove unused field from the assigned ...
From: Ben-Ami Yassour <benami@il.ibm.com> Remove unused field: struct kvm_assigned_pci_dev assigned_dev from struct: struct kvm_assigned_dev_kernel Signed-off-by: Ben-Ami Yassour <benami@il.ibm.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- include/asm-x86/kvm_host.h | 1 - 1 files changed, 0 insertions(+), 1 deletions(-) diff --git a/include/asm-x86/kvm_host.h b/include/asm-x86/kvm_host.h index fb7d7b7..1161af1 100644 --- a/include/asm-x86/kvm_host.h +++ ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 29/39] KVM: Allocate guest memory as MAP_PRIVATE, ...
From: Avi Kivity <avi@qumranet.com> There is no reason to share internal memory slots with fork()ed instances. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/x86.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c index bfc7c33..675d010 100644 --- a/arch/x86/kvm/x86.c +++ b/arch/x86/kvm/x86.c @@ -4296,7 +4296,7 @@ int kvm_arch_set_memory_region(struct kvm *kvm, userspace_addr = do_mmap(NULL, 0, ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 36/39] KVM: SVM: No need to unprotect memory duri ...
From: Avi Kivity <avi@qumranet.com> No memory is protected anyway. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/svm.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/arch/x86/kvm/svm.c b/arch/x86/kvm/svm.c index be86c09..6022888 100644 --- a/arch/x86/kvm/svm.c +++ b/arch/x86/kvm/svm.c @@ -1021,7 +1021,7 @@ static int pf_interception(struct vcpu_svm *svm, struct kvm_run *kvm_run) if (npt_enabled) svm_flush_tlb(&svm->vcpu); - if ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 13/39] KVM: SVM: Fix typo
From: Amit Shah <amit.shah@qumranet.com> Fix typo in as-yet unused macro definition. Signed-off-by: Amit Shah <amit.shah@qumranet.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/svm.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/arch/x86/kvm/svm.c b/arch/x86/kvm/svm.c index 179c2e0..be86c09 100644 --- a/arch/x86/kvm/svm.c +++ b/arch/x86/kvm/svm.c @@ -44,7 +44,7 @@ MODULE_LICENSE("GPL"); #define SVM_FEATURE_NPT (1 << 0) #define ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 12/39] KVM: VMX: Modify mode switching and vmentr ...
From: Mohammed Gamal <m.gamal005@gmail.com> This patch modifies mode switching and vmentry function in order to drive invalid guest state emulation. Signed-off-by: Mohammed Gamal <m.gamal005@gmail.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/vmx.c | 20 ++++++++++++++++++++ 1 files changed, 20 insertions(+), 0 deletions(-) diff --git a/arch/x86/kvm/vmx.c b/arch/x86/kvm/vmx.c index eae1f2c..9840f37 100644 --- a/arch/x86/kvm/vmx.c +++ b/arch/x86/kvm/vmx.c @@ ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 18/39] KVM: VMX: Change segment dpl at reset to 3
From: Avi Kivity <avi@qumranet.com> This is more emulation friendly, if not 100% correct. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/vmx.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/arch/x86/kvm/vmx.c b/arch/x86/kvm/vmx.c index 6aa305a..71e57ae 100644 --- a/arch/x86/kvm/vmx.c +++ b/arch/x86/kvm/vmx.c @@ -1991,7 +1991,7 @@ static void seg_setup(int seg) vmcs_write16(sf->selector, 0); vmcs_writel(sf->base, 0); ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 38/39] KVM: s390: Make facility bits future-proof
From: Christian Borntraeger <borntraeger@de.ibm.com> Heiko Carstens pointed out, that its safer to activate working facilities instead of disabling problematic facilities. The new code uses the host facility bits and masks it with known good ones. Signed-off-by: Christian Borntraeger <borntraeger@de.ibm.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/s390/kvm/priv.c | 4 ++-- 1 files changed, 2 insertions(+), 2 deletions(-) diff --git a/arch/s390/kvm/priv.c ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 31/39] KVM: x86 emulator: Add mov r, imm instruct ...
From: Mohammed Gamal <m.gamal005@gmail.com> The emulator only supported one instance of mov r, imm instruction (opcode 0xb8), this adds the rest of these instructions. Signed-off-by: Mohammed Gamal <m.gamal005@gmail.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/x86_emulate.c | 15 +++++++++++---- 1 files changed, 11 insertions(+), 4 deletions(-) diff --git a/arch/x86/kvm/x86_emulate.c b/arch/x86/kvm/x86_emulate.c index ae30435..66e0bd6 100644 --- ...
Sep 25, 4:55 am 2008
Avi Kivity
[PATCH 00/39] KVM Updates for 2.6.28 merge window (part ...
Here is the second batch of the KVM updates for the 2.6.28 merge window. Linux 2.6.28 KVM will introduce support for pci device assignment and will improve overall emulation accuracy. Amit Shah (3): KVM: Device assignment: Check for privileges before assigning irq KVM: SVM: Fix typo KVM: Use kvm_set_irq to inject interrupts Avi Kivity (20): KVM: VMX: Use interrupt queue for !irqchip_in_kernel KVM: Simplify exception entries by using __ASM_SIZE and _ASM_PTR KVM: Handle ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 09/39] KVM: VMX: Add Guest State Validity Checks
From: Mohammed Gamal <m.gamal005@gmail.com> This patch adds functions to check whether guest state is VMX compliant. Signed-off-by: Mohammed Gamal <m.gamal005@gmail.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/vmx.c | 180 ++++++++++++++++++++++++++++++++++++++++++++++++++++ 1 files changed, 180 insertions(+), 0 deletions(-) diff --git a/arch/x86/kvm/vmx.c b/arch/x86/kvm/vmx.c index 81db7d4..e889b76 100644 --- a/arch/x86/kvm/vmx.c +++ b/arch/x86/kvm/vmx.c @@ ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 04/39] KVM: VMX: Use interrupt queue for !irqchip ...
From: Avi Kivity <avi@qumranet.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/vmx.c | 11 +++++------ 1 files changed, 5 insertions(+), 6 deletions(-) diff --git a/arch/x86/kvm/vmx.c b/arch/x86/kvm/vmx.c index 229e2d0..81db7d4 100644 --- a/arch/x86/kvm/vmx.c +++ b/arch/x86/kvm/vmx.c @@ -2173,7 +2173,7 @@ static void kvm_do_inject_irq(struct kvm_vcpu *vcpu) clear_bit(bit_index, &vcpu->arch.irq_pending[word_index]); if (!vcpu->arch.irq_pending[word_index]) ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 21/39] KVM: x86 emulator: remove bad ByteOp speci ...
From: Avi Kivity <avi@qumranet.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/x86_emulate.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/arch/x86/kvm/x86_emulate.c b/arch/x86/kvm/x86_emulate.c index 5d6c144..ae30435 100644 --- a/arch/x86/kvm/x86_emulate.c +++ b/arch/x86/kvm/x86_emulate.c @@ -270,7 +270,7 @@ static u16 group_table[] = { 0, 0, 0, 0, [Group3*8] = DstMem | SrcImm | ModRM, 0, - DstMem | SrcNone | ModRM, ByteOp | DstMem | ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 16/39] KVM: ia64: add a dummy irq ack notification
From: Xiantao Zhang <xiantao.zhang@intel.com> Before enabling notify_acked_irq for ia64, leave the related APIs as nop-op first. Signed-off-by: Xiantao Zhang <xiantao.zhang@intel.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/ia64/kvm/irq.h | 32 ++++++++++++++++++++++++++++++++ virt/kvm/ioapic.c | 2 +- 2 files changed, 33 insertions(+), 1 deletions(-) create mode 100644 arch/ia64/kvm/irq.h diff --git a/arch/ia64/kvm/irq.h b/arch/ia64/kvm/irq.h new file mode ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 17/39] KVM: VMX: Change cs reset state to be a da ...
From: Avi Kivity <avi@qumranet.com> Real mode cs is a data segment, not a code segment. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/vmx.c | 3 +-- 1 files changed, 1 insertions(+), 2 deletions(-) diff --git a/arch/x86/kvm/vmx.c b/arch/x86/kvm/vmx.c index 9840f37..6aa305a 100644 --- a/arch/x86/kvm/vmx.c +++ b/arch/x86/kvm/vmx.c @@ -2239,6 +2239,7 @@ static int vmx_vcpu_reset(struct kvm_vcpu *vcpu) fx_init(&vmx->vcpu); + seg_setup(VCPU_SREG_CS); /* * ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 03/39] KVM: set debug registers after "schedulabl ...
From: Marcelo Tosatti <mtosatti@redhat.com> The vcpu thread can be preempted after the guest_debug_pre() callback, resulting in invalid debug registers on the new vcpu. Move it inside the non-preemptable section. Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/x86.c | 9 ++++----- 1 files changed, 4 insertions(+), 5 deletions(-) diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c index f1b0223..4a03375 100644 --- ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 05/39] KVM: Simplify exception entries by using _ ...
From: Avi Kivity <avi@qumranet.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- include/asm-x86/kvm_host.h | 13 ++----------- 1 files changed, 2 insertions(+), 11 deletions(-) diff --git a/include/asm-x86/kvm_host.h b/include/asm-x86/kvm_host.h index 1161af1..982b6b2 100644 --- a/include/asm-x86/kvm_host.h +++ b/include/asm-x86/kvm_host.h @@ -734,15 +734,6 @@ enum { TASK_SWITCH_GATE = 3, }; - -#ifdef CONFIG_64BIT -# define KVM_EX_ENTRY ".quad" -# define KVM_EX_PUSH ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 15/39] KVM: make irq ack notifier functions static
From: Harvey Harrison <harvey.harrison@gmail.com> sparse says: arch/x86/kvm/x86.c:107:32: warning: symbol 'kvm_find_assigned_dev' was not declared. Should it be static? arch/x86/kvm/i8254.c:225:6: warning: symbol 'kvm_pit_ack_irq' was not declared. Should it be static? Signed-off-by: Harvey Harrison <harvey.harrison@gmail.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/i8254.c | 2 +- arch/x86/kvm/x86.c | ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 19/39] KVM: Load real mode segments correctly
From: Avi Kivity <avi@qumranet.com> Real mode segments to not reference the GDT or LDT; they simply compute base = selector * 16. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/x86.c | 22 ++++++++++++++++++++++ 1 files changed, 22 insertions(+), 0 deletions(-) diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c index 22edd95..bfc7c33 100644 --- a/arch/x86/kvm/x86.c +++ b/arch/x86/kvm/x86.c @@ -3588,11 +3588,33 @@ static int load_segment_descriptor_to_kvm_desct(struct ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 25/39] KVM: MMU: Infer shadow root level in direc ...
From: Avi Kivity <avi@qumranet.com> In all cases the shadow root level is available in mmu.shadow_root_level, so there is no need to pass it as a parameter. Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/mmu.c | 9 ++++----- 1 files changed, 4 insertions(+), 5 deletions(-) diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c index 3ee856f..72f739a 100644 --- a/arch/x86/kvm/mmu.c +++ b/arch/x86/kvm/mmu.c @@ -1227,11 +1227,11 @@ static void nonpaging_new_cr3(struct ...
Sep 25, 4:54 am 2008
Avi Kivity
[PATCH 27/39] KVM: MMU: Convert direct maps to use the g ...
From: Avi Kivity <avi@qumranet.com> Signed-off-by: Avi Kivity <avi@qumranet.com> --- arch/x86/kvm/mmu.c | 93 ++++++++++++++++++++++++++++++--------------------- 1 files changed, 55 insertions(+), 38 deletions(-) diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c index 8b95cf7..a1ca4ff 100644 --- a/arch/x86/kvm/mmu.c +++ b/arch/x86/kvm/mmu.c @@ -1260,49 +1260,66 @@ static void nonpaging_new_cr3(struct kvm_vcpu *vcpu) { } -static int __direct_map(struct kvm_vcpu *vcpu, gpa_t v, ...
Sep 25, 4:54 am 2008
KOSAKI Motohiro
[PATCH 2/2] hugepage: support ZERO_PAGE()
Changelog: v1 -> v3 o Coding style fix (Thanks mel, adam) ========================================== Subject: [PATCH v3] hugepage: support ZERO_PAGE() Now, hugepage doesn't use zero page at all because almost zero page is only used for coredumping and hugepage can't core dump ago. However now, we implemented hugepage coredumping. therefore we should implement the zero page of hugepage. This patch do it. Implementation ...
Sep 25, 4:54 am 2008
KOSAKI Motohiro
[PATCH 1/2] coredump_filter: add hugepage dumping v3
Changelog ----------------- v2 -> v3 - separated /proc/[pid]/core_dump_filter bits into shared and private mapping pages. - updated document v1 -> v2 - updated document ============================================ Subject: [PATCH v3] coredump_filter: add hugepage core dumping Now, hugepage's vma has VM_RESERVED flag in order not to being swapped. But VM_RESERVED vma isn't core dumped because this flag is often used for some kernel vmas (e.g. vmalloc, sound ...
Sep 25, 4:52 am 2008
Frédéric Weisbecker
Re: [Patch -tip 3/3] Tracing/ftrace: Don't consume entri ...
Hi Pekka. It's up to you. I just guessed that one would trace IO and stack for example. Or why not IO and functions: This sounds to me very useful (who performed this io..?) As you want.... Regards... --
Sep 25, 7:44 am 2008
Pekka Paalanen
Re: [Patch -tip 3/3] Tracing/ftrace: Don't consume entri ...
On Thu, 25 Sep 2008 13:28:56 +0100 NACK. mmiotrace's log output is stricly specified, the standard/default printing functions may NOT be used. The output is supposed to be machine readable in addition to human readable. The ftrace infrastructure assumes there is only one active tracer at a time, therefore destroying unhandled entries is not a problem. -- Pekka Paalanen http://www.iki.fi/pq/ --
Sep 25, 7:34 am 2008
Frédéric Weisbecker
[Patch -tip 3/3] Tracing/ftrace: Don't consume entries u ...
When mmiotrace can't handle an entry output, it returns 1. It should return 0 to relay on other output functions. Signed-off-by: Frederic Weisbecker <fweisbec@gmail.com> ---
Sep 25, 5:28 am 2008
Pekka Paalanen
Re: [Patch -tip 3/3] Tracing/ftrace: Don't consume entri ...
On Thu, 25 Sep 2008 16:44:08 +0200 We can (must) return to this if/when multiple tracers are allowed to be active simultaneously. But in the current situation, I do not see a way to trace more than one thing, but I haven't really looked into the other tracers. OTOH, mmiotrace log format has a pid field, which is pretty much unused currently. It could be used to record the current process, if one exists. That can be added, should the need arise, but so far the field has been reserved for ...
Sep 25, 8:36 am 2008
Frédéric Weisbecker
Re: [Patch -tip 3/3] Tracing/ftrace: Don't consume entri ...
You can enable another tracer simultaneously. I never tried it but you can launch one from debugfs and another from kernel code for example. The new boot tracer will have to launch the sched_switch tracer. And since the functions which display the sched_switch entries are already inside trace.c I wanted to relay the display to the "default display functions". That's one of the reason of my patch (+ the bug). But I could proceed in a another way than default relaying. Why not using a function ...
Sep 25, 9:13 am 2008
Ingo Molnar
Re: [Patch -tip 3/3] Tracing/ftrace: Don't consume entri ...
applied to tip/tracing/mmiotrace, thanks Frédéric! Ingo --
Sep 25, 5:56 am 2008
Frédéric Weisbecker
[Patch -tip 2/3] Tracing/ftrace: Don't consume unhandled ...
When the boot tracer can't handle an entry output, it returns 1. It should return 0 to relay on other output functions. Signed-off-by: Frederic Weisbecker <fweisbec@gmail.com> --
Sep 25, 5:25 am 2008
Ingo Molnar
Re: [Patch -tip 2/3] Tracing/ftrace: Don't consume unhan ...
applied to tip/tracing/fastboot, thanks Frédéric! Ingo --
Sep 25, 5:53 am 2008
Ingo Molnar
Re: [Patch -tip 1/3] Tracing/ftrace: Relay unhandled ent ...
ah, nice! Applied to tip/tracing/ftrace. Small nit: could you please send another patch on top of this patch that cleans up the checkpatch failures below? checkpatch usage: you can run it like this: scripts/checkpatch.pl --file kernel/trace/*.c and/or you can run it over patches as well before you submit them. Ingo -------------> ERROR: do not use assignment in if condition #43: FILE: kernel/trace/trace.c:1910: + if ((ret = iter->trace->print_line(iter))) ERROR: do not ...
Sep 25, 5:38 am 2008
Pekka Paalanen
Re: [Patch -tip 1/3] Tracing/ftrace: Relay unhandled ent ...
On Thu, 25 Sep 2008 17:33:02 +0200 My understanding is that the pipe will be broken only, if the trace framework believes tracing is disabled. Recall the discussion about tracer_enabled = 0. Which is a bug, and I hoped Steven would have commented on how to fix that properly. Or was it fixed already, and this is a different issue? I didn't notice. -- Pekka Paalanen http://www.iki.fi/pq/ --
Sep 25, 8:49 am 2008
Pekka Paalanen
Re: [Patch -tip 1/3] Tracing/ftrace: Relay unhandled ent ...
On Thu, 25 Sep 2008 17:56:32 +0200 Oh cool, sounds like I should try linux-next when I can. -- Pekka Paalanen http://www.iki.fi/pq/ --
Sep 25, 9:01 am 2008
Frédéric Weisbecker
[Patch -tip 1/3] Tracing/ftrace: Relay unhandled entry output
Hi! I tried to figure out the origin of the bug reported by Pekka Paalanen about the broken pipe: http://kerneltrap.org/mailarchive/linux-kernel/2008/9/15/3305224 When I add a trace_mark with the boot tracer, I had this same problem but this time it was easy to reproduce. When it calls a tracer's print_line callback, the print_trace_line function in trace.c returns whithout verifying if it could handle the entry properly. And actually the seq could be empty. For example the boot_tracer ...
Sep 25, 5:19 am 2008
Frédéric Weisbecker Sep 25, 7:56 am 2008
Frédéric Weisbecker
Re: [Patch -tip 1/3] Tracing/ftrace: Relay unhandled ent ...
It is on -tip if you want :) So, unless someone is opposed to. I will send a patch which define three possible values for the print_line callback. I will define these values on trace.h Why not: #define TRACE_TYPE_PARTIAL_LINE 0 #define TRACE_TYPE_HANDLED 1 #define TRACE_TYPE_UNHANDLED 2 => will relay on default functions Hmm I'm questionning about the relevance of this kind of patch since the tracing API is in refactoring... --
Sep 25, 9:40 am 2008
Pekka Paalanen
Re: [Patch -tip 1/3] Tracing/ftrace: Relay unhandled ent ...
On Thu, 25 Sep 2008 15:56:03 +0100 Frederic, in the future, could you just copy the patch into the email body (watch out for line wraps and other damage), attachments are not usually included in "reply with quote", NACK. IMHO this breaks the trace_seq handling. trace_seq may contain the output of several entries, as far as they fit in it as a whole. E.g. trace_seq_printf() does not print partial lines but returns 0, so that the entry is not consumed right now. The user space reader must ...
Sep 25, 8:09 am 2008
Frédéric Weisbecker
Re: [Patch -tip 1/3] Tracing/ftrace: Relay unhandled ent ...
Actually there was two bugs which broke the pipe: _ As you say the old one when none tracer put tracer_enabled = 0 _ And the new one I described here. I think the first is fixed since none tracer has been replaced by "nop tracer" (a recently implemented) which doesn't touch tracer_enabled at all. --
Sep 25, 8:56 am 2008
Frédéric Weisbecker
Re: [Patch -tip 1/3] Tracing/ftrace: Relay unhandled ent ...
Ok. I think I have to change my email client. I'm starting to get rid of all these blank lines or other issues with the patches... Hmm you're right. I didn't thought about the partial line which must not be printed. The problem is that with this convention, 0 means two things: "I will handle this entry later" or "I can't handle it". But if you return 0 because you can't handle it, and if the current len of the seq is 0, the I think it's a good solution. Thanks to you! I didn't see ...
Sep 25, 8:33 am 2008
Heiko Carstens
[PATCH] iucv: Fix mismerge again.
Subject: [PATCH] iucv: Fix mismerge again. From: Heiko Carstens <heiko.carstens@de.ibm.com> fb65a7c091529bfffb1262515252c0d0f6241c5c ("iucv: Fix bad merging.") fixed a merge error, but in a wrong way. We now end up with the bug below. This patch corrects the mismerge like it was intended. BUG: scheduling while atomic: swapper/1/0x00000000 Modules linked in: CPU: 1 Not tainted 2.6.27-rc7-00094-gc0f4d6d #9 Process swapper (pid: 1, task: 000000003fe7d988, ksp: ...
Sep 25, 4:09 am 2008
Dmitry Baryshkov
lockdep warning inside ide/bio
Hi, I got this when running the tree based on commit c0f4d6d4b14a75a341d972ff73fb9740e1ceb634. It's arm (tosa) emulated in qemu. ------------[ cut here ]------------ WARNING: at /home/lumag/tosa-tree/kernel/lockdep.c:2195 trace_hardirqs_on_caller+0xf0/0x170() Modules linked in: [<c0029fe4>] (dump_stack+0x0/0x14) from [<c003fc40>] (warn_on_slowpath+0x4c/0x84) [<c003fbf4>] (warn_on_slowpath+0x0/0x84) from [<c00629ac>] (trace_hardirqs_on_caller+0xf0/0x170) r6:c0062a40 r5:00000001 ...
Sep 25, 3:53 am 2008
Joerg Roedel
[PATCH 1/3] x86/iommu: make GART driver checkpatch clean
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com> --- arch/x86/kernel/pci-gart_64.c | 17 +++++++++-------- include/asm-x86/gart.h | 2 ++ 2 files changed, 11 insertions(+), 8 deletions(-) diff --git a/arch/x86/kernel/pci-gart_64.c b/arch/x86/kernel/pci-gart_64.c index b85a2f9..aa569db 100644 --- a/arch/x86/kernel/pci-gart_64.c +++ b/arch/x86/kernel/pci-gart_64.c @@ -27,8 +27,8 @@ #include <linux/scatterlist.h> #include <linux/iommu-helper.h> #include ...
Sep 25, 3:13 am 2008
Joerg Roedel
[PATCH 0/3] some small GART cleanups
Hi, this small patch series contains some cleanups for the K8 GART driver in the x86 architecture. diffstat: arch/x86/kernel/pci-gart_64.c | 38 ++++++++++++++++++-------------------- include/asm-x86/gart.h | 2 ++ 2 files changed, 20 insertions(+), 20 deletions(-) --
Sep 25, 3:13 am 2008
Ingo Molnar Sep 25, 3:20 am 2008
Joerg Roedel
Re: [PATCH 3/3] x86/iommu: use __GFP_ZERO instead of mem ...
Argh, yes. Please try this one: = From c402fd7f5fe930d45a02ef863bf3fb61851e6ad0 Mon Sep 17 00:00:00 2001 From: Joerg Roedel <joerg.roedel@amd.com> Date: Thu, 25 Sep 2008 12:39:06 +0200 Subject: [PATCH] x86/iommu: use __GFP_ZERO instead of memset for GART Signed-off-by: Joerg Roedel <joerg.roedel@amd.com> --- arch/x86/kernel/pci-gart_64.c | 13 +++++-------- 1 files changed, 5 insertions(+), 8 deletions(-) diff --git a/arch/x86/kernel/pci-gart_64.c ...
Sep 25, 3:42 am 2008
Joerg Roedel
[PATCH 2/3] x86/iommu: convert GART need_flush to bool
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com> --- arch/x86/kernel/pci-gart_64.c | 10 +++++----- 1 files changed, 5 insertions(+), 5 deletions(-) diff --git a/arch/x86/kernel/pci-gart_64.c b/arch/x86/kernel/pci-gart_64.c index aa569db..aecea06 100644 --- a/arch/x86/kernel/pci-gart_64.c +++ b/arch/x86/kernel/pci-gart_64.c @@ -80,7 +80,7 @@ AGPEXTERN int agp_memory_reserved; AGPEXTERN __u32 *agp_gatt_table; static unsigned long next_bit; /* protected by iommu_bitmap_lock ...
Sep 25, 3:13 am 2008
Joerg Roedel
[PATCH 3/3] x86/iommu: use __GFP_ZERO instead of memset ...
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com> --- arch/x86/kernel/pci-gart_64.c | 11 ++++------- 1 files changed, 4 insertions(+), 7 deletions(-) diff --git a/arch/x86/kernel/pci-gart_64.c b/arch/x86/kernel/pci-gart_64.c index aecea06..84d541f 100644 --- a/arch/x86/kernel/pci-gart_64.c +++ b/arch/x86/kernel/pci-gart_64.c @@ -674,13 +674,13 @@ static __init int init_k8_gatt(struct agp_kern_info *info) info->aper_size = aper_size >> 20; gatt_size = (aper_size >> PAGE_SHIFT) * ...
Sep 25, 3:13 am 2008
Yi Yang
[PATCH] USB: improve ehci_watchdog's side effect in CPU ...
ehci_watchdog will wake up CPU very frequently so that CPU stays at C3 very short, average residence time is about 50 ms on Aspire One, but we expect it should be about 1 second or more, so this kind of periodic timer is very bad for power saving. We can't remove this timer because of some bad USB controller chipset, but at least we should reduce its side effect to as possible as low. This patch can make CPU stay at C3 longer, average residence time is about twice as long as original. ...
Sep 25, 2:25 am 2008
el es
Re: [kernel-abi] symbol versioning vs. out-of-tree modules.
+ http://www.mcentral.de (there is lots of discussion on linux.drivers.dvb, the maintainer has his own voes but... the driver actually works very good for me and I believe it should be mainline regardless) el es --
Sep 25, 1:52 am 2008
Borislav Petkov
[PATCH] ide-cd: fix another NULL ptr in debug statement
Hi Bart, here's another null ptr fix in the debugging code. I'll be auditing all the debug fragments in case I've missed some. --- From: Borislav Petkov <petkovbb@gmail.com> Date: Thu, 25 Sep 2008 09:36:26 +0200 Subject: [PATCH] ide-cd: fix another NULL ptr in debug statement There should be no functionality change resulting from this patch. Signed-off-by: Borislav Petkov <petkovbb@gmail.com> --- drivers/ide/ide-cd.c | 11 +++++++---- 1 files changed, 7 insertions(+), 4 ...
Sep 25, 12:51 am 2008
David Woodhouse
Re: [PATCH] x86/pci: fix dmar_tbl early_ioremap leak v2
I own the code in theory, but am not yet sufficiently competent. My VT-d hardware arrived just before I left the country for KS+LPC, so is gathering dust in my absence. I get home this weekend. _Next_ week, if you see me thinking about _anything_ other than the IOMMU, please slap me. -- David Woodhouse Open Source Technology Centre David.Woodhouse@intel.com Intel Corporation --
Sep 25, 7:59 am 2008
mark gross
Re: [PATCH] x86/pci: fix dmar_tbl early_ioremap leak v2
Sorry for the dumb question but where is acpi_get_table_with_size defined? Its not in linus's git. /me goes and gets linux-next... --mgross --
Sep 25, 9:24 am 2008
Suresh Siddha
Re: [PATCH] x86/pci: fix dmar_tbl early_ioremap leak v2
Acked-by: Suresh Siddha <suresh.b.siddha@intel.com> --
Sep 25, 11:56 am 2008
Ingo Molnar
Re: [PATCH] x86/pci: fix dmar_tbl early_ioremap leak v2
since this depends on other x2apic/sparseirq changes in -tip i've applied it to tip/irq/sparseirq. Jesse, any objections? Ingo --
Sep 25, 1:11 am 2008
Yinghai Lu Sep 25, 10:01 am 2008
Jesse Barnes
Re: [PATCH] x86/pci: fix dmar_tbl early_ioremap leak v2
Nope, but it would be good to get Dave or Mark's ack (Dave owns this code now). Jesse --
Sep 25, 7:43 am 2008
Yinghai Lu
[PATCH] x86/pci: fix dmar_tbl early_ioremap leak v2
use early_acpi_os_unmap_memory v2: move unmap_memory early... Signed-off-by: Yinghai Lu <yhlu.kernel@gmail.com> --- drivers/pci/dmar.c | 9 +++++++-- 1 file changed, 7 insertions(+), 2 deletions(-) Index: linux-2.6/drivers/pci/dmar.c =================================================================== --- linux-2.6.orig/drivers/pci/dmar.c +++ linux-2.6/drivers/pci/dmar.c @@ -42,6 +42,7 @@ LIST_HEAD(dmar_drhd_units); static struct acpi_table_header * __initdata ...
Sep 25, 12:28 am 2008
Matthew Garrett
Re: CONFIG_ACPI_VIDEO on Thinkpad blanks display.
Yes. X assumed that any ACPI key press was a request to switch the video output. Upgrade X. -- Matthew Garrett | mjg59@srcf.ucam.org --
Sep 25, 3:07 am 2008
carbonated beverage
Re: CONFIG_ACPI_VIDEO on Thinkpad blanks display.
Okay, I'll see what happens. -- DN Daniel --
Sep 25, 1:16 pm 2008
carbonated beverage
Re: CONFIG_ACPI_VIDEO on Thinkpad blanks display.
Since CONFIG_ACPI_VIDEO wasn't required before 2.6.26.x to change the LCD brightness, I'd rather the kernel go back to whatever it was doing for 2.6.25.x. Upgrading X past what's currently packaged by the distribution just to get a new kernel working is somewhat of a pain. What change linked the brightness keys to CONFIG_ACPI_VIDEO? -- DN Daniel --
Sep 25, 12:09 pm 2008
Henrique de Moraes H ...
Re: CONFIG_ACPI_VIDEO on Thinkpad blanks display.
Well, we do give you a way out: do not load ACPI video, and load thinkpad-acpi with the proper parameters to enable its backlight control (if Your X is broken, and in an extremely hideous way. You could try to get it to disable ACPI key support, or you can make sure nothing EVER sends ACPI video events to it. That means you must not use the ACPI video module in your kernel until you install a fixed X.org. -- "One disk to rule them all, One disk to find them. One disk to bring them ...
Sep 25, 12:58 pm 2008
carbonated beverage
Thinkpad brightness keys kill X on 2.6.26.5
Hi all, I just upgraded from 2.6.25.17 to 2.6.26.5 on my Thinkpad Z61t, and found the brightness keys will cause the X server to die. When using the brightness up/down keys, the screen will go blank -- Control+Alt+F# will still switch to a different virtual console, but switching back will not restore the video state. After killing the window manager, I was able to bring up X again without a problem -- but using the brightness keys still causes it to go blank again within X, while the ...
Sep 24, 11:02 pm 2008
carbonated beverage
CONFIG_ACPI_VIDEO on Thinkpad blanks display.
(Subject changed to match the observed cause) Figured out what the change was -- in 2.6.25.x, the brightness keys worked without setting CONFIG_ACPI_VIDEO. However, in 2.6.26.x, the brightness keys do not work on the console without that enabled. If 2.6.25.x has that config option enabled, I still get the same blank screen symptoms -- with the following in the X logs: (II) PM Event received: Capability Changed I830PMEvent: Capability change (II) I810(0): Next ACPI _DGS [0] ...
Sep 25, 12:07 am 2008
Yinghai Lu Sep 24, 11:48 pm 2008
Yinghai Lu
[PATCH] x86/pci: fix dmar_tbl early_ioremap leak
use early_acpi_os_unmap_memory Signed-off-by: Yinghai Lu <yhlu.kernel@gmail.com> --- drivers/pci/dmar.c | 9 +++++++-- 1 file changed, 7 insertions(+), 2 deletions(-) Index: linux-2.6/drivers/pci/dmar.c =================================================================== --- linux-2.6.orig/drivers/pci/dmar.c +++ linux-2.6/drivers/pci/dmar.c @@ -42,6 +42,7 @@ LIST_HEAD(dmar_drhd_units); static struct acpi_table_header * __initdata dmar_tbl; +static acpi_size __initdata ...
Sep 24, 11:27 pm 2008
Jing Huang
RE: [PATCH 1/6] bfa: Brocade BFA FC SCSI Driver submission
Thanks James. I am trying unix mail command. It seems to be working fine. I will resubmit once I make sure everything is ok. Again, sorry about the trouble. Jing -----Original Message----- From: James Bottomley [mailto:James.Bottomley@HansenPartnership.com] Sent: Thursday, September 25, 2008 8:35 AM To: Jing Huang Cc: Jeremy Higdon; Greg KH; linux-scsi@vger.kernel.org; linux-kernel@vger.kernel.org; Ramkumar Vadivelu; Vinodh Ravindran; Srikanth Rayas (CW) Subject: RE: [PATCH 1/6] bfa: ...
Sep 25, 9:56 am 2008
Jing Huang
RE: [PATCH 1/6] bfa: Brocade BFA FC SCSI Driver submission
Hi James, I looked at Documentation/email-clients.txt. There is no info about outlook. So I believe Jeremy is right. Is attachment acceptable for my first submission? Meanwhile I will find out if I can use other email client in my company. Jing -----Original Message----- From: Jeremy Higdon [mailto:jeremy@sgi.com] Sent: Wednesday, September 24, 2008 10:57 PM To: Jing Huang Cc: James Bottomley; Greg KH; linux-scsi@vger.kernel.org; linux-kernel@vger.kernel.org; Ramkumar Vadivelu; ...
Sep 24, 11:09 pm 2008
James Bottomley
RE: [PATCH 1/6] bfa: Brocade BFA FC SCSI Driver submission
Yes ... attachments can be tolerated for people who are forced to use outlook. We had a similar issue with Adaptec engineers. There are rumours that outlook can be persuaded not to mangle text, but everyone who claims to be able to do it has been unable to reproduce under laboratory conditions. James --
Sep 25, 8:34 am 2008
Ingo Molnar
Re: [PATCH 7/7] x86: print out irq nr for msi/ht -v2
still bad: arch/x86/kernel/hpet.c: In function ‘hpet_late_init’: arch/x86/kernel/hpet.c:828: error: ‘dev’ undeclared (first use in this function) arch/x86/kernel/hpet.c:828: error: (Each undeclared identifier is reported only once arch/x86/kernel/hpet.c:828: error: for each function it appears in.) we've got some changes in this area (per cpu hpet). Ingo --
Sep 25, 2:12 am 2008
Bjorn Helgaas
Re: [PATCH 7/7] x86: print out irq nr for msi/ht -v2
Why do we want to print the irq as hex? Do we do that anywhere else? I think we should pick one format and stick with it, and I think decimal is the logical choice because it's the most common. --
Sep 25, 8:00 am 2008
Yinghai Lu
[PATCH 7/7] x86: print out irq nr for msi/ht -v2
v2: fix hpet compiling Signed-off-by: Yinghai Lu <yhlu.kernel@gmail.com> --- arch/x86/kernel/hpet.c | 3 +++ arch/x86/kernel/io_apic.c | 7 +++++++ 2 files changed, 10 insertions(+), 0 deletions(-) diff --git a/arch/x86/kernel/hpet.c b/arch/x86/kernel/hpet.c index 422c577..686505a 100644 --- a/arch/x86/kernel/hpet.c +++ b/arch/x86/kernel/hpet.c @@ -467,6 +467,9 @@ static int hpet_setup_irq(struct hpet_dev *dev) irq_set_affinity(dev->irq, cpumask_of_cpu(dev->cpu)); ...
Sep 24, 11:13 pm 2008
Yinghai Lu
Re: [PATCH 7/7] x86: print out irq nr for msi/ht -v2
looks there is some merge problem. Index: linux-2.6/arch/x86/kernel/hpet.c =================================================================== --- linux-2.6.orig/arch/x86/kernel/hpet.c +++ linux-2.6/arch/x86/kernel/hpet.c @@ -467,6 +467,9 @@ static int hpet_setup_irq(struct hpet_de irq_set_affinity(dev->irq, cpumask_of_cpu(dev->cpu)); enable_irq(dev->irq); + printk(KERN_DEBUG "hpet: %s is using irq %#x aka %d for MSI\n", + dev->name, dev->irq, ...
Sep 25, 10:20 am 2008
Rolf Eike Beer
Re: [PATCH 7/7] x86: print out irq nr for msi/ht -v2
Are you sure you sent the right patch? It only adds some printk()'s and=20 unrelated newlines. Eike
Sep 25, 4:30 am 2008
Yinghai Lu Sep 25, 10:03 am 2008
Ingo Molnar
Re: [PATCH 7/7] x86: print out irq nr for msi/ht -v2
applied to tip/irq/sparseirq, thanks Yinghai. Ingo --
Sep 25, 1:42 am 2008
Yinghai Lu
Re: [PATCH 7/7] x86: print out irq nr for msi/ht -v2
when sparseirq is enabled, irq for msi will be bus_nr/dev/func + 12bits. so show it as hex is more straight. but we can not use hex in /proc/interrupts --ABI want decimal. just try to print out...in log, so user could get some idea that who is using it... YH --
Sep 25, 10:07 am 2008
Bjorn Helgaas
Re: [PATCH 7/7] x86: print out irq nr for msi/ht -v2
I think it'd be better to just print the MSI IRQ number in decimal using a dev_printk(). Then the connection is obvious and we don't need to make everything ugly by having IRQs in both decimal and hex. I know it's a nuisance to make hpet use dev_printk because it's pretty muddled as far as getting a struct device. But I'd rather fix that than add confusion like this. Bjorn --
Sep 25, 10:52 am 2008
KAMEZAWA Hiroyuki
[PATCH 2/12] memcg move charege() call to swapped-in pag ...
While page-cache's charge/uncharge is done under page_lock(), swap-cache isn't. (anonymous page is charged when it's newly allocated.) This patch moves do_swap_page()'s charge() call under lock. This helps us to avoid to charge already mapped one, unnecessary calls. Signed-off-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> mm/memory.c | 7 +++---- 1 file changed, 3 insertions(+), 4 deletions(-) Index: ...
Sep 24, 11:14 pm 2008
KAMEZAWA Hiroyuki
[PATCH 4/12] memcg make page->mapping NULL before callin ...
This patch tries to make page->mapping to be NULL before mem_cgroup_uncharge_cache_page() is called. "page->mapping == NULL" is a good check for "whether the page is still radix-tree or not". This patch also adds BUG_ON() to mem_cgroup_uncharge_cache_page(); Signed-off-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> mm/filemap.c | 2 +- mm/memcontrol.c | 1 + mm/migrate.c | 12 +++++++++--- 3 files changed, 11 insertions(+), 4 deletions(-) Index: ...
Sep 24, 11:16 pm 2008
KAMEZAWA Hiroyuki
[PATCH 3/12] memcg make root cgroup unlimited.
Make root cgroup of memory resource controller to have no limit. By this, users cannot set limit to root group. This is for making root cgroup as a kind of trash-can. For accounting pages which has no owner, which are created by force_empty, we need some cgroup with no_limit. A patch for rewriting force_empty will will follow this one. Signed-off-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> Documentation/controllers/memory.txt | 4 ++++ mm/memcontrol.c ...
Sep 24, 11:15 pm 2008
Dave Hansen
Re: [PATCH 9/12] memcg allocate all page_cgroup at boot
I thought the use of this variable was under the +#ifdef CONFIG_FLAT_NODE_MEM_MAP options. Otherwise, we unconditionally bloat mem_section, right? -- Dave --
Sep 25, 11:40 am 2008
KAMEZAWA Hiroyuki
[PATCH 12/12] memcg: fix race at charging swap-in
There is a small race in do_swap_page(). When the page swapped-in is charged, the mapcount can be greater than 0. But, at the same time some process (shares it ) call unmap and make mapcount 1->0 and the page is uncharged. For fixing this, I added a new interface. - precharge account to res_counter by PAGE_SIZE and try to free pages if necessary. - commit register page_cgroup and add to LRU if necessary. - cancel uncharge PAGE_SIZE because of do_swap_page failure. This ...
Sep 24, 11:36 pm 2008
KAMEZAWA Hiroyuki
[PATCH 10/12] memcg free page_cgroup from LRU in lazy
Free page_cgroup from its LRU in batched manner. When uncharge() is called, page is pushed onto per-cpu vector and removed from LRU, later.. This routine resembles to global LRU's pagevec. This patch is half of the whole patch and a set with following lazy LRU add patch. Signed-off-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> mm/memcontrol.c | 164 +++++++++++++++++++++++++++++++++++++++++++++++++++----- 1 file changed, 152 insertions(+), 12 deletions(-) Index: ...
Sep 24, 11:33 pm 2008
KAMEZAWA Hiroyuki
[PATCH 0/12] memcg updates v5
Hi, I updated the stack and reflected comments. Against the latest mmotm. (rc7-mm1) Major changes from previous one is - page_cgroup allocation/lookup manner is changed. all FLATMEM/DISCONTIGMEM/SPARSEMEM and MEMORY_HOTPLUG is supported. - force_empty is totally rewritten. and a problem that "force_empty takes long time" in previous version is fixed (I think...) - reordered patches. - first half are easy ones. - second half are big ones. I'm still testing with full ...
Sep 24, 11:11 pm 2008
KAMEZAWA Hiroyuki
[PATCH 1/12] memcg avoid accounting special mappings not ...
There are not-on-LRU pages which can be mapped and they are not worth to be accounted. (becasue we can't shrink them and need dirty codes to handle specical case) We'd like to make use of usual objrmap/radix-tree's protcol and don't want to account out-of-vm's control pages. When special_mapping_fault() is called, page->mapping is tend to be NULL and it's charged as Anonymous page. insert_page() also handles some special pages from drivers. This patch is for avoiding to account special ...
Sep 24, 11:13 pm 2008
KAMEZAWA Hiroyuki
[PATCH 8/12] memcg rewrite force empty to move account to root
Current force_empty of memory resource controller just removes page_cgroup. This maans the page is never accounted at all and create an in-use page which has no page_cgroup. This patch tries to move account to "root" cgroup. By this patch, force_empty doesn't leak an account but move account to "root" cgroup. Maybe someone can think of other enhancements as moving account to its parent. (But moving to the parent means we have to handle "limit" of pages. Need more complicated work to do ...
Sep 24, 11:29 pm 2008
KAMEZAWA Hiroyuki
[PATCH 11/12] memcg add to LRU in lazy
Delaying add_to_lru() and do it in batched manner like page_vec. For doing that 2 flags PCG_USED and PCG_LRU. If PCG_LRU is set, page is on LRU. It safe to access LRU via page_cgroup. (under some lock.) For avoiding race, this patch uses TestSetPageCgroupUsed(). and checking PCG_USED bit and PCG_LRU bit in add/free vector. By this, lock_page_cgroup() in mem_cgroup_charge() is removed. (I don't want to call lock_page_cgroup() under mz->lru_lock when add/free vector core logic. So, ...
Sep 24, 11:35 pm 2008
KAMEZAWA Hiroyuki
[PATCH 9/12] memcg allocate all page_cgroup at boot
Allocate all page_cgroup at boot and remove page_cgroup poitner from struct page. This patch adds an interface as struct page_cgroup *lookup_page_cgroup(struct page*) All FLATMEM/DISCONTIGMEM/SPARSEMEM and MEMORY_HOTPLUG is supported. Remove page_cgroup pointer reduces the amount of memory by - 4 bytes per PAGE_SIZE. - 8 bytes per PAGE_SIZE if memory controller is disabled. (even if configured.) meta data usage of this is no problem in FLATMEM/DISCONTIGMEM. On SPARSEMEM, this makes ...
Sep 24, 11:32 pm 2008
KAMEZAWA Hiroyuki
[PATCH 6/12] memcg optimize percpu stat
Some obvious optimization to memcg. I found mem_cgroup_charge_statistics() is a little big (in object) and does unnecessary address calclation. This patch is for optimization to reduce the size of this function. And res_counter_charge() is 'likely' to success. Changelog v3->v4: - merged with an other leaf patch. Signed-off-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> mm/memcontrol.c | 18 ++++++++++-------- 1 file changed, 10 insertions(+), 8 deletions(-) Index: ...
Sep 24, 11:18 pm 2008
KAMEZAWA Hiroyuki
[PATCH 7/12] memcg add function to move account
This patch provides a function to move account information of a page between mem_cgroups. This moving of page_cgroup is done under - lru_lock of source/destination mem_cgroup is held. - lock_page_cgroup() is held. Then, a routine which touches pc->mem_cgroup without lock_page_cgroup() should confirm pc->mem_cgroup is still valid or not. Typlical code can be following. (while page is not under lock_page()) mem = pc->mem_cgroup; mz = ...
Sep 24, 11:27 pm 2008
KAMEZAWA Hiroyuki
[PATCH 5/12] memcg make page_cgroup->flags atomic
This patch makes page_cgroup->flags to be atomic_ops and define functions (and macros) to access it. This patch itself makes memcg slow but this patch's final purpose is to remove lock_page_cgroup() and allowing fast access to page_cgroup. (And total performance will increase after all patches applied.) Before trying to modify memory resource controller, this atomic operation on flags is necessary. Most of flags in this patch is for LRU and modfied under mz->lru_lock but we'll add another ...
Sep 24, 11:17 pm 2008
Bharata B Rao
sched:
Hi, sched: Maintain only task entities in cfs_rq->tasks list. cfs_rq->tasks list is used by the load balancer to iterate over all the tasks. Currently it holds all the entities (both task and group entities) because of which there is a need to check for group entities explicitly during load balancing. This patch changes the cfs_rq->tasks list to hold only task entities. Signed-off-by: Bharata B Rao <bharata@linux.vnet.ibm.com> CC: Srivatsa Vaddagiri <vatsa@linux.vnet.ibm.com> CC: Peter ...
Sep 24, 9:21 pm 2008
Peter Zijlstra Sep 25, 2:14 am 2008
Ingo Molnar
Re: sched:
applied to tip/sched/devel, thanks! Ingo --
Sep 25, 2:19 am 2008
Bharata B Rao
sched: Maintain only task entities in cfs_rq->tasks list.
Oops, missed the subject line in the last post :( [repost] sched: Maintain only task entities in cfs_rq->tasks list. cfs_rq->tasks list is used by the load balancer to iterate over all the tasks. Currently it holds all the entities (both task and group entities) because of which there is a need to check for group entities explicitly during load balancing. This patch changes the cfs_rq->tasks list to hold only task entities. Signed-off-by: Bharata B Rao <bharata@linux.vnet.ibm.com> CC: ...
Sep 24, 9:23 pm 2008
Jing Huang
[PATCH 3/6] bfa: Brocade BFA FC SCSI Driver submission
From: Jing Huang <huangj@brocade.com> This patch contains common header files for linux driver and hardware/firmeare interface. It is created using 2.6.27-rc7 kernel. Signed-off-by: Jing Huang <huangj@brocade.com> --- drivers/scsi/bfa/include/aen/bfa_aen.h | 73 + drivers/scsi/bfa/include/aen/bfa_aen_adapter.h | 31 drivers/scsi/bfa/include/aen/bfa_aen_audit.h | 31 drivers/scsi/bfa/include/aen/bfa_aen_ioc.h | 35 ...
Sep 24, 9:04 pm 2008
Jing Huang
[PATCH 6/6] bfa: Brocade BFA FC SCSI Driver submission
From: Jing Huang <huangj@brocade.com> This patch contains updated MAINTAINERS file. It is created using 2.6.27-rc7 kernel. Signed-off-by: Jing Huang <huangj@brocade.com> --- MAINTAINERS | 6 ++++++ 1 files changed, 6 insertions(+) diff -urpN orig/MAINTAINERS patch/MAINTAINERS --- orig/MAINTAINERS 2008-09-24 13:40:12.000000000 -0700 +++ patch/MAINTAINERS 2008-09-24 13:44:25.000000000 -0700 @@ -995,6 +995,12 @@ M: mchan@broadcom.com L: netdev@vger.kernel.org S: Supported ...
Sep 24, 9:12 pm 2008
Jing Huang
[PATCH 2/6] bfa: Brocade BFA FC SCSI Driver submission
From: Jing Huang <huangj@brocade.com> This patch contains code that interfaces to Brocade Fibre channel HBA firmware/hardware. It is created using 2.6.27-rc7 kernel. Signed-off-by: Jing Huang <huangj@brocade.com> --- drivers/scsi/bfa/bfa_callback_priv.h | 55 + drivers/scsi/bfa/bfa_cb_ioim_macros.h | 215 ++++ drivers/scsi/bfa/bfa_core.c | 421 ++++++++ drivers/scsi/bfa/bfa_csdebug.c | 60 + drivers/scsi/bfa/bfa_fcpim.c | 158 +++ ...
Sep 24, 9:01 pm 2008
Greg KH
Re: [PATCH 5/6] bfa: Brocade BFA FC SCSI Driver submission
A lot of these (like the statistics) look like they should be debugfs entries instead of sysfs. Why do they need to be sysfs files, who/what is going to use these entries? thanks, greg k-h --
Sep 24, 9:50 pm 2008
James Bottomley
Re: [PATCH 5/6] bfa: Brocade BFA FC SCSI Driver submission
This is completely the wrong thing to do. The driver needs to bind to the Fibre Channel transport class which provides all of these features in a large measure through infrastructure shareable with the other FC drivers. Any other pieces that are brocade specific rather than FC general can go in host attributes in the same way as the rest of the FC drivers do it. James --
Sep 24, 10:18 pm 2008
Jing Huang
[PATCH 5/6] bfa: Brocade BFA FC SCSI Driver submission
From: Jing Huang <huangj@brocade.com> This patch contains document of brocade specific sysfs interface. It is created using 2.6.27-rc7 kernel. Signed-off-by: Jing Huang <huangj@brocade.com> --- Documentation/ABI/testing/sysfs-devices-bfa | 600 ++++++++++++++++++++++++++++ 1 files changed, 600 insertions(+) diff -urpN orig/Documentation/ABI/testing/sysfs-devices-bfa patch/Documentation/ABI/testing/sysfs-devices-bfa --- ...
Sep 24, 9:09 pm 2008
Jing Huang
[PATCH 4/6] bfa: Brocade BFA FC SCSI Driver submission
From: Jing Huang <huangj@brocade.com> This patch contains Makefile and Kconfig file for scsi and bfa. It is created using 2.6.27-rc7 kernel. Signed-off-by: Jing Huang <huangj@brocade.com> --- drivers/scsi/Kconfig | 6 ++++++ drivers/scsi/Makefile | 1 + drivers/scsi/bfa/Kconfig | 6 ++++++ drivers/scsi/bfa/Makefile | 25 +++++++++++++++++++++++++ 4 files changed, 38 insertions(+) diff -urpN orig/drivers/scsi/bfa/Kconfig patch/drivers/scsi/bfa/Kconfig --- ...
Sep 24, 9:07 pm 2008
James Bottomley
RE: [PATCH 1/6] bfa: Brocade BFA FC SCSI Driver submission
Unfortunately, the remedy tends to be fairly specific to the email tool you're using. There's a helpful list of what to do here: Documentation/email-clients.txt but it's not exhaustive. James --
Sep 24, 10:41 pm 2008
Greg KH
Re: [PATCH 1/6] bfa: Brocade BFA FC SCSI Driver submission
Your patch is line-wrapped and can't be applied :( Also, did I miss a [0/6] patch that describes this driver and what it is all about? Care to try again? thanks, greg k-h --
Sep 24, 9:49 pm 2008
James Bottomley
Re: [PATCH 1/6] bfa: Brocade BFA FC SCSI Driver submission
Actually, it's more than just linewrapping. The patch to the Makefile looks to be actively mangled: +++ patch/drivers/scsi/bfa/Makefile 2008-09-24 12:08:24.000000000 -0700 @@ -0,0 +1,25 @@ +# +# Copyright (c) 2005-2008 Brocade Communications Systems, Inc. +# All rights reserved +# www.brocade.com +# +# Linux driver for Brocade Fibre Channel Host Bus Adapter. +# +# This program is free software; you can redistribute it and/or modify +it # under the terms of the GNU General Public ...
Sep 24, 10:14 pm 2008
Jing Huang
[PATCH 1/6] bfa: Brocade BFA FC SCSI Driver submission
From: Jing Huang <huangj@brocade.com> This patch contains code that interfaces to upper layer linux kernel, such as PCI, SCSI mid-layer and sysfs etc. It is created using 2.6.27-rc7 kernel. Signed-off-by: Jing Huang <huangj@brocade.com> --- drivers/scsi/bfa/bfad.c | 1339 ++++++++++++++++++ drivers/scsi/bfa/bfad_attr.c | 660 +++++++++ drivers/scsi/bfa/bfad_attr.h | 75 + drivers/scsi/bfa/bfad_drv.h | 340 ++++ drivers/scsi/bfa/bfad_fwimg.c | 86 + ...
Sep 24, 8:56 pm 2008
Jing Huang
RE: [PATCH 1/6] bfa: Brocade BFA FC SCSI Driver submission
Thanks James and Greg for the quick check on the patches. I check the original patch, it is correct. I need to check the outlook setting to see how to prevent this issue. This is my first time sending a patch, is there any trick to prevent line-wrap? I did send it using plain text and I tried to send to myself before sending to lkml, and I did't see this problem. Sorry about the trouble. Jing -----Original Message----- From: James Bottomley ...
Sep 24, 10:33 pm 2008
Oleg Nesterov
Re: wait_event_interruptible_timeout
Something like unsigned long now = jiffies; ret = wait_event_interruptible_timeout(..., timeout); if (ret < 0) { next_timeout = now + timeout - jiffies; if (next_timeout < 0) next_timeout = 0; } We can even add another wait_ helper which treats "timeout" as lvalue I'm afraid it would be a pain to modify this API. But please do not hesitate to make the patch if you think the current API can be improved. At worst, the patch will be nacked ;) Oleg. --
Sep 25, 9:45 am 2008
Ani Sinha
wait_event_interruptible_timeout
I have noticed an issue with wait_event_interruptible_timeout() API which I will try to explain below: wait_event_interruptible_timeout() is supposed to wait until one of the following happens: (a) Timeout occurs (with no signals or the event of interest happening), in which case it returns 0 (b) The process receives a signal and wakes up prematurely (i.e., before its timeout expired or the event of interest occurred). In this case, it returns –ERESTARTSYS. I will come back to this ...
Sep 24, 9:05 pm 2008
David Gibson
Fix PCI in Holly device tree
The PCI bridge on the Holly board is current incorrectly represented in the device tree. The current device tree node for the PCI bridge sits under the tsi-bridge node. That's not obviously wrong, but the PCI bridge translated some PCI spaces into CPU address ranges which were not translated by the tsi-bridge node. We used to get away with this problem because the PCI bridge discovery code was also buggy, assuming incorrectly that PCI host bridge nodes were always directly under the root bus ...
Sep 24, 7:39 pm 2008
Josh Boyer
Re: Fix PCI in Holly device tree
On Thu, 25 Sep 2008 12:39:04 +1000 Acked-by: Josh Boyer <jwboyer@linux.vnet.ibm.com> Paul, I can include this in my 'next' branch if you aren't opposed. I'll have another set of patches going in there today/tomorrow. josh --
Sep 25, 5:11 am 2008
Josh Boyer
Re: Fix PCI in Holly device tree
Er... on second thought, this actually fixes a regression on Holly. So I'll amend my offer to put it in my 'next' branch to be contingent on you not wanting to get it into 2.6.27 this late. josh --
Sep 25, 5:22 am 2008
David Miller
Re: [git patches] further net driver updates for .28
From: Jeff Garzik <jeff@garzik.org> Pulled and pushed back out to net-next-2.6, thanks Jeff! --
Sep 24, 7:57 pm 2008
Jeff Garzik
[git patches] further net driver updates for .28
Please pull from 'davem-next' branch of master.kernel.org:/pub/scm/linux/kernel/git/jgarzik/netdev-2.6.git davem-next to receive the following updates: arch/arm/mach-kirkwood/db88f6281-bp-setup.c | 2 +- arch/arm/mach-kirkwood/rd88f6192-nas-setup.c | 2 +- arch/arm/mach-kirkwood/rd88f6281-setup.c | 2 +- arch/arm/mach-loki/lb88rc8480-setup.c | 2 +- arch/arm/mach-mv78xx0/common.c | 6 +- arch/arm/mach-mv78xx0/db78x00-bp-setup.c | 8 +- ...
Sep 24, 7:27 pm 2008
Daniel Phillips
Tux3 gets a high speed atom smasher
Hi filesystem aficionados, The Large Hadron Collider no longer has a monopoly on smashing atoms, now that Tux3 has weighed in with a bit of its own atom smashing. Probably the coolest idea that has seen the light of day so far in Tux3 is the concept of xattr atoms: numeric tokens that stand for xattr names. In contrast to pretty much every other filesystem out there, Tux3 looks up an xattr in two stages: 1) Look up the attribute name in the atom table 2) Search for the atom in the ...
Sep 24, 7:22 pm 2008
Hiroshi Shimamoto
[PATCH 3/4] x86: signal: introduce helper macro is_ia32
From: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> Introduce new macro is_ia32 for unification of setup_rt_frame(). No affect in binary, compiler will optimize. Signed-off-by: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> --- arch/x86/kernel/signal_32.c | 13 +++++++++---- arch/x86/kernel/signal_64.c | 13 +++++++++---- 2 files changed, 18 insertions(+), 8 deletions(-) diff --git a/arch/x86/kernel/signal_32.c b/arch/x86/kernel/signal_32.c index b1bc90f..cf62f70 100644 --- ...
Sep 24, 7:13 pm 2008
Hiroshi Shimamoto
Re: [PATCH 1/4] x86: signal_32.c: introduce signr_convert()
Hi, Ingo. Thanks for that information and testing my patches! It's very important to test patches. thanks, Hiroshi Shimamoto --
Sep 25, 9:59 am 2008
Hiroshi Shimamoto
[PATCH 4/4] x86: signal_32.c: introduce macro ia32_setup ...
From: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> Make 32-bit setup_rt_frame() look like 64-bit version for unification. Signed-off-by: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> --- arch/x86/kernel/signal_32.c | 6 ++++-- 1 files changed, 4 insertions(+), 2 deletions(-) diff --git a/arch/x86/kernel/signal_32.c b/arch/x86/kernel/signal_32.c index cf62f70..4337cd5 100644 --- a/arch/x86/kernel/signal_32.c +++ b/arch/x86/kernel/signal_32.c @@ -492,6 +492,8 @@ static int ...
Sep 24, 7:13 pm 2008
Hiroshi Shimamoto
[PATCH 1/4] x86: signal_32.c: introduce signr_convert()
From: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> Introduce signr_convert(). This function will help unification of setup_rt_frame(). Signed-off-by: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> --- arch/x86/kernel/signal_32.c | 17 ++++++++++------- 1 files changed, 10 insertions(+), 7 deletions(-) diff --git a/arch/x86/kernel/signal_32.c b/arch/x86/kernel/signal_32.c index bb05917..b1bc90f 100644 --- a/arch/x86/kernel/signal_32.c +++ b/arch/x86/kernel/signal_32.c @@ -482,18 ...
Sep 24, 7:10 pm 2008
Ingo Molnar
Re: [PATCH 1/4] x86: signal_32.c: introduce signr_convert()
applied your patches to tip/x86/signal: 4694d23: x86: signal_32.c: introduce macro ia32_setup_frame and ia32_setup_rt_frame 455edbc: x86: signal: introduce helper macro is_ia32 b94fd69: x86: signal_64.c: introduce helper function signr_convert() 8d8c13b: x86: signal_32.c: introduce signr_convert() thanks! FYI, your other 13 x86 signal code unification commits and other x86-signals enhancements passed testing just fine so far. Ingo --
Sep 25, 1:14 am 2008
Hiroshi Shimamoto
[PATCH 2/4] x86: signal_64.c: introduce helper function ...
From: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> This helper function is for unification of setup_rt_frame(). No affect in binary. Signed-off-by: Hiroshi Shimamoto <h-shimamoto@ct.jp.nec.com> --- arch/x86/kernel/signal_64.c | 10 ++++++++-- 1 files changed, 8 insertions(+), 2 deletions(-) diff --git a/arch/x86/kernel/signal_64.c b/arch/x86/kernel/signal_64.c index 963236f..c5d8002 100644 --- a/arch/x86/kernel/signal_64.c +++ b/arch/x86/kernel/signal_64.c @@ -281,18 +281,24 @@ ...
Sep 24, 7:12 pm 2008
Yinghai Lu
[PATCH 1/7] acpi: don't load acpi_cpufreq if acpi=off
Signed-off-by: Yinghai Lu <yhlu.kernel@gmail.com> --- arch/x86/kernel/cpu/cpufreq/acpi-cpufreq.c | 3 +++ 1 files changed, 3 insertions(+), 0 deletions(-) diff --git a/arch/x86/kernel/cpu/cpufreq/acpi-cpufreq.c b/arch/x86/kernel/cpu/cpufreq/acpi-cpufreq.c index dd097b8..9943b4c 100644 --- a/arch/x86/kernel/cpu/cpufreq/acpi-cpufreq.c +++ b/arch/x86/kernel/cpu/cpufreq/acpi-cpufreq.c @@ -779,6 +779,9 @@ static int __init acpi_cpufreq_init(void) { int ret; + if ...
Sep 24, 7:04 pm 2008
Bjorn Helgaas
Re: [PATCH 2/7] acpi: remove have_arch_parse_srat in acpi.h
I don't understand the connection between this commit log and --
Sep 25, 8:01 am 2008
Ingo Molnar
Re: [PATCH 4/7] mm: print out meminit for memmap
rationale: improve debuggability of memory setup problems. Ingo --
Sep 25, 1:35 am 2008
Ingo Molnar
Re: [PATCH 5/7] x86: fix typo in irq_desc array
applied to tip/irq/sparseirq, thanks Yinghai! Ingo --
Sep 25, 1:36 am 2008
Yinghai Lu
[PATCH 4/7] mm: print out meminit for memmap
Signed-off-by: Yinghai Lu <yhlu.kernel@gmail.com> --- mm/page_alloc.c | 7 +++---- 1 files changed, 3 insertions(+), 4 deletions(-) diff --git a/mm/page_alloc.c b/mm/page_alloc.c index e293c58..db30af6 100644 --- a/mm/page_alloc.c +++ b/mm/page_alloc.c @@ -3425,8 +3425,8 @@ static void __paginginit free_area_init_core(struct pglist_data *pgdat, PAGE_ALIGN(size * sizeof(struct page)) >> PAGE_SHIFT; if (realsize >= memmap_pages) { realsize -= ...
Sep 24, 7:04 pm 2008
Ingo Molnar
Re: [PATCH 6/7] x86: irq no should not use hex in /proc/ ...
applied to tip/irq/sparseirq, thanks Yinghai! Ingo --
Sep 25, 1:39 am 2008
Yinghai Lu
[PATCH 6/7] x86: irq no should not use hex in /proc/interrupts
Signed-off-by: Yinghai Lu <yhlu.kernel@gmail.com> --- arch/x86/kernel/irq_32.c | 2 +- arch/x86/kernel/irq_64.c | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/arch/x86/kernel/irq_32.c b/arch/x86/kernel/irq_32.c index b2e1082..001772f 100644 --- a/arch/x86/kernel/irq_32.c +++ b/arch/x86/kernel/irq_32.c @@ -307,7 +307,7 @@ int show_interrupts(struct seq_file *p, void *v) action = desc->action; if (!action && !any_count) goto skip; - seq_printf(p, ...
Sep 24, 7:04 pm 2008
Jesse Barnes
Re: [PATCH 3/7] pci: using %pF in quirks.c
Applied to linux-next, thanks. -- Jesse Barnes, Intel Open Source Technology Center --
Sep 25, 7:52 am 2008
Bjorn Helgaas
Re: [PATCH 3/7] pci: using %pF in quirks.c
I posted a similar patch already: http://lkml.org/lkml/2008/8/22/178 Maybe mine was too complicated because it touched more files at once. Regardless, we need to remove the #ifdef DEBUG because it's no longer necessary. --
Sep 25, 7:56 am 2008
Jesse Barnes
Re: [PATCH 3/7] pci: using %pF in quirks.c
Yeah you sent it to Andrew and it touched several files; maybe I shouldn't have ignored it. :) Anyway I'll fix up the #ifdefs in my linux-next branch. Thanks, Jesse --
Sep 25, 9:20 am 2008
Yinghai Lu
[PATCH 5/7] x86: fix typo in irq_desc array
when SPARSE_IRQ is not used, should still use irq_desc->lock Signed-off-by: Yinghai Lu <yhlu.kernel@gmail.com> --- kernel/irq/handle.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/kernel/irq/handle.c b/kernel/irq/handle.c index 7f625fb..fb6bdb6 100644 --- a/kernel/irq/handle.c +++ b/kernel/irq/handle.c @@ -253,7 +253,7 @@ struct irq_desc irq_desc[NR_IRQS] __cacheline_aligned_in_smp = { .chip = &no_irq_chip, .handle_irq = handle_bad_irq, .depth = ...
Sep 24, 7:04 pm 2008
Yinghai Lu
Re: [PATCH 2/7] acpi: remove have_arch_parse_srat in acpi.h
there was some workaround for 32bit numa (srat processing), and we removed those workaround, and make 32bit more like 64bit. now HAVE_ARCH_PARSE_SRAT is not defined in any place. YH --
Sep 25, 10:10 am 2008
Yinghai Lu
[PATCH 2/7] acpi: remove have_arch_parse_srat in acpi.h
32bit don't need that anymore Signed-off-by: Yinghai Lu <yhlu.kernel@gmail.com> --- include/linux/acpi.h | 8 -------- 1 files changed, 0 insertions(+), 8 deletions(-) diff --git a/include/linux/acpi.h b/include/linux/acpi.h index 4d5ec43..8eefaa9 100644 --- a/include/linux/acpi.h +++ b/include/linux/acpi.h @@ -95,18 +95,10 @@ int acpi_parse_mcfg (struct acpi_table_header *header); void acpi_table_print_madt_entry (struct acpi_subtable_header *madt); /* the following four ...
Sep 24, 7:04 pm 2008
Yinghai Lu
[PATCH 7/7] x86: print out irq nr for msi/ht
Signed-off-by: Yinghai Lu <yhlu.kernel@gmail.com> --- arch/x86/kernel/hpet.c | 3 +++ arch/x86/kernel/io_apic.c | 7 +++++++ 2 files changed, 10 insertions(+), 0 deletions(-) diff --git a/arch/x86/kernel/hpet.c b/arch/x86/kernel/hpet.c index 422c577..686505a 100644 --- a/arch/x86/kernel/hpet.c +++ b/arch/x86/kernel/hpet.c @@ -467,6 +467,9 @@ static int hpet_setup_irq(struct hpet_dev *dev) irq_set_affinity(dev->irq, cpumask_of_cpu(dev->cpu)); enable_irq(dev->irq); ...
Sep 24, 7:04 pm 2008
Yinghai Lu
[PATCH 3/7] pci: using %pF in quirks.c
Signed-off-by: Yinghai Lu <yhlu.kernel@gmail.com> --- drivers/pci/quirks.c | 3 +-- 1 files changed, 1 insertions(+), 2 deletions(-) diff --git a/drivers/pci/quirks.c b/drivers/pci/quirks.c index 1f26712..addf2db 100644 --- a/drivers/pci/quirks.c +++ b/drivers/pci/quirks.c @@ -1689,8 +1689,7 @@ static void pci_do_fixups(struct pci_dev *dev, struct pci_fixup *f, struct pci_f if ((f->vendor == dev->vendor || f->vendor == (u16) PCI_ANY_ID) && (f->device == dev->device || ...
Sep 24, 7:04 pm 2008
Greg KH
Re: PATCH 10/4]linux-usb: To support more Huawei data ca ...
Ok, that sounds reasonable. How about resending the patch, and adding this information to the changelog area so that others also understand what is going on here. thanks, greg k-h --
Sep 24, 7:10 pm 2008
Franko Fang
Re: PATCH 10/4]linux-usb: To support more Huawei data ca ...
1. In this patch, we want to do one thing: add more Huawei product IDs into the USB driver. Then it can support more Huawei data card devices. So I make them in one patch. 2. To change usb_stor_huawei_e220_init function as follows, that's because in the USB standard, while sending SET_FETURE_D to the device, it requires the corresponding data to be zero, and its sending length also must be zero. In our old solution, it can be compatible with our WCDMA data card devices, but can not ...
Sep 24, 6:36 pm 2008
Jesse Barnes
Re: [patch] pci: document the pcie_aspm kernel parameter
Applied to linux-next, thanks Chuck. -- Jesse Barnes, Intel Open Source Technology Center --
Sep 25, 9:16 am 2008
Chuck Ebbert
[patch] pci: document the pcie_aspm kernel parameter
From: Chuck Ebbert <cebbert@redhat.com> pci: document the pcie_aspm kernel parameter Index: linux-2.6.26.noarch/Documentation/kernel-parameters.txt =================================================================== --- linux-2.6.26.noarch.orig/Documentation/kernel-parameters.txt +++ linux-2.6.26.noarch/Documentation/kernel-parameters.txt @@ -1636,6 +1636,12 @@ and is between 256 and 4096 characters. reserved for the CardBus bridge's memory window. The default value is 64 ...
Sep 24, 5:40 pm 2008
Chris Snook
Re: [PATCH] net: remove LLTX in atl2 driver
I'm all in favor of removing legacy cruft. I want to know a little more about it so we can remove it from the atl1 code as well. Ultimately most of this will be shared code. -- Chris --
Sep 25, 3:34 pm 2008
Chris Snook
Re: [PATCH] net: remove LLTX in atl2 driver
Can you explain a bit more concretely the problem this solves, and your testing? Ultimately we'll want to merge this code with the atl1 code, so we need to be confident that we can and should make the same change there. --
Sep 25, 1:30 pm 2008
Kevin Hao
[PATCH] net: remove LLTX in atl2 driver
When NETIF_F_LLTX is set, the atlx driver will use a private lock. But in recent kernels this implementation seems redundant and can cause problems where AF_PACKET sees things twice. Since NETIF_F_LLTX is marked as deprecated and shouldn't be used in new driver, this patch removes NETIF_F_LLTX and adds a mmiowb before sending packet. Signed-off-by: Kevin Hao <kexin.hao@windriver.com> --- drivers/net/atlx/atl2.c | 24 +----------------------- drivers/net/atlx/atl2.h | 1 - 2 files ...
Sep 24, 5:35 pm 2008
Jay Cliburn
Re: [PATCH] net: remove LLTX in atl2 driver
On Thu, 25 Sep 2008 18:34:29 -0400 Yes, LLTX can (and should) be removed from the atl1 code. I had an exchange with Herbert Xu and David Miller on LKML a few weeks ago, and they gave me a few tips on how to do it, but I haven't yet had the time. What takes an experienced netdev warrior about a half hour to implement unfortunately takes me days of after-dayjob time as I track through all the calls that that are affected and test the modified driver. Now back to atl2... I'd like to hear ...
Sep 25, 4:58 pm 2008
Jeff Garzik
Re: [PATCH] net: remove LLTX in atl2 driver
Not directly addressing your question, but LLTX is indeed deprecated and in general we want to move away from it. Jeff --
Sep 25, 1:55 pm 2008
Zhu Yi
Re: pull request: wireless-2.6 2008-09-24
Can you also add this one? It doesn't panic the kernel when a frame from firmware is invalid. http://marc.info/?l=linux-wireless&m=122219037706528&w=2 Thanks, -yi --
Sep 24, 10:47 pm 2008
Marcel Holtmann
Re: pull request: wireless-2.6 2008-09-24
I was hitting this one a frequent basis on a 64-bit Linux running on an off-the-shelf X61. Regards Marcel --
Sep 25, 12:09 pm 2008
Tomas Winkler
Re: pull request: wireless-2.6 2008-09-24
On Thu, Sep 25, 2008 at 2:41 PM, John W. Linville I'm just exporting the major bugs from our internal database to buzilla, 5000 is new so we are hitting the problems during testing before users report them. Now I don't know if it was good idea to push the driver upstream same time as it hits the market. Tomas --
Sep 25, 7:57 am 2008
Johannes Berg
Re: pull request: wireless-2.6 2008-09-24
Actually, I now think that we don't want the patch. It appears that when this situation happens, the hardware has scribbled over memory elsewhere (most likely because of wrong DMA programming) and in that case I suppose we'd rather fail than corrupt disk buffers et. al. johannes
Sep 25, 5:27 am 2008
John W. Linville
Re: pull request: wireless-2.6 2008-09-24
Is there an open bug report for that anywhere? Is this something that real users (no offense to Johannes) are likely to hit? John -- John W. Linville Linux should be at the core linville@tuxdriver.com of your literate lifestyle. --
Sep 25, 4:41 am 2008
Tomas Winkler
Re: pull request: wireless-2.6 2008-09-24
On Thu, Sep 25, 2008 at 6:47 PM, John W. Linville I think it was pushed on time it would not bring the fixes sooner. But we are getting again to the same arguments Tomas --
Sep 25, 8:52 am 2008
John W. Linville
Re: pull request: wireless-2.6 2008-09-24
You're right -- pushing it sooner rather than sitting on it internally probably would have helped to give it more visibility sooner. John -- John W. Linville Linux should be at the core linville@tuxdriver.com of your literate lifestyle. --
Sep 25, 8:47 am 2008
David Miller
Re: [git patches] net driver updates for .28
From: Jeff Garzik <jeff@garzik.org> Done. --
Sep 25, 1:07 pm 2008
Jeff Garzik
Re: [git patches] net driver updates for .28
David, please revert 2eefbd63d0c85daa1317636474c226e236beba81 by request of author, maintainer, and maintainer's maintainer :) Jeff --
Sep 25, 10:38 am 2008
Sebastien Dugue
Re: [git patches] net driver updates for .28
Hi Jeff, ^^^^^^ That last one was NACKed by Thomas Klein (http://lkml.org/lkml/2008/9/15/81) who asked for it to be reverted. Thanks, Sebastien. --
Sep 24, 11:55 pm 2008
Randy Dunlap
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
I don't disagree with the CRAP name... fwiw. I think that we have enough quality problems without adding crap. -- ~Randy --
Sep 25, 2:04 pm 2008
Stefan Richter
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
True. OTOH many if not most of the -staging drivers are ones which are already in use. Their users already deal with whatever quality problems these drivers have, in addition to having to fight with the installation hassles that are inherent to out-of-tree drivers. -- Stefan Richter -=====-==--- =--= ==--= http://arcgraph.de/sr/ --
Sep 25, 2:51 pm 2008
Randy Dunlap
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
then it would be better if Greg/someone cleaned up the current tree's problems instead of introducing more CRAP under a different name. Oh well, his mind is already made up and I know how difficult it is to --- ~Randy --
Sep 25, 7:49 am 2008
Greg KH
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
I disagree, see my response to Parag's comments :) thanks, greg k-h --
Sep 24, 7:06 pm 2008
Paul Mundt
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
Uhm.. not quite. As the one that proposed the flag in the first place, perhaps it helps to cover the rationale (although Greg seems to have mostly covered that already). EXPERIMENTAL today is pretty damn meaningless. What it tends to mean in practice is that somethings needs some more testing, someone wants to be able to pull out the EXPERIMENTAL card when someone enables their option and their kernel blows up, the option/feature hasn't been around in the kernel for that long, or someone has ...
Sep 24, 10:27 pm 2008
Greg KH
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
Heh, no problem, normally my problem is that I change my mind too often, The whole EXPERIMENTAL issue hasn't come up in years, I'm supprised that people even consider it a valid option these days. I'm all for fixing it up, but as Paul so well described, the code I'm talking about is WAY worse than a mere "experimental" marking, it needs to be explicitly pointed out that this is not even up to that level at all. And as was also pointed out, the EXPERIMENTAL marking cleanup is ...
Sep 25, 1:48 pm 2008
Parag Warudkar
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
You don't tell me why you want to do something as absurd and incoherent as wanting users to load the modules easily and automatically, then printing a warning about it even when user may not have loaded it on his/her own, then taint the kernel when there is no need to - I have repeatedly said it is very easy to see from the module list for people to decide - not rocket science, definitely not for developers. The whole TAINT drama is unnecessary. Why would people run with even 24 more lines ...
Sep 25, 3:22 pm 2008
Parag Warudkar
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
I don't think I got my point across - let me try again. My disagreement was with getting crap code in mainline, re-classifying it as something other than experimental when it is not, expecting users to use it, printing warning when it is used, tainting the kernel for even more harassment and then having developers not support it as a bonus - all cumulative. The naming part was also something that I did not like but that like I said was just my preference and does not matter because I ...
Sep 25, 2:40 pm 2008
Greg KH
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
I'm sorry we disagree here. Also note that this goes against what a number of us decided at the kernel summit, we want this code into the So you now agree with the idea, just the implementation of the area A name doesn't really matter here, the modinfo tag is just as Um, no, I'm not going to force all userspace users to upgrade their version of module-init-tools, JUST to prevent them from automatically So you have to load the modules by "hand"? Ick, no, that too is ...
Sep 25, 3:04 pm 2008
Greg KH
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
Sorry, no, the EXPERIMENTAL horse has left the barn. If you want to run after it and round it up and bring it back, that's great, but that is Again, no, the current EXPERMENTAL marking is pointless and a mess. If It provides a fast way into the kernel for companies who do not stick around to take the time to merge things in. That's what the -staging tree has been doing quite well for the past 6 months or so, with lots of drivers moving into mainline, and 15 drivers currently residing in ...
Sep 24, 9:21 pm 2008
Greg KH
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
Heh, ok, so the basic premise of getting code that is not currently in mergable shape into the tree earlier to get wider usage and testing is something that you agree with? If so, we can arm-wrestle over what to call it, one name is as good as another, as long as we don't overload a currently used name like EXPERIMENTAL, which Paul has so well explained is a mess right now. And I don't have the inclination to clean up that mess right now, I'm more worried about bigger messes like these 15 ...
Sep 25, 1:53 pm 2008
Randy Dunlap
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
Thanks. I agree with Parag's comments. Looks like overkill/duplication. --- ~Randy --
Sep 24, 6:03 pm 2008
Parag Warudkar
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
The whole concept is a bike shed - disproportionate importance to labeling same things with different name. So it should come as no surprise that we are discussing the need for Again - sure move the staging directory to the mainline tree, group all those drivers under CONFIG_HALFBAKED and default the whole category to N. Name those driver modules with a _stg prefix and be done with it. No need for the insanely useless module loading crap because - 1) You will happily load from that directory ...
Sep 25, 4:02 am 2008
Randy Dunlap
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
I shouldn't have said that last sentence. I apologize to Greg. ISTM that the real problems are (a) it's easier to introduce new staging/crap than it is to fix EXPERIMENTAL and (b) no one wants to try to fix EXPERIMENTAL. --- ~Randy --
Sep 25, 10:53 am 2008
Parag Warudkar
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
How? TAINT_EXPERIMENTAL (I'll stick to that, thanks :) and CONFIG_EXPERIMENTAL are no different - neither to users nor to developers. Here is why - Both try to do the same thing - let people use the drivers on their own risk (as if the stable ones are developer's risk - but let's keep it aside for the moment) and give developers a chance to keep the code in sync with mainline and improve it per user problem reports or generally make it better. You might try to differentiate between them on ...
Sep 24, 7:59 pm 2008
Greg KH
Re: [patch 00/04] RFC: Staging tree (drivers/staging)
No, this is much different from EXPERIMENTAL. That flag is pretty much useless right now. This is for a temporary landing place for drivers that are not good enough to be merged, yet are useful enough for some people to use. Examples of this is the USB driver I posted, some network drivers that are in -staging that only work on x86 boxes, and drivers that have userspace interfaces that are "wrong" and need to be changed (reading files from /etc is one example.) All of those types of ...
Sep 24, 7:06 pm 2008
Timur Tabi
Re: [PATCH v2] fsl-dma: allow Freescale Elo DMA driver t ...
Re-read my explanation, please. Technically, it doesn't *matter* in that nothing will break, but so what? It's nicer if the DMA driver is already available when the client drivers load, so that they can use the DMA facilities right away. -- Timur Tabi Linux kernel developer at Freescale --
Sep 25, 12:09 pm 2008
Timur Tabi
Re: [PATCH v2] fsl-dma: allow Freescale Elo DMA driver t ...
This was intentional. When compiled as a module, subsys_initcall becomes module_init(). When compiled in-kernel, this code is initialized before most Thanks. -- Timur Tabi Linux kernel developer at Freescale --
Sep 25, 6:54 am 2008
Scott Wood
Re: [PATCH v2] fsl-dma: allow Freescale Elo DMA driver t ...
If there's a dependency there, how will it work when this is built as a module? -Scott --
Sep 25, 11:40 am 2008
Scott Wood
Re: [PATCH v2] fsl-dma: allow Freescale Elo DMA driver t ...
It's not nicer to people reading the code and wondering why, or to people who use it as a module and execute less-well-tested code paths, and I doubt it's a significant addition to boot time to do things in the normal way. I'm not particularly worried about the code going on strike because we're not being "nice" to it. -Scott --
Sep 25, 1:16 pm 2008
Timur Tabi
Re: [PATCH v2] fsl-dma: allow Freescale Elo DMA driver t ...
There are no dependencies. fsldma registers with the DMA engine, which is always built in-kernel. The DMA engine is what handles linking DMA clients to DMA drivers. The DMA clients get a callback whenever a DMA driver registers with the DMA engine. If the DMA driver is already registered when the client registers, then the client will get a callback immediately after it registers. I chose subsys_initcall() to increase the probability that fsldma is already present when DMA clients are ...
Sep 25, 11:47 am 2008
Li Yang
Re: [PATCH v2] fsl-dma: allow Freescale Elo DMA driver t ...
Acked-by: Li Yang <leoli@freescale.com> - Leo --
Sep 24, 11:54 pm 2008
Scott Wood
Re: [PATCH v2] fsl-dma: allow Freescale Elo DMA driver t ...
If there's no dependency, why does it matter whether fsldma is already present? -Scott --
Sep 25, 12:00 pm 2008
Tino Keitel
Re: Differend udev names with different kernels
It has the same name with 2.6.26.3 and with 2.6.27-rc7-something with fastboot merged. So why would this be a regression? The different name (scsi-...) happended with a vanilla 2.6.27-rc7. Initially, my intention was to find a unique name for the hard disk, that stays even if the disk is changed from Firewire to USB. However, this doesn't seem to be possible with a default udev. Regards, Tino --
Sep 24, 10:47 pm 2008
Kay Sievers
Re: Differend udev names with different kernels
Yeah, I have more reports from people, that with recent kernels, the ieee1394-* links showed up and replaced the scsi-* links, which they say never happended before. We ship these rules for a long time now, so I just guessed, they never really worked before, but as all depends on timing, it's probably not predictable what happens. Regardless of the needed fix for scsi sysfs, we should provide both links, I guess. Thanks, Kay --
Sep 24, 11:09 pm 2008
Kay Sievers
Re: Differend udev names with different kernels
Seems, for some reason, the "ieee1394_id" attribute becomes readable I guess, it's just a not-easy-to-reproduce timing issue with the sysfs Yeah, sure they should not change like this. We could just drop the ieee1394-* link creation entirely, if they never worked as expected. Or we can provide it as an additional link instead of making it skip the scsi-* link creation. Tino, care to try if something like the attached works for your setup and creates at least the scsi-links, and, if ...
Sep 24, 8:12 pm 2008
Tino Keitel
Re: Differend udev names with different kernels
On Thu, Sep 25, 2008 at 23:13:19 +0200, Stefan Richter wrote: Not really. My initial problem was that a volume sometimes came up very late, so that the lvm2 initscript didn't activate that volume group. So I wanted to wait for this specific disk during boot. Meanwhile, I created an udev rule that uses the vendor and device name, which also has the advantage of being the same when the disk is connected via Firewire and USB (or, should be the same at least, I didn't test ...
Sep 25, 2:23 pm 2008
Stefan Richter
Re: Differend udev names with different kernels
Tino Keitel wrote: Tino, when you reply on LKML, please keep _all_ responders in a thread in the Cc list. I for one am not subscribed to linux-hotplug, and I may easily miss a response on LKML with its hundreds of messages per day if I misunderstood. (Comprehension difficulties at 0700 AM.) It is of course intended that FireWire disks get the /dev/disk/by-id/ieee1394-* link. Only this one is reliably unique. The /dev/disk/scsi-* link, which I presume is generated from SCSI INQUIRY ...
Sep 25, 2:13 pm 2008
Stefan Richter
Re: Differend udev names with different kernels
Then this is a regression of the fastboot patch or whatever. They currently work. I never heard of them not working. Even if there were already problems now, then the necessary course of action would be to fix them instead of dropping them. Or fix the fine SCSI stack already to properly export target identifiers and LU identifiers. (Not going to happen because the SCSI folks just The scsi-* link may not be unique because it may be based on unsuitable I will look into it. -- ...
Sep 24, 10:08 pm 2008
Stefan Richter Sep 24, 10:59 pm 2008
Dave Chinner
Re: 2.6.27-rc7 no init found on the root partition?
Hmmm - the filesystem mounted without errors, but init was not Same again, it's unable to run the init process. Of course, kernel_execve() throws away any error that might have occurred, so without hacking the init code we can't find out Ok, is the 2.6.27 kernel based on the same config as the 2.6.26 Does it boot if you take away the "ro" option? Cheers, -- Dave Chinner david@fromorbit.com --
Sep 25, 4:41 pm 2008
Eric Sandeen
Re: 2.6.27-rc7 no init found on the root partition?
and try mounting root by label? (or UUID ...) -Eric --
Sep 25, 6:48 am 2008
Lukas Hejtmanek
Re: 2.6.27-rc7 no init found on the root partition?
http://undomiel.ics.muni.cz/tmp/IMG_2962.JPG this one is if I pass init=/bin/bash indeed. this is a grub record: title Linux 2.6.26.3 root (hd0,0) kernel /boot/vmlinuz-2.6.26.3 root=/dev/sda1 ro console=tty0 title Linux 2.6.27-rc7 root (hd0,0) kernel /boot/vmlinuz-2.6.27-rc7 root=/dev/sda1 ro -- Lukáš Hejtmánek --
Sep 25, 4:12 am 2008
Lukas Hejtmanek
Re: 2.6.27-rc7 no init found on the root partition?
no, using printk's I found that search_binary_handler is the one that fails. Dunno why. -- Lukáš Hejtmánek --
Sep 25, 9:30 am 2008
H. Peter Anvin
Re: [PATCH v2] fbdev: ignore VESA modes if framebuffer d ...
I'm *REALLY* skeptical to burying this in screen_info like this, it would probably be better to put it in Kconfig, or it is unlikely to get updated properly. -hpa --
Sep 25, 2:12 am 2008
H. Peter Anvin
Re: v2.6.27-rc7: x86: #GP on panic?
Yes, but there shouldn't be any external interrupts that could turn into a divide error. It really smells like a Qemu problem -- possibly even a Qemu miscompile -- to me. Does it reproduce in KVM? -hpa --
Sep 25, 1:49 pm 2008
Ingo Molnar
Re: v2.6.27-rc7: x86: #GP on panic?
hm, 0x5a is a simple pop %rdx. A #GP there means the stack segment is bust? so it's preceded by a popfq and on the next instruction we #GP. but the stack and flags state looks good: [ 4.523641] RSP: 0018:ffff880007867d70 EFLAGS: 00000286 weird. Ingo --
Sep 25, 1:04 am 2008
Vegard Nossum
Re: v2.6.27-rc7: x86: #GP on panic?
I have no computer that can do KVM, sorry :-( Stack trace contains IO_APIC functions, so it seems that maybe the emulated IOAPIC is trying to (erroneously) deliver an int 0 (for some reason)? But I don't know, that's just speculation which can be done better by others, so I will stop now :-) Vegard -- "The animistic metaphor of the bug that maliciously sneaked in while the programmer was not looking is intellectually dishonest as it disguises that the error is the programmer's own ...
Sep 25, 2:02 pm 2008
Vegard Nossum
Re: v2.6.27-rc7: x86: #GP on panic?
I'm sorry for the false alarm. I discovered that it did not happen on a clean kernel. My kernel was using this patch. diff --git a/arch/x86/kernel/cpu/common_64.c b/arch/x86/kernel/cpu/common_64.c index a11f5d4..abf5bc8 100644 --- a/arch/x86/kernel/cpu/common_64.c +++ b/arch/x86/kernel/cpu/common_64.c @@ -261,6 +261,8 @@ void __init early_cpu_init(void) cpu_devs[cvdev->vendor] = cvdev->cpu_dev; early_cpu_support_print(); ...
Sep 25, 7:07 am 2008
Vegard Nossum
Re: v2.6.27-rc7: x86: #GP on panic?
Keeping it going also found this bootup failure: [ 0.321423] Freeing SMP alternatives: 39k freed [ 0.323950] ACPI: Core revision 20080609 [ 0.360390] divide error: 0000 [1] SMP [ 0.360944] CPU 0 [ 0.360944] Modules linked in: [ 0.360944] Pid: 1, comm: swapper Tainted: G W 2.6.27-rc7 #9 [ 0.360944] RIP: 0010:[<ffffffff81039193>] [<ffffffff81039193>] __do_softirq+0x49/0xc5 [ 0.360944] RSP: 0018:ffffffff81792f00 EFLAGS: 00000206 [ 0.360944] RAX: ...
Sep 25, 1:46 pm 2008
Vegard Nossum
Re: v2.6.27-rc7: x86: #GP on panic?
No, I was wrong! It *does* happen for vanilla as well, but it doesn't happen reliably. [ 4.043370] Kernel panic - not syncing: VFS: Unable to mount root fs on unknown-block(2,0) [ 4.048765] general protection fault: fff2 [1] SMP [ 4.048765] CPU 0 [ 4.048765] Modules linked in: [ 4.048765] Pid: 1, comm: swapper Tainted: G W 2.6.27-rc7 #8 [ 4.048765] RIP: 0010:[<ffffffff81019d27>] [<ffffffff81019d27>] native_smp_send_stop+0x29/0x2d [ 4.048765] RSP: ...
Sep 25, 8:20 am 2008
H. Peter Anvin
Re: v2.6.27-rc7: x86: #GP on panic?
No, that would be #SS (and segments don't really exist in 64-bit mode anyway.) In 32-bit mode it could mean a code segment overrun. *However*... [ 4.523477] general protection fault: fff2 [1] SMP There is an error code attached to the #GP, which is supposed to mean that somehow a segment selector was involved. This doesn't look like a My guess is that the popfq enables interrupts, and we try to take an interrupt through an IDT entry which isn't set up correctly. -hpa --
Sep 25, 1:53 am 2008
H. Peter Anvin
Re: v2.6.27-rc7: x86: #GP on panic?
I suspect it's a problem in Qemu's IOAPIC model, but it's hard to know for sure. -hpa --
Sep 25, 2:53 pm 2008
Bill Nottingham
Re: [PATCH] x86_64: be less annoying on boot
Does the 'kernel alive' right before it serve any purpose either? Bill --
Sep 25, 9:13 am 2008
Ingo Molnar
Re: [PATCH] x86_64: be less annoying on boot
LOL :) Applied the commit below to tip/x86/debug, thanks Bill! Ingo --------------------> From f6476774f1fe32593d3d71903b1e98514efbf685 Mon Sep 17 00:00:00 2001 From: Bill Nottingham <notting@redhat.com> Date: Wed, 24 Sep 2008 14:35:17 -0400 Subject: [PATCH] x86_64: be less annoying on boot Remove mostly useless message on every boot. Signed-off-by: Bill Nottingham <notting@redhat.com> Signed-off-by: Ingo Molnar <mingo@elte.hu> --- arch/x86/kernel/head64.c | 2 -- 1 files ...
Sep 25, 2:15 am 2008
Corey Minyard
Re: [PATCH 1/1] Use RCU for the UDP hash lock
It also needs sk_next_rcu(). sk_unhashed() is only used on the update side, so an rcu version is not needed there. Thanks for pointing this out. -corey --
Sep 25, 12:14 pm 2008
Jarek Poplawski
Re: [PATCH 1/1] Use RCU for the UDP hash lock
On 24-09-2008 19:28, Corey Minyard wrote: ... Aren't other functions like sk_next() or sk_unhashed() used on the read side and need _rcu versions? Jarek P. --
Sep 25, 1:45 am 2008
Corey Minyard
Re: [PATCH 1/1] Use RCU for the UDP hash lock
Ok, will do. I read more on this, and I think I understand the issues Do you mean synchronize_rcu(), too? It seems to be used in the net code. To avoid that I'd need to add a struct rcu_head to struct sock. Would that be preferable? Thanks, -corey --
Sep 25, 12:21 pm 2008
Paul E. McKenney
Re: [PATCH 1/1] Use RCU for the UDP hash lock
You do indeed need to match the update-side and read-side primitives: Update-side Read-side synchronize_rcu() rcu_read_lock() call_rcu() rcu_read_unlock() call_rcu_bh() rcu_read_lock_bh() rcu_read_unlock_bh() synchronize_sched() preempt_disable() preempt_enable() [and anything else that disables either preemption or irqs] synchronize_srcu() srcu_read_lock() srcu_read_unlock() Mixing RCU or RCU-SCHED with RCU-BH will ...
Sep 25, 8:29 am 2008
Stephen Hemminger
Re: [PATCH 1/1] Use RCU for the UDP hash lock
On Thu, 25 Sep 2008 14:21:55 -0500 synchhonize_rcu or call_rcu_bh is fine. But I worry that if the other stricter types are used, then we would have to audit all the other RCU usage in networking to make sure nesting was correct. --
Sep 25, 1:34 pm 2008
Stephen Hemminger
Re: [PATCH 1/1] Use RCU for the UDP hash lock
On Thu, 25 Sep 2008 08:29:36 -0700 Also, for consistency with other parts of networking code, don't introduce the synchronize_sched() or synchronize_srcu() pattern to network protocols unless there is a no other way to achieve the desired result. --
Sep 25, 8:34 am 2008
Gautham R Shenoy
Re: [RFC PATCH v1 3/5] Collect statistics required for p ...
Are there groups other than local and non-local ? If not, this comment can be condensed so that it can be applied -- Thanks and Regards gautham --
Sep 25, 5:24 am 2008
Sitsofe Wheeler
Re: [kernel-abi] symbol versioning vs. out-of-tree modules.
> - http://sourceforge.net/projects/bluetooth-alsa Allegedly deprecated in favour of a userspace ALSA plugin: http://www.mail-archive.com/debian-devel@lists.debian.org/msg253253.html > - http://www.unav-micro.com/Drivers.aspx -> AR81Family-linux-v1.0.1.0.tar.gz > (e.g. asus-p5q-pro m/b has such ethernet port, pciid=1969:1026) Isn't this the atl1e gigabit ethernet driver? Isn't this being merged into 2.6.27? - ...
Sep 25, 1:26 am 2008
Greg KH
Re: [kernel-abi] symbol versioning vs. out-of-tree modules.
"we" are working hard to remove any and all barriers for modules to be out-of-tree. See the recent drivers/staging/ announcement, I can now take pretty much any code that at least builds and works for someone, and get it into the kernel tree where it can be properly cleaned up and cared for over time before it moves to the "correct" location within the kernel. What kernel modules do you currently use/rely on that are not in the main kernel tree? Do you have links to them? I'd be glad to ...
Sep 24, 7:48 pm 2008
Paweł Sep 25, 6:48 am 2008
Greg KH
Re: [kernel-abi] symbol versioning vs. out-of-tree modules.
This looks almost sane, I'll suck it in and contact the author. It does look like they are suggesting that a core network change is also needed, This is a bit odder, but might just be needed for one brand of laptop. I'll ask to find out, at the least it needs to be changed to not use /proc/ anymore, which is easily done. thanks for the pointers, if anyone has any others, please let me know. greg k-h --
Sep 25, 6:54 am 2008
Paweł Sikora
Re: [kernel-abi] symbol versioning vs. out-of-tree modules.
here's a quick list: - http://www.lirc.org - http://sourceforge.net/projects/bluetooth-alsa - http://rt2x00.serialmonkey.com - http://www.unav-micro.com/Drivers.aspx -> AR81Family-linux-v1.0.1.0.tar.gz (e.g. asus-p5q-pro m/b has such ethernet port, pciid=1969:1026) - http://aufs.sourceforge.net - http://linux-uvc.berlios.de - http://www.misdn.org - toshiba bluetooth module [http://0bits.com/toshbt/toshbt-1.0.tar.gz] - zaptel telephony driver [http://www.asterisk.org] with firmware ...
Sep 24, 11:29 pm 2008
Greg KH
Re: [kernel-abi] symbol versioning vs. out-of-tree modules.
I think Sitsofe has answered all of these links as to if they are already merged, working on being merged, or refuse to be merged (like zaptel.) Are there any other out-of-tree modules that I need to be aware of to help integrate? If not, then there really isn't an issue here, right? :) thanks, greg k-h --
Sep 25, 5:54 am 2008
Sitsofe Wheeler
Re: [kernel-abi] symbol versioning vs. out-of-tree modules.
^^^^ People are taking steps to adapt this to be included in kernel. See http://lwn.net/Articles/297770/ . I could have sworn I saw someone posting an input layer patch for lirc the other day too... Aha - Isn't this one already in the wireless tree? I believe it was merged in 2.6.24 - http://kernelnewbies.org/Linux_2_6_24#head-62e9ebf067c978bbf70898986c0aa3904d1a3543 I've read rumblings about one of the stacking FSes being merged but thus USB video has already been merged into ...
Sep 25, 12:49 am 2008
Ingo Molnar
Re: [PATCH] x86 gart: remove unnecessary initialization
applied to tip/x86/iommu, thanks! Ingo --
Sep 25, 2:05 am 2008
Yan Li
Re: [PATCH 2/2] Suppress false "mtrr all empty" warning ...
Sorry, do you mean we also need a warning for this in VMware? -- Li, Yan --
Sep 24, 7:52 pm 2008
Yan Li Sep 24, 7:51 pm 2008
Yan Li
Re: [PATCH 2/2] Suppress false "mtrr all empty" warning ...
I checked your patch 1c6e5503 that moved dmi_scan_machine() earlier, but it's still behind mtrr_trim_uncached_memory(), so we still can't use DMI to check for VMware there. -- Li, Yan --
Sep 25, 7:18 am 2008
Yinghai Lu
Re: [PATCH 2/2] Suppress false "mtrr all empty" warning ...
you must be kidding! ... dmi_scan_machine(); dmi_check_system(bad_bios_dmi_table); #ifdef CONFIG_X86_32 probe_roms(); #endif /* after parse_early_param, so could debug it */ insert_resource(&iomem_resource, &code_resource); insert_resource(&iomem_resource, &data_resource); insert_resource(&iomem_resource, &bss_resource); if (efi_enabled) efi_init(); #ifdef CONFIG_X86_32 if ...
Sep 25, 10:13 am 2008
Ingo Molnar
Re: [PATCH 0/3] x86: restore old GART alloc_coherent
applied your fixes to tip/x86/iommu: 1615965: x86 gart: remove unnecessary initialization 1d99088: x86: restore old GART alloc_coherent behavior ecef533: revert "x86: make GART to respect device's dma_mask about virtual mappings" 9f6ac57: x86: export pci-nommu's alloc_coherent thanks Fujita! Ingo --
Sep 25, 2:04 am 2008
Hirokazu Takahashi
Re: [dm-devel] Re: [PATCH 4/8] bio-cgroup: Split the cgr ...
I have a plan on getting rid of the blist after your work is done, whose design will depend on that all page_cgroups are preallocated. I also think the size of bio_cgroup can be reduced if making bio_cgroup contain a bio-cgroup ID instead of the pointer. --
Sep 25, 1:46 am 2008
KAMEZAWA Hiroyuki
Re: [PATCH 4/8] bio-cgroup: Split the cgroup memory subs ...
On Wed, 24 Sep 2008 19:13:09 +0900 (JST) I posted new ones. - http://lkml.org/lkml/2008/9/25/33 (changes in page_cgroup) http://lkml.org/lkml/2008/9/25/56 (I'm not sure this gets Ack or Nack. but direction will not change.) Then, please tell me if you have new troubles with new ones. Or if you have requests. Major changes are - page_cgroup.h is added. - lookup_page_cgroup(struct page*), lock_page_cgroup() etc.. is exported. - All page_cgroup are allocated at boot. - you can use ...
Sep 25, 12:14 am 2008
Joakim Tjernlund
Re: UIO device name
Thanks for your kind words. It is true that I am not an expert in this area, just trying to understand and improve. Obviously I still lack some knowledge but now I am done with uio's interface naming, I will just scan sysfs for it. Jocke --
Sep 25, 5:36 am 2008
Ben Nizette
Re: UIO device name
UIO drivers certainly aren't first class citizens like kernel mode UIO is an interface type, not a bus type. UIO isn't a subsystem as such, it's a user interface. If the interface is consistent (even if the backing device is different) I don't see the problem with consistent naming. Anyway, I don't really see the point arguing here - the interface is what it is, it does everything it needs to to allow you to identify the device nodes. The kernel boys have spent a lot of effort over ...
Sep 25, 3:48 am 2008
Joakim Tjernlund
Re: UIO device name
My problem is this, uio is a generic container for any user space device and by itself it doesn't mean much. You put some protocol driver on top of uio, such as uio_smx, to make it mean something. Comparing uio with hdX is wrong as hdX means something, it is a block device for a disk. A better comparison would be if all kernel devices were named kio%d and you had to scan /sys to find the name hdX. Look at the spi subsystem, the protocol drivers name them self. Jocke --
Sep 25, 3:05 am 2008
Paul Mundt
Re: UIO device name
This thread is still going? Amazing. Anyways, your protocol driver argument doesn't make any sense. Take the case of uio_pdrv or the genirq variant. This is the name it hands off to the core, while the devices that register underneath it all have their own names set. Go grep for all instances of uio_pdrv platform data in the architecture code. I'm getting the impression you haven't actually even bothered to look at the name entries. Breaking the uio%d stuff is unacceptable, and if you need ...
Sep 25, 4:53 am 2008
Joakim Tjernlund
Re: UIO device name
Do you see a problem with letting the protocol driver choose another The changes needed to make it possible to select a different name are really minimal(include below for 2.6.25). So I ask again, why not let From: Joakim Tjernlund <Joakim.Tjernlund@transmode.se> Date: Thu, 25 Sep 2008 13:22:53 +0200 Subject: [PATCH] [UIO] Let UIO protocol drivers name them self. Currently all UIO based drivers will be named "uio%d" in user space. Make it possible for protocol drivers to use a different ...
Sep 25, 4:41 am 2008
Serge E. Hallyn
Re: [TOMOYO #9 (2.6.27-rc7-mm1) 1/6] LSM adapter functions.
So IMO there is some major badness here in the form of copying all of those functions out of fs/namei.c. I think we need to discuss case-by-case whether using the functions is appropriate (and hence they should be made non-static in fs/namei.c), or whether the intended likewise... (except in the case of fifo/sock, devcgroup should not be consulted as I'm not sure it'll handle that properly - have you tested -serge --
Sep 25, 9:59 am 2008
Tino Keitel
Re: Regression in 2.6.27-rc7: Wake On LAN with sky2 broken
On Wed, Sep 24, 2008 at 23:10:22 +0200, Rafael J. Wysocki wrote: It didn't hang. The dmesg output from the suspend attempt is here: dmesg output after boot: http://tikei.de/dmesg Regards, Tino --
Sep 25, 12:42 pm 2008
Tino Keitel
Re: Regression in 2.6.27-rc7: Wake On LAN with sky2 broken
Hi, I just found a method how I can enable WOL again: echo enabled > /sys/devices/pci0000:00/0000:00:1c.0/0000:01:00.0/power/wakeup This device is the NIC device, of cause. I already had another case (EHCI) where I needed to do this to be able to wake the computer up with the USB keyboard. It seems like 2.6.27 behaves different here compared to 2.6.26, and things that worked with 2.6.26 need manual fixing to work again. Or did I make something wrong? Regards, Tino --
Sep 25, 1:03 pm 2008
Rafael J. Wysocki
Re: Regression in 2.6.27-rc7: Wake On LAN with sky2 broken
I guess the box didn't hang during suspend with this setting? No, you didn't. The behavior was changed by the PCI wake-up patches that fixed a couple of problems and I'd like to fix all of the cases when the new behavior causes issues like this to happen. I'll send you a patch for the EHCI case later today. Thanks, Rafael --- sky2: Fix WOL regression Since dev->power.should_wakeup bit is used by the PCI core to decide whether the device should wake up the system from ...
Sep 25, 1:38 pm 2008
Mathieu Desnoyers
Re: [RFC PATCH 1/3] Unified trace buffer
I actually do use a lockless algorithm in LTTng and don't have to disable interrupts around tracing. This is how I get the kind of performance the Google folks expect. I would recommend to stay with interrupt disable + per-cpu spinlock (slow and heavy locking) for v1, but to keep in mind that we might want to go for a more lightweight LTTng does it, no disaster happened in the past 2-3 years. :) I guess we could manage to deal with NMI tracing specfically using the NMI tracing is a ...
Sep 25, 10:42 am 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
Please don't normalize to ns. It's really quite hard, and it's rather _expensive_ on many CPU's. It involves a non-constant 64-bit divide, after all. I bet it can be optimized to be a multiply-by-inverse instead, but it would be a 128-bit (or maybe just 96-bit?) multiply, and the code would be nasty, and likely rather more expensive than the TSC reading itself. Sure, you have to normalize at _some_ point, and normalizing early might make some things simpler, but the main thing that ...
Sep 25, 8:05 am 2008
Steven Rostedt
Re: [RFC PATCH 1/3] Unified trace buffer
Slight correction. You can annotate the function with "notrace" and that function will not be traced. So the "only be disabled on a per-file Currently my code calls "ring_buffer_time_stamp" to get the time stamp, whatever it will be. Currently it is using sched_clock, but since I have it as a wrapper, it shouldn't be too hard to modify later. I'll also add a "ring_buffer_time_stamp_normalize(ts)" function to be called on reading of the trace. This will normalize whatever time stamp ...
Sep 25, 9:53 am 2008
Mathieu Desnoyers
Re: [RFC PATCH 1/3] Unified trace buffer
Hi Ingo, The problem with sched_clock is that it gives a 1 HZ timestamp accuracy for events happening across different CPUs. Within this 1 HZ range, it uses the TSC and clip when it reaches a max. Good enough for scheduler or for tracing events on a single CPU, but I think it is not exactly what we need to reorder events happening across CPUs. Mathieu -- Mathieu Desnoyers OpenPGP key fingerprint: 8CD5 52C3 8E3C 4140 715F BA06 3F25 A8FE 3BAE 9A68 --
Sep 25, 9:23 am 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
hm, i'd really hope hw makers see the light and actually make the hw do it all. Signs are that they are cozying up to these ideas. Good and fast timestamps are important, and it is _infinitely_ more easy to do it in hw than in sw. Firstly they need a low-frequency (10khz-100khz) shared clock line across all CPUs. A single line - and since it's low frequency it could be overlaid on some existing data line and filtered out. That works across NUMA nodes as well and physics allows it to ...
Sep 25, 3:25 pm 2008
Steven Rostedt
Re: [RFC PATCH 1/3] Unified trace buffer
You'll also find in the lib Makefile: ifdef CONFIG_FTRACE ORIG_CFLAGS := $(KBUILD_CFLAGS) KBUILD_CFLAGS = $(subst -pg,,$(ORIG_CFLAGS)) endif Which removes all -pg flags from all in the lib directory. The reason is that a lot of archs (well, I know PPC for sure) use these functions on early boot up, where simply calling mcount will produce a page fault. -- Steve --
Sep 25, 2:01 pm 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
i do not understand this argument of yours. (really) 1) is your point that we might lock up? 2) or perhaps that the timestamps update only once every jiffy, and are in essence useless because they show the same value again and again? the latter is true, and that's why we were pushed hard in the past by tracer users towards using GTOD timestamps. Everyone's favorite suggestion was: "why dont you use gettimeofday internally in the tracer???". We resisted that because GTOD ...
Sep 25, 3:14 pm 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
yes. And there are radio telescope arrays that are synced up to do delta interferometry, over thousands of kilometers. Syncing up time over a few dozen meters is no challenge - and the reason for that ease is that physical time is neatly and uniformly broadcasted by nature in a pretty dependable way, at around 300 thousand kilometers per second. the challenge is to make it cheap enough for commodity hw. I.e. no extra CPU pins or lines in critical parts of the board, no extra power, low ...
Sep 25, 4:25 pm 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
here is the observations from my second blind attempt - this time to show that we can trace even lockdep internals just fine. I applied the patch attached below: it removes all the -pg removals from kernel/Makefile and restores the notrace markings in kernel/lockdep.c that commit 1d09daa5 ("ftrace: use Makefile to remove tracing from lockdep") removed. then i enabled the whole lockdep machinery (to insert as much extra code as possible): CONFIG_DEBUG_SPINLOCK=y ...
Sep 25, 2:56 pm 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
heh, and i even have a link for a latency tracing patch for 2005 that is still alive that proves it: http://people.redhat.com/mingo/latency-tracing-patches/patches/latency-tracing.patch (dont look at the quality of that code too much) It has this line for timestamp generation: + timestamp = get_cycles(); i.e. we used the raw TSC, we used RDTSC straight away, and we used that for _years_, literally. So i can tell you my direct experience with it: i had far more problems ...
Sep 25, 1:12 pm 2008
Mathieu Desnoyers
Re: [RFC PATCH 1/3] Unified trace buffer
Hi Ingo, I completely agree with both Linus and you that accuracy utterly matters. I currently provide a time source meant to meant the tracing requirements and support architectures lacking synchronized TSC (or tsc at all) in my lttng tree. Feel free to have a look. I've had statisfied users relying on these time sources for about 3 years. See the lttng-timestamp-* commits in git://git.kernel.org/pub/scm/linux/kernel/git/compudj/linux-2.6-lttng.git The one in question here (x86) is ...
Sep 25, 1:29 pm 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
i'd prefer Peter's simplification and not pass event_id along. Since static types are lost anyway (which is the biggest cost and risk of any such abstraction), we have to convert between types early on. Whether event_id is visible in the API is no big difference. (It might be cheaper to not pass it along even if everyone ends up using it - as it has no semantic meaning anyway.) pretty much the only generic tracing information is time and payload size. ( but even a time key is ...
Sep 25, 3:38 am 2008
Steven Rostedt
Re: [RFC PATCH 1/3] Unified trace buffer
Can this possibly be true? I mean, light travels only one foot every nanosecond. Can it really keep nanosecond accuracy up to dozens of meters away? If you send the same signal to CPU1 that is 1 foot away, as well as send it to CPU2 that is 2 feet away. CPU2 will get that signal at least 1 nanosec after CPU1 receives it. Of course if the hardware is smart enough to know this topology, then it could account for these delays in traffic. -- Steve --
Sep 25, 3:45 pm 2008
Peter Zijlstra
Re: [RFC PATCH 1/3] Unified trace buffer
Hmm, you've got a point there, then it would be 3 package types: - regular - full time - nop Right - if you use raw tsc you're dependent on clock speed, if we'd normalize that on ns instead you'd need at least: l(10000000)/l(2) 23.25349666421153643532 bits to handle HZ=100, leaving us with 32-2-24 = 6 bits for size. Sounds doable (unless I mis-counted on the 0's). Also, I agree on the 4byte alignment, rather than the 8byte Steve seems to favour. --
Sep 25, 7:53 am 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
Steve got the _worst-case_ cpu_clock() difference down to 60 usecs not so long ago. It might have regressed since then, it's really hard to do it without cross-CPU synchronization. ( But it's not impossible, as Steve has proven it, because physical time goes on linearly on each CPU so we have a chance to do it: by accurately correlating the GTOD timestamps we get at to-idle/from-idle times to the TSC. ) And note that i'm not only talking about cross-CPU synchronization, i'm also ...
Sep 25, 1:52 pm 2008
Mathieu Desnoyers
Re: [RFC PATCH 1/3] Unified trace buffer
We could use a page header instead to contain the "unused_size" information. It does not need to be an event per se. Putting this information in the page header makes it easy for a consumer to just read the amount of bytes needed, excluding the padding (turns up to be useful for network streaming of trace data). Also, it frees up one event ID for other uses. I think the event ID real estate is pretty important, because every event ID we don't keep for "internal uses" could be used I would ...
Sep 25, 10:15 am 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
firstly, for the sake of full disclosure, the very first versions of the latency tracer (which, through hundreds of revisions, morphed into ftrace), used raw TSC timestamps. I stuck to that simple design for a _long_ time because i shared your exact views about robustness and simplicity. But it was pure utter nightmare to get the timings right after the fact, and i got a _lot_ of complaints about the quality of timings, and i could never _trust_ the timings myself for certain types of ...
Sep 25, 12:55 pm 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
First off, that's simply not true. Yes, it happens to be true on modern x86-64 CPU's. But in very few other places. Doing even just 64-bit multiples is _expensive_. It's not even _near_ single-cycle. Total and utter bullshit, all of it. Have you forgotten all the oopses due to divide-by-zero because sched_clock() was called early? All that early code that we might well want to trace through? Not only that, but have you forgotten about FTRACE and -pg? Which means that every ...
Sep 25, 9:40 am 2008
Steven Rostedt
Re: [RFC PATCH 1/3] Unified trace buffer
Pretty much all CPUs word align on a 8 byte bounder (until we get those 128bit boxes running), but not all can word align on 4 bytes. I was hoping to make the buffer output somewhat the same across archs. Otherwise, this is going to be quite a complex mess IMHO. -- Steve --
Sep 25, 8:20 am 2008
Martin Bligh
Re: [RFC PATCH 1/3] Unified trace buffer
Agree with you on doing the expensive stuff later. If we wanted to get something that'd pack down to a couple fewer bits, and approximate ns, we could always >> 1 if you were > 2GHz, and >> 2 if you where > 4GHz, etc. which is at least cheap. But do we really need more than 3 bits for size anyway? 28 bytes would fit most events such as system calls, interrupts, page faults, all the common stuff.. Longer than that starts to be expensive in memory consumption anyway, and if you're logging 32 ...
Sep 25, 8:25 am 2008
Steven Rostedt
Re: [RFC PATCH 1/3] Unified trace buffer
Yep, even on those ;-) -- Steve --
Sep 25, 10:32 am 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
Now do the same on a CPU that doesn't have TSC. And notice how useless the timestamps are. Linus --
Sep 25, 2:58 pm 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
Umm. cpu_clock() isn't even cross-cpu synchronized, and has actually thrown away all the information that can make it so, afaik. At least the comments say "never more than 2 jiffies difference"). You do realize that if you want to order events across CPU's, we're not talking about "jiffies" here, we're talking about 50-100 CPU _cycles_. You also ignore the early trace issues, and have apparently not used it for FTRACE. You also ignore the fact that without TSC, it goes into the same ...
Sep 25, 1:24 pm 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
hm, i'm not sure you've thought through this delta record idea. Take a system that goes idle thousands of times a second. (it's easy - just some networking workload) Now take a tracer that just wants to emit a trace entry every now and then. Once or twice a second or so. Note that suddenly you have thousands of totally artificial 'delta' time records between two real events, and have to post-process all your way up between these events to get to the real timeline. ... it is ...
Sep 25, 3:55 pm 2008
Steven Rostedt
Re: [RFC PATCH 1/3] Unified trace buffer
Hmm, sched_clock gives ns accuracy unless the tsc is disabled. And in that case, we don't have any CPU clock :-/ -- Steve --
Sep 25, 9:32 am 2008
Jeremy Fitzhardinge
Re: [RFC PATCH 1/3] Unified trace buffer
Well, you don't need the tsc at the precise moment of the frequency change. You just need to emit the current tsc+frequency+wallclock time before you emit any more delta records after the frequency change. You Yeah. If you ever mention wallclock time in the event stream, you have to tie it to your local timebase (tsc+frequency) to make the whole thing Yeah. Unfortunately, in the virtual case - unless you're virtualizing the tsc itself, which is horrible - you can't really control or ...
Sep 25, 3:39 pm 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
The log entries should be reserved with interrupts disabled anyway, and they are per-CPU, so there are no atomicity issues. For NMI's, things get more exciting. I'd really prefer NMI's to go to a separate ring buffer entirely, because otherwise consistency gets really hard. Using lockless algorithms for a variable-sized pool of pages is a disaster waiting to happen. I don't think we can currently necessarily reasonably trace NMI's, but it's something to keep in mind as required ...
Sep 25, 10:29 am 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
Why would any of this be "generic"? Quite the reverse. It should be as trace-buffer specific as possible, so that we do *not* share any code or any constraints with other people. Just do rdtsc at first, and make it depend on x86. If the thing is made simple enough, it will be a couple of lines of code for architectures to read their own timestamp counters. And since the normalization is then no longer in the critical part, _that_ can be architecture-independent, but obviously ...
Sep 25, 10:22 am 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
No, no. The timestamp code is all in the ring buffer code. That was why I refused to have the layering without it. And hell no, nobody should *ever* read the "tsc_delta" fields etc. Those are entirely internal to the buffering. If any user _ever_ reads or writes those on its own, it's a bug, plain and simple. So when you read trace events, you should get the event data and the timestamp from the trace buffer routines. Nobody should ever even _see_ the internal trace buffer ...
Sep 25, 11:14 am 2008
Jeremy Fitzhardinge
Re: [RFC PATCH 1/3] Unified trace buffer
Sure. NTP keeps machines within 1ms (or better) of each other even though the network latency is much higher and jittery. J --
Sep 25, 4:04 pm 2008
Mathieu Desnoyers
Re: [RFC PATCH 1/3] Unified trace buffer
Even on architectures with non-synchronized TSCs ? -- Mathieu Desnoyers OpenPGP key fingerprint: 8CD5 52C3 8E3C 4140 715F BA06 3F25 A8FE 3BAE 9A68 --
Sep 25, 10:20 am 2008
Steven Rostedt
Re: [RFC PATCH 1/3] Unified trace buffer
Note: RFC v2 implements this. --
Sep 25, 10:02 am 2008
Martin Bligh
Re: [RFC PATCH 1/3] Unified trace buffer
There is part of the type stuff that belongs in the lower layer, it seems - the padding events for the up-to-end-of-page buffering, and the timestamp extensions. It seems wrong to split those across two layers. But perhaps we can keep a couple of bits for this, and three of the bits to represent the length of the data payload (maybe in 4 byte multiples rather than bytes?) That'd let up to 28 bytes as a payload in a short event. --
Sep 25, 7:33 am 2008
Peter Zijlstra
Re: [RFC PATCH 1/3] Unified trace buffer
I rather like this idea, as it gives small entries (the common case) the least overhead but does allow for larger ones. By also putting the time in there you can do the merge sort iterator, Linus was right that everybody wants this anyway. As for delta encoding the time, we could make the tick log the absolute time packet, that's at least 100Hz and it already has to compute the full gtod thing anyway. I don't much like Linus' idea of bringing type information back into the primitive ...
Sep 25, 3:41 am 2008
Steven Rostedt
Re: [RFC PATCH 1/3] Unified trace buffer
Even in my last version I added a "TIME_STAMP" type that can be used in the future to add some kind of synchronization into the trace, that reads GTOD or whatnot. But as you can see, I've been trying to implement these various ideas, since the devil is in the details and the code is the details. How do you get this GTOD read in the ring buffer? If the ring buffer does it without any knowledge from the tracer, it may be doing it a inappropriate times. This would also imply that the GTOD ...
Sep 25, 3:59 pm 2008
Mathieu Desnoyers
Re: [RFC PATCH 1/3] Unified trace buffer
I remembered other concerns about 27 vs 32 bits TSC decision, which are rather important. First, if we have a 27 bits TSC, with overflow every 33ms at 4GHz, we assume the kernel will _never_ have an interrupt latency longer than this for correct heartbeat behavior. The sad thing is that it is not uncommon to see such interrupt latencies once in a while. It's especially bad if the correctness of the timestamps gathered by the one tool that would be helpful to debug such interrupt latency problem ...
Sep 25, 9:37 am 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
We do no such thing. Guys, the heartbeat is a _separate_ thing from overflow handling. You don't handle overflow by having a heartbeat that beats fifty times a second just to insert events, just so that the TSC delta would always fit in 27 bits. That would work, but be stupid. It would mean that you fill up your event buffer with uninteresting crud just because nothing happens. Yes, many people want to have a heartbeat (a "Mark" event) every once in a while, but what I suggest is ...
Sep 25, 9:49 am 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
Ok. It's still true that we absolutely don't want to add random notrace markers to code just because it's shared with the scheduler. And "sched_clock()" is not a single function with just a few well-defined places, nor are all versions of it at all appropriate for tracing (the non-TSC ones are a total joke - it works for scheduling, but not tracing. Yes. The code looked fine, and had a FIXME. I have no objection to using it as a known buggy approximation for TSC in order to not force ...
Sep 25, 10:07 am 2008
Steven Rostedt
Re: [RFC PATCH 1/3] Unified trace buffer
The above I added because they just made the function tracer output quite ugly and bloated. The lockdep got messing when it was debugging the locks used in the tracer that was tracing lockdep. The above had nothing to do with what kind of clock we used. We added recursion protection, but it was still producing ugly output. --
Sep 25, 1:47 pm 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
to prove it, i just applied this patch: Index: linux/kernel/Makefile =================================================================== --- linux.orig/kernel/Makefile +++ linux/kernel/Makefile @@ -21,7 +21,6 @@ CFLAGS_REMOVE_mutex-debug.o = -pg CFLAGS_REMOVE_rtmutex-debug.o = -pg CFLAGS_REMOVE_cgroup-debug.o = -pg CFLAGS_REMOVE_sched_clock.o = -pg -CFLAGS_REMOVE_sched.o = -mno-spe -pg endif obj-$(CONFIG_PROFILING) += profile.o and sched.o was fully traced again. For example ...
Sep 25, 2:41 pm 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
note, CONFIG_LOCKSTAT - which hooks deep inside lockdep, uses cpu_clock() too for timings, for similar reasons. Lockdep and lockstat both have a very robust design and a very good track record to prove it. you'd be correct in pointing out that we _do_ have a relatively high regression count in cpu_clock()/sched_clock(), but the reason for that is that its implementation balances on the narrow edge of doability. It implements a very unstable set of requirements: "absolutely fast" pitted ...
Sep 25, 1:20 pm 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
... which is exactly what sched_clock() does, combined with a multiplication. (which is about as expensive as normal linear arithmetics on most CPUs - i.e. in the 1 cycle range) Normalizing has the advantage that we dont have to worry about it ever again. Not about a changing scale due to cpufreq, slowing down or speeding up TSCs due to C2/C3. We have so much TSC breakage all across the spectrum that post-processing it is a nightmare in practice. Plus we want sched_clock() to be fast ...
Sep 25, 8:36 am 2008
Steven Rostedt
Re: [RFC PATCH 1/3] Unified trace buffer
I've been just using sched_clock() which already normalizes to ns. But I use a wrapper (ring_buffer_time_stamp) so we can decide on how to keep track later. If we do not normalize, then we must come up yet another generic way to read the CPU clock for all archs. And then we also need to come up with another generic way to normalize it later for output. If I'm missing that this already exists, then I'll go and use it, but I do not think that tracing is worthy enough to implement this ...
Sep 25, 8:26 am 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
Have you at all followed the discussion about the people who asked for cross-CPU ordering? They wanted not timestamps at all, but global atomic counter updates. Which is very reasonable, if timestamps don't work (and jiffies certainly doesn't, especially in a NOHZ environment). IOW, my whole argument is that what tracers want is _not_ the same thing as what "sched_clock()" wants. Linus --
Sep 25, 4:33 pm 2008
Martin Bligh
Re: [RFC PATCH 1/3] Unified trace buffer
Simple solution: turn off cpufreq whilst tracing is on ;-) Harder: Keep a timebase and frequency divisor on a per-cpu basis and calculate your offsets from there. This brings you down to HPET resolution though --
Sep 25, 2:15 pm 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
and note that there's another pragmatic argument: often we notice cpu_clock() bugs by looking at traces. I.e. people fixing trace timestamps _fix the scheduler_. Sometimes it is very hard to notice scheduling artifacts that happen due to small inaccuracies in cpu_clock(). so there's continuous coupling between precise scheduling and good trace timestamps. I'd be willing to pay a lot more for that than the few (rather obvious...) robustness problems we had with sched_clock() in the ...
Sep 25, 2:16 pm 2008
Jeremy Fitzhardinge
Re: [RFC PATCH 1/3] Unified trace buffer
That suggests that frequency changes should be recorded at a lower layer as well, along with full timestamps and deltas, so that a log parser can correctly generate the timestamps without having to parse higher-level trace records. Maybe the simplest thing to do is make the full timestamp records also include frequency and offset information (to deal with discontinuities caused by things like a vcpu switching between physical cpus with unsynced tscs). J --
Sep 25, 2:02 pm 2008
Steven Rostedt
Re: [RFC PATCH 1/3] Unified trace buffer
generic as in, could be implement in architecture dependent ways but with a "generic" interface. IOW, I don't want the trace to be dependent on any arch. ftrace already runs on x86, ppc, sparc64, mips, arm, sh, and I could do a HAVE_RING_BUFFER_TIMESTAMP config option for archs that implement it, and just use something dumb for those that don't. For now I'll keep to sched_clock, just because it makes it easy for me. But with The one thing that seemed to me most apparent from talking to ...
Sep 25, 10:39 am 2008
Ingo Molnar
Re: [RFC PATCH 1/3] Unified trace buffer
Really, we traced all these files for ages. I can restore it if it's worthwile - but lockdep totally kills the usability of function traces, it inserts thousands of uninteresting events over and over again. Note commit c349e0a0, there i added -no-pg for robustness reasons (we locked up) - back then the scheduler clock was within sched.c. Now it is all separated out cleanly in kernel/sched_clock.o and i think we can add yes, that's true. And that's why we absolutely want to have ...
Sep 25, 2:10 pm 2008
Jeremy Fitzhardinge
Re: [RFC PATCH 1/3] Unified trace buffer
The "full timestamp" records should include: * absolute tsc * absolute monotonic timestamp * new tsc freqency If you then make sure that all the cpufreq/idle/suspend-resume code emits appropriate records when changing the tsc frequency, then you should always be able to fully regenerate an absolute timestamp. If you generate the monotonic timestamp with a good clocksource, then you should be able to correlate the timestamps between cpus. Oddly enough, this is identical to ...
Sep 25, 2:14 pm 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
Oh, and I didn't notice (because Steven pointed out "notrace" and I didn't see any of them), that in order to get things to work you had just added CFLAGS_REMOVE_lockdep.o = -pg CFLAGS_REMOVE_lockdep_proc.o = -pg CFLAGS_REMOVE_mutex-debug.o = -pg CFLAGS_REMOVE_rtmutex-debug.o = -pg CFLAGS_REMOVE_cgroup-debug.o = -pg CFLAGS_REMOVE_sched_clock.o = -pg CFLAGS_REMOVE_sched.o = -mno-spe -pg all ovr the place, which was part of my argument against this crap in the first place. Yes, by ...
Sep 25, 1:29 pm 2008
Linus Torvalds
Re: [RFC PATCH 1/3] Unified trace buffer
Yes and no. The reason I say "and no" is that it's not technically really possible to atomically give the exact TSC at which the frequency change took place. We just don't have the information, and I doubt we will ever have it. As such, there is no point in trying to make it a low-level special op, because we'd _still_ end up being totally equivalent with just doing as regular trace-event, with a regular TSC field, and then just fill the data field with the new frequency. But ...
Sep 25, 2:55 pm 2008
Ingo Molnar
Re: [PATCH 1/3] corruption check: move the corruption ch ...
yes, that's natural. Anything that isnt super-important to backmerge should go into a separate patch. Ingo --
Sep 25, 3:40 am 2008
Andreas Gruenbacher
Re: [PATCH 2/2] file capabilities: remove CONFIG_SECURIT ...
Real file capability support in RPM seems important to me; hacking this into %post scripts is not a reasonable approach. Thanks, Andreas --
Sep 24, 6:36 pm 2008
Serge E. Hallyn
Re: [PATCH 2/2] file capabilities: remove CONFIG_SECURIT ...
That surprises me - I thought a reasonable amount of code was cut as Fedora 9 and ubuntu intrepid already have full capabilities support and modern libcap. Sles is set to ship with a modern libcap, and according to what Andreas is saying, if we can provide them with the no_file_caps boot option then suse is willing to have a kernel with capabilities turned on. I think gentoo still comes with libcap-1. Need to look into changing that. I suppose the next baby-step will be to do get rid ...
Sep 24, 6:02 pm 2008
Chris Wright
Re: [PATCH 2/2] file capabilities: remove CONFIG_SECURIT ...
The baby step including simple things like setuid ping was the step I was thinking of. That w/ embedded and bloatwatch in mind is why I asked. thanks, -chris --
Sep 24, 6:19 pm 2008
David Brownell
Re: [PATCH 2/4] powerpc/qe: new call to revert a gpio to ...
I think you have enough pieces in place to get that "something better" _very_ quickly. Or I'd feel worse about abusing those poor little grey cells. ;) --
Sep 24, 9:13 pm 2008
Jeff Garzik Sep 24, 5:51 pm 2008
Jeff Garzik Sep 24, 6:29 pm 2008
Jiri Kosina
Re: [PATCH 3/3] e1000e: remove failed request for sw/fw/ ...
BTW are you going to take the whole series, or just this fix? I don't think that the other patches should be really applied to upstream (they could be very handy for debugging this horrible issue, once we could do any debugging of it, thoug), as they don't protect against anyone creating a new rw mapping, which can then be freely used to still overwrite the mmio area without any problems. -- Jiri Kosina SUSE Labs --
Sep 24, 6:00 pm 2008
Drew Moseley
Re: [PATCH] Create PNP device attributes via dev_attrs f ...
Ack. Sorry about that. Wrong tree. Here's the right one. ;-) Signed-off-by: Drew Moseley <dmoseley@mvista.com> diff --git a/drivers/pnp/base.h b/drivers/pnp/base.h index 9fd7bb9..3532984 100644 --- a/drivers/pnp/base.h +++ b/drivers/pnp/base.h @@ -4,6 +4,7 @@ */ extern spinlock_t pnp_lock; +extern struct device_attribute pnp_interface_attrs[]; void *pnp_alloc(long size); int pnp_register_protocol(struct pnp_protocol *protocol); @@ -16,7 +17,6 @@ struct pnp_card ...
Sep 25, 9:42 am 2008
Kay Sievers Sep 25, 3:13 am 2008
KOSAKI Motohiro
Re: [PATCH 1/3] Report the pagesize backing a VMA in /pr ...
looks good to me. and, I tested this patch on x86_64 mmotm 0923 and it works well. Reviewed-by: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com> --
Sep 25, 7:51 am 2008
KOSAKI Motohiro
Re: [PATCH 3/3] Report the pagesize backing a VMA in /pr ...
looks good to me. However, this patch can't build on x86_64 mmotm 0923 because it conflict against fix-vma-display-mismatch-between-proc-pid-mapssmaps.patch (by Joe Korty <joe.korty@ccur.com>). instead, I tested following rebased version and I confirmed it works well. =============================================================== From: Mel Gorman <mel@csn.ul.ie> This patch adds a new field for hugepage-backed memory regions to show the pagesize in /proc/pid/maps. While the ...
Sep 25, 7:57 am 2008
Jens Axboe
Re: [PATCH] BUG: ll_merge_requests_fn() updates req->nr_ ...
This is totally broken, I gather you are thinking the bug is fixed since the counts always match up. Well that's mainly because you know don't do Hmm? I think you should try and describe the problem you are seeing instead, then we can perhaps find the real issue. -- Jens Axboe --
Sep 25, 6:55 am 2008
FUJITA Tomonori
Re: [PATCH] BUG: ll_merge_requests_fn() updates req->nr_ ...
On Tue, 23 Sep 2008 23:58:02 +0530 Yeah, in fact, blk_hw_contig_segment() is always false on the majority of architectures (on only PARISC and Alpha, it could be true). Your patch doesn't look correct. Virtually, the patch always disables I have no idea how BUG_ON() in scsi_init_sgtable() is triggered. Can you give more information, HBA, IOMMU (if you use), and the values of req->nr_phys_segment, req->nr_hw_segment, count, etc in in scsi_init_sgtable() when you hit the bug? BTW, ...
Sep 25, 12:35 am 2008
Serge E. Hallyn
Re: [PATCH 3/4] TPM: rcu locking
Hey Paul, curious, why do they not look like the same list? -serge --
Sep 25, 6:36 am 2008
Serge E. Hallyn
Re: [PATCH 3/4][resubmit] TPM: rcu locking
Oops. So ignore my other comment :) Sorry, I'm behind on some email. --
Sep 25, 6:39 am 2008
Paul E. McKenney
Re: [PATCH 3/4] TPM: rcu locking
Because one of them is named &tpm_chip_list, a global variable, and the other seemed to be returned from a function taking a struct device as an argument. This is indeed consistent with an element in this list being hung off of the struct device, so perhaps I was just being insufficiently persistent in tracking things down. Thanx, Paul --
Sep 25, 7:33 am 2008
Serge E. Hallyn
Re: [PATCH 3/4] TPM: rcu locking
Right, tpm_register_hardware both does dev_set_drvdata(dev, chip) to set dev->driver_data = chip, and list_add(chip->list, &tpm_chip_list). The tpm_remove_hardware(), then, gets the device, gets the chip from dev->driver_data, and removes it from the tpm_chip_list. It does look kosher. Though once this is all applied to some public git tree, I want to take some time to look at the full result. -serge --
Sep 25, 7:59 am 2008
Serge E. Hallyn
Re: [PATCH 3/4] TPM: rcu locking
And so chip->data_buffer (and data_pending) may *only* be accessed by the owner of the chip, who has set chip->is_open? Could you comment that in tpm.h for future reference? I've been trying to push for better commenting of precisely how the locking is expected to work, but I think that was misunderstood as asking for email --
Sep 25, 6:29 am 2008
Keith Packard
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
Well, the goal is to "eventually" get to use fds so that at least some of our common operations can use regular sys calls. But, not having those trapped in the shmem layer may end up being a feature as we'll get to watch more closely, although dealing with the actual pread/pwrite One does what one can. Of course, in this case, 'cross platform' is just x86/x86_64 as we're talking Intel integrated graphics. When (if?) we figure out how to create a common interface across multiple cards for some ...
Sep 24, 6:20 pm 2008
Nick Piggin
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
That might be the cleanest logical way to do it actually. But for the moment I'm happy not to pull tmpfs apart :) Even if it seems like the wrong way around, at least it is insulated to within mm/ --
Sep 24, 8:07 pm 2008
Keith Packard
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
Sure; no sense changing that before we've gotten some experience with the new API anyway. Would we consider modifying sysv IPC as well? It currently uses the shmem_file_setup function although it lives a long ways from mm... --=20 keith.packard@intel.com
Sep 24, 11:16 pm 2008
Keith Packard
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
It seems like once you've done that you might consider extracting the page allocator from shmem so that drm, tmpfs and sysv IPC would share the same underlying memory manager API. --=20 keith.packard@intel.com
Sep 24, 7:43 pm 2008
Dave Airlie
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
On Fri, Sep 26, 2008 at 1:39 AM, Thomas Hellström You can't fail suspend, it just doesn't work like that. The use case is close laptop shove in bag, walk away. Having my bag heat up and the laptop inside not suspended isn't the answer ever. So with that in mind, I think we either a) keep some backing pages around, or b) make object file backed so if the swap space fills up we can back out to the file objects. --
Sep 25, 3:41 pm 2008
Nick Piggin
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
OK, that's all that can be asked I guess. Low level object / memory OK. I will have to add some facilities to allow mmaps that go back through to tmpfs and be swappable... Thanks for the data point. --
Sep 24, 7:30 pm 2008
KAMEZAWA Hiroyuki
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
On Tue, 23 Sep 2008 11:10:17 +0200 Some questions.. - could you use GFP_HIGHUSER rather than GFP_HIGHUSER_MOVABLE ? I think setting mapping_gfp_mask() (address_space->flags) to appropriate value is enough. - Can we mlock pages while it's vmapped ? (or Reserve and remove from LRU) Then, new split-lru can ignore these pages while there are mapped. over-killing ? - Doesn't we need to increase page->mapcount ? - memory resource contorller should account these pages ? ...
Sep 25, 1:45 am 2008
Thomas Hellström
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
Actually, I think the pages must be allowed to be freed, and that we don't put a requirement on "pageable" to keep swap-space slots for these pages. If we hit an OOM-condition during suspend-to-memory that's bad, but let's say we required "pageable" to keep swap space slots for us, the result would perhaps be that another device wasn't able to suspend, or a user-space program was killed due to lack of swap-space prior to suspend. I'm not really sure what's the worst situation, but my ...
Sep 25, 8:39 am 2008
Nick Piggin
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
Ah, interesting... freeing/recreating isn't _too_ expensive, but it is going to have to allocate a lot of pages (for a big object) and copy a lot of memory. It's strange to say "cleaned", in a sense, because the allocator itself doesn't know it is being used as a writeback cache ;) (and it might get confusing with the shmem implementation because your cleaned != shmem cleaned!). I understand the operation you need, but it's tricky to make it work in the existing shmem / vm infrastructure I ...
Sep 24, 5:18 pm 2008
Nick Piggin
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
Pity. Anyway, I accept that, let's move on. I guess so. A big problem of ioctls is just that they had been easier to add so they got less thought and review ;) If your ioctls are stable, Well, no not a seperate filesystem to do the pageable backing store, but a filesystem to do your object management. If there was a need for pageable You can map them to userspace if you just take a page at a time and insert them into the page tables at fault time (or mmap time if you ...
Sep 24, 5:30 pm 2008
Thomas Hellström
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
Well, the typical usage pattern is: 1) User creates a texture object, the data of which lives in a pageable object. 2) DRM decides it needs to go into V(ideo)RAM, and doesn't need a backing store. It indicates "dontneed" status on the object. 3) Data is evicted from VRAM. If it's not dirtied in VRAM and the "dontneed" pages are still around in the old backing object, fine. We can and should reuse them. If data is dirtied in VRAM or the page(s) got discarded we need new pages and to set ...
Sep 25, 12:19 am 2008
Keith Packard
Re: [patch] mm: pageable memory allocator (for DRM-GEM?)
Note that this can occur as a result of a suspend-to-memory transition at which point *all* of the objects in VRAM will need to be preserved in main memory, and so the pages aren't really 'freed', they just don't need to have valid contents, but the system should be aware that the space may be needed at some point in the future. --=20 keith.packard@intel.com
Sep 25, 7:38 am 2008
Andreas Dilger
Re: [PATCH, RFC] ext4: Use preallocation when reading fr ...
It looks like moving from 64kB to 128kB readahead might be a loss for "unknown" workloads, since that increases latency by 7.67% for the random inode case, but we only get 4.5% improvement in the sequential inode case. Also recall that at large scale "htree" breaks down to random inode lookup so that isn't exactly a fringe case (though readahead may still help if the cache is large enough). Cheers, Andreas -- Andreas Dilger Sr. Staff Engineer, Lustre Group Sun Microsystems of Canada, ...
Sep 25, 4:40 pm 2008
Rafael J. Wysocki
Re: Wakeup from suspend via keyboard and mouse doesn't w ...
This actually is a regression for which I'm sorry. Can you please check if the appended patch restores the previous behavior? Thanks, Rafael --- From: Rafael J. Wysocki <rjw@sisk.pl> ACPI: Make /proc/acpi/wakeup interface handle PCI devices Make the ACPI /proc/acpi/wakeup interface set the appropriate wake-up bits of physical devices corresponding to the ACPI devices and make those bits be set initially for devices that are enabled to wake up by default. This is needed to restore ...
Sep 25, 3:35 pm 2008
Grant Grundler
Re: Notes from LPC PCI/MSI BoF session
That's sort of what I proposed at the end of my email: | A second solution I thought of later might be for the device driver to | export (sysfs?) to irqbalanced which MSIs the driver instance owns and | how many "domains" those MSIs can serve. irqbalanced can then write | back into the same (sysfs?) the mapping of MSI to domains and update I agree. The discussion will need more though. thanks, --
Sep 25, 8:53 am 2008
Grant Grundler
Re: Notes from LPC PCI/MSI BoF session
I would entirely agree with this but we have the "N:1" case that I described. (multiple vectors which by design should target one CPU.) That sounds remarkable close saying the sysadmin has to know about each devices attributes. If interpreted that way, I'll argue that's not realistic in 99% of the cases and certainly not how sysadmins Yes, good point. That's certainly a better approach and could precede the "second proposal" above. ie driver queries how many domains it should "plan" for, ...
Sep 25, 9:15 am 2008
Haavard Skinnemoen
Re: [PATCH 3/4] atmel-mci: support multiple mmc slots
Ok, but how _do_ I deal with that? If the clock is turned off due to power management, it should be safe to leave it running at the same rate. But how about initialization? Should I stall the queue until the Ok, good. A change like that needs quite extensive testing. Btw, I just noticed that other thread about pxamci clock management. Will it cause problems if the clock stops briefly from time to time? Currently, the controller stops the clock if the FIFO gets full/empty during transfer ...
Sep 25, 6:51 am 2008
Pierre Ossman
Re: [PATCH 3/4] atmel-mci: support multiple mmc slots
On Wed, 24 Sep 2008 16:35:27 +0200 I'd say always. I'd never dare overclock something that's powered, even if we're not currently communicating with it. Remember to deal with the case of a powered but unclocked card. This will primarily happen during card init, but it could should up later as Agreed. --=20 -- Pierre Ossman Linux kernel, MMC maintainer http://www.kernel.org rdesktop, core developer http://www.rdesktop.org WARNING: This correspondence is ...
Sep 25, 12:05 am 2008
Bill Davidsen
Re: exception Emask 0x0 SAct 0x1 / SErr 0x0 action 0x2 frozen
You certainly called the SMART issue, I was wondering why a new distribution install on some older hardware was getting all the errors, clearly the Fedora "smartd" doesn't check SMART capability before trying to enable the feature. Oddly the drive on which I see this does reply to SMART requests, so the firmware must be "semi-functional." Not a problem, in my case the drive is just used for testing handling of hot swap, and has no data of any value. -- Bill Davidsen ...
Sep 25, 8:17 am 2008
Eric Sandeen
Re: [PATCH 4/10] xfs: Fix error handling in write_super_ ...
Here's that part in case it's helpful :) Feel free to roll it into your patch. -Eric Make xfs_fs_log_dummy() return errors which can then be returned by xfs_fs_lockfs(), once it is capable of doing so. Signed-off-by: Eric Sandeen <sandeen@sandeen.net> --- Index: linux-2.6/fs/xfs/xfs_fsops.c =================================================================== --- linux-2.6.orig/fs/xfs/xfs_fsops.c 2008-08-04 15:30:31.000000000 -0500 +++ linux-2.6/fs/xfs/xfs_fsops.c 2008-09-25 ...
Sep 25, 2:22 pm 2008
Takashi Iwai
Re: [alsa-devel] [PATCH] add the nvidia HDMI codec drive ...
At Thu, 25 Sep 2008 15:14:50 +0800, Thanks for patches. It's good that you are using git now. The code changes look good, but there are small issues in the patches: - The author line is bogus. Edit your ~/.gitconfig and add like the following: [user] name = Wei Ni email = wni@nvidia.com Then reset and commit patches again. - Missing sign-off. Add --sign option when you do git-commit. - No proper change log. Only a subject line isn't enough for many (most) cases, ...
Sep 25, 2:55 am 2008
Wei Ni
RE: [alsa-devel] [PATCH] add the nvidia HDMI codec drive ...
Hi, Takashi I use git to generate patches. (I run git-fetch origin, git-fetch-rebase origin, and git-format-patch origin) There are two patch, one for the addition of HDMI, and one for the fix for our aza controller. Please check it. Thanks -----Original Message----- From: Takashi Iwai [mailto:tiwai@suse.de]=20 Sent: Monday, September 22, 2008 5:58 PM To: Wei Ni Cc: peerchen; Pavel Hofman; Peer Chen; alsa-devel; linux-kernel; akpm Subject: Re: [alsa-devel] [PATCH] add the nvidia HDMI ...
Sep 25, 12:14 am 2008
Wei Ni
RE: [alsa-devel] [PATCH] add the nvidia HDMI codec drive ...
Thanks for your comments, I will fix and repost again. -----Original Message----- From: Takashi Iwai [mailto:tiwai@suse.de] Sent: Thursday, September 25, 2008 5:55 PM To: Wei Ni Cc: peerchen; Pavel Hofman; Peer Chen; alsa-devel; linux-kernel; akpm Subject: Re: [alsa-devel] [PATCH] add the nvidia HDMI codec driverfor MCP77/79 At Thu, 25 Sep 2008 15:14:50 +0800, Thanks for patches. It's good that you are using git now. The code changes look good, but there are small issues in the ...
Sep 25, 4:29 am 2008
KOSAKI Motohiro
Re: [PATCH 1/2] Report the pagesize backing a VMA in /pr ...
OK. I'll review and test your latest patch without MMUPageSize part. (maybe today's midnight or tommorow) Thanks! --
Sep 25, 5:23 am 2008
H. Peter Anvin
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
That looks like the device disappeared completely. -hpa --
Sep 25, 11:39 am 2008
Jiri Kosina
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Yes, I think that xorg/xorg i915 driver/libdrm/GEM/whatever are the biggest suspect currently, according to the data that has been gathered so far. Still, what confuses me a little bit -- the EEPROM of the card is set to all 0xff, once the corruption happens. Isn't that a quite a coincidence, that bytes representing "nothing" in this context are used? If being set to 0 (it's so easy to call memset(0) on a bogus pointer, there are usually lots of them in the code) or to random garbage, ...
Sep 25, 10:24 am 2008
Jiri Kosina
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
I wasn't able to rule out PAT from the suse bugreports POV, as we have PAT enabled both for 32bit and 64bit x86. If Ubuntu has it disabled also for 64bit x86 (where do these guys have .config files to check?), I think we can definitely rule out PAT, as there has been at least one report from Ubuntu user on this very issue. -- Jiri Kosina SUSE Labs --
Sep 25, 3:45 pm 2008
Dave Airlie
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Well the non-GEM paths are really the old codepaths we used in the older drivers.. So unless we do something really dumb... I'd target three areas PAT, pciaccess and e1000e itself. Dave. --
Sep 25, 2:06 pm 2008
David Miller
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
From: "Dave Airlie" <airlied@gmail.com> All PCI device drivers in the kernel first do pci_enable_device() which essentially enables all BARs. The flash lives in BAR 1 of the E1000E, for example. --
Sep 24, 9:00 pm 2008
Jiri Kosina
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
But the xorg intel driver shipped with xorg 7.4 already has support for GEM, right? So there could still be some bug in the GEM-aware driver Yes, booting with 'nopat' is on my list to try immediately after we are This we could catch easily even with strace, right? -- Jiri Kosina SUSE Labs --
Sep 25, 1:22 pm 2008
Chuck Ebbert
Re: [Bug #11608] 2.6.27-rc6 BUG: unable to handle kernel ...
On Sun, 21 Sep 2008 20:54:23 +0200 (CEST) As I said in the bugzilla entry: Oops: 000b Bit 3 is set -- the processor detected 1's in reserved bits of the page directory. That can't be good... --
Sep 24, 5:46 pm 2008
Jesse Barnes
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
I should be able to test the mmap fix independently of the e1000 breakage at least... lemme try it out now... -- Jesse Barnes, Intel Open Source Technology Center --
Sep 25, 9:08 am 2008
Jiri Kosina
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Uh oh. Shouldn't we put something like the patch below in Linus' tree unless we get this sorted out? Otherwise more and more people who use -rc kernels will run into this, and will get their hardware [hopefully temporarily, but not all users are able to re-flash their network card EEPROMs, right] bricked. I know that it is quite aggressive and is going to disable wired networking on a lot of systems that have been functioning properly, therefore RFC ... From: Jiri Kosina ...
Sep 24, 6:27 pm 2008
H. Peter Anvin
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Okay, I just had a scary and hopefully stupid thought. Especially Intel often has backchannels between the chipset and the Ethernet controller for management functions -- anything from WoL to IPMI -- generally over some kind of low-speed serial bus. We're not in a situation where the EEPROM can be touched from the chipset via the SMBus or some other non-CPU channel? -hpa --
Sep 25, 3:57 pm 2008
Jesse Barnes
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Here's a patch that adds range checking to the sysfs mappings at least. This patch should catch the case where X (or some other process) tries to map beyond the specific BAR it's (supposedly) trying to access, making things safer in general. FWIW both my F9 and development versions of X start up fine with this patch applied. DaveM, will this work for you on sparc? It looked like your code was allowing bridge window mappings, but that behavior should be preserved as long as your ...
Sep 25, 12:43 pm 2008
Krzysztof Halasa
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
I guess it's more about the E1000's serial configuration EEPROM, the registers seem to live in BAR0 (EECD and for reading perhaps EERD). Corrupted EEPROM (and thus PCI config registers) can easily result in a dead machine. I will be writing a tool for writing 82541PI EEPROMs on a custom board soon (unless there is one available, for Linux, of course), I'm not sure it's the flash that is corrupted. Anyway booting the laptop should be quite easy (physically disabling the EEPROM on ...
Sep 25, 9:26 am 2008
Jesse Barnes
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Moreover, we don't actually do any writing (that I know of) of the ROM image from the X drivers or the kernel. In fact, in many cases X should be accessing the RAM copy of the ROM at 0xc0000 rather than via the ROM BAR. That said, adding a check to the x86 code would be a good thing to do; I'll hack up a patch tomorrow unless someone beats me to it. -- Jesse Barnes, Intel Open Source Technology Center --
Sep 24, 5:26 pm 2008
Dave Airlie
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
On Fri, Sep 26, 2008 at 7:42 AM, Jesse Brandeburg It rules out PAT I suppose, they have seen the issue as well. Dave. --
Sep 25, 2:45 pm 2008
Dave Airlie
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
No we actually are more likely unable to do anything from the kernel, if its happening from userspace firstly we need a reflash utility that is safe, otherwise people who have the issue can't reproduce it, and people who don't have the issue don't want to play with it. I think e1000e may enable a BAR or something that causes the issue to break this hw., I haven't seen it broken on any machine where e1000e wasn't loaded yet. Again the r8169 might be the same issue, but it maybe because the ...
Sep 24, 8:51 pm 2008
Dave Airlie
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Yup on my laptop these were far away and I wondered what could mangle things that badly. Well I'm out of the race, my attempts to re-write my eeprom using an eeprom from an equivalent laptop have totally failed and my BIOS won't boot anymore - so my laptop is == a brick. Dave. --
Sep 24, 5:22 pm 2008
Nick Piggin
Re: [Bug #11608] 2.6.27-rc6 BUG: unable to handle kernel ...
54384.988151] BUG: unable to handle kernel paging request at ffff8800601dd000 [54384.992095] IP: [<ffffffff80375457>] clear_page_c+0x7/0x10 [54384.992095] PGD 202063 PUD 8067 PMD 65d54163 PTE 80002020601dd163 [54384.992095] Oops: 000b [1] SMP DEBUG_PAGEALLOC I initially suspect PAT (maybe via DEBUG_PAGEALLOC)... but let's see if the 3rd line here is useful. xRRRRRRRRRRRRRRRRRRRRRRR|40b|<--MAXPHYS PHYS-->|...RR.actuwp PGD: ...
Sep 24, 8:03 pm 2008
Jiri Kosina
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
That was exactly the point I was trying to make, that these error paths will probably also need auditing, once we rule out the possibility of NVRAM being overwritten from kernelspace. Thanks, -- Jiri Kosina SUSE Labs --
Sep 25, 1:35 pm 2008
H. Peter Anvin
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Typical card EEPROMs are serial - either I2C or SPI. I believe the Intel cards use SPI EEPROMs, but I'm not sure. [Disclaimer: I don't actually know SPI all that well; I know I2C better. However, I'm pretty sure the following argument does apply to both.] Consider a corruption which turns a read command into a write command -- often just a single bit difference. Now, the EEPROM will expect data in to write, but nothing will be driving the data line, so it will typically be a 1. As ...
Sep 25, 11:46 am 2008
Jiri Kosina
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Actually there is no way of not shipping GEM when shipping xorg 7.4, isn't it? So definitely GEM could be potential cause here, I think. -- Jiri Kosina SUSE Labs --
Sep 25, 5:24 am 2008
Jesse Brandeburg
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
on my ich9 based system the e1000e BAR1 regions are back to back with both the vga memory map and the audio mem, either of which could be the mangler, but more likely vga device (say X maybe) since it is mapped I'm really sorry to hear that, I wonder if the laptop has an "emergency bios update" mode like many PCs used to through a jumper. Dave A., let us know if you make any recovery progress. I plan to try some random writes tomorrow to my BAR1 space and see if my flash gets ...
Sep 24, 9:25 pm 2008
Jesse Barnes
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
We have confirmation that this isn't GEM related; according to the Novell bug at https://bugzilla.novell.com/show_bug.cgi?id=425480 people have hit the problem with kernels w/o GEM. That doesn't rule out i915 (though I don't think any changes have gone in since 2.6.26 that would have caused this) or xf86-video-intel. It's possible that X is getting confused about BAR mappings somehow, resulting in a clobbered e1000e NVRAM, but why would the kernel version matter in that case? The only ...
Sep 25, 11:56 am 2008
Jesse Barnes
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
X.Org 7.4 came with xf86-video-intel 2.4.2 right? That doesn't have any GEM bits in it either. However, the "Factory" log at #425480 *does* indicate that a GEM aware 2D driver was loaded (the "[drm:i915_getparam] *ERROR* Unknown parameter 5" message indicates as much), but the kernel was definitely not GEM aware otherwise the call would have succeeded. So that rules out GEM proper, but it could still be a bug in one of the non-GEM paths in the experimental Yep, that one's easy to ...
Sep 25, 12:36 pm 2008
Jiri Kosina
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Good. We will use this on affected machines after we start some real At least for debugging purposes I'd propose to put a printk() there with process name, and the range it tries to map. Thanks, -- Jiri Kosina SUSE Labs --
Sep 25, 1:45 pm 2008
Jiri Kosina
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
If any of you guys has Lenovo thinkpad (T60p ideally) with 8086:104b revision 3 card, could you please send me the respective "ethtool -e" dump? Thanks, -- Jiri Kosina SUSE Labs --
Sep 25, 12:01 pm 2008
Jeff Garzik
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
That seems a bit drastic, particularly when the debugging was beginning to point to another culprit. We have equal case at this point to disable r8169 and i915_drm, no? Jeff --
Sep 24, 7:28 pm 2008
Jesse Brandeburg
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
ubuntu has CONFIG_X86_PAT disabled for at least i386 arch, maybe that is relevant. --
Sep 25, 2:42 pm 2008
Frans Pop
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Something else to worry about is bisections. People seeing an unrelated issue with .27 after release may well be asked to do a bisection and could then run into the issue even if it is fixed before the release. Guess we'll need to wait and see what the root cause is to know if that's Extra datapoint. As far as I've seen this problem has not yet been reported by any people running Debian. This could point to X.Org as Debian currently has 7.3 while I think the reports so far have been ...
Sep 24, 7:01 pm 2008
David Miller
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
From: Jiri Kosina <jkosina@suse.cz> Setting framebuffer bytes to 0xff is pretty common, for example for color keys and anti-aliasing pixel values. --
Sep 25, 1:06 pm 2008
Krzysztof Halasa
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
Perhaps the entire chip has been erased with the "ERAL" (erase all) command. Requires previously issued EWEN (erase/write enable). Each command seems to require several writes to the EEPROM control register. -- Krzysztof Halasa --
Sep 25, 12:23 pm 2008
Kok, Auke
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
I asked that person and he said reverting to an older kernel made it work again, also there is a fix out as romieu pointed out as well, so this seems to be a different bug for now. Auke --
Sep 25, 1:45 pm 2008
Jiri Kosina
Re: [Bug #11382] e1000e: 2.6.27-rc1 corrupts EEPROM/NVM
The problem here is that what we desperately need first is a method to restore the original EEPROM contents after it gets corrupted (David Airlie has, sadly, apparently bricked his notebook while trying to do so). Without this, we can put a lot of debugging/protecting patches into the kernel, but we won't be able to succesfully verify anything, because testing wouldn't be possible. Added Jesse and Karsten to CC, as they are working on such a tool right now, as far as I know. -- ...
Sep 24, 5:33 pm 2008
chri
Re: [spi-devel-general] [PATCH RESEND] max3100 driver
I will have a look at kerneldoc and do the change. I'm waiting for a minor number in "Low-density serial ports" before resending the modifications asked by Andrew and Alan, so I will do this too. I just I (wrongly!) was assuming that probing of devices is serialized. I it's freezeabe and (looking at include/linux/workqueue.) it implies that it's single-threaded. This is important because I don't do locking since I presume all the I/O to the MAX3100 is done in just one workqueue. When I do ...
Sep 25, 12:07 am 2008
David Brownell
Re: [spi-devel-general] [PATCH RESEND] max3100 driver
You got this working ... congrats! :) Small suggestion: next time you resend this, make sure that $SUBJECT mentions it's a UART driver. Maybe even that it's a SPI UART driver. That should help get more comments. This is a bit picky, but it's the first thing I noticed when scanning the patch ... wierd comment layout! Either indent those all, or (better) convert to kerneldoc style. Potentially less picky: probe() doesn't lock max3100s[], neither does remove(), and in fact there ...
Sep 24, 9:56 pm 2008
Jay Cliburn
Re: [PATCH] net: add net poll support for atl2 driver
On Sat, 20 Sep 2008 15:56:44 +0800 Acked-by: Jay Cliburn <jacliburn@bellsouth.net> --
Sep 25, 4:45 pm 2008
Tom Zanussi
[RFC PATCH 5/8] relay - Map the first sub-buffer at the ...
Map the first sub-buffer at the end of the buffer, for temporary convenience. Make relay buffers 'circular' for writing by mapping the first subbuf at end of last subbuf. This is so we can do writes across last subbuf boundary without adding special write logic. This is a temporary state of affairs and it all goes away in a future patch, but it's here now so things will still work. --- kernel/relay.c | 26 +++++++++++++++----------- 1 files changed, 15 insertions(+), 11 ...
Sep 24, 11:07 pm 2008
Tom Zanussi
[RFC PATCH 4/8] relay - Add reserved param to switch-sub ...
Add reserved param to switch-subbuf, in preparation for non-pad write/reserve. Because a write/reserve can now cross sub-buffer boundaries, we use the length returned as a remainder for the new sub-buffer, and use the reserved param to return a pointer to the reserved space, or NULL if it couldn't be reserved. This patch also changes write/reserve to preserve their current behavior despite that change. This all goes away in a future patch, but is here now so things don't break. --- ...
Sep 24, 11:07 pm 2008
Tom Zanussi
[RFC PATCH 3/8] relay - Add channel flags to relay, remo ...
relay should probably have had a flags param from the beginning; it wasn't originally added because it wasn't originally needed - it probably would have helped avoid some of the callback contortions that were added due to a lack of flags. This adds them and does a small amount of low-hanging cleanup, and is also in preparation for some new flags in future patches. --- block/blktrace.c | 5 ++--- include/linux/relay.h | 19 ++++++++++--------- kernel/relay.c | 20 ...
Sep 24, 11:07 pm 2008
Tom Zanussi
[RFC PATCH 0/8] current relay cleanup patchset
Here's the current relay cleanup patchset. The first two patches make the write path completely replaceable, the third adds flags along with some related cleanup, and the next 5 remove the padding in several stages. It's a work in progress, but because I wanted the intermediate stages to actually work and not break anything, some of these patches, especially 05, are just temporary and will be removed in the next iteration. I didn't have time to clean up the first 3 either - I'll also do ...
Sep 24, 11:07 pm 2008
Tom Zanussi
[RFC PATCH 8/8] relay - Clean up remaining padding-relat ...
Clean up remaining padding-related junk. Removes the rest of the padding-related junk. Also simplifies the subbuf_start callback a bit. --- block/blktrace.c | 5 +++-- include/linux/relay.h | 12 ++---------- kernel/relay.c | 19 ++++--------------- virt/kvm/kvm_trace.c | 7 ++++--- 4 files changed, 13 insertions(+), 30 deletions(-) diff --git a/block/blktrace.c b/block/blktrace.c index 150c5f7..271b7b7 100644 --- a/block/blktrace.c +++ b/block/blktrace.c @@ ...
Sep 24, 11:08 pm 2008
Tom Zanussi
[RFC PATCH 6/8] relay - Replace relay_reserve/relay_writ ...
Replace relay_reserve/relay_write with non-padded versions. The old versions of relay_reserve/relay_write would write/reserve an event only if the whole thing could fit in the remaining space of the current sub-buffer; if it couldn't it would add padding to the current sub-buffer and reserve in the next. The new versions don't add padding but use up all the space in a sub-buffer and write the remainder in the next sub-buffer. They won't however write a partial event - if there's not enough ...
Sep 24, 11:07 pm 2008
Tom Zanussi
[RFC PATCH 7/8] relay - Remove padding-related code from ...
Remove padding-related code from relay_read()/relay_splice_read() et al. Because we no longer write padding, we no longer have to read it or account for it anywhere else, greatly simplifying the related code. --- kernel/relay.c | 149 ++++++++------------------------------------------------ 1 files changed, 20 insertions(+), 129 deletions(-) diff --git a/kernel/relay.c b/kernel/relay.c index 15e4de2..21b3e19 100644 --- a/kernel/relay.c +++ b/kernel/relay.c @@ -966,72 +966,13 @@ static ...
Sep 24, 11:07 pm 2008
Tom Zanussi
[RFC PATCH 2/8] relay - Make the relay sub-buffer switch ...
Make the relay sub-buffer switch code replaceable. With this patch, tracers now have complete control over the relay write (or reserve) path if they choose to do so, by implementing their own version of the sub-buffer switch function (switch_subbuf()), in addition to their own local write/reserve functions. Tracers who choose not to do so automatically default to the normal behavior. --- include/linux/relay.h | 22 +++++++++++++++++----- kernel/relay.c | 13 ++++++++----- 2 files ...
Sep 24, 11:07 pm 2008
Tom Zanussi
[RFC PATCH 1/8] relay - Clean up relay_switch_subbuf() a ...
Clean up relay_switch_subbuf() and make waking up consumers optional. Over time, relay_switch_subbuf() has accumulated some cruft - this patch cleans it up and at the same time makes available some of it available as common functions that any subbuf-switch implementor would need (this is partially in preparation for the next patch, which makes the subbuf-switch function completely replaceable). It also removes the hard-coded reader wakeup and moves it into a replaceable callback called ...
Sep 24, 11:07 pm 2008
Benjamin Herrenschmidt
Re: PTE access rules & abstraction
Not necessarily .. depends how you factor out the interface to it. Anyway, not a big deal now. I'll do a patch to fix the hole on powerpc, and if my brain clicks, over the next few weeks, I'll see if I can come up with an overall nicer API covering all usages. In many case might just be a matter of giving a saner name to existing calls and documenting them properly tho :-) Cheers, Ben. --
Sep 25, 4:02 pm 2008
Jeremy Fitzhardinge
Re: PTE access rules & abstraction
I think we need to be a bit clearer about the terminology: A batch is a bunch of things that can be optionally deferred so they can be done in chunks. A transaction is something that must either complete successfully, or have no effect at all. Doing multiple things in a transaction means that they must all complete or none. (In general we assume there's nothing about these low-level pagetable operations which can fail, so we can ignore the failure part of transactions.) In this case, a ...
Sep 25, 11:15 am 2008
Jeremy Fitzhardinge
Re: PTE access rules & abstraction
Yeah, that would work too; that's pretty much how Xen implements it anyway. The main advantage of the start/commit pair is that the resulting code was completely unchanged from the old code. The mprotect sequence using ptep_modify_protection would end up reading the pte twice before writing it. J --
Sep 25, 3:27 pm 2008
Benjamin Herrenschmidt
Re: PTE access rules & abstraction
I still can't see the point of having now 3 functions instead of just one such as ptep_modify_protection(). I don't see what it buys you other than adding gratuituous new interfaces. Ben;. --
Sep 25, 2:44 pm 2008
Benjamin Herrenschmidt
Re: PTE access rules & abstraction
Yeah. Not that I don't quite understand what the point of the start/modify/commit thing the way it's currently used in mprotect since we are doing the whole transaction for a single PTE change, ie how does that help with hypervisors vs. a single ptep_modify_protection() for example is beyond me :-) When I think about transactions, I think about starting a transaction, changing a -bunch- of PTEs, then commiting... Essentially I see the PTE lock thing as being a ...
Sep 24, 6:04 pm 2008
Jeremy Fitzhardinge
Re: Populating multiple ptes at fault time
Perhaps this helps: Context switches between guest<->hypervisor are relatively expensive. The more work we can make each context switch perform the better, because we can amortize the cost. Rather than synchronously switching between the two every time one wants to express a state change to the other, we batch those changes up and only sync when its important. While there are batched outstanding changes in one, the other will have a somewhat out of date view of the state. At this level, ...
Sep 25, 11:32 am 2008
Greg KH
Re: [PATCH] PM: use pm_op methods for device types
I agree, I think we would have seen more bugs if mainline can't suspend with a USB device attached, right? confused, greg k-h --
Sep 25, 3:06 pm 2008
Alan Stern
[PATCH] PM: use pm_op methods for device types
This patch (as1141) adds code to use the device type's pm_op methods, if they are defined. It fixes a regression in the USB PM code; the various suspend and resume methods are defined in the device type rather than in the bus, because USB devices have to be handled differently from USB interfaces. Without the patch, those methods never get called. The patch also fixes a couple of spelling errors. Signed-off-by: Alan Stern <stern@rowland.harvard.edu> Tested-by: Jiri Slaby ...
Sep 25, 2:07 pm 2008
Jiri Slaby Sep 25, 1:51 pm 2008
Rafael J. Wysocki
Re: [PATCH] PM: use pm_op methods for device types
Hm, these changes are not needed in the current mainline, so there's a patch in -next that removes the code added by this patch. --
Sep 25, 2:27 pm 2008
Pascal Terjan
Re: Turning off camera also kills card reader on EeePC 900
On Mon, Sep 15, 2008 at 3:37 PM, Alan Jenkins The issue also exist on 701 on 2.6.27-rc7 so the regression is in the kernel not in the hardware Disabling camera : ehci_hcd 0000:00:1d.7: HC died; cleaning up hub 1-0:1.0: hub_port_status failed (err = -19) hub 1-0:1.0: connect-debounce failed, port 8 disabled usb 1-8: USB disconnect, address 3 usb 1-5: USB disconnect, address 2 Enabling it again : irq 23: nobody cared (try booting with the "irqpoll" option) Pid: 0, comm: swapper Not ...
Sep 25, 1:27 pm 2008
Alan Jenkins
Re: Turning off camera also kills card reader on EeePC 900
On the bright side, that means more people should have the hardware to test it on (including me :-). I've certainly used the camera enable/disable at some point. But I could have missed the error message, and frankly I don't use the camera or cardreader very often. I'll have a bash at it tomorrow. Sitsofe says it reproduced on kernel.org 2.6.21 (presumably with an out-of-tree eeepc module). So I guess this isn't a simple git-bisect job - more thinking required... Thanks Alan --
Sep 25, 2:21 pm 2008
Sebastien Dugue
Re: [PATCH HACK] powerpc: quick hack to get a functional ...
That's already the case as the irq fetch (xx_xirr_info_get()) and eoi (xx_xirr_info_set()) are both done in interrupt context, therefore on This is what is already done in the threaded case: - fetch + mask + eoi in interrupt context - unmask in the thread when processing is complete. Sebastien. --
Sep 25, 12:31 am 2008
Benjamin Herrenschmidt
Re: [PATCH HACK] powerpc: quick hack to get a functional ...
We probably need to make a special xics flow handler for -rt that does what Milton suggested, ie, bring down the CPU priority right away and only EOI later or something like that, instead of masking/unmasking. I don't know what are the other potential issues with the HEA though. Ben. --
Sep 25, 12:22 am 2008
Milton Miller
Re: [PATCH HACK] powerpc: quick hack to get a functional ...
Ben and I had a online chat, and he pointed out I needed to be more cpu takes interrupt, checks soft disabled if so, set hard disabled else call get_irq if threaded write cppr to restore this cpu irq dispatch state to non-interrupt mark irq thread as irq pending else handle interrupt eoi (cppr = base) irq thread will handle interrupt eoi wait for marked pending again The part Ben did not follow was that the cppr write to base priority is done by the ...
Sep 24, 8:56 pm 2008
Sebastien Dugue
Re: [PATCH HACK] powerpc: quick hack to get a functional ...
Do you mean creating a custom fasteoi handler into xics.c? Also, it's not clear to me from looking at the code how you go about changing the Don't know either, but that I can test. Thanks, Sebastien. --
Sep 25, 12:42 am 2008
Sebastien Dugue
Re: [PATCH HACK] powerpc: quick hack to get a functional ...
Thanks Ben, will look into this. Nite Sebastien. --
Sep 25, 1:39 am 2008
Sebastien Dugue
Re: [PATCH HACK] powerpc: quick hack to get a functional ...
Yep, except as it behaves in way that the current -rt fasteoi flow That's what I gathered from looking at the sources. Thanks, Sebastien. --
Sep 25, 12:18 am 2008
Benjamin Herrenschmidt
Re: [PATCH HACK] powerpc: quick hack to get a functional ...
Yup. I think the priority is the CPPR.. Milton can give you more details, if not, I'll pick it up tomorrow when at the office. Ben. --
Sep 25, 1:36 am 2008
Sebastien Dugue
Re: [PATCH HACK] powerpc: quick hack to get a functional ...
So do I, it was just to make sure I was not hit by another interrupt while handling the previous one and thus reduce the number of hypothesis. I sure do not say that it cannot happen, just that that path is not taken Whoops, my bad, in the non threaded case, there's no mask at all, only an Looks like it is, but I cannot say for sure, the only observable effect That may be, but I'm only looking at the code (read no specifications at hand) Now, that is quite interesting ...
Sep 25, 1:45 am 2008
Cedric Le Goater
Re: [RFC v5][PATCH 0/9] Kernel based checkpoint/restart
sure. the user tools are in CVS for the moment : http://lxc.cvs.sourceforge.net/lxc/ but Daniel will probably release something soon. the lxc-checkpoint and lxc-restart tools are a bit too optimistic on the availability of the checkpoint and restart syscalls, a cleanup might be needed. the patchset is here : http://legoater.free.fr/patches/2.6.27/2.6.27-rc7-lxc2/ it's based on top of current mainline and includes : . sysfs patches for net namespace . freezer subsystem . ...
Sep 25, 5:58 am 2008
Brice Goglin
Re: [PATCH] mm: make do_move_pages() complexity linear
Actually, this "pm+1" breaks the case where migrate_pages() calls unmap_and_move() multiple times on the same page. In this case, we need the while loop to look at pm instead of pm+1 first. So we can't cache pm+1 in private, but caching pm is ok. There will be 1 while loop instead of 0 in the regular case. Updated patch (with more comments) coming soon. Brice --
Sep 25, 5:58 am 2008
Jarod Wilson
Re: [PATCH 07/18] lirc driver for the CommandIR USB Tran ...
The kernel-space drivers for CommandIR devices are being dropped entirely, in favor of the new userspace driver for it, which is included in the lirc 0.8.4pre1 userspace. -- Jarod Wilson jarod@redhat.com --
Sep 25, 8:21 am 2008
Ric Wheeler
Re: [PATCH 3/3] Add timeout feature
I agree with Christoph here, I think that the timeout is unneeded. Regards, Ric --
Sep 25, 2:06 pm 2008
Michał Mirosław
RFC: Driver for CB710/720 memory card reader (MMC part) - v2
Hello, Thanks for your comments, Pierre - I have applied most of your suggestions and replied to some below. Changes in this version: - power management functions; - IRQ handler modifications: core driver uses RCU, MMC-specific handler uses spinlock only in the uncommon case of 'test' interrupt; - debugging printks converted to dev_dbg() and friends; - code split to files such that MMC specific parts are self-contained This should be almost ready. mmc-test passes up to a point ...
Sep 24, 11:29 pm 2008
David Howells
Re: Building Kernel with -O0
When 2.4 was the cutting edge kernel. I don't remember the dates or versions exactly. David --
Sep 25, 6:51 am 2008
Adrian Bunk
Re: Building Kernel with -O0
You can call me surprised. cu Adrian -- "Is there not promise of rain?" Ling Tan asked suddenly out of the darkness. There had been need of rain for many days. "Only a promise," Lao Er said. Pearl S. Buck - Dragon Seed --
Sep 25, 6:31 am 2008
Bjorn Helgaas
Re: [Bug 11603] Re: ACPI PnP on Intel MU440EX
> Aren't parport still probing it's own port if pnp failed ? Could be; I haven't looked. I just assumed parport didn't do the legacy probe because Martin reported that the port didn't work when PNPACPI was enabled. And I retract my "parport.nopnp" idea. A positive statement like "parport.legacy" or something along the lines of "look here" would be better. It's safe to use PNP to probe; it just won't find anything. All we want is something to tell parport to do the legacy probe ...
Sep 25, 12:53 pm 2008
matthieu castet
Re: [Bug 11603] Re: ACPI PnP on Intel MU440EX
Yes I was curious of the value before the mask : some of the mask and Aren't parport still probing it's own port if pnp failed ? Matthieu --
Sep 25, 11:52 am 2008
Konstantin Kletschke
Re: SATA Cold Boot problems on >2.6.25 with NV
Tejun, sorry for misspelling your name as Tijo :-/ Do I hit your spamfilter with my KSYMOOPS enabled debug outputs or some sort of that? If not, sorry for the inconvenience and take your time. Regards, Konsti -- GPG KeyID EF62FCEF Fingerprint: 13C9 B16B 9844 EC15 CC2E A080 1E69 3FDA EF62 FCEF --
Sep 25, 1:18 am 2008
Konstantin Kletschke
Re: SATA Cold Boot problems on >2.6.25 with NV
Replying to myself... Indeed my HArddisk was plugged into SATA2 and SATA1 was left empty, I moved it to SATA1 now. Now: Sep 25 08:35:29 zappa ata1: SATA max UDMA/133 cmd 0xf80 ctl 0xf00 bmdma 0xd800 irq 21 Sep 25 08:35:29 zappa ata2: SATA max UDMA/133 cmd 0xe80 ctl 0xe00 bmdma 0xd808 irq 21 Sep 25 08:35:29 zappa ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300) Sep 25 08:35:29 zappa ata1.00: ATA-7: SAMSUNG HD753LJ, 1AA01106, max UDMA7 Sep 25 08:35:29 zappa ata1.00: 1465149168 ...
Sep 25, 1:15 am 2008
Jeff Garzik Sep 24, 7:14 pm 2008
Rusty Russell
Re: [Bug #11342] Linux 2.6.27-rc3: kernel BUG at mm/vmal ...
Hi Linus, This turns out to be awful in practice, mainly due to const. Consider: #ifdef CONFIG_CPUMASK_OFFSTACK typedef unsigned long *cpumask_t; #else typedef unsigned long cpumask_t[1]; #endif cpumask_t returns_cpumask(void); That's obviously illegal if cpumask_t is an array. So we need a typedef which says "really always a pointer". typedef unsigned long *cpumask_return_t; cpumask_return_t returns_cpumask(void); But we usually want it to return a const ptr, ...
Sep 24, 6:50 pm 2008
Ingo Molnar
Re: [Bug #11342] Linux 2.6.27-rc3: kernel BUG at mm/vmal ...
since most of the important cpumasks in high-perf codepaths are already pre-constructed and embedded in some existing object (say task_struct), i think a variant of #3 is the best approach: - get rid of cpumask_t and use 'struct cpumask' everywhere. - do not expose normal kernel code to struct cpumask's definition, only declare the type via 'struct cpumask;' in sched.h - a'la kmem_cache_t. - even hide the structure from sched.h - use an extra indirection for struct ...
Sep 25, 1:55 am 2008
Mike Travis
Subject: [RFC 1/1] cpumask: Provide new cpumask API
Subject: [RFC 1/1] cpumask: Provide new cpumask API Provide new cpumask interface API. The relevant change is basically cpumask becomes an opaque object. I believe this results in the minimum amount of editing while still allowing the inline cpumask functions, and the ability to declare static cpumask objects. /* raw declaration */ struct __cpumask_data_s { DECLARE_BITMAP(bits, NR_CPUS); }; /* cpumask_map_t used for declaring static cpumask maps */ typedef struct ...
Sep 25, 1:59 pm 2008
Linus Torvalds
Re: [Bug #11342] Linux 2.6.27-rc3: kernel BUG at mm/vmal ...
No. That's already broken. You cannot return a cpumask_t, regardless of interface. We must not do it regardless of how we pass those things around, since it generates _yet_ another temporary on the stack for the return slot for any kind of structure. So all cpumask functions should always return pointers and/or take pointers to be filled in. That's true *regardless* of how we actually are to then allocate them. So forget returning cpumasks. It's irrelevant. What _is_ relevant is ...
Sep 25, 8:42 am 2008
Ingo Molnar
Re: [PATCH, RFC] v6 scalable classic RCU implementation
i'm wondering about those timekeeping bugs. Do you have an idea what's it about and does it affect mainline? Ingo --
Sep 25, 12:29 am 2008
Paul E. McKenney
Re: [PATCH, RFC] v6 scalable classic RCU implementation
Good point -- I guess I do need to implement the CLASSIC_RCU piece of this in any case. Thanx, Paul --
Sep 25, 7:05 am 2008
Ingo Molnar
Re: [PATCH, RFC] v6 scalable classic RCU implementation
could you please send this bit separately, even if rcutree isnt finished yet? Seems like a quite useful debug feature. Ingo --
Sep 25, 12:26 am 2008
Paul E. McKenney
Re: [PATCH, RFC] v6 scalable classic RCU implementation
Sad to say, they do affect mainline -- I can reproduce these problems in 2.6.27-rc7. Thomas has given me a couple of fixes that got rid of earlier problems in which jiffies would stop counting, and I believe has located the cause of at least one other problem. Thanx, Paul --
Sep 25, 7:18 am 2008
Miller, Mike (OS Dev) Sep 25, 1:56 pm 2008
Randy Dunlap
Re: in 2.6.23-rc3-git7 in do_cciss_intr
Mike, also notice this: it's always during driver init, as indicated by the (+) in the dump ('+' means that the module is in the process of being loaded, but module load has not completed): calling cciss_init+0x0/0x2e [cciss] HP CISS Driver (v 3.6.20) ACPI: PCI Interrupt Link [LNKA] enabled at IRQ 54 cciss 0000:42:08.0: PCI INT A -> Link[LNKA] -> GSI 54 (level, high) -> IRQ 54 cciss0: <0x3238> at PCI 0000:42:08.0 IRQ 503 using DAC BUG: unable to handle kernel NULL pointer dereference at ...
Sep 25, 1:40 pm 2008
Randy Dunlap
Re: in 2.6.23-rc3-git7 in do_cciss_intr
Yes, correct IMO. I checked my daily test logs and I have had this problem in do_cciss_intr() 3 times, all at the same location, which appears to be in removeQ(), as Jens says. -- ~Randy --
Sep 25, 1:33 pm 2008
Robert Richter
Re: [PATCH] oprofile: document undocumented oprofile.p4f ...
We should rework/remove this (maybe later), if it makes no longer sense to keep it. If we had have a force_cpu_type implementation this could be thrown away. But as long as it is in it's better to have it documented. -- Advanced Micro Devices, Inc. Operating System Research Center email: robert.richter@amd.com --
Sep 25, 1:10 pm 2008
Robert Richter
Re: [PATCH] oprofile: Don't report Nehalem as core_2
I will send this patch upstream together with the architectural perfmon implementation and when the userland part is upstream. -- Advanced Micro Devices, Inc. Operating System Research Center email: robert.richter@amd.com --
Sep 25, 1:05 pm 2008
Robert Richter
Re: [PATCH] oprofile: Implement Intel architectural perf ...
Could you create a separate patch that introduces this new kernel parameter? This would make it easier to send all other changes upstream. We already discussed the need of this parameter. Maybe it would fit better to have a more generalized paramater for this that could be reused then by other archs/models as well. Something like force_pmu_detection that could be used for all new CPUs (also other models) that do not yet have a specific kernel implementation. Even better would a sysfs entry ...
Sep 25, 1:00 pm 2008
Robert Richter
Re: oprofile: Implement Intel architectural perfmon
Andi, I have uploaded all pending OProfile patches to my kernel.org repository. As we already talked about this, there are changes in that implement model specific init/exit functions. Please change your patch in a way that it uses these functions. This will make your implementation cleaner. I will also send some more comments to the patches itself. It would help me if you could send the new patches relative to my tree. Thanks a lot, -Robert -- Advanced Micro Devices, ...
Sep 25, 12:32 pm 2008
Adrian Bunk
Re: [2.6 patch] m68k: remove the dead PCI code
Sorry for the late answer. cu Adrian -- "Is there not promise of rain?" Ling Tan asked suddenly out of the darkness. There had been need of rain for many days. "Only a promise," Lao Er said. Pearl S. Buck - Dragon Seed --
Sep 25, 6:31 am 2008
Rusty Russell
Re: linux-next: Tree for August 14 (sysfs/acpi errors)
There's a proper fix in linux-next for 2.6.28. I was unclear about whether this ACPI change was going into 2.6.27; since it's not, there's no real issue. Hope that helps, Rusty. --
Sep 24, 7:39 pm 2008
Fei Liu
Re: A question to Frank Kingswood about ch340/341 suppor ...
Hi Jan, thanks for the info. I didn't find this gem until today when I am cleaning up my emails... Cheers, Fei --
Sep 25, 1:53 pm 2008
Greg KH
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
Well, having a config option like this isn't the way to go as it will be forced on for all distros and users anyway. A simple cpuid test is the easier way to do this, that's what the userspace tools do, if it's really needed in the kernel. But hopefully, such things shouldn't be needed within the kernel as it's not Linux's fault that the hypervisor has bugs in it :) We wouldn't be wanting to work around bugs in Microsoft's hypervisor, would we? thanks, greg k-h --
Sep 24, 7:55 pm 2008
Yan Li
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
Hi Greg, For me this is used in the next patch (for mtrr/main.c) to suppress an unnecessary warning when running as a VMware guest: http://lkml.org/lkml/2008/9/24/144 We already have code to suppress warning under KVM so the above patch suppress warnings for VMware guest also. H. Peter Anvin and Alok kataria are also proposing we may need a more general approach for detecting hypervisors that can be used for some other quirks. Thanks. -- Li, Yan --
Sep 24, 7:47 pm 2008
David Sanders
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
From: i8042-x86ia64io.h { .ident = "Microsoft Virtual Machine", .matches = { DMI_MATCH(DMI_SYS_VENDOR, "Microsoft Corporation"), DMI_MATCH(DMI_PRODUCT_NAME, "Virtual Machine"), DMI_MATCH(DMI_PRODUCT_VERSION, "VS2005R2"), }, }, --
Sep 25, 2:56 am 2008
Yan Li
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
I think if Virtual PC has similar problems we should add codes to detect Virtual PC to be used by mtrr/main.c. A general interface might not be good for this specific problem (false MTRR blank warning) since we have no way to know all VMs has MTRR set to blank thus handling KVM, VMware and Virtual PC here should be enough for now. If you can tell me what manufacture vendor string is in Virtual PC I can make another similar patch using dmi_name_in_vendors(). So at first I'd like to see the ...
Sep 24, 5:32 pm 2008
Yan Li
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
I think it's a common practice for VM to blank the MTRRs rather than a bug. Many hypervisors (KVM, VMware, Virtual PC) are doing this since long before. Therefore I think issuing a warning here complaining My idea is that this should be included in all general purpose kernels or the vendors may have to cope with flood questions about boot time warnings when using under VMware/KVM/Virtual PC. It's configurable so good for vendors who wish to provide different kernels for using with A simple ...
Sep 24, 8:29 pm 2008
Yan Li Sep 25, 3:23 am 2008
H. Peter Anvin
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
To be fair, SMM sometimes also play these kinds of games -- even though it is equally frowned upon there. However, it is the particular use of this for detection use that is utterly damning. Using random I/O port probes for hardware detect should have disappeared in the early 1990's, and it's really disturbing that virtualization vendors -- not just VMWare -- are, in effect, re-making all the mistakes hardware vendors did in the 1980's. Fortunately, we can usually use DMI to bail us ...
Sep 25, 2:59 pm 2008
Alok Kataria
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
Hmm...what can a IN on an unknown port cause on native hardware, if a port is not being used it would return 0xFFFFFFFF in eax, and if you have a real device there (a sane one), what can IN result in apart from reading some IO register/counter value in eax ? If there is anything apart from the above 2 outcomes, please let me know exactly what you mean. Thanks, Alok --
Sep 24, 10:23 pm 2008
H. Peter Anvin
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
You have no idea what you just did to a real piece of hardware. -hpa --
Sep 24, 9:38 pm 2008
David Sanders Sep 25, 3:17 pm 2008
Bernd Eckenfels
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
Having a high HZ slows down VMs and also leads to tick loss (time drift). HZ 100 or 250 is recommended by most Vendors. I think some distributions use the reduced HZ value by default anyway, so you might never had a problem with this. It is also only a problem with system load and timekeeping, which is not always obvious. When running a lot of nearly idle VM Guests you might see it. Gruss Bernd --
Sep 24, 6:28 pm 2008
H. Peter Anvin
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
You accessed a bloody I/O port! If you think it's harmless because it was an IN, you're sorely mistaken. -hpa --
Sep 24, 9:54 pm 2008
Yan Li
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
Thanks, so we can begin with this. -- Li, Yan --
Sep 24, 7:48 pm 2008
Greg KH
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
Why do we need to do this within the kernel, what is that going to achieve? People can do this easily in userspace if they need to detect this, I think there's a patch for util-linux-ng adding such a simple utility that handles almost all of the known virtualization engines right now. thanks, greg k-h --
Sep 24, 7:23 pm 2008
Zachary Amsden
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
It's not disturbing, it's expected. Re-using old broken solutions happens all the time, they can be perfectly valid in some contexts. The problem is that they tend to live on and evolve into a larger context where they break again. Surely we can do better, but how to do that isn't always clear-cut. DMI is a pretty good standard for this, but it still doesn't solve the problem in all contexts (userspace apps). Zach --
Sep 25, 3:20 pm 2008
H. Peter Anvin
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
This, of course, is what CPUID is for. -hpa --
Sep 25, 3:27 pm 2008
Zachary Amsden
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
Peter's right. The hardware device simply sees an I/O request to a port and a read / write. The actual internal implementation may not even see whether it is a R/W, and may do anything. Some of our virtual hardware is activated in strange ways, reads where you would expect writes, etc. For example, a device made by Exploder Technologies has two ports, 3686, and 3687. Read access to port 3686 returns the status register and any access at all to port 3687 moves the robot arm, activating ...
Sep 25, 1:48 pm 2008
Alok kataria
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
Sorry for joining the discussion this late. But i only noticed this after somebody pointed me to it. Even if there is anything on that port on native hardware it would work perfectly well and is _safe_. First let me post the code to access this backdoor port (the way it should really be done ) ------------------------------------------------------------------------------- #define VMWARE_BDOOR_MAGIC 0x564D5868 #define VMWARE_BDOOR_PORT 0x5658 #define VMWARE_BDOOR_CMD_GETVERSION ...
Sep 24, 7:28 pm 2008
Yan Li
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
That sounds great but technically this centralized test can only be done after dmi_scan_machine(), so it can't help the detection code in mtrr_trim_uncached_memory() which is ran very early before dmi_scan_machine(). So I think my patch is still necessary unless we want to live with the warning message in all VMware guest. -- Li, Yan --
Sep 25, 7:38 am 2008
H. Peter Anvin
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
It's not a false warning. It's a true warning. -hpa --
Sep 24, 5:26 pm 2008
Yan Li
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
Oh I never heard about this. I've been using several VMware VMs (combined RHEL and SLES and Debian) but haven't seen such issue. What's the symptom? -- Li, Yan --
Sep 24, 5:23 pm 2008
H. Peter Anvin
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
First, you are assuming all devices are "sane". This is obviously wrong -- you're poking in hyperspace, and you don't know if you're going to hit someone's ancient controller card that perhaps drives a medical accelerator for all you know. Second, you are assuming that devices you call "sane" don't have I/O ports with read side effects. Many, if not most, devices have some I/O ports with read side effects, especially read-clear semantics and/or queue drain operations. Third, in the ...
Sep 24, 10:30 pm 2008
Yan Li
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
Hi Alok, Thanks for your comments. Sure, it's good to add a common interface My motivation behind this patch is to serve the MTRR codes to fix a false warning, so I'd like to see it in 2.6.28 as soon as possible. The latest 2.6.27-rc7 is issuing false warning when running under the VMware Server 1.0.7, complaining that MTRR's all blank. Currently the false warning has been confirmed under both KVM and VMware so the detection for these two VMs are added in my [PATCH 2/2]. For this specific ...
Sep 24, 5:15 pm 2008
H. Peter Anvin
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
You accessed an unknown I/O port. This means you caused an unknown action in an unknown peripheral device. This could cause ANYTHING to happen. -hpa --
Sep 24, 10:04 pm 2008
Alok Kataria
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
Why ? what do you mean ? ebx is a local variable in the code above that i posted. Only when on hypervisor will we write the magic value over there. How can this affect native hardware, i fail to understand. Please explain. Thanks, --
Sep 24, 9:46 pm 2008
Greg KH
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
Ah, I was hoping they were all doing this, as it seems the most "sane" manner. Good luck :) greg k-h --
Sep 25, 5:56 am 2008
Alan Cox
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
Changing the status of the device that is actually at the port, losing IRQs, hanging the bus solid, causing data corruption (eg if you probe the address of the data port of something like a disk doing a block transfer) Alan --
Sep 25, 1:45 am 2008
H. Peter Anvin
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
We pretty much have to, just as we have to work around bugs in, say, AMD's microcode. We have avoided it so far, but it's gotten to a breaking point, and rather than having ad hoc hacks scattered all over the place I want a centralized test site setting a single global variable. Unfortunately, hypervisor vendors haven't adopted a uniform detection scheme (CPUID level 0x40000000 is sometimes mentioned as a pseudo-standard, but it's not universal, and not all virtualization solutions ...
Sep 24, 9:54 pm 2008
Yan Li
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
I'd like to see it in 2.6.28 to fix the false warning here. I'll take several comments here and post a improved patch soon (changing code is fast but testing them on VMs here with different configurations are time-consuming). But I think people could just start to test this version of patch since further change will be mostly cosmetic, FWIW. -- Li, Yan --
Sep 24, 5:19 pm 2008
Yan Li
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
You can say that warning is justified because it's true that the MTRR's are all blank in VMware. But that warning is no good to VMware users since it's by-design all MTRR's are blank. For example, we've scripts watching the log for lines containing "warning" and "error" on all production servers. So this warning actually is "false" to VMware guest users. A KERN_INFO message should be enough here. We already have a checking to suppress warning on KVM so I think we should also suppress the ...
Sep 24, 7:34 pm 2008
H. Peter Anvin
Re: [PATCH 1/2] VMware guest detection for x86 and x86-64
Supposedly "Microsoft Corporation", "Virtual Machine". No idea what pre-MS versions of VPC return. -hpa --
Sep 24, 5:37 pm 2008
Alok Kataria
Re: [PATCH 1/2] VMware detection support for x86 and x86-64
Hi Peter, It would be really helpful if you could explain me when can this go wrong or what kinds of problems can this cause on native hardware. Thanks, --
Sep 24, 10:02 pm 2008
previous daytodaynext day
September 24, 2008September 25, 2008September 26, 2008