login
Header Space

 
 

linux-netdev mailing list

FromSubjectsort iconDate
Ayaz Abdulla
[PATCH] forcedeth: new backoff implementation
This patch adds support for a new backoff algorithm for half duplex supported in newer hardware. The old method is will be designated as legacy mode. Re-seeding random values for the backoff algorithms are performed when a transmit has failed due to a maximum retry count (1 to 15, where max is considered the wraparound case of 0). Signed-off-by: Ayaz Abdulla <aabdulla@nvidia.com>
Apr 7, 5:40 pm 2008
Paul Moore
[RFC PATCH 0/2] Labeled networking tweaks for 2.6.26
Pretty small pile of patches (are two patches even a pile) this time around, but they should make things a little faster. The first patch addresses a gripe Casey had about an extra allocation/copy for the NetLabel secattr and the second tries to address the Labeled IPsec stack ugliness pointed out by Jamal and Dave. I'm posting these as RFC patches for right now because I haven't had a chance to really beat on these changes, however, with 2.6.26 getting close I wanted to get these out so people co...
Apr 7, 7:16 pm 2008
Paul Moore
[RFC PATCH 2/2] LSM: Make the Labeled IPsec hooks more stack...
The xfrm_get_policy() and xfrm_add_pol_expire() put some rather large structs on the stack to work around the LSM API. This patch attempts to fix that problem by changing the LSM API to require only the relevant "security" pointers instead of the entire SPD entry; we do this for all of the security_xfrm_policy*() functions to keep things consistent. Signed-off-by: Paul Moore <paul.moore@hp.com> --- include/linux/security.h | 48 ++++++++++++++++++++------------------- net/xfrm/x...
Apr 7, 7:16 pm 2008
Paul Moore
[RFC PATCH 1/2] NetLabel: Allow passing the LSM domain as a ...
Smack doesn't have the need to create a private copy of the LSM "domain" when setting NetLabel security attributes like SELinux, however, the current NetLabel code requires a private copy of the LSM "domain". This patches fixes that by letting the LSM determine how it wants to pass the domain value. * NETLBL_SECATTR_DOMAIN_CPY The current behavior, NetLabel assumes that the domain value is a copy and frees it when done * NETLBL_SECATTR_DOMAIN New, Smack-friendly behavior, NetLabel as...
Apr 7, 7:16 pm 2008
Juliusz Chroboczek
[PATCH] Stochastic Fair Blue queue discipline
Hello, The attached is an implementation of the Stochastic Fair Blue queue discipline for Linux. I would like to request a review of the following code, and whether it is worthwile to add the Kconfig bits and submit it for inclusion into the mainline kernel. You will find a ready to build tarball on http://www.pps.jussieu.fr/~jch/software/files/sch_sfb-20080407.tar.gz http://www.pps.jussieu.fr/~jch/software/files/sch_sfb-20080407.tar.gz.asc and a short description on [ message continues ]
" title="http://www.p...">http://www.p...
Apr 7, 6:37 pm 2008
Andrew Brampton
Re: sock_get_timestamp() ktime_to_timeval returns -2?
Thanks Eric, I didn't expect someone to identify and fix this bug so quickly!. I have tested your patch and my problems have gone away. thanks Andrew --
Apr 7, 5:57 pm 2008
Anthony Liguori
[PATCH] virtio_net: remove overzealous printk
The 'disable_cb' is really just a hint and as such, it's possible for more work to get queued up while callbacks are disabled. Under stress with an SMP guest, this printk triggers very frequently. There is no race here, this is how things are designed to work so let's just remove the printk. Signed-off-by: Anthony Liguori <aliguori@us.ibm.com> Acked-by: Rusty Russell <rusty@rustcorp.com.au> diff --git a/drivers/net/virtio_net.c b/drivers/net/virtio_net.c index b58472c..d1a200f 10064...
Apr 7, 4:33 pm 2008
Ben Hutchings
[PATCH] Fix kernel-doc for skb_segment
The kernel-doc comment for skb_segment is clearly wrong. This states what it actually does. Signed-off-by: Ben Hutchings <bhutchings@solarflare.com> diff --git a/net/core/skbuff.c b/net/core/skbuff.c index 0d0fd28..6087013 100644 --- a/net/core/skbuff.c +++ b/net/core/skbuff.c @@ -2131,8 +2131,8 @@ EXPORT_SYMBOL_GPL(skb_pull_rcsum); * @features: features for the output path (see dev->features) * * This function performs segmentation on the given skb. It returns - * the segment ...
Apr 7, 2:28 pm 2008
Pavel Emelyanov
[PATCH net-2.6.26 2/2] Do not allocate unneeded memory for d...
The alloc_netdev_mq() tries to produce 32-bytes alignment for both the net_device itself and its private data. The second alignment is achieved by adding the NETDEV_ALIGN_CONST to the whole size of the memory to be allocated. However, for those devices that do not need the private area, this addition just makes the net_device weight 1024 + 32 = 1068 bytes, i.e. consume twice as much memory. Since loopback device is such (sizeof_priv == 0 for it), and each net namespace creates one, this can save...
Apr 7, 12:31 pm 2008
Jarek Poplawski
Re: [PATCH net-2.6.26 2/2] Do not allocate unneeded memory f...
Pavel Emelyanov wrote, On 04/07/2008 06:31 PM: IMHO this second "+ NETDEV_ALIGN_CONST" is needed here because of "~NETDEV_ALIGN_CONST". Regards, Jarek P. --
Apr 7, 1:44 pm 2008
Pavel Emelyanov
[PATCH net-2.6.26 1/2] Shrink size of net_device by filling ...
I've found a much easier way to shrink the net_device structure rather that moving all the operations out of it. However, since the net_device may grow further, moving the operations into a separate place may look reasonable. The pahole tool showed, that there are a 124 and 80 bytes holes before the queue_lock and the _xmit_lock respectively. Moving most of the devices callbacks into the 2nd hole makes the sizeof of the structure be 1024 bytes. The hard_start_xmit callback is not moved to keep ...
Apr 7, 12:25 pm 2008
Eric Dumazet
Re: [PATCH net-2.6.26 1/2] Shrink size of net_device by fill...
On 32 bits platform and CONFIG_X86_L1_CACHE_SHIFT=7 I presume :) Could you check if x86_64 machines with X86_L1_CACHE_SHIFT = 7 or 8 dont suffer from this patch ? At first glance I would say it seems OK, but this net_device is really --
Apr 7, 1:04 pm 2008
Eric Dumazet
Re: [PATCH net-2.6.26 1/2] Shrink size of net_device by fill...
I meant : X86_L1_CACHE_SHIFT = 6 (MK8 | MCORE2) or 7 (others) --
Apr 7, 1:17 pm 2008
Arnaldo Carvalho de Melo
Re: [PATCH net-2.6.26 1/2] Shrink size of net_device by fill...
Info _before_ Pavel's patch, not that big holes. [acme@doppio linux-2.6]$ getconf LEVEL1_DCACHE_LINESIZE 64 model name : Intel(R) Core(TM)2 CPU T7200 @ 2.00GHz [acme@doppio linux-2.6]$ grep X86_L1_CACHE_SHIFT /boot/config-`uname -r` CONFIG_X86_L1_CACHE_SHIFT=6 [acme@doppio linux-2.6]$ pahole -C net_device ../build/linux-2.6/doppio/net/core/dev.o struct net_device { char name[16]; /* 0 16 */ struct hlist_node name_hlist; ...
Apr 7, 1:47 pm 2008
Gerrit Renker
[INET]: sk_reuse is valbool
I found this while checking how DCCP handles duplicate bind operations; it seems it is a remainder of earlier code and now defunct. -----------------------------------> Patch <--------------------------------------- [INET]: sk_reuse is valbool sk_reuse is declared as "unsigned char", but is set as type valbool in net/core/sock.c. There is no other place in net/ where sk->sk_reuse is set to a value > 1, so the test "sk_reuse > 1" can not be true. Signed-off-by: Gerrit Renker <...
Apr 7, 11:01 am 2008
Dmitry Butskoy
[PATCH] for reopen bug #8747 -- complete the fix
There was a bug #8747, http://bugzilla.kernel.org/show_bug.cgi?id=8747, "MSG_ERRQUEUE messages do not pass to connected raw sockets", which still is not fixed, due to absence of proper tests. I've made an additional patch (attached), against the 2.6.24.4 tree. The patch fixes the issue completely for me. Tested. Sorry that I have not checked up the previous attempt in real tests. I assumed that the "typo" which I had found at that time (saddr/daddr swapped) is just "a typo", and things "ob...
Apr 7, 8:11 am 2008
Ilpo Järvinen
[PATCHv2 net-2.6 0/4 (was 0/3)]: TCP fixes
Hi Dave, Ok, here's the v2 series which tries hard to avoid opening loopholes to if condition in tcp_mark_head_lost which again would make the added tcp_fragment code to not execute :-). The rest (patches 3-4 are identical to previous series 2-3). ...I repost the whole series to make it easy for you if you were fast enough to deleted them from your inbox already, others please bear me :-). --- v1 intro follows --- Here are some TCP fixes that resulted from the last weeks reports & debug. ...
Apr 7, 7:55 am 2008
Ilpo Järvinen Apr 7, 5:59 pm 2008
Ilpo Järvinen
[PATCHv2 net-2.6 1/4] [TCP]: Restore 2.6.24 mark_head_lost b...
The fast retransmission can be forced locally to the rfc3517 branch in tcp_update_scoreboard instead of making such fragile constructs deeper in tcp_mark_head_lost. This is necessary for the next patch which must not have loopholes for cnt > packets check. As one can notice, readability got some improvements too because of this :-). Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@helsinki.fi> --- net/ipv4/tcp_input.c | 16 ++++++++-------- 1 files changed, 8 insertions(+), 8 deletions(...
Apr 7, 7:55 am 2008
Ilpo Järvinen
[PATCHv2 net-2.6 2/4] [TCP]: Fix NewReno's fast rexmit/recov...
Fixes a long-standing bug which makes NewReno recovery crippled. With GSO the whole head skb was marked as LOST which is in violation of NewReno procedure that only wants to mark one packet and ended up breaking our TCP code by causing counter overflow because our code was built on top of assumption about valid NewReno procedure. This manifested as triggering a WARN_ON for the overflow in a number of places. It seems relatively safe alternative to just do nothing if tcp_fragment fails due to oom b...
Apr 7, 7:55 am 2008
Ilpo Järvinen
[PATCHv2 net-2.6 3/4] [TCP]: tcp_simple_retransmit can cause...
This fixes Bugzilla #10384 tcp_simple_retransmit does L increment without any checking whatsoever for overflowing S+L when Reno is in use. The simplest scenario I can currently think of is rather complex in practice (there might be some more straightforward cases though). Ie., if mss is reduced during mtu probing, it may end up marking everything lost and if some duplicate ACKs arrived prior to that sacked_out will be non-zero as well, leading to S+L > packets_out, tcp_clean_rtx_queue on the ...
Apr 7, 7:55 am 2008
Ilpo Järvinen
[PATCHv2 net-2.6 4/4] [TCP]: Don't allow FRTO to take place ...
MTU probe can cause some remedies for FRTO because the normal packet ordering may be violated allowing FRTO to make a wrong decision (it might not be that serious threat for anything though). Thus it's safer to not run FRTO while MTU probe is underway. It seems that the basic FRTO variant should also look for an skb at probe_seq.start to check if that's retransmitted one but I didn't implement it now (plain seqno in window check isn't robust against wraparounds). Signed-off-by: Ilpo Järvinen &...
Apr 7, 7:55 am 2008
Ilpo Järvinen
[PATCH net-2.6 0/3]: TCP fixes
Hi Dave, Here are some TCP fixes that resulted from the last weeks reports & debug. The first one is quite likely to hit but it's considerably harder to get it print an overflow warning, while I've seen two reports about the second one (one of them is for 2.6.24.y), please ignore the earlier version of the tcp_simple_retransmit patch. The FRTO patch is once again result of code review rather than some report but seems necessary to avoid detection of some not that likely cases as spuriou...
Apr 7, 7:25 am 2008
Ilpo Järvinen
[PATCH net-2.6 1/3] [TCP]: Fix NewReno's fast rexmit/recover...
Fixes a long-standing bug which makes NewReno recovery crippled. With GSO the whole head skb was marked as LOST which is in violation of NewReno procedure that only wants to mark one packet and ended up breaking our TCP code by causing counter overflow because our code was built on top of assumption about valid NewReno procedure. This manifested as triggering a WARN_ON for the overflow in a number of places. It seems relatively safe alternative to just do nothing if tcp_fragment fails due to oom b...
Apr 7, 7:25 am 2008
Ilpo Järvinen Apr 7, 7:33 am 2008
Ilpo Järvinen
[PATCH net-2.6 2/3] [TCP]: tcp_simple_retransmit can cause S+L
This fixes Bugzilla #10384 tcp_simple_retransmit does L increment without any checking whatsoever for overflowing S+L when Reno is in use. The simplest scenario I can currently think of is rather complex in practice (there might be some more straightforward cases though). Ie., if mss is reduced during mtu probing, it may end up marking everything lost and if some duplicate ACKs arrived prior to that sacked_out will be non-zero as well, leading to S+L > packets_out, tcp_clean_rtx_queue on the ...
Apr 7, 7:25 am 2008
Ilpo Järvinen
[PATCH net-2.6 3/3] [TCP]: Don't allow FRTO to take place wh...
MTU probe can cause some remedies for FRTO because the normal packet ordering may be violated allowing FRTO to make a wrong decision (it might not be that serious threat for anything though). Thus it's safer to not run FRTO while MTU probe is underway. It seems that the basic FRTO variant should also look for an skb at probe_seq.start to check if that's retransmitted one but I didn't implement it now (plain seqno in window check isn't robust against wraparounds). Signed-off-by: Ilpo Järvinen &...
Apr 7, 7:25 am 2008
Kasper Sandberg
Re: Realtek 8111c weirdness problems, apic/msi, and normal bug
Hello. I apologize, but it will have to wait a few days, as the fan on the graphics card just broke, and it overheats almost instantly. I will post as soon as possible, sorry for the inconvenience. --
Apr 7, 12:09 am 2008
Jonathan Corbet
Re: [PATCH RFC 1/5] vringfd syscall
I'm *sure* you meant to document that somewhat non-trivial proposed new kernel API as soon as you got a moment. But, since I went to the trouble of reverse engineering it, I decided not to wait and to make a little unreliable guide of my own. For those who are curious about how vringfd() is meant to work (or about how badly I misunderstood it), the info is at: http://lwn.net/SubscriberLink/276856/8c927025c53ad7ce/ Hopefully it will be helpful, jon --
Apr 7, 1:54 pm 2008
Rusty Russell
Re: [PATCH RFC 1/5] vringfd syscall
Actually, yes. But I wanted to get it out there before I start the treck across to the virtualization summit. A few points: 'The page alignment for the used array is important - that array might be mapped separately into kernel space.' Well, the used array is written by one side only, so it's possible to split the ring here and make each part r/o to the other side. More importantly, a page boundary is almost certainly a cacheline boundary, and we already have a userspace interface for ...
Apr 7, 6:34 pm 2008
Herbert Xu
Re: [PATCH RFC 4/5] tun: vringfd xmit support.
On second thought, this is not going to work. The network stack can clone individual pages out of this skb and put it into a new skb. Therefore whatever scheme we come up with will either need to be page-based, or add a flag to tell the network stack that it can't clone those pages. Cheers, -- Visit Openswan at http://www.openswan.org/ Email: Herbert Xu ~{PmV>HI~} <herbert@gondor.apana.org.au> Home Page: http://gondor.apana.org.au/~herbert/ PGP Key: [ message continues ]
" title="http://gondor.apana.org.au/~herb...">http://gondor.apana.org.au/~herb...
Apr 7, 1:13 am 2008
Rusty Russell
Re: [PATCH RFC 4/5] tun: vringfd xmit support.
Erk... I'll put in the latter for now. A page-level solution is not really an option: if userspace hands us mmaped pages for example. Thanks, Rusty. --
Apr 7, 3:24 am 2008
David Miller
Re: [PATCH RFC 4/5] tun: vringfd xmit support.
From: Rusty Russell <rusty@rustcorp.com.au> Keep in mind that the core of the TCP stack really depends upon being able to slice and dice paged SKBs as is pleases in order to send packets out. In fact, it also does such splitting during SACK processing. It really is a base requirement for efficient TSO support. Otherwise the above operations would be so incredibly expensive we might as well rip all of the TSO support out. --
Apr 7, 3:35 am 2008
Paul Moore
Re: [TOMOYO #7 30/30] Hooks for SAKURA and TOMOYO.
Hello, I understand your frustration with the existing LSM hooks/API and your reasoning for abandoning LSM in favor of a new set of hooks, however, I think this sets a dangerous precedence which could result in an abundance of security related hooks scattered throughout the kernel. I would much rather see the LSM API extended/tweaked to support the needs of SAKURA and TOMOYO than ignored and duplicated; I suspect several others will say the same. You have made good progress with TOMOYO ...
Apr 7, 11:40 am 2008
Casey Schaufler Apr 7, 6:57 pm 2008
Tetsuo Handa
Re: [TOMOYO #7 30/30] Hooks for SAKURA and TOMOYO.
Filesystem part to fsdevel, socket operation part to netdev. That's OK. But I wonder where to post the rest part. Since the patches for passing "struct vfsmount" to LSM has been rejected, TOMOYO has been unable to use LSM. Thus, I proposed this patch set as a non-LSM version, but it bothers me I see. Thank you. --
Apr 7, 9:56 am 2008
Daniel Walker
Re: [TOMOYO #7 30/30] Hooks for SAKURA and TOMOYO.
Not sure .. You might look through the MAINTAINERS file and see if anyone maintains the sections of code your changing. Usually LKML and Andrew are pretty good bets especially when your not sure who the patches should go to. Daniel --
Apr 7, 11:39 am 2008
Wenji Wu
RE: A Linux TCP SACK Question
https://plone3.fnal.gov/P0/WAN/Members/wenji/tcp_dump_files_sack/ Two tcpdump files: one with SACK on, the other with SACK off. The test configures described in my previous emails. Best, wenji --
Apr 7, 10:56 am 2008
Meelis Roos
Re: [patch] NET: remove support for Davicom 9102 from the Tu...
There was one patch (for getting the MAC from OpenFirmware) but as people reported that the Tx timeouts still happen, I didn't try it. Any reason to try? -- Meelis Roos (mroos@linux.ee) --
Apr 7, 10:28 am 2008
Andy Johnson
Re: [GIT PULL net-2.6.26] [IPV6] MROUTE: Support multicast r...
Hello, I had compilation errors when trying to build pim6sd soleley , and it seems to me that the code should be changed for it to compile. I found another PIM-sm open source project, which **does** build cleanly on my Linux box: It is mrd6: http://fivebits.net/proj/mrd6/ Perhaps you have any ideas: Should it work with multicast routing with this kernel patch ? are there specific requirements which mcast-tools fulfiils (and maybe mrd6 does not) ? Regards, AJ On Thu, Apr 3, 2008 at 10:31 AM, YOSHI...
Apr 7, 9:13 am 2008
YOSHIFUJI Hideaki /
Re: [GIT PULL net-2.6.26] [IPV6] MROUTE: Support multicast r...
It SHOULD compile and work... MRD6 seems independent from this kernel patch so far. The kernel patch is regarding in-kernel multicast forwarding. --yoshfuji --
Apr 7, 10:37 am 2008
Eric Dumazet
Re: [RFC] fib_trie: memory waste solutions
Hum... I prefer my patch Stephen, as your version enlarges every tnode with an embedded "struct work_struct" which can be larger than a "struct rcu_head" And as your 2nd patch doesnt use vmalloc() at all, we only can gain one order for the max bits in root node (19 instead of 18) : We will hit the 'bug' again in a couple of months, or if router memory is somehow fragmented. --
Apr 7, 12:46 pm 2008
Stephen Hemminger
Re: [RFC] fib_trie: memory waste solutions
On Mon, 07 Apr 2008 18:46:29 +0200 The number of tnode's is small and the size growth of 2*(unsigned long) is not worth worrying about. Also theoretically, my version could have multiple work elements processed at once. --
Apr 7, 6:48 pm 2008
Robert Olsson
[RFC] fib_trie: memory waste solutions
Stephen Hemminger writes: > Eric wisely pointed out that for larger sizes of nodes, the > current allocation in fib_trie wastes lots of memory. For a sample > of routes extracted from the bugzilla bug the largest node grows > to 2M bytes on 64 bit system. This leads to 2044K of wasted memory. > > There are two possible solutions (see attached). One uses vmalloc() > rather than alloc_pages, but has to add complexity on freeing. > The other adds a layer of indirecti...
Apr 7, 2:55 am 2008
Andi Kleen
Re: [RFC] fib_trie: memory waste solutions
In some cases it might even go faster because a lot of x86 CPUs have far more 4K TLBs than 2M TLBs. vmalloc is just quite expensive in setup/free time, but that shouldn't be a big issue here. The memory savings are impressive. -Andi --
Apr 7, 3:58 am 2008
Robert Olsson
Re: [RFC] fib_trie: memory waste solutions
Andi Kleen writes: > > Do we get slower with vmalloc due to TLB-lookups etc? Guess this > > should be investigated. > > In some cases it might even go faster because a lot of x86 CPUs > have far more 4K TLBs than 2M TLBs. vmalloc is just quite expensive > in setup/free time, but that shouldn't be a big issue here. I've did some rDoS testing and the lookup performance is the same or slightly better. So it should be fine. Cheers --ro ...
Apr 7, 10:42 am 2008
Andi Kleen
Re: [RFC] fib_trie: memory waste solutions
If you want more realistic worst case numbers run something in user space in the background that thrashes the TLBs constantly and then see how the numbers change. The main advantage of using large pages is that they tend to be separated from 4K TLBs and since most user space doesn't use large pages it gives the kernel an effective private TLB pool. With vmalloc it will now compete with whatever other TLB pigs are active. -Andi --
Apr 7, 11:15 am 2008
Eric Dumazet
Re: [RFC] fib_trie: memory waste solutions
Yes, but with vmalloc(), NUMA machines have some chance to distribute this big area on several nodes (only if the process currently expanding the fib_trie root node has an appropriate numa_policy.... ah well :) :) ) --
Apr 7, 11:36 am 2008
Jarek Poplawski
Re: [Bugme-new] [Bug 10375] New: IPSec tunnel kernel panic
Probably this new Thomas Graf's patch to esp_input() should help with this problem: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=920fc941a9... Regards, Jarek P. --
Apr 7, 2:37 am 2008
Wang Chen
Re: [2.6 PATCH] IPV6: Check length of optval provided by use...
Check length of setsockopt's optval, which provided by user, before copy it from user space. For POSIX compliant, return -EINVAL for setsockopt of short lengths. Signed-off-by: Wang Chen <wangchen@cn.fujitsu.com> --- net/ipv6/ipv6_sockglue.c | 10 ++++++++-- 1 files changed, 8 insertions(+), 2 deletions(-) diff --git a/net/ipv6/ipv6_sockglue.c b/net/ipv6/ipv6_sockglue.c index bf2a686..a5ac121 100644 --- a/net/ipv6/ipv6_sockglue.c +++ b/net/ipv6/ipv6_sockglue.c @@ -557,6 +557,9 @@ d...
Apr 6, 9:42 pm 2008
previous daytodaynext day
April 6, 2008April 7, 2008April 8, 2008
speck-geostationary