David responding to pj:Sorry ... my response was ambiguous, so not surprisingly you read it the other way than I intended it and missed my point. Let me try this again. Yes, if we drop that MPOL_FLAG_SHIFT, we can't tell if exactly the low eight bits are valid, -however- we can still tell if the low bits not used by our MPOL_F_* flag bits are valid, and that's sufficient. I included a PATCH in my last reply, in order to demonstrate this. No. That is not an implementation detail that must be explicitly defined in order to separate the mode from the flags. One can separate out the high order flag bits by simply masking them off. I agree that we must return -EINVAL if the user specifies an invalid flag. I have always agreed with this. ====== Bingo!! That's what I've been saying all along. I'm not quite sure how we got here; this seems to me such a sharp turn from what went before that I fear I've misread you, but I suppose I should just be glad we're in agreement, despite the strange journey. You go on to suggest a couple of variations of this check, by first masking off the high order flag bits: if ((mode & MPOL_MODE_FLAGS) >= MPOL_MAX) return -EINVAL; and then, in a follow-on message, refining that line of thought with: unsigned short flags; flags = mode & MPOL_MODE_FLAGS; mode &= ~MPOL_MODE_FLAGS; if (mode < 0 || mode >= MPOL_MAX) return -EINVAL; Actually, I don't think you need to add either variation, as I think you -already- have that check, in both sys_mbind() and sys_set_mempolicy(), as the check: if (mpol_mode(mode) >= MPOL_MAX) return -EINVAL; With one minor tweak (changing the return type of mpol_mode() from uchar to ushort), here again is the PATCH of my last reply, with this check present (actually, it's a check of yours, unchanged from what has been in your patch for days now). Does this PATCH do what you have in mind? --- include/linux/mempolicy.h | 30 +++++++++++++++++------------- mm/mempolicy.c | 6 ------ 2 files changed, 17 insertions(+), 19 deletions(-) --- 2.6.24-mm1.orig/include/linux/mempolicy.h 2008-02-15 00:11:10.000000000 -0800 +++ 2.6.24-mm1/include/linux/mempolicy.h 2008-02-15 15:16:16.031031424 -0800 @@ -8,6 +8,14 @@ * Copyright 2003,2004 Andi Kleen SuSE Labs */ +/* + * The 'policy' field of 'struct mempolicy' has both a mode and + * some flags packed into it. The flags (MPOL_F_* below) occupy + * the high bit positions (MPOL_MODE_FLAGS), and the mempolicy + * modes (the "Policies" below) are encoded in the remaining low + * bit positions. + */ + /* Policies */ enum { MPOL_DEFAULT, @@ -18,16 +26,12 @@ enum { }; /* - * The lower MPOL_FLAG_SHIFT bits of the policy mode represent the MPOL_* - * constants defined in the above enum. The upper bits represent optional - * set_mempolicy() or mbind() MPOL_F_* mode flags. + * Optional flags that modify nodemask numbering. */ -#define MPOL_FLAG_SHIFT (8) -#define MPOL_MODE_MASK ((1 << MPOL_FLAG_SHIFT) - 1) - -/* Flags for set_mempolicy */ -#define MPOL_F_STATIC_NODES (1 << MPOL_FLAG_SHIFT) -#define MPOL_MODE_FLAGS (MPOL_F_STATIC_NODES) /* legal set_mempolicy() MPOL_F_* flags */ +#define MPOL_F_RELATIVE_NODES (1<<14) /* remapped relative to cpuset */ +#define MPOL_F_STATIC_NODES (1<<15) /* unremapped physical masks */ +#define MPOL_MODE_FLAGS (MPOL_F_RELATIVE_NODES|MPOL_F_STATIC_NODES) + /* combined MPOL_F_* mask flags */ /* Flags for get_mempolicy */ #define MPOL_F_NODE (1<<0) /* return next IL mode instead of node mask */ @@ -128,14 +132,14 @@ static inline int mpol_equal(struct memp #define mpol_set_vma_default(vma) ((vma)->vm_policy = NULL) -static inline unsigned char mpol_mode(unsigned short mode) +static inline unsigned short mpol_mode(unsigned short mode) { - return mode & MPOL_MODE_MASK; + return mode & ~MPOL_MODE_FLAGS; } static inline unsigned short mpol_flags(unsigned short mode) { - return mode & ~MPOL_MODE_MASK; + return mode & MPOL_MODE_FLAGS; } /* @@ -201,7 +205,7 @@ static inline int mpol_equal(struct memp #define mpol_set_vma_default(vma) do {} while(0) -static inline unsigned char mpol_mode(unsigned short mode) +static inline unsigned short mpol_mode(unsigned short mode) { return 0; } --- 2.6.24-mm1.orig/mm/mempolicy.c 2008-02-15 00:18:35.000000000 -0800 +++ 2.6.24-mm1/mm/mempolicy.c 2008-02-15 08:16:52.034431591 -0800 @@ -884,8 +884,6 @@ asmlinkage long sys_mbind(unsigned long if (mpol_mode(mode) >= MPOL_MAX) return -EINVAL; - if (mpol_flags(mode) & ~MPOL_MODE_FLAGS) - return -EINVAL; err = get_nodes(&nodes, nmask, maxnode); if (err) return err; @@ -898,13 +896,9 @@ asmlinkage long sys_set_mempolicy(int mo { int err; nodemask_t nodes; - unsigned short flags; if (mpol_mode(mode) >= MPOL_MAX) return -EINVAL; - if (mpol_flags(mode) & ~MPOL_MODE_FLAGS) - return -EINVAL; - flags = mpol_flags(mode) & MPOL_MODE_FLAGS; err = get_nodes(&nodes, nmask, maxnode); if (err) return err; -- I won't rest till it's the best ... Programmer, Linux Scalability Paul Jackson <pj@sgi.com> 1.940.382.4214 --
| Andrew Morton | -mm merge plans for 2.6.23 |
| Benjamin Herrenschmidt | Re: [PATCH] Remove process freezer from suspend to RAM pathway |
| Greg KH | [GIT PATCH] driver core patches against 2.6.24 |
| Mel Gorman | [PATCH 6/8] x86_64 - Specify amount of kernel memory at boot time |
git: | |
| Jarek Poplawski | [PATCH] pkt_sched: Destroy gen estimators under rtnl_lock(). |
| David Miller | [GIT]: Networking |
| Gerrit Renker | [PATCH 15/37] dccp: Set per-connection CCIDs via socket options |
| Jarek Poplawski | Re: Soft-Lockup/Race in networking in 2.6.31-rc1+195 ( possibly?caused by netem) |
