Commit message blocked due to size, forwarding trimmed version: Author: eliot Date: 2011-07-18 17:35:51 -0700 (Mon, 18 Jul 2011) New Revision: 2463 Added: branches/Cog/macbuild/CoreMTVM.xcodeproj/ branches/Cog/macbuild/CoreMTVM.xcodeproj/project.pbxproj branches/Cog/src/vm/cointerpmt.c branches/Cog/src/vm/cointerpmt.h branches/Cog/src/vm/gcc3x-cointerpmt.c Modified: branches/Cog/macbuild/makevm branches/Cog/macbuild/mvm branches/Cog/nscogsrc/plugins/AsynchFilePlugin/AsynchFilePlugin.c branches/Cog/nscogsrc/plugins/B2DPlugin/B2DPlugin.c branches/Cog/nscogsrc/plugins/BMPReadWriterPlugin/BMPReadWriterPlugin.c branches/Cog/nscogsrc/plugins/BitBltPlugin/BitBltPlugin.c branches/Cog/nscogsrc/plugins/DSAPrims/DSAPrims.c branches/Cog/nscogsrc/plugins/DropPlugin/DropPlugin.c branches/Cog/nscogsrc/plugins/FileCopyPlugin/FileCopyPlugin.c branches/Cog/nscogsrc/plugins/FilePlugin/FilePlugin.c branches/Cog/nscogsrc/plugins/FloatArrayPlugin/FloatArrayPlugin.c branches/Cog/nscogsrc/plugins/FloatMathPlugin/FloatMathPlugin.c branches/Cog/nscogsrc/plugins/JPEGReadWriter2Plugin/JPEGReadWriter2Plugin.c branches/Cog/nscogsrc/plugins/JPEGReaderPlugin/JPEGReaderPlugin.c branches/Cog/nscogsrc/plugins/LargeIntegers/LargeIntegers.c branches/Cog/nscogsrc/plugins/Matrix2x3Plugin/Matrix2x3Plugin.c branches/Cog/nscogsrc/plugins/MiscPrimitivePlugin/MiscPrimitivePlugin.c branches/Cog/nscogsrc/plugins/RePlugin/RePlugin.c branches/Cog/nscogsrc/plugins/SecurityPlugin/SecurityPlugin.c branches/Cog/nscogsrc/plugins/SocketPlugin/SocketPlugin.c branches/Cog/nscogsrc/plugins/SoundPlugin/SoundPlugin.c branches/Cog/nscogsrc/plugins/UUIDPlugin/UUIDPlugin.c branches/Cog/nscogsrc/plugins/VMProfileMacSupportPlugin/VMProfileMacSupportPlugin.c branches/Cog/nscogsrc/plugins/Win32OSProcessPlugin/Win32OSProcessPlugin.c branches/Cog/nscogsrc/plugins/ZipPlugin/ZipPlugin.c branches/Cog/nscogsrc/vm/cointerp.c branches/Cog/nscogsrc/vm/cointerp.h branches/Cog/nscogsrc/vm/gcc3x-cointerp.c branches/Cog/nscogsrc/vm/interp.h branches/Cog/nscogsrc/vm/vmCallback.h branches/Cog/platforms/win32/vm/sqWin32Window.c branches/Cog/scripts/findUnofficialFiles branches/Cog/src/vm/cointerp.c branches/Cog/src/vm/cointerp.h branches/Cog/src/vm/gcc3x-cointerp.c branches/Cog/src/vm/interp.h branches/Cog/src/vm/vmCallback.h branches/Cog/stacksrc/vm/gcc3x-interp.c branches/Cog/stacksrc/vm/interp.c branches/Cog/stacksrc/vm/interp.h branches/Cog/stacksrc/vm/vmCallback.h Log: CogVM source as per VMMaker.oscog-eem.105. Fix signed32BitValueOf for most neg- ative value C compiler mis-optimization. Speed up primitiveFail using ! trick. Add multi-threaded sources to tree (won't build yet due to issue in ia32abicc.c) Upgrade nscogsrc/plugins to official versions. Added: branches/Cog/macbuild/CoreMTVM.xcodeproj/project.pbxproj =================================================================== --- branches/Cog/macbuild/CoreMTVM.xcodeproj/project.pbxproj (rev 0) +++ branches/Cog/macbuild/CoreMTVM.xcodeproj/project.pbxproj 2011-07-19 <snip> |
On Thu, Jul 21, 2011 at 07:08:18AM -0400, David T. Lewis wrote: > > Commit message blocked due to size, forwarding trimmed version: > > Author: eliot > Date: 2011-07-18 17:35:51 -0700 (Mon, 18 Jul 2011) > New Revision: 2463 <snip> > Log: > CogVM source as per VMMaker.oscog-eem.105. Fix signed32BitValueOf for most neg- > ative value C compiler mis-optimization. Speed up primitiveFail using ! trick. > Add multi-threaded sources to tree (won't build yet due to issue in ia32abicc.c) > Upgrade nscogsrc/plugins to official versions. Hi Eliot, Can you say what the issue was with signed32BitValueOf? I can see the changes in InterpreterPrimitives>>signed32BitValueOf: but I'm not clear on whether this is something that affects Alien, or if it is something that has been causing problems more generally but went unnoticed. Also, I'd like to document this with a unit test, so if you can suggest a code snippet that would be great. TIA, Dave |
If it would help here the change (in #signed32BitValueOf:) Before: value < 0 ifTrue: [self assert: (self sizeof: value) == 4. (negative and: [self cCode: [value - 1 > 0] inSmalltalk: [value = -16r80000000]]) ifTrue: "Don't fail for -2147483648 /-16r80000000" [^value]. After: value < 0 ifTrue: [self assert: (self sizeof: value) == 4. "Don't fail for -16r80000000/-2147483648 Alas the simple (negative and: [value - 1 > 0]) fails with contemporary gcc and icc versions with optimization and sometimes without. The shift works on all, touch wood." (negative and: [0 = (self cCode: [value << 1] inSmalltalk: [value << 1 bitAnd: (1 << 32) - 1])]) ifTrue: [^value]. On 21 July 2011 13:30, David T. Lewis <[hidden email]> wrote: > > On Thu, Jul 21, 2011 at 07:08:18AM -0400, David T. Lewis wrote: >> >> Commit message blocked due to size, forwarding trimmed version: >> >> Author: eliot >> Date: 2011-07-18 17:35:51 -0700 (Mon, 18 Jul 2011) >> New Revision: 2463 > > <snip> > >> Log: >> CogVM source as per VMMaker.oscog-eem.105. Fix signed32BitValueOf for most neg- >> ative value C compiler mis-optimization. Speed up primitiveFail using ! trick. >> Add multi-threaded sources to tree (won't build yet due to issue in ia32abicc.c) >> Upgrade nscogsrc/plugins to official versions. > > Hi Eliot, > > Can you say what the issue was with signed32BitValueOf? I can > see the changes in InterpreterPrimitives>>signed32BitValueOf: > but I'm not clear on whether this is something that affects > Alien, or if it is something that has been causing problems > more generally but went unnoticed. Also, I'd like to document > this with a unit test, so if you can suggest a code snippet > that would be great. > > TIA, > Dave > > -- Best regards, Igor Stasenko AKA sig. |
In reply to this post by David T. Lewis
On Thu, Jul 21, 2011 at 4:30 AM, David T. Lewis <[hidden email]> wrote:
The unit test is in the lien tests and is the attempt to assign max neg int (-2^31) through signedLongAt:put:. The problem is due to a pervasive C compiler bug with optimization. Recall that max neg int is exceptional in that it is the only value in a 2's complement representation that can't be negated since max neg int = -max pos int - 1. Here's the bug, and it occurred in signed64BitValueOf: and signedMachineIntegerValueOf: as well; I slipped in not fixing signed32BitValueOf:
<var: #value type: #int> ... (negative
and: [self cCode: [value - 1 > 0] inSmalltalk: [value = -16r80000000]]) ifTrue: "Don't fail for -2147483648 /-16r80000000"
Since value is int (32 bits on relevant systems) then if value = 16r80000000, value - 1 will wrap around to 16r7fffffff, bit all other negative values will remain negative. Hence value - 1 > 0 /should/ cheaply identify max neg int. But both gcc in many versions, and icc, the intel compiler, under optimization assume that value - 1 > 0 is always false, and hence cause the primitive to fail for max neg int.
Instead, the solution I've adopted is to use a different sequence using shifts that is not mis-optimized, at least by gcc and icc: (negative and: [0 = (self cCode: [value << 1]
inSmalltalk: [value << 1 bitAnd: (1 << 32) - 1])]) ifTrue: i.e. 16r8000000 is the only negative value (0 is positive) that when shifted left within a 32-bit type equals zero since the sign bit is shifted out.
And to those C compilers, grrr.... ;)
-- best, Eliot |
Thanks a lot Eliot, and Igor also! Dave On Thu, Jul 21, 2011 at 10:35:00AM -0700, Eliot Miranda wrote: > > On Thu, Jul 21, 2011 at 4:30 AM, David T. Lewis <[hidden email]> wrote: > > > > > On Thu, Jul 21, 2011 at 07:08:18AM -0400, David T. Lewis wrote: > > > > > > Commit message blocked due to size, forwarding trimmed version: > > > > > > Author: eliot > > > Date: 2011-07-18 17:35:51 -0700 (Mon, 18 Jul 2011) > > > New Revision: 2463 > > > > <snip> > > > > > Log: > > > CogVM source as per VMMaker.oscog-eem.105. Fix signed32BitValueOf for > > most neg- > > > ative value C compiler mis-optimization. Speed up primitiveFail using ! > > trick. > > > Add multi-threaded sources to tree (won't build yet due to issue in > > ia32abicc.c) > > > Upgrade nscogsrc/plugins to official versions. > > > > Hi Eliot, > > > > Can you say what the issue was with signed32BitValueOf? I can > > see the changes in InterpreterPrimitives>>signed32BitValueOf: > > but I'm not clear on whether this is something that affects > > Alien, or if it is something that has been causing problems > > more generally but went unnoticed. Also, I'd like to document > > this with a unit test, so if you can suggest a code snippet > > that would be great. > > > > The unit test is in the lien tests and is the attempt to assign max neg int > (-2^31) through signedLongAt:put:. The problem is due to a pervasive C > compiler bug with optimization. Recall that max neg int is exceptional in > that it is the only value in a 2's complement representation that can't be > negated since max neg int = -max pos int - 1. Here's the bug, and it > occurred in signed64BitValueOf: and signedMachineIntegerValueOf: as well; I > slipped in not fixing signed32BitValueOf: > > <var: #value type: #int> > ... > (negative > and: [self > cCode: [value - 1 > 0] > inSmalltalk: [value = -16r80000000]]) ifTrue: "Don't fail for -2147483648 > /-16r80000000" > > Since value is int (32 bits on relevant systems) then if value = > 16r80000000, value - 1 will wrap around to 16r7fffffff, bit all other > negative values will remain negative. Hence value - 1 > 0 /should/ cheaply > identify max neg int. But both gcc in many versions, and icc, the intel > compiler, under optimization assume that value - 1 > 0 is always false, and > hence cause the primitive to fail for max neg int. > > Instead, the solution I've adopted is to use a different sequence using > shifts that is not mis-optimized, at least by gcc and icc: > > (negative and: [0 = (self cCode: [value << 1] > inSmalltalk: [value << 1 bitAnd: (1 << 32) - 1])]) ifTrue: > > i.e. 16r8000000 is the only negative value (0 is positive) that when shifted > left within a 32-bit type equals zero since the sign bit is shifted out. > > And to those C compilers, grrr.... ;) > > > > TIA, > > Dave > > > > > > > -- > best, > Eliot |
In reply to this post by Eliot Miranda-2
> > Hi Eliot, > > Can you say what the issue was with signed32BitValueOf? I can > see the changes in InterpreterPrimitives>>signed32BitValueOf: > but I'm not clear on whether this is something that affects > Alien, or if it is something that has been causing problems > more generally but went unnoticed. Also, I'd like to document > this with a unit test, so if you can suggest a code snippet > that would be great. > > The unit test is in the lien tests and is the attempt to assign max neg int (-2^31) through signedLongAt:put:. The problem is due to a pervasive C compiler bug with optimization. Recall that max neg int is exceptional in that it is the only value in a 2's complement representation that can't be negated since max neg int = -max pos int - 1. Here's the bug, and it occurred in signed64BitValueOf: and signedMachineIntegerValueOf: as well; I slipped in not fixing signed32BitValueOf: > > <var: #value type: #int> > ... > (negative > and: [self > cCode: [value - 1 > 0] > inSmalltalk: [value = -16r80000000]]) ifTrue: "Don't fail for -2147483648 /-16r80000000" > > Since value is int (32 bits on relevant systems) then if value = 16r80000000, value - 1 will wrap around to 16r7fffffff, bit all other negative values will remain negative. Hence value - 1 > 0 /should/ cheaply identify max neg int. But both gcc in many versions, and icc, the intel compiler, under optimization assume that value - 1 > 0 is always false, and hence cause the primitive to fail for max neg int. > > Instead, the solution I've adopted is to use a different sequence using shifts that is not mis-optimized, at least by gcc and icc: > > (negative and: [0 = (self cCode: [value << 1] > inSmalltalk: [value << 1 bitAnd: (1 << 32) - 1])]) ifTrue: > > i.e. 16r8000000 is the only negative value (0 is positive) that when shifted left within a 32-bit type equals zero since the sign bit is shifted out. > > And to those C compilers, grrr.... ;) indeed! |
In reply to this post by Eliot Miranda-2
Hi Eliot: On 21/07/11 19:35, Eliot Miranda wrote: > > On Thu, Jul 21, 2011 at 4:30 AM, David T. Lewis <[hidden email] > <mailto:[hidden email]>> wrote: > > > > Hi Eliot, > > Can you say what the issue was with signed32BitValueOf? I can > see the changes in InterpreterPrimitives>>signed32BitValueOf: > but I'm not clear on whether this is something that affects > Alien, or if it is something that has been causing problems > more generally but went unnoticed. Also, I'd like to document > this with a unit test, so if you can suggest a code snippet > that would be great. > > > The unit test is in the lien tests and is the attempt to assign max > neg int (-2^31) through signedLongAt:put:. The problem is due to a > pervasive C compiler bug with optimization. http://blog.llvm.org/2011/05/what-every-c-programmer-should-know.html (*Signed integer overflow:)* but from that I would conclude that it is not a compiler bug, but undefined behavior in C instead. Thus, GCC and ICC are just doing there job in terms of what they are allowed to do by the spec. Best regards Stefan |
On Thu, Jul 21, 2011 at 3:17 PM, Stefan Marr <[hidden email]> wrote:
Thanks, Stefan. I hadn't known this (or had forgotten): for (i = 0; i <= N; ++i) { ... } In this loop, the compiler can assume that the loop will iterate exactly N+1 times if "i" is undefined on overflow, which allows a broad range of loop optimizations to kick in. On the other hand, if the variable is defined to wrap around on overflow, then the compiler must assume that the loop is possibly infinite (which happens if N is INT_MAX) - which then disables these important loop optimizations. This particularly affects 64-bit platforms since so much code uses "int" as induction variables.
Makes evil sense :)
-- best, Eliot |
On 22 July 2011 01:24, Eliot Miranda <[hidden email]> wrote: > > > > On Thu, Jul 21, 2011 at 3:17 PM, Stefan Marr <[hidden email]> wrote: >> >> Hi Eliot: >> >> On 21/07/11 19:35, Eliot Miranda wrote: >>> >>> On Thu, Jul 21, 2011 at 4:30 AM, David T. Lewis <[hidden email] <mailto:[hidden email]>> wrote: >>> >>> >>> >>> Hi Eliot, >>> >>> Can you say what the issue was with signed32BitValueOf? I can >>> see the changes in InterpreterPrimitives>>signed32BitValueOf: >>> but I'm not clear on whether this is something that affects >>> Alien, or if it is something that has been causing problems >>> more generally but went unnoticed. Also, I'd like to document >>> this with a unit test, so if you can suggest a code snippet >>> that would be great. >>> >>> >>> The unit test is in the lien tests and is the attempt to assign max neg int (-2^31) through signedLongAt:put:. The problem is due to a pervasive C compiler bug with optimization. >> >> Maybe I misread this blog post: http://blog.llvm.org/2011/05/what-every-c-programmer-should-know.html (*Signed integer overflow:)* >> but from that I would conclude that it is not a compiler bug, but undefined behavior in C instead. Thus, GCC and ICC are just doing there job in terms of what they are allowed to do by the spec. > > Thanks, Stefan. I hadn't known this (or had forgotten): > Signed integer overflow: If arithmetic on an 'int' type (for example) overflows, the result is undefined. One example is that "INT_MAX+1" is not guaranteed to be INT_MIN. This behavior enables certain classes of optimizations that are important for some code. For example, knowing that INT_MAX+1 is undefined allows optimizing "X+1 > X" to "true". Knowing the multiplication "cannot" overflow (because doing so would be undefined) allows optimizing "X*2/2" to "X". While these may seem trivial, these sorts of things are commonly exposed by inlining and macro expansion. A more important optimization that this allows is for "<=" loops like this: > > for (i = 0; i <= N; ++i) { ... } > > In this loop, the compiler can assume that the loop will iterate exactly N+1 times if "i" is undefined on overflow, which allows a broad range of loop optimizations to kick in. On the other hand, if the variable is defined to wrap around on overflow, then the compiler must assume that the loop is possibly infinite (which happens if N is INT_MAX) - which then disables these important loop optimizations. This particularly affects 64-bit platforms since so much code uses "int" as induction variables. > Makes evil sense :) > signed/unsigned/different bit size.. Yeah.. a huge field for bugs and frustration. >> >> Best regards >> Stefan >> > > > > -- > best, > Eliot > > -- Best regards, Igor Stasenko AKA sig. |
In reply to this post by Eliot Miranda-2
On Thu, Jul 21, 2011 at 03:24:08PM -0700, Eliot Miranda wrote: > > On Thu, Jul 21, 2011 at 3:17 PM, Stefan Marr <[hidden email]> wrote: > > > > > Hi Eliot: > > > > On 21/07/11 19:35, Eliot Miranda wrote: > > > > > >> On Thu, Jul 21, 2011 at 4:30 AM, David T. Lewis <[hidden email]<mailto: > >> [hidden email]>> wrote: > >> > >> > >> > >> Hi Eliot, > >> > >> Can you say what the issue was with signed32BitValueOf? I can > >> see the changes in InterpreterPrimitives>>**signed32BitValueOf: > >> but I'm not clear on whether this is something that affects > >> Alien, or if it is something that has been causing problems > >> more generally but went unnoticed. Also, I'd like to document > >> this with a unit test, so if you can suggest a code snippet > >> that would be great. > >> > >> > >> The unit test is in the lien tests and is the attempt to assign max neg > >> int (-2^31) through signedLongAt:put:. The problem is due to a pervasive C > >> compiler bug with optimization. > >> > > Maybe I misread this blog post: http://blog.llvm.org/2011/05/** > > what-every-c-programmer-**should-know.html<http://blog.llvm.org/2011/05/what-every-c-programmer-should-know.html>(*Signed integer overflow:)* > > but from that I would conclude that it is not a compiler bug, but undefined > > behavior in C instead. Thus, GCC and ICC are just doing there job in terms > > of what they are allowed to do by the spec. > > > > Thanks, Stefan. I hadn't known this (or had forgotten): > > *Signed integer overflow:* If arithmetic on an 'int' type (for example) > overflows, the result is undefined. One example is that "INT_MAX+1" is not > guaranteed to be INT_MIN. This behavior enables certain classes of > optimizations that are important for some code. For example, knowing that > INT_MAX+1 is undefined allows optimizing "X+1 > X" to "true". Knowing the > multiplication "cannot" overflow (because doing so would be undefined) > allows optimizing "X*2/2" to "X". While these may seem trivial, these sorts > of things are commonly exposed by inlining and macro expansion. A more > important optimization that this allows is for "<=" loops like this: > > for (i = 0; i <= N; ++i) { ... } > > > In this loop, the compiler can assume that the loop will iterate exactly N+1 > times if "i" is undefined on overflow, which allows a broad range of loop > optimizations to kick in. On the other hand, if the variable is defined to > wrap around on overflow, then the compiler must assume that the loop is > possibly infinite (which happens if N is INT_MAX) - which then disables > these important loop optimizations. This particularly affects 64-bit > platforms since so much code uses "int" as induction variables. > > Makes evil sense :) The issue does not exist in VMMaker trunk, which uses a simpler and presumably faster implementation. There is no need for the "value - 1" logic, hence no need for a workaround in the special case of the max negative value. The relevant methods are also tested on 32/64 host and image combinations. Here is the logic from #signed32BitValueOf: with the special case handling removed entirely: "Fail if value exceeds range of a 32-bit two's-complement signed integer." negative ifTrue:[value := 0 - value. value >= 0 ifTrue: [^ self primitiveFail]] ifFalse:[value < 0 ifTrue:[^ self primitiveFail]]. I did not run the alien tests, so maybe I'm missing something. In any case, background is at <http://bugs.squeak.org/view.php?id=6987>. HTH, Dave |
Hi David, On Mon, Jul 25, 2011 at 6:27 AM, David T. Lewis <[hidden email]> wrote:
Alas, no, it still exists. The expression 0 - value is still undefined for overflow (i.e. undefined for max neg int) and hence, depending on compiler, the following statement value >= 0 may be assumed to be always true. This is the case for the intel compiler that we used at Qwaq/Teleplace, and is the reason I made the change in the first place. So yes, one does need the shift expression to be able to rely on different compilers generating correct code in this edge case.
There is no need for the "value - 1" -- best, Eliot |
On Mon, Jul 25, 2011 at 02:01:31PM -0700, Eliot Miranda wrote: > > Hi David, > > > > > The issue does not exist in VMMaker trunk, which uses a simpler and > > presumably faster implementation. > > Alas, no, it still exists. The expression 0 - value is still undefined for > overflow (i.e. undefined for max neg int) and hence, depending on compiler, > the following statement value >= 0 may be assumed to be always true. This > is the case for the intel compiler that we used at Qwaq/Teleplace, and is > the reason I made the change in the first place. So yes, one does need the > shift expression to be able to rely on different compilers generating > correct code in this edge case. Thanks Eliot. Dave |
In reply to this post by Eliot Miranda-2
On Mon, Jul 25, 2011 at 02:01:31PM -0700, Eliot Miranda wrote: > > On Mon, Jul 25, 2011 at 6:27 AM, David T. Lewis <[hidden email]> wrote: > > > > The issue does not exist in VMMaker trunk, which uses a simpler and > > presumably faster implementation. > > Alas, no, it still exists. The expression 0 - value is still undefined for > overflow (i.e. undefined for max neg int) and hence, depending on compiler, > the following statement value >= 0 may be assumed to be always true. This > is the case for the intel compiler that we used at Qwaq/Teleplace, and is > the reason I made the change in the first place. So yes, one does need the > shift expression to be able to rely on different compilers generating > correct code in this edge case. On further thought, according to the blog that Stefan referenced: "It is worth noting that unsigned overflow is guaranteed to be defined as 2's complement (wrapping) overflow, so you can always use them." <http://blog.llvm.org/2011/05/what-every-c-programmer-should-know.html> If this is the case, then coercing to unsigned for the subtraction should prevent the problem of undefined results for value -16r80000000, thus: "Fail if value exceeds range of a 32-bit twos-complement signed integer." negative ifTrue:["perform subtraction using unsigned int to prevent undefined result for optimizing C compilers in the case of value = -16r80000000" value := 0 - (self cCoerce: value to: 'unsigned int'). value >= 0 ifTrue: [^ self primitiveFail]] ifFalse:[value < 0 ifTrue:[^ self primitiveFail]]. I do not have access to a compiler that exhibits the problem, so I cannot verify. But I would expect this to work on all compilers, while avoiding an extra check for value -16r80000000. Dave |
Free forum by Nabble | Edit this page |