SSE2. Added a new Iop_Not128 bit primop and generate at least tolerable SSE code for it. git-svn-id: svn://svn.valgrind.org/vex/trunk@648