Non Uniform Memory Architecture is kind of a dry subject for most. Where a bit of code or data sits in processor cache is of little importance to the average user, unless you become interested in things like virtualization (which seeks to maximize the utilization of that cache and slams face first into the issue ) or you're interested in performance testing, in which case squeezing every extra productive cycle out of the hardware becomes a pressing concern. I happen to be interested in both, so...