« August 2005 | Main | October 2005 »

September 2005 Archives

September 4, 2005

cold rage

Aster has been providing me with information and insight ever since I first learned of the hurricane. But this has been brought to my attention and &mdash well, dammit, it just defies description. You need to see this, because as Aster says, "... but then, I also believe people need to get angry. People need to know."

Watch the interview that says it all. Consider reading Aster's take on it as well.

(Thanks, Aster. You're truly a gem!)

September 7, 2005

more technolust

iPod nano picture

Go on. Admit it. You know you want one too.

So delish.

Of course, now I really can't wait for the iPod AV….

September 8, 2005

speaking of aesthetics…

Should the PEBL be my next phone?

September 12, 2005

vector processing fun

Ever get the feeling that there are factions inside of Apple which aren't happy about the upcoming move to Intel?

Specifically, check out some of these highlights from the performance tips on migrating from AltiVec to SSE:

Don't bother synthesizing constants on the fly like you did for AltiVec. Most of the time, you wont have register space to keep those constants in register. You also don't have vec_splat_*, so synthesizing constants takes a lot longer.

Just like AltiVec, denormal stalls can be very expensive. Unlike AltiVec, you are much more likely to encounter them.

Reduce or eliminate your need for the permute unit. It is not as strong as on AltiVec. You could find yourself spending all your CPU time solving permute problems rather than doing actual work.

While translating code from AltiVec to SSE, pay attention to the expense of each translation. Some AltiVec instructions translate directly to a single SSE equivalent, while another potentially very similar instruction may take a dozen SSE instructions to do.

AltiVec is a rich ISA. This gives you a lot of freedom. There are frequently three ways to do anything, one of which is highly unintuitive but delivers a miracle in two instructions. SSE is smaller.

SSE involves destructive instructions most of the time. If you can phrase your algorithm in terms of destructive logic, you can probably save some unnecessary copies, and possibly some register spillage. (This will probably preclude software pipelining. However, software pipelining may not be necessary because the Intel processors are highly out-of-order.)

Heed cautions and tips in the Intel Processor Optimization Reference Manual.

Yeah, I'm really thrilled about this, too. Sounds to me like a well-tweaked AltiVec-enabled app on the G5 will still blow the doors off of a tweaked SSE2 app on a P5. Don't even get me started on their marginalization of SSE3:

SSE3 adds a small series of instructions mostly geared to making complex floating point arithmetic work better in some data layouts. However, since it is possible to get the same or better performance by repacking data as uniform vectors rather than non-uniform vectors ahead of time, it is not expected that most developers will need to rely on this feature.

Hmm. Perhaps in 2006-2007, it will be time to hoard dual-processor Xserve cluster nodes, especially for those of us who have cause to work in bioinformatics or other HPC environs?

September 19, 2005

well spoken on Katrina

I apologize for getting a tad political here, but Jason Lefkowitz has written a savagely inspired piece on the nature of leadership regarding hurricane Katrina. Definitely worth a read.

About September 2005

This page contains all entries posted to a blab by idle in September 2005. They are listed from oldest to newest.

August 2005 is the previous archive.

October 2005 is the next archive.

Many more can be found on the main index page or by looking through the archives.

Creative Commons License
This weblog is licensed under a Creative Commons License.
Powered by
Movable Type 3.34