Google-apps
Hoofdmenu

Post a Comment On: C0DE517E

"Embrace your bottleneck"

3 Comments -

1 – 3 of 3
Anonymous Anonymous said...

So true.

But nothing hurts more than the LHS. That's got to be the dumbest thing with the in-order PPC cores.

March 5, 2009 at 7:01 PM

Blogger DEADC0DE said...

Yep, indeed load/hit/stores are one of the worst performance bottlenecks when working on a PPC, second only to cache misses. More often than not it's not possible to use all the computational power of the PPC, the pipilines, especially the vector one, are very long. But the power is there, and as I wrote, it's nice after you did all the possible optimizations around the implementation of a given algorithm, to fill up the holes in the pipeline with extra features...

March 6, 2009 at 12:02 AM

Anonymous Anonymous said...

I certainly agree with you there, it sure is nice to stuff extra "free" work into various loops. I have done this on a number of occasions. This works well on both the CPU's and GPU's. Yay!

I mention the PPC LHS because it seems to get in the way of everything, and of course sometimes eliminating LHS means keeping things in the same register sets and therefore sometimes in the longest pipes.

Balancing this is tricky, and it's been bugging me much recently :-)

March 6, 2009 at 10:24 AM

You can use some HTML tags, such as <b>, <i>, <a>

Comment moderation has been enabled. All comments must be approved by the blog author.

You will be asked to sign in after submitting your comment.
Please prove you're not a robot