Hi all,
Today, I had the chance to play for a few hours with the quad Opteron I will use in Mainz next month (thanks, Ray!). I ran several positions (positional, tactical, and endgame) with 1, 2, 3 and 4 threads, and compared the results. To my big surprise, it turns out that the number of nodes needed to complete a given depth usually decreases when I increase the number of threads. This is, of course, exactly the opposite of what one would expect.
Unfortunately, I doubt that this result is caused by some revolutionary breakthrough in parallel search from my side. My parallel search is nothing more than an unoptimised, quick and dirty implementation of YBWC, without any unusual tricks. I therefore suspect that there must be some nasty bug or inefficiency waiting to be discovered in my search. Something that hurts a lot with just a single thread, but not so much with a bigger number of threads.
Does anybody have any idea what the problem could be? Awful move ordering, perhaps?
Tord