Test with open source engines
Posted: 12 Jul 2006, 08:34
I run some testgames with open source engines. I guess I'll limit myself to these engines because
- that limit the number of engines to look at
- I can optimize the executables to my system
- all compilations are of the same (most likely bad) quality
- hard times for clones.
Also the engines should be under active developement. If the author gives them a break for more than six months or so they also will have a break in testing.
In addition the engines should have clear instructions how to compile the sources (a Makefile is best) and the build should be possible without installing dozens of additional packages.
I report the result here and not in the tournamnt forum, because this will not be a continious list. I don't claim to determine the real strength of the engines.
I play the games with ponder on on a single cpu system. I never noticed a difference in the resulting rating list if ponder on or ponder off is used (if all engines can ponder). I also didn't notice an unfair usage of the ressources by the engines. If both engines are pondering they use 49+ % of the cpu each. If an engine can't ponder then it is in bad luck. The ability to ponder is an advantage. Why handicapping the engines which can do?
To prevent my harddisk from possible damage I use only 4-piece TBs in a ramdisk. This makes also a faster access to the TBs possible. I'm thinking about to put all 5-piece TBs on two of these 4 GB USB memory sticks but at the moment they are still a bit to expensive. I also don't know if the TBs will fit on them because the amount of storage is only possible with a compression software on these sticks.
Time control of the games was 5 minutes initially and 5 seconds per move. It came out GreKo lose all games with black under these conditions and was therefore taken out (Vladimir is informed already). Here are the results:
As you know Toga is based on Fruit.
Scorpio came in 2nd ahead of the two Crafty versions.
I don't see Crafty-Cito dozens of phantasy rating points ahead of the original but I also don't see it behind it.
Glaurung's antecessor did better in the previous (unpublished) test. But as we all know these results are all like coin tossing if not hundrets or thousands of games have been played.
Arasan lost a game on time (at move 140). I don't expect any losses on time with a 5 seconds increment. It possibly have to do with pondering. I'll check that with the new version of the engine.
EXchess worked without any problem.
There is an unclear situation between Kings Out and xpdnt. Xpdnt won on time although it checkmated Kings Out. I can't read its logfile good enough if possibly the RESULT command from the engine was missing.
HoiChess and Prophet did well. Prophet didn't lose on time what happend in the previous (unpublished) test very often.
GiuChess didn't get any point and lost 8 of its game on time. I guess I'll wait for a new version before testing it again.
Comments welcome.
Regards
Volker
The games are here: http://www.volker-pittlik.name/temp/Ranking-060712.pgn.bz2
- that limit the number of engines to look at
- I can optimize the executables to my system
- all compilations are of the same (most likely bad) quality
- hard times for clones.
Also the engines should be under active developement. If the author gives them a break for more than six months or so they also will have a break in testing.
In addition the engines should have clear instructions how to compile the sources (a Makefile is best) and the build should be possible without installing dozens of additional packages.
I report the result here and not in the tournamnt forum, because this will not be a continious list. I don't claim to determine the real strength of the engines.
I play the games with ponder on on a single cpu system. I never noticed a difference in the resulting rating list if ponder on or ponder off is used (if all engines can ponder). I also didn't notice an unfair usage of the ressources by the engines. If both engines are pondering they use 49+ % of the cpu each. If an engine can't ponder then it is in bad luck. The ability to ponder is an advantage. Why handicapping the engines which can do?
To prevent my harddisk from possible damage I use only 4-piece TBs in a ramdisk. This makes also a faster access to the TBs possible. I'm thinking about to put all 5-piece TBs on two of these 4 GB USB memory sticks but at the moment they are still a bit to expensive. I also don't know if the TBs will fit on them because the amount of storage is only possible with a compression software on these sticks.
Time control of the games was 5 minutes initially and 5 seconds per move. It came out GreKo lose all games with black under these conditions and was therefore taken out (Vladimir is informed already). Here are the results:
- Code: Select all
Ranking-060712
XP2000+, PonderOn, 32 MB hash, 4 piece TBs, 5+5, VPittlik, 2006.07.12
Score Toga Scor Craf Craf Glau Aras EXch King xpdn HoiC Prop GiuC
-------------------------------------------------------------------------------------------------
1: Toga 1.2.1 36.5 / 44 XXXX ==01 1100 11=1 =0== 1=11 1111 1111 1111 1111 1111 1111
2: Scorpio 1.8 35.5 / 44 ==10 XXXX =10= 0011 111= 0101 1111 1111 1111 1111 1111 1111
3: Crafty-Cito 1.5 33.0 / 44 0011 =01= XXXX 001= 111= 1010 11== 1110 1111 1111 1111 1111
4: Crafty-20.14 33.0 / 44 00=0 1100 110= XXXX 0=1= 1011 =11= 1111 1111 1111 1111 1111
5: Glaurung 1.2 32.5 / 44 =1== 000= 000= 1=0= XXXX 1111 111= 1=11 1111 1111 1111 1111
6: Arasan 9.2 27.0 / 44 0=00 1010 0101 0100 0000 XXXX 0=11 1111 1101 1111 1111 1111
7: EXchess-5.01-beta 22.5 / 44 0000 0000 00== =00= 000= 1=00 XXXX 1111 1011 1=11 1111 1111
8: King's Out 0.2.41 15.5 / 44 0000 0000 0001 0000 0=00 0000 0000 XXXX 1011 111= 11=1 1111
9: xpdnt_060602 14.0 / 44 0000 0000 0000 0000 0000 0010 0100 0100 XXXX 1110 1111 1111
10: HoiChess 0.4.3 10.0 / 44 0000 0000 0000 0000 0000 0000 0=00 000= 0001 XXXX 1111 1111
11: Prophet v2.0 Beta 3 4.5 / 44 0000 0000 0000 0000 0000 0000 0000 00=0 0000 0000 XXXX 1111
12: GiuChess-1.0beta2 0.0 / 44 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 XXXX
-------------------------------------------------------------------------------------------------
264 games: +128 =24 -112
As you know Toga is based on Fruit.
Scorpio came in 2nd ahead of the two Crafty versions.
I don't see Crafty-Cito dozens of phantasy rating points ahead of the original but I also don't see it behind it.
Glaurung's antecessor did better in the previous (unpublished) test. But as we all know these results are all like coin tossing if not hundrets or thousands of games have been played.
Arasan lost a game on time (at move 140). I don't expect any losses on time with a 5 seconds increment. It possibly have to do with pondering. I'll check that with the new version of the engine.
EXchess worked without any problem.
There is an unclear situation between Kings Out and xpdnt. Xpdnt won on time although it checkmated Kings Out. I can't read its logfile good enough if possibly the RESULT command from the engine was missing.
HoiChess and Prophet did well. Prophet didn't lose on time what happend in the previous (unpublished) test very often.
GiuChess didn't get any point and lost 8 of its game on time. I guess I'll wait for a new version before testing it again.
Comments welcome.
Regards
Volker
The games are here: http://www.volker-pittlik.name/temp/Ranking-060712.pgn.bz2