Geschrieben von:/Posted by: Aaron at 03 February 2000 05:58:25:
Als Antwort auf:/As an answer to: Re: Crafty174 vs Phalanx22; Rapid 60 AMD K6_II 350/64 (9,0 - 1,0) geschrieben von:/posted by: Dann Corbit at 02 Februar 2000 01:46:03:
In a rapid 60 match under WinBoard 4.05 running on AMD K6_II 350/64 with 16 MB hash tables for each program and no tablebases I could hardly believe the most surprising result of 9,0 - 1,0 2 draws, 8 wins) in favour of Crafty 1.74 against the latest Phalanx 22 version. On request I can send the games in PGN-format.
Hello,
It's very hard to believe.
May be something is broken in your setings or....
Not at all difficult to imagine. If the chess programs were about equal in >strength, this result would not be unexpected. Since crafty is somewhat >stronger, it should not be surprising at all. Even a Phalanx 22 victory of 9->1 would not be incredibly unlikely.
But the more games the less likely it is right? So a 90-10 Crafty victory over Phalanx would increase the certainty that Crafty is that much stronger?
(In my 5 mins Blitz games, I've played close to 60 games for this matchup and I believe the results are very close with Crafty just ahead by 10 games or so)
Is there a formula to calculate this? I remember someone on CCC saying that a Binomial function would not be accurate since it allows only 2 outcomes Win and Loss ...
Also is there a simple way to figure out ELO differences based on results?
I've taken a basic statsics course, so I probably familar with most terms like chi-square etc..But I just need reminding
