Creating a positional testset
Posted: 09 Mar 2006, 22:25
I'm about to compose a positional testset. I think the WM-Test would be a good base. It ships with a subset of 36 positions thats considered to contain mainly positional positions. But some of them focus to much other aspects like king safty or contain tactical aspects. Sure each positional advantage will turn into a tactical one sooner or later. But to have a real positional subset this should happen far beyond the horizon of current engines.
To distinguish between positions with to much tactical impact an pure positional positions I thought about the following test:
This might look strange at first glance but if a engine finds a solution in 2) but not in 1) it's very likely that the positional advantage turns into a tactical one to fast (within the horizon of the engine) so your engine might have solved the test by tactics. If the positional evaluation of the engine is good you will find the solution very fast. If the positional evaluation of the engine is wrong it will never find the solution regardless how long it will search (as long as a first tactical bebefit is to deep).
Do you think this is good approach to build a testset ?
Klaus
To distinguish between positions with to much tactical impact an pure positional positions I thought about the following test:
- Solve the testset with your favorite engine at short timecontrols (10sec).
- Repeat the test at much longer timecontrol (10min)
- Remove all positions that are not found in 1) but are found in 2)
- Maybe repeat 1)-3) with other engines.
This might look strange at first glance but if a engine finds a solution in 2) but not in 1) it's very likely that the positional advantage turns into a tactical one to fast (within the horizon of the engine) so your engine might have solved the test by tactics. If the positional evaluation of the engine is good you will find the solution very fast. If the positional evaluation of the engine is wrong it will never find the solution regardless how long it will search (as long as a first tactical bebefit is to deep).
Do you think this is good approach to build a testset ?
Klaus