![]() |
| If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below. |
|
|||||||
| Tags: positions, sanny, test |
|
|
Thread Tools | Display Modes |
|
#1
|
|||
|
|||
|
Sanny;
Here are some standard test positions for you to use with GetClub chess. The classic Bratko-Kopec test and the Win-At-Chess test. There are many other test sets, but these are classics that most people use. The Bratko-Kopec tests are particularly intersting, since they've been around so long and so many programs have done it. They aren't extensive enough, which is why people use additional test sets, but they are classics that most people use. These are done in fairly basic EPD strings. The first part is the FEN description, followed by 'bm' which stands for 'best move', followed by an identification string. (There are more complicated forms of EPD, that can have a move to avoid or multiple acceptable moves, etc. But these are fairly simple ones for you.) FEN starts at A8 and goes across to H8, then A7-H7, etc. Lowercase is black. Uppercase is white. Numbers are how many blank squares. Slashes indicate a new row. This is followed by w/b for side to move, followed by Castling (K=Kingside, Q=Queenside. Uppercase is white, lower case is black. - means no castling status. Kkq would mean White kingside, black bothsides. This is followed by the enpassant square. - means no enpassant possible. These are optionally follwed by half move clock & full move clock. You wont have that in these tests. Simply feed each position into GetClub and have it search, then compare the result, and report the results. Try searches for 5 seconds, 10 seconds and 30 seconds. A test script is easy to do. Or you can program the testing stuff into the program itself. That's what most people do because it makes testing easy that way. You just say "epdtest filename" and it grabs the test file, runs the tests, saves the results to a file and when you come back the results are ready for you. No human participation required. (Note: These tests have gotten a lot of research done on them. As a result, there are some who disagree with the results for various moves. However, the standard results are 'good enough' at this stage. No program gets all the right answers at the quicker time controls (without being specifically tuned to do so), so a few positions with different moves aren't significant right now.) Here are the classic Bratko-Kopec tests. ****Begin 1k1r4/pp1b1R2/3q2pp/4p3/2B5/4Q3/PPP2B2/2K5 b - - bm Qd1+; id "BK.01"; 3r1k2/4npp1/1ppr3p/p6P/P2PPPP1/1NR5/5K2/2R5 w - - bm d5; id "BK.02"; 2q1rr1k/3bbnnp/p2p1pp1/2pPp3/PpP1P1P1/1P2BNNP/2BQ1PRK/7R b - - bm f5; id "BK.03"; rnbqkb1r/p3pppp/1p6/2ppP3/3N4/2P5/PPP1QPPP/R1B1KB1R w KQkq - bm e6; id "BK.04"; r1b2rk1/2q1b1pp/p2ppn2/1p6/3QP3/1BN1B3/PPP3PP/R4RK1 w - - bm Nd5 a4; id "BK.05"; 2r3k1/pppR1pp1/4p3/4P1P1/5P2/1P4K1/P1P5/8 w - - bm g6; id "BK.06"; 1nk1r1r1/pp2n1pp/4p3/q2pPp1N/b1pP1P2/B1P2R2/2P1B1PP/R2Q2K1 w - - bm Nf6; id "BK.07"; 4b3/p3kp2/6p1/3pP2p/2pP1P2/4K1P1/P3N2P/8 w - - bm f5; id "BK.08"; 2kr1bnr/pbpq4/2n1pp2/3p3p/3P1P1B/2N2N1Q/PPP3PP/2KR1B1R w - - bm f5; id "BK.09"; 3rr1k1/pp3pp1/1qn2np1/8/3p4/PP1R1P2/2P1NQPP/R1B3K1 b - - bm Ne5; id "BK.10"; 2r1nrk1/p2q1ppp/bp1p4/n1pPp3/P1P1P3/2PBB1N1/4QPPP/R4RK1 w - - bm f4; id "BK.11"; r3r1k1/ppqb1ppp/8/4p1NQ/8/2P5/PP3PPP/R3R1K1 b - - bm Bf5; id "BK.12"; r2q1rk1/4bppp/p2p4/2pP4/3pP3/3Q4/PP1B1PPP/R3R1K1 w - - bm b4; id "BK.13"; rnb2r1k/pp2p2p/2pp2p1/q2P1p2/8/1Pb2NP1/PB2PPBP/R2Q1RK1 w - - bm Qd2 Qe1; id "BK.14"; 2r3k1/1p2q1pp/2b1pr2/p1pp4/6Q1/1P1PP1R1/P1PN2PP/5RK1 w - - bm Qxg7+; id "BK.15"; r1bqkb1r/4npp1/p1p4p/1p1pP1B1/8/1B6/PPPN1PPP/R2Q1RK1 w kq - bm Ne4; id "BK.16"; r2q1rk1/1ppnbppp/p2p1nb1/3Pp3/2P1P1P1/2N2N1P/PPB1QP2/R1B2RK1 b - - bm h5; id "BK.17"; r1bq1rk1/pp2ppbp/2np2p1/2n5/P3PP2/N1P2N2/1PB3PP/R1B1QRK1 b - - bm Nb3; id "BK.18"; 3rr3/2pq2pk/p2p1pnp/8/2QBPP2/1P6/P5PP/4RRK1 b - - bm Rxe4; id "BK.19"; r4k2/pb2bp1r/1p1qp2p/3pNp2/3P1P2/2N3P1/PPP1Q2P/2KRR3 w - - bm g4; id "BK.20"; 3rn2k/ppb2rpp/2ppqp2/5N2/2P1P3/1P5Q/PB3PPP/3RR1K1 w - - bm Nh6; id "BK.21"; 2r2rk1/1bqnbpp1/1p1ppn1p/pP6/N1P1P3/P2B1N1P/1B2QPP1/R2R2K1 b - - bm Bxe4; id "BK.22"; r1bqk2r/pp2bppp/2p5/3pP3/P2Q1P2/2N1B3/1PP3PP/R4RK1 b kq - bm f6; id "BK.23"; r2qnrnk/p2b2b1/1p1p2pp/2pPpp2/1PP1P3/PRNBB3/3QNPPP/5RK1 w - - bm f4; id "BK.24"; ****End Here are the Win At Chess positions **Begin 2rr3k/pp3pp1/1nnqbN1p/3pN3/2pP4/2P3Q1/PPB4P/R4RK1 w - - bm Qg6; id "WAC.001"; 8/7p/5k2/5p2/p1p2P2/Pr1pPK2/1P1R3P/8 b - - bm Rxb2; id "WAC.002"; 5rk1/1ppb3p/p1pb4/6q1/3P1p1r/2P1R2P/PP1BQ1P1/5RKN w - - bm Rg3; id "WAC.003"; r1bq2rk/pp3pbp/2p1p1pQ/7P/3P4/2PB1N2/PP3PPR/2KR4 w - - bm Qxh7+; id "WAC.004"; 5k2/6pp/p1qN4/1p1p4/3P4/2PKP2Q/PP3r2/3R4 b - - bm Qc4+; id "WAC.005"; 7k/p7/1R5K/6r1/6p1/6P1/8/8 w - - bm Rb7; id "WAC.006"; rnbqkb1r/pppp1ppp/8/4P3/6n1/7P/PPPNPPP1/R1BQKBNR b KQkq - bm Ne3; id "WAC.007"; r4q1k/p2bR1rp/2p2Q1N/5p2/5p2/2P5/PP3PPP/R5K1 w - - bm Rf7; id "WAC.008"; 3q1rk1/p4pp1/2pb3p/3p4/6Pr/1PNQ4/P1PB1PP1/4RRK1 b - - bm Bh2+; id "WAC.009"; 2br2k1/2q3rn/p2NppQ1/2p1P3/Pp5R/4P3/1P3PPP/3R2K1 w - - bm Rxh7; id "WAC.010"; r1b1kb1r/3q1ppp/pBp1pn2/8/Np3P2/5B2/PPP3PP/R2Q1RK1 w kq - bm Bxc6; id "WAC.011"; 4k1r1/2p3r1/1pR1p3/3pP2p/3P2qP/P4N2/1PQ4P/5R1K b - - bm Qxf3+; id "WAC.012"; 5rk1/pp4p1/2n1p2p/2Npq3/2p5/6P1/P3P1BP/R4Q1K w - - bm Qxf8+; id "WAC.013"; r2rb1k1/pp1q1p1p/2n1p1p1/2bp4/5P2/PP1BPR1Q/1BPN2PP/R5K1 w - - bm Qxh7+; id "WAC.014"; 1R6/1brk2p1/4p2p/p1P1Pp2/P7/6P1/1P4P1/2R3K1 w - - bm Rxb7; id "WAC.015"; r4rk1/ppp2ppp/2n5/2bqp3/8/P2PB3/1PP1NPPP/R2Q1RK1 w - - bm Nc3; id "WAC.016"; 1k5r/pppbn1pp/4q1r1/1P3p2/2NPp3/1QP5/P4PPP/R1B1R1K1 w - - bm Ne5; id "WAC.017"; R7/P4k2/8/8/8/8/r7/6K1 w - - bm Rh8; id "WAC.018"; r1b2rk1/ppbn1ppp/4p3/1QP4q/3P4/N4N2/5PPP/R1B2RK1 w - - bm c6; id "WAC.019"; r2qkb1r/1ppb1ppp/p7/4p3/P1Q1P3/2P5/5PPP/R1B2KNR b kq - bm Bb5; id "WAC.020"; 5rk1/1b3p1p/pp3p2/3n1N2/1P6/P1qB1PP1/3Q3P/4R1K1 w - - bm Qh6; id "WAC.021"; r1bqk2r/ppp1nppp/4p3/n5N1/2BPp3/P1P5/2P2PPP/R1BQK2R w KQkq - bm Ba2 Nxf7; id "WAC.022"; r3nrk1/2p2p1p/p1p1b1p1/2NpPq2/3R4/P1N1Q3/1PP2PPP/4R1K1 w - - bm g4; id "WAC.023"; 6k1/1b1nqpbp/pp4p1/5P2/1PN5/4Q3/P5PP/1B2B1K1 b - - bm Bd4; id "WAC.024"; 3R1rk1/8/5Qpp/2p5/2P1p1q1/P3P3/1P2PK2/8 b - - bm Qh4+; id "WAC.025"; 3r2k1/1p1b1pp1/pq5p/8/3NR3/2PQ3P/PP3PP1/6K1 b - - bm Bf5; id "WAC.026"; 7k/pp4np/2p3p1/3pN1q1/3P4/Q7/1r3rPP/2R2RK1 w - - bm Qf8+; id "WAC.027"; 1r1r2k1/4pp1p/2p1b1p1/p3R3/RqBP4/4P3/1PQ2PPP/6K1 b - - bm Qe1+; id "WAC.028"; r2q2k1/pp1rbppp/4pn2/2P5/1P3B2/6P1/P3QPBP/1R3RK1 w - - bm c6; id "WAC.029"; 1r3r2/4q1kp/b1pp2p1/5p2/pPn1N3/6P1/P3PPBP/2QRR1K1 w - - bm Nxd6; id "WAC.030"; rb3qk1/pQ3ppp/4p3/3P4/8/1P3N2/1P3PPP/3R2K1 w - - bm Qxa8 d6 dxe6; id "WAC.031"; 6k1/p4p1p/1p3np1/2q5/4p3/4P1N1/PP3PPP/3Q2K1 w - - bm Qd8+; id "WAC.032"; 8/p1q2pkp/2Pr2p1/8/P3Q3/6P1/5P1P/2R3K1 w - - bm Qe5+ Qf4; id "WAC.033"; 7k/1b1r2p1/p6p/1p2qN2/3bP3/3Q4/P5PP/1B1R3K b - - bm Bg1; id "WAC.034"; r3r2k/2R3pp/pp1q1p2/8/3P3R/7P/PP3PP1/3Q2K1 w - - bm Rxh7+; id "WAC.035"; 3r4/2p1rk2/1pQq1pp1/7p/1P1P4/P4P2/6PP/R1R3K1 b - - bm Re1+; id "WAC.036"; 2r5/2rk2pp/1pn1pb2/pN1p4/P2P4/1N2B3/nPR1KPPP/3R4 b - - bm Nxd4+; id "WAC.037"; 4k3/p4prp/1p6/2b5/8/2Q3P1/P2R1PKP/4q3 w - - bm Rd8+; id "WAC.038"; r1br2k1/pp2bppp/2nppn2/8/2P1PB2/2N2P2/PqN1B1PP/R2Q1R1K w - - bm Na4; id "WAC.039"; 3r1r1k/1p4pp/p4p2/8/1PQR4/6Pq/P3PP2/2R3K1 b - - bm Rc8; id "WAC.040"; 1k6/5RP1/1P6/1K6/6r1/8/8/8 w - - bm Ka5 Kc5; id "WAC.041"; r1b1r1k1/pp1n1pbp/1qp3p1/3p4/1B1P4/Q3PN2/PP2BPPP/R4RK1 w - - bm Ba5; id "WAC.042"; r2q3k/p2P3p/1p3p2/3QP1r1/8/B7/P5PP/2R3K1 w - - bm Be7 Qxa8; id "WAC.043"; 3rb1k1/pq3pbp/4n1p1/3p4/2N5/2P2QB1/PP3PPP/1B1R2K1 b - - bm dxc4; id "WAC.044"; 7k/2p1b1pp/8/1p2P3/1P3r2/2P3Q1/1P5P/R4qBK b - - bm Qxa1; id "WAC.045"; r1bqr1k1/pp1nb1p1/4p2p/3p1p2/3P4/P1N1PNP1/1PQ2PP1/3RKB1R w K - bm Nb5; id "WAC.046"; r1b2rk1/pp2bppp/2n1pn2/q5B1/2BP4/2N2N2/PP2QPPP/2R2RK1 b - - bm Nxd4; id "WAC.047"; 1rbq1rk1/p1p1bppp/2p2n2/8/Q1BP4/2N5/PP3PPP/R1B2RK1 b - - bm Rb4; id "WAC.048"; 2b3k1/4rrpp/p2p4/2pP2RQ/1pP1Pp1N/1P3P1P/1q6/6RK w - - bm Qxh7+; id "WAC.049"; k4r2/1R4pb/1pQp1n1p/3P4/5p1P/3P2P1/r1q1R2K/8 w - - bm Rxb6+; id "WAC.050"; r1bq1r2/pp4k1/4p2p/3pPp1Q/3N1R1P/2PB4/6P1/6K1 w - - bm Rg4+; id "WAC.051"; r1k5/1p3q2/1Qpb4/3N1p2/5Pp1/3P2Pp/PPPK3P/4R3 w - - bm Re7; id "WAC.052"; 6k1/6p1/p7/3Pn3/5p2/4rBqP/P4RP1/5QK1 b - - bm Re1; id "WAC.053"; r3kr2/1pp4p/1p1p4/7q/4P1n1/2PP2Q1/PP4P1/R1BB2K1 b q - bm Qh1+; id "WAC.054"; r3r1k1/pp1q1pp1/4b1p1/3p2B1/3Q1R2/8/PPP3PP/4R1K1 w - - bm Qxg7+; id "WAC.055"; r1bqk2r/pppp1ppp/5n2/2b1n3/4P3/1BP3Q1/PP3PPP/RNB1K1NR b KQkq - bm Bxf2+; id "WAC.056"; r3q1kr/ppp5/3p2pQ/8/3PP1b1/5R2/PPP3P1/5RK1 w - - bm Rf8+; id "WAC.057"; 8/8/2R5/1p2qp1k/1P2r3/2PQ2P1/5K2/8 w - - bm Qd1+; id "WAC.058"; r1b2rk1/2p1qnbp/p1pp2p1/5p2/2PQP3/1PN2N1P/PB3PP1/3R1RK1 w - - bm Nd5; id "WAC.059"; rn1qr1k1/1p2np2/2p3p1/8/1pPb4/7Q/PB1P1PP1/2KR1B1R w - - bm Qh8+; id "WAC.060"; 3qrbk1/ppp1r2n/3pP2p/3P4/2P4P/1P3Q2/PB6/R4R1K w - - bm Qf7+; id "WAC.061"; 6r1/3Pn1qk/p1p1P1rp/2Q2p2/2P5/1P4P1/P3R2P/5RK1 b - - bm Rxg3+; id "WAC.062"; r1brnbk1/ppq2pp1/4p2p/4N3/3P4/P1PB1Q2/3B1PPP/R3R1K1 w - - bm Nxf7; id "WAC.063"; 8/6pp/3q1p2/3n1k2/1P6/3NQ2P/5PP1/6K1 w - - bm g4+; id "WAC.064"; 1r1r1qk1/p2n1p1p/bp1Pn1pQ/2pNp3/2P2P1N/1P5B/P6P/3R1RK1 w - - bm Ne7+; id "WAC.065"; 1k1r2r1/ppq5/1bp4p/3pQ3/8/2P2N2/PP4P1/R4R1K b - - bm Qxe5; id "WAC.066"; 3r2k1/p2q4/1p4p1/3rRp1p/5P1P/6PK/P3R3/3Q4 w - - bm Rxd5; id "WAC.067"; 6k1/5ppp/1q6/2b5/8/2R1pPP1/1P2Q2P/7K w - - bm Qxe3; id "WAC.068"; 2k5/pppr4/4R3/4Q3/2pp2q1/8/PPP2PPP/6K1 w - - bm f3 h3; id "WAC.069"; 2kr3r/pppq1ppp/3p1n2/bQ2p3/1n1PP3/1PN1BN1P/1PP2PP1/2KR3R b - - bm Na2+; id "WAC.070"; 2kr3r/pp1q1ppp/5n2/1Nb5/2Pp1B2/7Q/P4PPP/1R3RK1 w - - bm Nxa7+; id "WAC.071"; r3r1k1/pp1n1ppp/2p5/4Pb2/2B2P2/B1P5/P5PP/R2R2K1 w - - bm e6; id "WAC.072"; r1q3rk/1ppbb1p1/4Np1p/p3pP2/P3P3/2N4R/1PP1Q1PP/3R2K1 w - - bm Qd2; id "WAC.073"; 5r1k/pp4pp/2p5/2b1P3/4Pq2/1PB1p3/P3Q1PP/3N2K1 b - - bm Qf1+; id "WAC.074"; r3r1k1/pppq1ppp/8/8/1Q4n1/7P/PPP2PP1/RNB1R1K1 b - - bm Qd6; id "WAC.075"; r1b1qrk1/2p2ppp/pb1pnn2/1p2pNB1/3PP3/1BP5/PP2QPPP/RN1R2K1 w - - bm Bxf6; id "WAC.076"; 3r2k1/ppp2ppp/6q1/b4n2/3nQB2/2p5/P4PPP/RN3RK1 b - - bm Ng3; id "WAC.077"; r2q3r/ppp2k2/4nbp1/5Q1p/2P1NB2/8/PP3P1P/3RR1K1 w - - bm Ng5+; id "WAC.078"; r3k2r/pbp2pp1/3b1n2/1p6/3P3p/1B2N1Pq/PP1PQP1P/R1B2RK1 b kq - bm Qxh2+; id "WAC.079"; r4rk1/p1B1bpp1/1p2pn1p/8/2PP4/3B1P2/qP2QP1P/3R1RK1 w - - bm Ra1; id "WAC.080"; r4rk1/1bR1bppp/4pn2/1p2N3/1P6/P3P3/4BPPP/3R2K1 b - - bm Bd6; id "WAC.081"; 3rr1k1/pp3pp1/4b3/8/2P1B2R/6QP/P3q1P1/5R1K w - - bm Bh7+; id "WAC.082"; 3rr1k1/ppqbRppp/2p5/8/3Q1n2/2P3N1/PPB2PPP/3R2K1 w - - bm Qxd7; id "WAC.083"; r2q1r1k/2p1b1pp/p1n5/1p1Q1bN1/4n3/1BP1B3/PP3PPP/R4RK1 w - - bm Qg8+; id "WAC.084"; kr2R3/p4r2/2pq4/2N2p1p/3P2p1/Q5P1/5P1P/5BK1 w - - bm Na6; id "WAC.085"; 8/p7/1ppk1n2/5ppp/P1PP4/2P1K1P1/5N1P/8 b - - bm Ng4+; id "WAC.086"; 8/p3k1p1/4r3/2ppNpp1/PP1P4/2P3KP/5P2/8 b - - bm Rxe5; id "WAC.087"; r6k/p1Q4p/2p1b1rq/4p3/B3P3/4P3/PPP3P1/4RRK1 b - - bm Rxg2+; id "WAC.088"; 1r3b1k/p4rpp/4pp2/3q4/2ppbPPQ/6RK/PP5P/2B1NR2 b - - bm g5; id "WAC.089"; 3qrrk1/1pp2pp1/1p2bn1p/5N2/2P5/P1P3B1/1P4PP/2Q1RRK1 w - - bm Nxg7; id "WAC.090"; 2qr2k1/4b1p1/2p2p1p/1pP1p3/p2nP3/PbQNB1PP/1P3PK1/4RB2 b - - bm Be6; id "WAC.091"; r4rk1/1p2ppbp/p2pbnp1/q7/3BPPP1/2N2B2/PPP4P/R2Q1RK1 b - - bm Bxg4; id "WAC.092"; r1b1k1nr/pp3pQp/4pq2/3pn3/8/P1P5/2P2PPP/R1B1KBNR w KQkq - bm Bh6; id "WAC.093"; 8/k7/p7/3Qp2P/n1P5/3KP3/1q6/8 b - - bm e4+; id "WAC.094"; 2r5/1r6/4pNpk/3pP1qp/8/2P1QP2/5PK1/R7 w - - bm Ng4+; id "WAC.095"; r1b4k/ppp2Bb1/6Pp/3pP3/1qnP1p1Q/8/PPP3P1/1K1R3R w - - bm Qd8+; id "WAC.096"; 6k1/5p2/p5np/4B3/3P4/1PP1q3/P3r1QP/6RK w - - bm Qa8+; id "WAC.097"; 1r3rk1/5pb1/p2p2p1/Q1n1q2p/1NP1P3/3p1P1B/PP1R3P/1K2R3 b - - bm Nxe4; id "WAC.098"; r1bq1r1k/1pp1Np1p/p2p2pQ/4R3/n7/8/PPPP1PPP/R1B3K1 w - - bm Rh5; id "WAC.099"; 8/k1b5/P4p2/1Pp2p1p/K1P2P1P/8/3B4/8 w - - bm b6+; id "WAC.100"; 5rk1/p5pp/8/8/2Pbp3/1P4P1/7P/4RN1K b - - bm Bc3; id "WAC.101"; 2Q2n2/2R4p/1p1qpp1k/8/3P3P/3B2P1/5PK1/r7 w - - bm Qxf8+; id "WAC.102"; 6k1/2pb1r1p/3p1PpQ/p1nPp3/1q2P3/2N2P2/PrB5/2K3RR w - - bm Qxg6+; id "WAC.103"; b4r1k/pq2rp2/1p1bpn1p/3PN2n/2P2P2/P2B3K/1B2Q2N/3R2R1 w - - bm Qxh5; id "WAC.104"; r2r2k1/pb3ppp/1p1bp3/7q/3n2nP/PP1B2P1/1B1N1P2/RQ2NRK1 b - - bm Qxh4; id "WAC.105"; 4rrk1/pppb4/7p/3P2pq/3Qn3/P5P1/1PP4P/R3RNNK b - - bm Nf2+; id "WAC.106"; 5n2/pRrk2p1/P4p1p/4p3/3N4/5P2/6PP/6K1 w - - bm Nb5; id "WAC.107"; r5k1/1q4pp/2p5/p1Q5/2P5/5R2/4RKPP/r7 w - - bm Qe5; id "WAC.108"; rn2k1nr/pbp2ppp/3q4/1p2N3/2p5/QP6/PB1PPPPP/R3KB1R b KQkq - bm c3; id "WAC.109"; 2kr4/bp3p2/p2p2b1/P7/2q5/1N4B1/1PPQ2P1/2KR4 b - - bm Be3; id "WAC.110"; 6k1/p5p1/5p2/2P2Q2/3pN2p/3PbK1P/7P/6q1 b - - bm Qf1+; id "WAC.111"; r4kr1/ppp5/4bq1b/7B/2PR1Q1p/2N3P1/PP3P1P/2K1R3 w - - bm Rxe6; id "WAC.112"; rnbqkb1r/1p3ppp/5N2/1p2p1B1/2P5/8/PP2PPPP/R2QKB1R b KQkq - bm Qxf6; id "WAC.113"; r1b1rnk1/1p4pp/p1p2p2/3pN2n/3P1PPq/2NBPR1P/PPQ5/2R3K1 w - - bm Bxh7+; id "WAC.114"; 4N2k/5rpp/1Q6/p3q3/8/P5P1/1P3P1P/5K2 w - - bm Nd6; id "WAC.115"; r2r2k1/2p2ppp/p7/1p2P1n1/P6q/5P2/1PB1QP1P/R5RK b - - bm Rd2; id "WAC.116"; 3r1rk1/q4ppp/p1Rnp3/8/1p6/1N3P2/PP3QPP/3R2K1 b - - bm Ne4; id "WAC.117"; r5k1/pb2rpp1/1p6/2p4q/5R2/2PB2Q1/P1P3PP/5R1K w - - bm Rh4; id "WAC.118"; r2qr1k1/p1p2ppp/2p5/2b5/4nPQ1/3B4/PPP3PP/R1B2R1K b - - bm Qxd3; id "WAC.119"; r4rk1/1bn2qnp/3p1B1Q/p2P1pP1/1pp5/5N1P/PPB2P2/2KR3R w - - bm g6; id "WAC.120"; 6k1/5p1p/2bP2pb/4p3/2P5/1p1pNPPP/1P1Q1BK1/1q6 b - - bm Bxf3+; id "WAC.121"; 1k6/ppp4p/1n2pq2/1N2Rb2/2P2Q2/8/P4KPP/3r1B2 b - - bm Rxf1+; id "WAC.122"; 6k1/1b2rp2/1p4p1/3P4/PQ4P1/2N2q2/5P2/3R2K1 b - - bm Bxd5 Rc7; id "WAC.123"; 6k1/3r4/2R5/P5P1/1P4p1/8/4rB2/6K1 b - - bm g3; id "WAC.124"; r1bqr1k1/pp3ppp/1bp5/3n4/3B4/2N2P1P/PPP1B1P1/R2Q1RK1 b - - bm Bxd4+; id "WAC.125"; r5r1/pQ5p/1qp2R2/2k1p3/4P3/2PP4/P1P3PP/6K1 w - - bm Rxc6+; id "WAC.126"; 2k4r/1pr1n3/p1p1q2p/5pp1/3P1P2/P1P1P3/1R2Q1PP/1RB3K1 w - - bm Rxb7; id "WAC.127"; 6rk/1pp2Qrp/3p1B2/1pb1p2R/3n1q2/3P4/PPP3PP/R6K w - - bm Qg6; id "WAC.128"; 3r1r1k/1b2b1p1/1p5p/2p1Pp2/q1B2P2/4P2P/1BR1Q2K/6R1 b - - bm Bf3; id "WAC.129"; 6k1/1pp3q1/5r2/1PPp4/3P1pP1/3Qn2P/3B4/4R1K1 b - - bm Qh6 Qh8; id "WAC.130"; 2rq1bk1/p4p1p/1p4p1/3b4/3B1Q2/8/P4PpP/3RR1K1 w - - bm Re8; id "WAC.131"; 4r1k1/5bpp/2p5/3pr3/8/1B3pPq/PPR2P2/2R2QK1 b - - bm Re1; id "WAC.132"; r1b1k2r/1pp1q2p/p1n3p1/3QPp2/8/1BP3B1/P5PP/3R1RK1 w kq - bm Bh4; id "WAC.133"; 3r2k1/p6p/2Q3p1/4q3/2P1p3/P3Pb2/1P3P1P/2K2BR1 b - - bm Rd1+; id "WAC.134"; 3r1r1k/N2qn1pp/1p2np2/2p5/2Q1P2N/3P4/PP4PP/3R1RK1 b - - bm Nd4; id "WAC.135"; 6kr/1q2r1p1/1p2N1Q1/5p2/1P1p4/6R1/7P/2R3K1 w - - bm Rc8+; id "WAC.136"; 3b1rk1/1bq3pp/5pn1/1p2rN2/2p1p3/2P1B2Q/1PB2PPP/R2R2K1 w - - bm Rd7; id "WAC.137"; r1bq3r/ppppR1p1/5n1k/3P4/6pP/3Q4/PP1N1PP1/5K1R w - - bm h5; id "WAC.138"; rnb3kr/ppp2ppp/1b6/3q4/3pN3/Q4N2/PPP2KPP/R1B1R3 w - - bm Nf6+; id "WAC.139"; r2b1rk1/pq4p1/4ppQP/3pB1p1/3P4/2R5/PP3PP1/5RK1 w - - bm Rc7; id "WAC.140"; 4r1k1/p1qr1p2/2pb1Bp1/1p5p/3P1n1R/1B3P2/PP3PK1/2Q4R w - - bm Qxf4; id "WAC.141"; r2q3n/ppp2pk1/3p4/5Pr1/2NP1Qp1/2P2pP1/PP3K2/4R2R w - - bm Re8 f6+; id "WAC.142"; 5b2/pp2r1pk/2pp1pRp/4rP1N/2P1P3/1P4QP/P3q1P1/5R1K w - - bm Rxh6+; id "WAC.143"; r2q1rk1/pp3ppp/2p2b2/8/B2pPPb1/7P/PPP1N1P1/R2Q1RK1 b - - bm d3; id "WAC.144"; r1bq4/1p4kp/3p1n2/p4pB1/2pQ4/8/1P4PP/4RRK1 w - - bm Re8; id "WAC.145"; 8/8/2Kp4/3P1B2/2P2k2/5p2/8/8 w - - bm Bc8; id "WAC.146"; r2r2k1/ppqbppbp/2n2np1/2pp4/6P1/1P1PPNNP/PBP2PB1/R2QK2R b KQ - bm Nxg4; id "WAC.147"; 2r1k3/6pr/p1nBP3/1p3p1p/2q5/2P5/P1R4P/K2Q2R1 w - - bm Rxg7; id "WAC.148"; 6k1/6p1/2p4p/4Pp2/4b1qP/2Br4/1P2RQPK/8 b - - bm Bxg2; id "WAC.149"; r3r1k1/5p2/pQ1b2pB/1p6/4p3/6P1/Pq2BP1P/2R3K1 b - - bm Ba3 Bf8 e3; id "WAC.150"; 8/3b2kp/4p1p1/pr1n4/N1N4P/1P4P1/1K3P2/3R4 w - - bm Nc3; id "WAC.151"; 1br2rk1/1pqb1ppp/p3pn2/8/1P6/P1N1PN1P/1B3PP1/1QRR2K1 w - - bm Ne4; id "WAC.152"; 2r3k1/q4ppp/p3p3/pnNp4/2rP4/2P2P2/4R1PP/2R1Q1K1 b - - bm Nxd4; id "WAC.153"; r1b2rk1/2p2ppp/p7/1p6/3P3q/1BP3bP/PP3QP1/RNB1R1K1 w - - bm Qxf7+; id "WAC.154"; 5bk1/1rQ4p/5pp1/2pP4/3n1PP1/7P/1q3BB1/4R1K1 w - - bm d6; id "WAC.155"; r1b1qN1k/1pp3p1/p2p3n/4p1B1/8/1BP4Q/PP3KPP/8 w - - bm Qxh6+; id "WAC.156"; 5rk1/p4ppp/2p1b3/3Nq3/4P1n1/1p1B2QP/1PPr2P1/1K2R2R w - - bm Ne7+; id "WAC.157"; 5rk1/n1p1R1bp/p2p4/1qpP1QB1/7P/2P3P1/PP3P2/6K1 w - - bm Rxg7+; id "WAC.158"; r1b2r2/5P1p/ppn3pk/2p1p1Nq/1bP1PQ2/3P4/PB4BP/1R3RK1 w - - bm Ne6+; id "WAC.159"; qn1kr2r/1pRbb3/pP5p/P2pP1pP/3N1pQ1/3B4/3B1PP1/R5K1 w - - bm Qxd7+; id "WAC.160"; 3r3k/3r1P1p/pp1Nn3/2pp4/7Q/6R1/Pq4PP/5RK1 w - - bm Qxd8+; id "WAC.161"; r3kbnr/p4ppp/2p1p3/8/Q1B3b1/2N1B3/PP3PqP/R3K2R w KQkq - bm Bd5; id "WAC.162"; 5rk1/2p4p/2p4r/3P4/4p1b1/1Q2NqPp/PP3P1K/R4R2 b - - bm Qg2+; id "WAC.163"; 8/6pp/4p3/1p1n4/1NbkN1P1/P4P1P/1PR3K1/r7 w - - bm Rxc4+; id "WAC.164"; 1r5k/p1p3pp/8/8/4p3/P1P1R3/1P1Q1qr1/2KR4 w - - bm Re2; id "WAC.165"; r3r1k1/5pp1/p1p4p/2Pp4/8/q1NQP1BP/5PP1/4K2R b K - bm d4; id "WAC.166"; 7Q/ppp2q2/3p2k1/P2Ppr1N/1PP5/7R/5rP1/6K1 b - - bm Rxg2+; id "WAC.167"; r3k2r/pb1q1p2/8/2p1pP2/4p1p1/B1P1Q1P1/P1P3K1/R4R2 b kq - bm Qd2+; id "WAC.168"; 5rk1/1pp3bp/3p2p1/2PPp3/1P2P3/2Q1B3/4q1PP/R5K1 b - - bm Bh6; id "WAC.169"; 5r1k/6Rp/1p2p3/p2pBp2/1qnP4/4P3/Q4PPP/6K1 w - - bm Qxc4; id "WAC.170"; 2rq4/1b2b1kp/p3p1p1/1p1nNp2/7P/1B2B1Q1/PP3PP1/3R2K1 w - - bm Bh6+; id "WAC.171"; 5r1k/p5pp/8/1P1pq3/P1p2nR1/Q7/5BPP/6K1 b - - bm Qe1+; id "WAC.172"; 2r1b3/1pp1qrk1/p1n1P1p1/7R/2B1p3/4Q1P1/PP3PP1/3R2K1 w - - bm Qh6+; id "WAC.173"; 2r2rk1/6p1/p3pq1p/1p1b1p2/3P1n2/PP3N2/3N1PPP/1Q2RR1K b - - bm Nxg2; id "WAC.174"; r5k1/pppb3p/2np1n2/8/3PqNpP/3Q2P1/PPP5/R4RK1 w - - bm Nh5; id "WAC.175"; r1bq3r/ppp2pk1/3p1pp1/8/2BbPQ2/2NP2P1/PPP4P/R4R1K b - - bm Rxh2+; id "WAC.176"; r1b3r1/4qk2/1nn1p1p1/3pPp1P/p4P2/1p3BQN/PKPBN3/3R3R b - - bm Qa3+; id "WAC.177"; 3r2k1/p1rn1p1p/1p2pp2/6q1/3PQNP1/5P2/P1P4R/R5K1 w - - bm Nxe6; id "WAC.178"; r1b2r1k/pp4pp/3p4/3B4/8/1QN3Pn/PP3q1P/R3R2K b - - bm Qg1+; id "WAC.179"; r1q2rk1/p3bppb/3p1n1p/2nPp3/1p2P1P1/6NP/PP2QPB1/R1BNK2R b KQ - bm Nxd5; id "WAC.180"; r3k2r/2p2p2/p2p1n2/1p2p3/4P2p/1PPPPp1q/1P5P/R1N2QRK b kq - bm Ng4; id "WAC.181"; r1b2rk1/ppqn1p1p/2n1p1p1/2b3N1/2N5/PP1BP3/1B3PPP/R2QK2R w KQ - bm Qh5; id "WAC.182"; 1r2k1r1/5p2/b3p3/1p2b1B1/3p3P/3B4/PP2KP2/2R3R1 w - - bm Bf6; id "WAC.183"; 4kn2/r4p1r/p3bQ2/q1nNP1Np/1p5P/8/PPP3P1/2KR3R w - - bm Qe7+; id "WAC.184"; 1r1rb1k1/2p3pp/p2q1p2/3PpP1Q/Pp1bP2N/1B5R/1P4PP/2B4K w - - bm Qxh7+; id "WAC.185"; r5r1/p1q2p1k/1p1R2pB/3pP3/6bQ/2p5/P1P1NPPP/6K1 w - - bm Bf8+; id "WAC.186"; 6k1/5p2/p3p3/1p3qp1/2p1Qn2/2P1R3/PP1r1PPP/4R1K1 b - - bm Nh3+; id "WAC.187"; 3RNbk1/pp3p2/4rQpp/8/1qr5/7P/P4P2/3R2K1 w - - bm Qg7+; id "WAC.188"; 3r1k2/1ppPR1n1/p2p1rP1/3P3p/4Rp1N/5K2/P1P2P2/8 w - - bm Re8+; id "WAC.189"; 8/p2b2kp/1q1p2p1/1P1Pp3/4P3/3B2P1/P2Q3P/2Nn3K b - - bm Bh3; id "WAC.190"; 2r1Rn1k/1p1q2pp/p7/5p2/3P4/1B4P1/P1P1QP1P/6K1 w - - bm Qc4; id "WAC.191"; r3k3/ppp2Npp/4Bn2/2b5/1n1pp3/N4P2/PPP3qP/R2QKR2 b Qq - bm Nd3+; id "WAC.192"; 5bk1/p4ppp/Qp6/4B3/1P6/Pq2P1P1/2rr1P1P/R4RK1 b - - bm Qxe3; id "WAC.193"; 5rk1/ppq2ppp/2p5/4bN2/4P3/6Q1/PPP2PPP/3R2K1 w - - bm Nh6+; id "WAC.194"; 3r1rk1/1p3p2/p3pnnp/2p3p1/2P2q2/1P5P/PB2QPPN/3RR1K1 w - - bm g3; id "WAC.195"; rr4k1/p1pq2pp/Q1n1pn2/2bpp3/4P3/2PP1NN1/PP3PPP/R1B1K2R b KQ - bm Nb4; id "WAC.196"; 7k/1p4p1/7p/3P1n2/4Q3/2P2P2/PP3qRP/7K b - - bm Qf1+; id "WAC.197"; 2br2k1/ppp2p1p/4p1p1/4P2q/2P1Bn2/2Q5/PP3P1P/4R1RK b - - bm Rd3; id "WAC.198"; r1br2k1/pp2nppp/2n5/1B1q4/Q7/4BN2/PP3PPP/2R2RK1 w - - bm Bxc6 Rfd1; id "WAC.199"; 2rqrn1k/pb4pp/1p2pp2/n2P4/2P3N1/P2B2Q1/1B3PPP/2R1R1K1 w - - bm Bxf6; id "WAC.200"; 2b2r1k/4q2p/3p2pQ/2pBp3/8/6P1/1PP2P1P/R5K1 w - - bm Ra7; id "WAC.201"; QR2rq1k/2p3p1/3p1pPp/8/4P3/8/P1r3PP/1R4K1 b - - bm Rxa2; id "WAC.202"; r4rk1/5ppp/p3q1n1/2p2NQ1/4n3/P3P3/1B3PPP/1R3RK1 w - - bm Qh6; id "WAC.203"; r1b1qrk1/1p3ppp/p1p5/3Nb3/5N2/P7/1P4PQ/K1R1R3 w - - bm Rxe5; id "WAC.204"; r3rnk1/1pq2bb1/p4p2/3p1Pp1/3B2P1/1NP4R/P1PQB3/2K4R w - - bm Qxg5; id "WAC.205"; 1Qq5/2P1p1kp/3r1pp1/8/8/7P/p4PP1/2R3K1 b - - bm Rc6; id "WAC.206"; r1bq2kr/p1pp1ppp/1pn1p3/4P3/2Pb2Q1/BR6/P4PPP/3K1BNR w - - bm Qxg7+; id "WAC.207"; 3r1bk1/ppq3pp/2p5/2P2Q1B/8/1P4P1/P6P/5RK1 w - - bm Bf7+; id "WAC.208"; 4kb1r/2q2p2/r2p4/pppBn1B1/P6P/6Q1/1PP5/2KRR3 w k - bm Rxe5+; id "WAC.209"; 3r1rk1/pp1q1ppp/3pn3/2pN4/5PP1/P5PQ/1PP1B3/1K1R4 w - - bm Rh1; id "WAC.210"; r1bqrk2/pp1n1n1p/3p1p2/P1pP1P1Q/2PpP1NP/6R1/2PB4/4RBK1 w - - bm Qxf7+; id "WAC.211"; rn1qr2Q/pbppk1p1/1p2pb2/4N3/3P4/2N5/PPP3PP/R4RK1 w - - bm Qxg7+; id "WAC.212"; 3r1r1k/1b4pp/ppn1p3/4Pp1R/Pn5P/3P4/4QP2/1qB1NKR1 w - - bm Rxh7+; id "WAC.213"; r2r2k1/1p2qpp1/1np1p1p1/p3N3/2PPN3/bP5R/4QPPP/4R1K1 w - - bm Ng5; id "WAC.214"; 3r2k1/pb1q1pp1/1p2pb1p/8/3N4/P2QB3/1P3PPP/1Br1R1K1 w - - bm Qh7+; id "WAC.215"; r2qr1k1/1b1nbppp/p3pn2/1p1pN3/3P1B2/2PB1N2/PP2QPPP/R4RK1 w - - bm Nxf7; id "WAC.216"; r3kb1r/1pp3p1/p3bp1p/5q2/3QN3/1P6/PBP3P1/3RR1K1 w kq - bm Qd7+; id "WAC.217"; 6k1/pp5p/2p3q1/6BP/2nPr1Q1/8/PP3R1K/8 w - - bm Bh6; id "WAC.218"; 7k/p4q1p/1pb5/2p5/4B2Q/2P1B3/P6P/7K b - - bm Qf1+; id "WAC.219"; 3rr1k1/ppp2ppp/8/5Q2/4n3/1B5R/PPP1qPP1/5RK1 b - - bm Qxf1+; id "WAC.220"; r3k3/P5bp/2N1bp2/4p3/2p5/6NP/1PP2PP1/3R2K1 w q - bm Rd8+; id "WAC.221"; 2r1r2k/1q3ppp/p2Rp3/2p1P3/6QB/p3P3/bP3PPP/3R2K1 w - - bm Bf6; id "WAC.222"; r1bqk2r/pp3ppp/5n2/8/1b1npB2/2N5/PP1Q2PP/1K2RBNR w kq - bm Nxe4; id "WAC.223"; 5rk1/p1q3pp/1p1r4/2p1pp1Q/1PPn1P2/3B3P/P2R2P1/3R2K1 b - - bm Rh6 e4; id "WAC.224"; 4R3/4q1kp/6p1/1Q3b2/1P1b1P2/6KP/8/8 b - - bm Qh4+; id "WAC.225"; 2b2rk1/p1p4p/2p1p1p1/br2N1Q1/1p2q3/8/PB3PPP/3R1RK1 w - - bm Nf7; id "WAC.226"; 2k1rb1r/ppp3pp/2np1q2/5b2/2B2P2/2P1BQ2/PP1N1P1P/2KR3R b - - bm d5; id "WAC.227"; r4rk1/1bq1bp1p/4p1p1/p2p4/3BnP2/1N1B3R/PPP3PP/R2Q2K1 w - - bm Bxe4; id "WAC.228"; 8/8/8/1p5r/p1p1k1pN/P2pBpP1/1P1K1P2/8 b - - bm Rxh4 b4; id "WAC.229"; 2b5/1r6/2kBp1p1/p2pP1P1/2pP4/1pP3K1/1R3P2/8 b - - bm Rb4; id "WAC.230"; r4rk1/1b1nqp1p/p5p1/1p2PQ2/2p5/5N2/PP3PPP/R1BR2K1 w - - bm Bg5; id "WAC.231"; 1R2rq1k/2p3p1/Q2p1pPp/8/4P3/8/P1r3PP/1R4K1 w - - bm Qb5 Rxe8; id "WAC.232"; 5rk1/p1p2r1p/2pp2p1/4p3/PPPnP3/3Pq1P1/1Q1R1R1P/4NK2 b - - bm Nb3; id "WAC.233"; 2kr1r2/p6p/5Pp1/2p5/1qp2Q1P/7R/PP6/1KR5 w - - bm Rb3; id "WAC.234"; 5r2/1p1RRrk1/4Qq1p/1PP3p1/8/4B3/1b3P1P/6K1 w - - bm Qxf7+ Rxf7+; id "WAC.235"; 1R6/p5pk/4p2p/4P3/8/2r3qP/P3R1b1/4Q1K1 b - - bm Rc1; id "WAC.236"; r5k1/pQp2qpp/8/4pbN1/3P4/6P1/PPr4P/1K1R3R b - - bm Rc1+; id "WAC.237"; 1k1r4/pp1r1pp1/4n1p1/2R5/2Pp1qP1/3P2QP/P4PB1/1R4K1 w - - bm Bxb7; id "WAC.238"; 8/6k1/5pp1/Q6p/5P2/6PK/P4q1P/8 b - - bm Qf1+; id "WAC.239"; 2b4k/p1b2p2/2p2q2/3p1PNp/3P2R1/3B4/P1Q2PKP/4r3 w - - bm Qxc6; id "WAC.240"; 2rq1rk1/pp3ppp/2n2b2/4NR2/3P4/PB5Q/1P4PP/3R2K1 w - - bm Qxh7+; id "WAC.241"; r1b1r1k1/pp1nqp2/2p1p1pp/8/4N3/P1Q1P3/1P3PPP/1BRR2K1 w - - bm Rxd7; id "WAC.242"; 1r3r1k/3p4/1p1Nn1R1/4Pp1q/pP3P1p/P7/5Q1P/6RK w - - bm Qe2; id "WAC.243"; r6r/pp3ppp/3k1b2/2pb4/B4Pq1/2P1Q3/P5PP/1RBR2K1 w - - bm Qxc5+; id "WAC.244"; 4rrn1/ppq3bk/3pPnpp/2p5/2PB4/2NQ1RPB/PP5P/5R1K w - - bm Qxg6+; id "WAC.245"; 6R1/4qp1p/ppr1n1pk/8/1P2P1QP/6N1/P4PP1/6K1 w - - bm Qh5+; id "WAC.246"; 2k1r3/1p2Bq2/p2Qp3/Pb1p1p1P/2pP1P2/2P5/2P2KP1/1R6 w - - bm Rxb5; id "WAC.247"; 5r1k/1p4pp/3q4/3Pp1R1/8/8/PP4PP/4Q1K1 b - - bm Qc5+; id "WAC.248"; r4rk1/pbq2pp1/1ppbpn1p/8/2PP4/1P1Q1N2/PBB2PPP/R3R1K1 w - - bm c5 d5; id "WAC.249"; 1b5k/7P/p1p2np1/2P2p2/PP3P2/4RQ1R/q2r3P/6K1 w - - bm Re8+; id "WAC.250"; k7/p4p2/P1q1b1p1/3p3p/3Q4/7P/5PP1/1R4K1 w - - bm Qe5 Qf4; id "WAC.251"; 1rb1r1k1/p1p2ppp/5n2/2pP4/5P2/2QB4/qNP3PP/2KRB2R b - - bm Re2; id "WAC.252"; k5r1/p4b2/2P5/5p2/3P1P2/4QBrq/P5P1/4R1K1 w - - bm Qe8+; id "WAC.253"; r6k/pp3p1p/2p1bp1q/b3p3/4Pnr1/2PP2NP/PP1Q1PPN/R2B2RK b - - bm Nxh3; id "WAC.254"; 3r3r/p4pk1/5Rp1/3q4/1p1P2RQ/5N2/P1P4P/2b4K w - - bm Rfxg6+; id "WAC.255"; 3r1rk1/1pb1qp1p/2p3p1/p7/P2Np2R/1P5P/1BP2PP1/3Q1BK1 w - - bm Nf5; id "WAC.256"; 4r1k1/pq3p1p/2p1r1p1/2Q1p3/3nN1P1/1P6/P1P2P1P/3RR1K1 w - - bm Rxd4; id "WAC.257"; r3brkn/1p5p/2p2Ppq/2Pp3B/3Pp2Q/4P1R1/6PP/5R1K w - - bm Bxg6; id "WAC.258"; r1bq1rk1/ppp2ppp/2np4/2bN1PN1/2B1P3/3p4/PPP2nPP/R1BQ1K1R w - - bm Qh5; id "WAC.259"; 2r2b1r/p1Nk2pp/3p1p2/N2Qn3/4P3/q6P/P4PP1/1R3K1R w - - bm Qe6+; id "WAC.260"; r5k1/1bp3pp/p2p4/1p6/5p2/1PBP1nqP/1PP3Q1/R4R1K b - - bm Nd4; id "WAC.261"; 6k1/p1B1b2p/2b3r1/2p5/4p3/1PP1N1Pq/P2R1P2/3Q2K1 b - - bm Rh6; id "WAC.262"; rnbqr2k/pppp1Qpp/8/b2NN3/2B1n3/8/PPPP1PPP/R1B1K2R w KQ - bm Qg8+; id "WAC.263"; r2r2k1/1R2qp2/p5pp/2P5/b1PN1b2/P7/1Q3PPP/1B1R2K1 b - - bm Rab8; id "WAC.264"; 2r1k2r/2pn1pp1/1p3n1p/p3PP2/4q2B/P1P5/2Q1N1PP/R4RK1 w k - bm exf6; id "WAC.265"; r3q2r/2p1k1p1/p5p1/1p2Nb2/1P2nB2/P7/2PNQbPP/R2R3K b - - bm Rxh2+; id "WAC.266"; 2r1kb1r/pp3ppp/2n1b3/1q1N2B1/1P2Q3/8/P4PPP/3RK1NR w Kk - bm Nc7+; id "WAC.267"; 2r3kr/ppp2n1p/7B/5q1N/1bp5/2Pp4/PP2RPPP/R2Q2K1 w - - bm Re8+; id "WAC.268"; 2kr2nr/pp1n1ppp/2p1p3/q7/1b1P1B2/P1N2Q1P/1PP1BPP1/R3K2R w KQ - bm axb4; id "WAC.269"; 2r1r1k1/pp1q1ppp/3p1b2/3P4/3Q4/5N2/PP2RPPP/4R1K1 w - - bm Qg4; id "WAC.270"; 2kr4/ppp3Pp/4RP1B/2r5/5P2/1P6/P2p4/3K4 w - - bm Rd6; id "WAC.271"; nrq4r/2k1p3/1p1pPnp1/pRpP1p2/P1P2P2/2P1BB2/1R2Q1P1/6K1 w - - bm Bxc5; id "WAC.272"; 2k4B/bpp1qp2/p1b5/7p/1PN1n1p1/2Pr4/P5PP/R3QR1K b - - bm Ng3+; id "WAC.273"; 8/1p6/p5R1/k7/Prpp4/K7/1NP5/8 w - - bm Rb6; id "WAC.274"; r1b2rk1/1p1n1ppp/p1p2q2/4p3/P1B1Pn2/1QN2N2/1P3PPP/3R1RK1 b - - bm Nxg2 b5; id "WAC.275"; r5k1/pp1RR1pp/1b6/6r1/2p5/B6P/P4qPK/3Q4 w - - bm Qd5+; id "WAC.276"; 1r4r1/p2kb2p/bq2p3/3p1p2/5P2/2BB3Q/PP4PP/3RKR2 b - - bm Rg3 Rxg2; id "WAC.277"; r2qkb1r/pppb2pp/2np1n2/5pN1/2BQP3/2N5/PPP2PPP/R1B1K2R w KQkq - bm Bf7+; id "WAC.278"; r7/4b3/2p1r1k1/1p1pPp1q/1P1P1P1p/PR2NRpP/2Q3K1/8 w - - bm Nxf5; id "WAC.279"; r1r2bk1/5p1p/pn4p1/N2b4/3Pp3/B3P3/2q1BPPP/RQ3RK1 b - - bm Bxa3; id "WAC.280"; 2R5/2R4p/5p1k/6n1/8/1P2QPPq/r7/6K1 w - - bm Rxh7+; id "WAC.281"; 6k1/2p3p1/1p1p1nN1/1B1P4/4PK2/8/2r3b1/7R w - - bm Rh8+; id "WAC.282"; 3q1rk1/4bp1p/1n2P2Q/3p1p2/6r1/Pp2R2N/1B4PP/7K w - - bm Ng5; id "WAC.283"; 3r3k/pp4pp/8/1P6/3N4/Pn2P1qb/1B1Q2B1/2R3K1 w - - bm Nf5; id "WAC.284"; 2rr3k/1b2bppP/p2p1n2/R7/3P4/1qB2P2/1P4Q1/1K5R w - - bm Qxg7+; id "WAC.285"; 3r1k2/1p6/p4P2/2pP2Qb/8/1P1KB3/P6r/8 b - - bm Rxd5+; id "WAC.286"; rn3k1r/pp2bBpp/2p2n2/q5N1/3P4/1P6/P1P3PP/R1BQ1RK1 w - - bm Qh5; id "WAC.287"; r1b2rk1/p4ppp/1p1Qp3/4P2N/1P6/8/P3qPPP/3R1RK1 w - - bm Nf6+; id "WAC.288"; 2r3k1/5p1p/p3q1p1/2n3P1/1p1QP2P/1P4N1/PK6/2R5 b - - bm Qe5; id "WAC.289"; 2k2r2/2p5/1pq5/p1p1n3/P1P2n1B/1R4Pp/2QR4/6K1 b - - bm Ne2+; id "WAC.290"; 5r1k/3b2p1/p6p/1pRpR3/1P1P2q1/P4pP1/5QnP/1B4K1 w - - bm h3; id "WAC.291"; 4r3/1Q1qk2p/p4pp1/3Pb3/P7/6PP/5P2/4R1K1 w - - bm d6+; id "WAC.292"; 1nbq1r1k/3rbp1p/p1p1pp1Q/1p6/P1pPN3/5NP1/1P2PPBP/R4RK1 w - - bm Nfg5; id "WAC.293"; 3r3k/1r3p1p/p1pB1p2/8/p1qNP1Q1/P6P/1P4P1/3R3K w - - bm Bf8 Nf5 Qf4; id "WAC.294"; 4r3/p4r1p/R1p2pp1/1p1bk3/4pNPP/2P1K3/2P2P2/3R4 w - - bm Rxd5+; id "WAC.295"; 3r4/1p2k2p/p1b1p1p1/4Q1Pn/2B3KP/4pP2/PP2R1N1/6q1 b - - bm Rd4+ Rf8; id "WAC.296"; 3r1rk1/p3qp1p/2bb2p1/2p5/3P4/1P6/PBQN1PPP/2R2RK1 b - - bm Bxg2 Bxh2+; id "WAC.297"; 3Q4/p3b1k1/2p2rPp/2q5/4B3/P2P4/7P/6RK w - - bm Qh8+; id "WAC.298"; 1n2rr2/1pk3pp/pNn2p2/2N1p3/8/6P1/PP2PPKP/2RR4 w - - bm Nca4; id "WAC.299"; b2b1r1k/3R1ppp/4qP2/4p1PQ/4P3/5B2/4N1K1/8 w - - bm g6; id "WAC.300"; ****End ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! 100,000 Newsgroups ---= - Total Privacy via Encryption =--- |
| Ads |
|
#2
|
|||
|
|||
|
Guest wrote: Sanny; Here are some standard test positions for you to use with GetClub chess. The classic Bratko-Kopec test and the Win-At-Chess test. There are many other test sets, but these are classics that most people use. The Bratko-Kopec tests are particularly intersting, since they've been around so long and so many programs have done it. They aren't extensive enough, which is why people use additional test sets, but they are classics that most people use. I did a google search on "Bratko-Kopec test" and learned a little bit about this. One issue was the small number of test positions, and another was the fact that their results always came out nearly a class too high, relative to superior methods. But here is where I have a real beef with this approach: apparently, some of the positions were snatched from the latest fads in openings theory, and if you have read some ultra-narrow analysis of the lines in question and could recall everything, you will pick the "correct" answer. Otherwise, you're just out of luck, since nobody -- but nobody -- can figure all this stuff out in the allotted two minutes. That, my friends, is not true chess skill. Bring Emanuel Lasker forth from the grave and have him answer whether or not White's Queen can be trapped in some peculiar line from /New In Chess/ magazine circa 1988, and he will of course fail. Yet he would make mincemeat of all these IMs and FMs who construct their tests to closely match their own, peculiar chess knowledge of the latest openings theory or even the supposedly "proper" pawn lever in some position or other. Sanny's program requires easier questions-- far easier, I would say. And what's more, you can't score answers with this simplistic right/ wrong approach; you need to penalize losing moves and reward good-enough answers by comparison, just as in real-world play. In fact, it seems to me that a well-programmed chess engine might be a good way to figure out the real strength of Sanny's program, provided it can reasonably weaken its own play all the way down to GetClub's low level. Suppose for a moment that Fritz's "limit playing strength" function worked like magic, perfectly simulating the various playing strengths from zero to infinity. All you would have to do is have it play the GC program a large number of games at various settings, and look to see where the results evened out, where it won and lost equally. That would tell you it's actual strength, based on entire games-- not just a few, peculiar chess positions. There would be no opportunity for controversy over whether or not any particular "answer" was correct, for the games' results would settle the issue objectively, based on a very large number of chess decisions in all sorts of positions. It seems to me, from what little I saw (for free) of these Kopec positions, they are quite simply the sort of thing that Sanny's program could not solve, except perhaps by dumb luck; they are too advanced for weak programs. What is needed is a test of simple, straight-forward tactics, where one may win a piece, force checkmate, that sort of thing. Start with one-movers, then have some questions that require seeing two full moves ahead, then three and then four. -- help bot |
|
#3
|
|||
|
|||
|
"help bot" wrote in message
... I did a google search on "Bratko-Kopec test" and learned a little bit about this. One issue was the small number of test positions, and another was Definetly. the fact that their results always came out nearly a class too high, relative to superior methods. As I told Sanny a long time ago, the ratings numbers are bogus. I warned him about that months ago. I should have warned him again, but I doubted he would have actually tried it. (And to be honest... even if he had, I suspect those numbers would have still been lower than what he advertises.) Even with a couple 'recalibrations' / revistation's over the years, the ratings numbers aren't really accurate. That's not what's being tested here. What's being tested is solely whether or not GC can find the accepted move. No matter how bad your program is, with today's hardware, you should be able to get at least half right. Less than that suggests serious issues with the program. And any program that misses the first position has *major* problems. But here is where I have a real beef with this approach: apparently, some of the positions were snatched from the latest fads in openings theory, and if you have read some ultra-narrow analysis of Computers don't have to worry about reading... They calculate everything. the lines in question and could recall everything, you will pick the "correct" answer. Otherwise, you're just out of luck, since nobody -- but nobody -- can figure all this stuff out in the allotted two minutes. The tests have been widely used in the computer chess world since the report was done. (In fact, the original testing method was essentially impossible to do on computers because it required 2nd & 3rd best lines. But computer chess people long since ignored that.) Even though everybody knew the numbers were bogus and the positions not the best, it was still a standard test that everybody could share and compare. That's why I bothered to suggest it. And if you'll notice, at no time did I ever say the BK or WAC test suites were the best. I said they were classics that most people use. I think there is some confusion here... You are taking the Bratko-Kopec test as they originally intended. As Bratko still advertises it as if it's valid. That's not what computer chess people do. And I didn't suggest he do that. We treat it like any other test suite of positions. The big difference is that instead of it being a private collection of test positions, these are public. And for whatever reason, the positions themselves began to be used as a way to test chess programs and to compare chess programs. Since then, other sets have also become very popular. Such as the Win-At-Chess set, which I also suggested. (I guess everybody has a set of 'favorites', but BK & WAC are certainly among the most common test suites used.) But we don't treat the BK test like they originally intended. You either find the right move and get a +1 score, or you get it wrong and you get a zero. Nothing more. I could have left out the BK test. But the big reason I did suggest it is because it's been used for so long that there are results available for a number of classic chess programs. Sargon 2.5, Sargon 4, Sargon 5, Awit, Chaos, Belle, CrayBlitz, Deep Thought, HiTech, And others. Those are just some I happen to know of off hand. (Plus, of course, any current program can be easily tested with a mere 24 positions of the BK test suite.) It was a way to get some results that could be compared to other, older programs. Some people in here have compared GC to Sargon, Super Connie, etc. I could have also posted other test sets, but I figured two were enough. We'd be lucky if Sanny even did one single position. comparison, just as in real-world play. In fact, it seems to me that a well-programmed chess engine might be a good way to figure out the real strength of Sanny's program, provided it can reasonably weaken its own play all the way down to GetClub's low level. No. There are significant problems in computer vs. computer play. It makes it very hard to actually determine the strength of the new program. For example, the one with the deeper search is likely to win. Regardless of whether its the stronger program when playing humans. Computer vs. computer is just not the same human vs. human or computer vs. human. Hyatt has also noticed that doing too few different starting positions can lead to some very eratic testing results in computer vs. computer. Even with lots of games (thousands to millions) the results are still eratic from one batch to the next. (Has to do with timing fluctuations and limited variability of starting positions.) He's done testing for a year and a half and the eratic nature is quite repeatable. Most people just don't do enough games to notice or do their testing in a different way. Yes, computer vs. computer tournaments are often done. There are often a problem though. First, most tournaments have far too few rounds. You just can't get enough accuracy playing a few dozen games against several other opponents. (The World computer chess championship and all the older similar tournaments were perpitrators of such limited games.) Second, you need to play more than just one or two programs. Sanny playing only against Rybka and coming up with ratings is definetly wrong. Third, for a quality program to actually weaken its play down to a specific level is actually harder than you might think. It's more than just weakening the eval and making the occasional tactical blunder because at other times, the search is going to see so much that most of its moves will still be too strong. The point is, it's difficult to get realistic reduced playing strengths. It can be done, but it's not easy. Any program that gives reduced ratings levels should be looked at with suspicion unless there is testing to back it up. To do computer vs. computer testing reliably and get a reasonable rating, you'd need several programs with reliable strengths and you'd need lots of games for each time control of interest. You need to automate it. You need lots of different opening positions, so opening books aren't being judged. Etc. ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! 100,000 Newsgroups ---= - Total Privacy via Encryption =--- |
|
#4
|
|||
|
|||
|
Guest wrote: the fact that their results always came out nearly a class too high, relative to superior methods. As I told Sanny a long time ago, the ratings numbers are bogus. I warned him about that months ago. I should have warned him again, but I doubted he would have actually tried it. (And to be honest... even if he had, I suspect those numbers would have still been lower than what he advertises.) True, but why not re-calibrate the test scores so that they are in better accord with reality? Even with a couple 'recalibrations' / revistation's over the years, the ratings numbers aren't really accurate. That's not what's being tested here. What's being tested is solely whether or not GC can find the accepted move. Ah, but the answers to those few problems are *not* accepted. Some are disputed, and when you consider the time frame, a few are probably wrong, which means the scoring system penalizes you for not agreeing with it, rather than for not finding the best moves. No matter how bad your program is, with today's hardware, you should be able to get at least half right. Less than that suggests serious issues with the program. We already know that Sanny's program has serious issues. What we don't know is just what sort of rating all those issues, combined together, will net. Will it be 1200? 1700? But here is where I have a real beef with this approach: apparently, some of the positions were snatched from the latest fads in openings theory, and if you have read some ultra-narrow analysis of Computers don't have to worry about reading... They calculate everything. My point, exactly. The latest fads may well have it wrong, and if a 1988 /New in Chess/ shows 19. ...h6!!! is best against the hyper-Borg Attack, it may well be that in 2007 Rybka 2.0c finds a mate-in-twenty refutation to Kopec's "correct" answer. The tests have been widely used in the computer chess world since the report was done. Okay. Are you trying to suggest that wide- spread adoption -- as with the Ptolemaic model of the Universe -- is somehow proof of correct- ness? Even though everybody knew the numbers were bogus and the positions not the best, it was still a standard test that everybody could share and compare. That's why I bothered to suggest it. And if you'll notice, at no time did I ever say the BK or WAC test suites were the best. I said they were classics that most people use. But Sanny's program is relatively weak. Why do you want him to use a test suite that gives his engine no real chance of finding the supposedly correct answers? A short article I read implied that the test suite was designed to see if strong chess programs of that era could find the "right" moves or ideas that masters or IMs may find. This is way, way out of Sanny's league; his program may well score a big, fat zero on every level, barring dumb luck answer-matches by sheer chance. What useful information would such a result provide-- that the program is not a grandmaster? (We already know this!) But we don't treat the BK test like they originally intended. You either find the right move and get a +1 score, or you get it wrong and you get a zero. Nothing more. That's just plain silly. If a chess engine plays a terrible move, that fact needs to be factored into the score, just as if it finds another viable move which is not an exact match to Brat-Kopec, there should be no penalty. I could have left out the BK test. But the big reason I did suggest it is because it's been used for so long that there are results available for a number of classic chess programs. Sargon 2.5, Sargon 4, Sargon 5, Awit, Chaos, Belle, CrayBlitz, Deep Thought, HiTech, And others. Those are just some I happen to know of off hand. And they also just happen to be among the strongest chess programs of their eras. Why are you intent on rating GetClub on a scale so high that it's score will be "off the chart"? I could have also posted other test sets, but I figured two were enough. We'd be lucky if Sanny even did one single position. I somehow doubt that he will even understand your explanation of the complex notation. comparison, just as in real-world play. In fact, it seems to me that a well-programmed chess engine might be a good way to figure out the real strength of Sanny's program, provided it can reasonably weaken its own play all the way down to GetClub's low level. No. There are significant problems in computer vs. computer play. It makes it very hard to actually determine the strength of the new program. For example, the one with the deeper search is likely to win. Regardless of whether its the stronger program when playing humans. This commentary seems to completely ignore the gist of my idea: playing Fritz set to perfectly emulate a 1200 player is *not* the same as a match between GetClub and Fritz; the issues you describe do not just carry over. Computer vs. computer is just not the same human vs. human or computer vs. human. Hyatt has also noticed that doing too few different starting positions can lead to some very eratic testing results in computer vs. computer. Even with lots of games (thousands to millions) the results are still eratic from one batch to the next. (Has to do with timing fluctuations and limited variability of starting positions.) That sort of testing is not practical, as nobody is going to sit down and manually play GetClub against some other 1200- rated computers. He's done testing for a year and a half and the eratic nature is quite repeatable. Most people just don't do enough games to notice or do their testing in a different way. Maybe there is something wrong with the way he is doing this; sometimes people can be blinded to even the most obvious of things. (Take that one fellow who played with a sting ray, for instance; do you think for one moment he did not know that he was swimming within striking range? Maybe he thought his quick reflexes would allow him to jump out of the way, as when a gator strikes.) Second, you need to play more than just one or two programs. Sanny playing only against Rybka and coming up with ratings is definetly wrong. I repeat: not only is it meaningless to test GetClub against Rybka, but any test suite that is designed for a set level, such as Expert/Master/SM, is just as wrongheaded. We already know that Sanny's program is well below those levels; what we don't know is where among the Class B/C/D/E sections it fits, exactly. The test questions need to tackle everything from the simplest tactics to more difficult problems-- and Mr. Kopec's seem to miss this aim by a country mile. Third, for a quality program to actually weaken its play down to a specific level is actually harder than you might think. It's more than just weakening the eval and making the occasional tactical blunder because at other times, the search is going to see so much that most of its moves will still be too strong. I know that; this is why I postulated a supposedly perfect emulation instead of suggesting that any such thing actually exists. The point is, it's difficult to get realistic reduced playing strengths. It can be done, but it's not easy. Any program that gives reduced ratings levels should be looked at with suspicion unless there is testing to back it up. Such as the testing that showed the Kopec test results to be about a full class off base? To do computer vs. computer testing reliably and get a reasonable rating, you'd need several programs with reliable strengths and you'd need lots of games for each time control of interest. You need to automate it. You need lots of different opening positions, so opening books aren't being judged. This goes right back to my criticism of a few of the Kopec test questions, which one article related were based on one's familiarity with then recently-published and dubious openings theory. I would just as soon avoid the issue of openings /opinions/ and focus instead on what is known as fact. Including dubious opinions into such tests is pretty much counter-productive, while adding a variety of questions -- to which the answers are indisputable -- would serve to improve the process immensely. -- help bot |
|
#5
|
|||
|
|||
|
I will try them next week as I need to convert them to the format
which GetClub understands. Today a big bug was found and removed. Now it will not show awkward scores. I was very worried it used to show +2 benefit in equal positions. Now it show 0.02 in equal position. So the Scores will be correct as per the position of the board. That was a great Bug which give 2 pawn losses in equal positions. Thats why GetClub used to sacrifice its pawns for no reasions. Having corrected that I feel GetClub will play stronger moves. Bye Sanny Play Chess at: http://www.GetClub.com/Chess.html |
|
#6
|
|||
|
|||
|
Here is what I think. Here is my guess work
Beginner Level will not be able to find any of the correct moves Easy Level will solve 30% of them correctly. Normal Level will solve 60% of them. Master Level will solve 80% of them Rybka will solve 90% of them. Before I test them, tell me what is the validity that the above results are correct. And they are the only moves that GetClub should play? It may happen the best move is wrong. Who has checked that these results are OK? In opening both e4 and d4 are correct So what if the GetClub chose another correct alternatives? Are the best moves the only move that has to be played in these positions? r1r2bk1/5p1p/pn4p1/N2b4/3Pp3/B3P3/2q1BPPP/RQ3RK1 b - - bm Bxa3; id "WAC.280"; here b - - means Black to play and No Chastling allowed. And best move is Bxa3. If Black plays something else how much bad will be that. What should the rating of GetClub if GetClub score 30% ? What should the rating of GetClub if GetClub score 40% ? What should the rating of GetClub if GetClub score 50% ? What should the rating of GetClub if GetClub score 60% ? What should the rating of GetClub if GetClub score 70% ? What should the rating of GetClub if GetClub score 80% ? What should the rating of GetClub if GetClub score 90% ? What should the rating of GetClub if GetClub score 100% ? Is this the way to find the correct ratings? How much time should GetClub be given to solve these positions? 10 sec or 1 min or 5 min??? Bye Sanny Play Chess at: http://www.GetClub.com/Chess.html |
|
#7
|
|||
|
|||
|
Sanny wrote: Here is what I think. Here is my guess work Beginner Level will not be able to find any of the correct moves Easy Level will solve 30% of them correctly. Normal Level will solve 60% of them. Master Level will solve 80% of them Rybka will solve 90% of them. Before I test them, tell me what is the validity that the above results are correct. I have not seen but a couple of these types of test positions being discussed by others, but I believe you may find that they are far too difficult for GetClub to handle well. Rybka, on the other hand, could be used as a fair guide to which test answers are dubious; just make sure you allow plenty of thinking time so she can zoom up to speed (~3000). And they are the only moves that GetClub should play? It may happen the best move is wrong. Who has checked that these results are OK? I read that some of the answers were, as they termed it, controversial. In other words, the "correct" answer was decided by a measly International Master, before the advent of really strong chess engines. In opening both e4 and d4 are correct So what if the GetClub chose another correct alternatives? The starting position will hopefully not be one of the test questions. What should the rating of GetClub if GetClub score 30% ? I found some information about this by simply doing a google search on "Bratco- Kopec test". Unfortunately, to actually take the test on screen, you had to be a member. What should the rating of GetClub if GetClub score 40% ? What should the rating of GetClub if GetClub score 50% ? What should the rating of GetClub if GetClub score 60% ? What should the rating of GetClub if GetClub score 70% ? I wouldn't worry about that. What should the rating of GetClub if GetClub score 100% ? That would indeed be scary; it would indicate that you had specifically fine-tuned your GC engine for this crude test, and succeeded!! Is this the way to find the correct ratings? No. How much time should GetClub be given to solve these positions? This was all explained in the original post in this thread. The test I saw discussed on the Web had a strict limit of two minutes per test position, and the problems were very complex. In fact, their scale for the various test results did not go above the level of the test's designer (i.e. IM), and as I recall, the lower limit was not useful for our purposes here. In my opinion, you need a test suite which has lots of different types of positions, and which can tell you if your program is a USCF 1200, a 1500 or an 1800. The Kopec test is simply in the wrong range of strengths. -- help bot |
|
#8
|
|||
|
|||
|
"help bot" wrote in message
... As I told Sanny a long time ago, the ratings numbers are bogus. I warned him about that months ago. I should have warned him again, but I doubted he would have actually tried it. (And to be honest... even if he had, I suspect those numbers would have still been lower than what he advertises.) True, but why not re-calibrate the test scores so that they are in better accord with reality? Because there is considerable doubt that so few positions can even give an accurate rating, even allowing for a couple hundred point downward adjustment and a couple hundred points width in ratings aproximation.. There just isn't enough variety. It is possible that more positions (several hundred) might be able to give a more accurate rating estimate, but I'm not aware of any research into developing such a test & rating scale. Don't misunderstand me, I have no objection to the *idea* of running a quick test suite and using that to calculate an aproximate playing ability. I think it'd be quite handy. It's just that to my knowledge, there has been no more research into developing such a test, and there's no proof that such test results are reliable. It is true that strong programs tend to do tolerably well on the tests. But that doesn't mean the other direction is true. That if you do well on the test the program will actually play reasonably well. The program could end up being tuned for that test and be terrible in other positions. Even with a couple 'recalibrations' / revistation's over the years, the ratings numbers aren't really accurate. That's not what's being tested here. What's being tested is solely whether or not GC can find the accepted move. Ah, but the answers to those few problems are *not* accepted. Some are disputed, and when you consider the time frame, a few are A lot of answers are disputed in all the common tests. That's one of the big problems with test suites, and one of the reasons that you can't easily make a ratings estimation from them. Depending on how you play, you might actually prefer a different move. Depending on who you are playing (and their skill and techniques) you might prefer a different move. For example, wasn't it Tal who loved to get into very risky tactical positions, and overwhelm his opponents? He did that because his opponents were uncomfortable with those positions. He certainly couldn't do that with any computer chess program. So even *who* you are playing can effect what's considered the 'best' move. With deeper investigation (or searching) it's possible that an accepted move is found to be less than expected. That's the risk you take with any sort of testing. Whether it's testing positions or testing games. In my first message in this thread, I did point out that very possibility. But I also pointed out that at least with WAC, a few missed moves aren't going to significantly effect his score. There are newer versions of the WAC test suite with 'revised' answers. Answers that are based on current research and deep computer searches. But even those could be disputed. We already know that Sanny's program has serious issues. What we don't know is just what sort of rating all those issues, combined together, will net. Will it be 1200? 1700? But with test positions, *he* can see that his program has a problem with a particular position that other programs easily get. Computers don't have to worry about reading... They calculate everything. My point, exactly. The latest fads may well have it wrong, and if a 1988 /New in Chess/ shows 19. ...h6!!! is best against the hyper-Borg Attack, it may well be that in 2007 Rybka 2.0c finds a mate-in-twenty refutation to Kopec's "correct" answer. Possibly Things do change as positions get analyzed more & more. But that's true of opening theory that people use to play games. And endgame theory that people use. But usually that doesn't happen because a particular move is 'best' for a reason. So it's not really any different from with humans. The tests have been widely used in the computer chess world since the report was done. Okay. Are you trying to suggest that wide- spread adoption -- as with the Ptolemaic model of the Universe -- is somehow proof of correct- ness? Who said anything about every move being correct? In fact, in my original message I went out of my way to point out that some positions have contested moves. But the idea of a good test suite is that there tends to be only one 'best' move, even if there are several 'good' moves. Many of them are 'best' for a particular reason, not because of any fad. The idea here is that test positions are something he can use and judge by, and something that he can accurately report. Or would you rather his next message say that now beginner level has reached 2800 elo? Even though everybody knew the numbers were bogus and the positions not the best, it was still a standard test that everybody could share and compare. That's why I bothered to suggest it. And if you'll notice, at no time did I ever say the BK or WAC test suites were the best. I said they were classics that most people use. But Sanny's program is relatively weak. Why do you want him to use a test suite that gives his engine no real chance of finding the supposedly correct answers? He does have a chance. Not to solve all of the positions, but that's kind of the point. You don't want a test where you can solve them all. The test suites I suggested are not the hardest around. They are actually fairly basic. And the time controls I suggested are actually fairly generous. Many people use 5 seconds a move for their programs in these suites. I suggested he try 5, 10 and 30 seconds. He can even go to one minute if he wants to spend that much time. Just as long as he does so consistantly. They are often used in the early stages of chess programming. Remember, computer chess searching is a lot different from the way humans play. So programs can actually find quite a few of these positions simply because of their tactical search. But hey, if *you* don't like these test suites, feel free to suggest some of your own. I have absolutely no problem with that. A short article I read implied that the test suite was designed to see if strong chess Not sure which test you are talking about. Win-At-Chess test suite is, of course, from a chess teaching book called "Win At Chess" The Bratko-Kopec test was also derived from several teaching chess books, although I can't remember their names. Incidentally, if you want to read the original paper on the BK test, http://spider.sci.brooklyn.cuny.edu/...s/P_index.html http://spider.sci.brooklyn.cuny.edu/...ons/O_42_C.pdf There are several papers about the BK test, but I don't happen to have a handy link to them. One is "The Bratko-Kopec test Revisited" by T. Marsland, and another is "The Bratko-Kopec Test Recalibrated" by Benn &^ Kopec. And a couple others that I can't find. programs of that era could find the "right" moves or ideas that masters or IMs may find. This is way, way out of Sanny's Many of the 'best moves' are actually done for tactical reasons that computers can do. For the positional ones, he'll get some right and some wrong. For the tactical ones, he'll miss some, too. And then he'll have some idea of where his program is failing and where to improve it. That's what test positions are all about. league; his program may well score a big, fat zero on every level, barring dumb Bet not. I wouldn't be at all surprised if TSCP (or the CPW clone) or micromax could solve quite a few of them. Remember, chess is tactics and because of that, many 'best moves' are done for eventual tactical reasons. Tactics are something computers are pretty good at. But no, I don't expect him to score as well as many other programs. That doesn't matter. These tests give solid answers rather than the vague kind of testing he's been doing. But we don't treat the BK test like they originally intended. You either find the right move and get a +1 score, or you get it wrong and you get a zero. Nothing more. That's just plain silly. If a chess engine plays a terrible move, that fact needs to be factored into the score, just as if it finds another viable move which is not an exact match to Brat-Kopec, there should be no penalty. There are some tests that do have 'avoid' moves. But those are harder tests. Let's keep things simple for him and just give him points for the ones he gets right. For the ones he doesn't, he needs to work on. Remember, most of these have only one best move for a reason. Implicitly if you don't choose it, you made a terrible move. (Not all are like that, but we can pretend that for now.) I could have left out the BK test. But the big reason I did suggest it is because it's been used for so long that there are results available for a number of classic chess programs. Sargon 2.5, Sargon 4, Sargon 5, Awit, Chaos, Belle, CrayBlitz, Deep Thought, HiTech, And others. Those are just some I happen to know of off hand. And they also just happen to be among the strongest chess programs of their eras. Why are you intent on rating GetClub on a scale so high that it's score will be "off the chart"? (shaking head....) You really don't get how much computer performance effects things. With modern hardware, even a *SIMPLE* program like MicroMax has a higher rating that Sargon 2.5 / 4 / 5 or Awit or Chaos did back in their prime. I think TSCP can score around 1800. Faile over 2000. Neither of those are complicated programs. Even a semi-smart program easily has a higher rating than Belle or CrayBlitz. By today's standards, the early CrayBlitz is an idiot running on slower hardware than many PDA's have. Modern hardware is *FAST* and it really helps. I once had a discussion with Sanny about the "Technology curve". About how much faster hardware can help your play and ratings. Going from the hardware from back then (such as sargon 2.5) to now is going to really help! So his scores should be comparable to what the Sargons could do. (I said should... not would...) If he improves his program, then he can compare to Belle, CrayBlitz, etc. Kind of scary, huh? I could have also posted other test sets, but I figured two were enough. We'd be lucky if Sanny even did one single position. I somehow doubt that he will even understand your explanation of the complex notation. (shrug) I tried. Feel free to offer him other, supposedly better test suites. This commentary seems to completely ignore the gist of my idea: playing Fritz set to perfectly emulate a 1200 player is *not* the same as a match between GetClub and Fritz; the issues you describe do not just carry over. Then I misunderstood what you were trying to say. That sort of testing is not practical, as nobody is going to sit down and manually play GetClub against some other 1200- rated computers. That's why it needs some way to do automatic testing. But I don't think he's going to do that. He's done testing for a year and a half and the eratic nature is quite repeatable. Most people just don't do enough games to notice or do their testing in a different way. Maybe there is something wrong with the way he is doing this; sometimes There has been a lot more discussion going on and even a mathematician joined in. To put it simply 1) the computer clock introduces a lot of randomness in the game. Doing a timed game is different from doing a fixed number of nodes or plies. Hyatt has shown that even doing a few hundred nodes more or less can lead to radically different games with different results. A few hundred nodes can be enough to determine whether the computer picks a great move or an okay move or even a blunder but didn't have time enough to notice. (Note: This has been comfirmed by others.) 2) You need an adequate number of opponents. 3) You need to have enough opening positions to test from. Hyatt had been using 40 standard positions and that isn't enough. You need more but it's not sure yet how many are enough. 4) You need lots of games, since the clock introduces some randomness. It's not yet determined how many games you need to get reliable results. Most people simply don't do enough testing to be able to notice these variations. You make a change in your program, you run a few simple tests and you decide to keep it or throw it out. Most people just assumed the method was valid. They never did the scale of testing Hyatt has to verify it. It's not. Or at least it can easily be invalid. It's still being investigated as to the best way to do reliable testing with reasonable error bounds. people can be blinded to even the most obvious of things. (Take that one fellow who played with a sting ray, for instance; Stingings aren't too uncommon because they can get steped on. Fatalities are rare though. However, stingrays are actually pretty gentle creatures unless you startle or harrass them. Leave them alone and don't startle them, and they don't care about you. do you think for one moment he did not know that he was swimming within striking range? Maybe he thought his quick reflexes would allow him to jump out of the way, as when a gator strikes.) You are talking about Irwin, and the couple of others who were stung very shortly after that. No, he was leaving it alone. Swiming near it observing it, but leaving it alone. The same thing thousands of other divers do every year. The problem was, he somehow startled it. And by sheer chance the barb went into his heart. The odds are good that if he had left the barb in, he would have survived. I agree the guy was a bit of a nut, but not because of the stingray. Second, you need to play more than just one or two programs. Sanny playing only against Rybka and coming up with ratings is definetly wrong. I repeat: not only is it meaningless to test GetClub against Rybka, but any test suite that is designed for a set level, such as Expert/Master/SM, is just as wrongheaded. Then feel free to offer your own choices to Sanny.... We already know that Sanny's program is well below those levels; what we don't know is where among the Class B/C/D/E sections it fits, exactly. The test questions need to tackle everything from the simplest tactics to more difficult problems-- and Mr. Kopec's seem to miss this aim by a country mile. Half of them are tactical positions. Most chess programs should be able to do pretty well on that part. Bu they, I'm not saying the BK or WAC test is best, either. The point is, it's difficult to get realistic reduced playing strengths. It can be done, but it's not easy. Any program that gives reduced ratings levels should be looked at with suspicion unless there is testing to back it up. Such as the testing that showed the Kopec test results to be about a full class off base? For computers, it's far more than that. ----== Posted via Pronews.Com - Unlimited-Unrestricted-Secure Usenet News==---- http://www.pronews.com The #1 Newsgroup Service in the World! 100,000 Newsgroups ---= - Total Privacy via Encryption =--- |