The BOXES algorithm was originally developed by Michie and Chambers (1968) and was one of the first examples of reinforcement learning. The version described here is a modification of the original algorithm that is simpler and requires fewer trials to learn to balance a pole and cart system.

You can find the source code for a Java applet and a brief description of the BOXES algorithm here.


