Getting data for a Freecell AI

I'm training a neural network to play freecell.

As mentioned in part 1 of this series, there are several existing freecell solvers using a variation of A* search. To differentiate this project, I have set a goal:

  • The goal of this project is to build a neural network to play Freecell by looking at a board state and choosing the next move, without search/undos.

In summary, I want to build a neural network NN such that next_move = NN(board_state).

Time to come up with a plan and get started.

Overall plan

In short, the plan is to build a freecell engine, collect game data, train a neural network and then deliver the network in a "usable" state.

Step 1: Build a freecell engine. The AI has to play something, after all. Including a browser UI will let me collect my own solutions, let us watch the AI play in the final product, and give me a fun game to play.

Step 2: Collect data. Can manually solve games, look for solutions online, or generate solutions from solvers. Will need a serialization format to store, read and share the data.

Step 3: Train a neural network. This is the meat of the project. The plan is to start simple (feed forward), gradually increasing in complexity as necessary. I want to take incremental steps to help build an intuitive understanding of how each piece works.

Step 4: Deliver the freecell AI. Ideally a web app that anybody can visit and watch a game of freecell get solved. Idea is to statically host it with JS/wasm (wgpu?) inference. If it's too large or slow, an easy "run locally" is acceptable. Can serve some recorded NN games in that case.

With a plan born mostly of optimism, let's get started.

Build a freecell engine

I built a freecell engine in Rust. It includes a basic CLI interface as well as a web app. I compiled the Rust code to wasm using wasm-bindgen, so it uses the exact same engine everywhere. The JS frontend is only used to render the cards and handle clicks.

The web ui is functional but barebones, without animations or even drag-and-drop. I think there are probably moves that are impossible to make in the web UI at the moment.

One neat feature is that the core game engine works without any heap allocations. I built it this way mostly out of performance curiosity, but it turned out to be extra useful because (spoiler!) I would later use the same strategy to interface with the neural networks.

Collect data

The first data I found was a solution catalog from Michael Keller, containing several hundred manual solutions to games. It introduced me to a serialization format, which took me down a brief detour.

Standard notation

The most common serialization format used online is "standard notation". The format is a game number (from Microsoft freecell) and a sequence of moves. Each move is encoded as 2 characters: the position to move from and the position to move to. Valid positions are:

  • Free cells, as abcd
  • Home, as h
  • Card columns, as 12345678

An example solution:

#527 Michael Keller
8d 83 43 d8 2a 23 2b 25 27 2c
c2 62 64 42 4c 4d 43 41 81 d8
32 17 13 82 83 b4 84 68 68 1b
6d 61 a6 d6 b6 c1 76 16 4a 4b
54 5d 57 56 b6 a6 75 7h 73 1a
1b 14 d1 b1 71 74 7b a3 34 31
35

Where the first two moves are

  1. Move from column 8 to free cell 4
  2. Move from column 8 to column 3

This format is good for humans reading the solutions while playing a game, but it is lacking for my use case of training a neural network:

  • Turning a game number into a deck requires duplicating Microsoft freecell's deck shuffle algorithm
  • Ambiguous moves are possible when moving to an empty column
  • Arbitrary shuffles are not supported, and both parties need to agree on the shuffling algorithm
  • This format does not list "automatic moves" commonly done by freecell game engines

Better serialization

I wanted a format that was unambiguous, url safe and portable.

A basic initial idea is to serialize my rust game engine structs to JSON, and then base64 encode it. serde makes this easy. But doesn't it seem silly to use base64 to serialize data consisting of 52 cards? Well, what would base52 be anyhow? It could be just the alphabet, A-Za-z.

I came up with a new format that is fully unambiguous, web safe, and easy for a computer to parse.

  • Each card assigned a letter of the alphabet, using both upper and lower case
  • A game/deck is a sequence of 52 cards, dealt row-wise from left to right
  • Moves are encoded as a card to move, and a destination
  • Destinations are encoded the same way, with abcd for free cells, h for home and 1-8 for columns

Examples:

  • A is the ace of spades, B is the two of spaces, and n is the ace of diamonds
  • A game/deck is any shuffle of the 52 char string A-Za-z
  • Zc is a move for the king of hearts to the third free cell

In hindsight, I wish I used only f for free cells, instead of abcd, since which cell is chosen doesn't matter if the moves are valid.

Compared to the standard notation, this format is the same size (for moves), unambiguous, and portable. However, it is not human readable.

More data, all the data!

While I had a few hundred high quality (manual) game solutions already, I was worried that it wouldn't be enough data to generalize to solving any of the ~10^64 possible Freecell deals.

I decided to generate solutions, using the fc-solve code. It took me a while to build the code, then to figure out the options to the various programs.

I finally stitched together some Python to parallelize execution and generate two solutions for each of 32k games in only a couple minutes. After several mildly frustrating hours, I was able to publish the data I generated online.

A few days later, I found the freecell project, which contains an open database of one million Freecell solutions. However, that repository is licensed under GPL 3.0, which is too long for me to read. So for now, I'm going to avoid using that data.

That's it for now, next time I'll talk about designing my first neural networks.