$24
This project is intended to explore how data collection and inference can inform future action and future data collection. This is a situation frequently confronted by arti cial intelligence agents operating in the world - based on current information, they must decide how to act, balancing both achieving a goal and collecting new information. Additionally, this project stresses the importance of formulation and representation. There are a number of roughly equivalent ways to express and solve this problem, it is left to you to decide which is best for your purposes.
Background: MineSweeper
In the game minesweeper, you are presented with a square grid landscape of cells. Hidden in some of the cells are `mines'. At every turn, you may select a cell to uncover. At this point, one of two things will happen: if there is a mine at that location, it explodes and you lose the game; if there is not a mine at that location, it reveals a number, indicating the number of adjacent cells where there are mines. If the cell reveals 0, all the surrounding 8 cells are empty of mines. If the cell reveals 8, all 8 adjacent cells must have mines. For any value in between, for instance 4, you know that half of the adjacent cells have mines, but you cannot be sure by this clue alone which half are dangerous and which half are safe.
The goal of the game is to identify the locations of all the mines (if possible); by collecting clues and information, you can begin to infer which cells are dangerous and which are safe, and use the safe cells to collect more information. This process is iterated until, hopefully, all cells are either uncovered, marked as clear, or marked as mined.
Figure 1: In the rst board, all cells are unknown. The rst move reveals a 3, in which case we may infer that all three surrounding cells have mines. We mark them as such, and know not to ever search those cells. The second move reveals a 0, in which case we know that all 8 surrounding cells must be clear. We mark them as clear, and know that we can search them without worry. Collecting data from the clear cells, we are able to infer the locations of more mines.
The goal of this project is to write a program to play MineSweeper - that is, a program capable of sequentially deciding what cells to check, and using the resulting information to direct future action.
Program Speci cation
Your program should initially meet the following speci cations for use (though some modi cations and suggestions are given in the questions for writeup). The user should have a board / distribution of mines in mind, and use it to answer the program's queries.
Initially take input from the user, specifying the length and width of the board, then enter the following loop: { Display a representation of the current state of knowledge about the board (unknown cells, clear cells,
mine cells, clue cells).
{ Query the user for the state of a speci ed cell based on its coordinates. If the cell has no mine, the user should enter a number between 0 and the number of adjacent cells, specifying the number of adjacent mines. If the cell has a mine, the user should enter this, at which point the game is over and the computer has lost.
{ The game nishes once either a) the computer accidentally queried a mine cell, or b) all cells have been either revealed, or marked as clear / mined.
You may either do a GUI or a text based interface - the important thing in either case is accurately displaying the current state of the program's knowledge. Very deliberately, there are no algorithmic speci cations for how to implement this program. But you should draw on the material discussed both in a) Search, b) Constraint-Satisfaction Problems, and c) Logic and Satis ability. As usual, you must create your own code for solving the main issues and algorithmic problems in this assignment, but you are welcome to use external libraries for things like visualizations, etc.
Questions and Writeup
Answer the following questions about the design choices you made in implementing this program, both from a representational and algorithmic perspective.
Representation: How did you represent the board in your program, and how did you represent the information / knowledge that clue cells reveal?
Inference: When you collect a new clue, how do you model / process / compute the information you gain from it? i.e., how do you update your current state of knowledge based on that clue? Does your program deduce everything it can from a given clue before continuing? If so, how can you be sure of this, and if not, how could you consider improving it?
Decisions: Given a current state of the board, and a state of knowledge about the board, how does your program decide which cell to search next? Are there any risks, and how do you face them?
Performance: For a reasonably-sized board and a reasonable number of mines, include a play-by-play progres-sion to completion or loss. Are there any points where your program makes a decision that you don't agree with? Are there any points where your program made a decision that surprised you? Why was your program able to make that decision?
Performance: For a xed, reasonable size of board, what is the largest number of mines that your program can still usually solve? Where does your program struggle?
E ciency: What are some of the space or time constraints you run into in implementing this program? Are these problem speci c constraints, or implementation speci c constraints? In the case of implementation constraints, what could you improve on?
Improvements: Consider augmenting your program's knowledge in the following way - when the user inputs the size of the board, they also input the total number of mines on the board. How can this information be modeled and included in your program, and used to inform action? How can you use this information to e ectively improve the performance of your program, particularly in terms of the number of mines it can e ectively solve.
Bonus: Chains of In uence
For any cell that is marked clear or mined, we can think about the cells that led to or informed that conclusion. In Fig. 1 above, Cell (0; 0) [upper left corner] containing the clue 3 is enough to determine the value of Cells (1; 0); (1; 1); (0; 1). Cell (1; 2) on the other hand (depending on your speci c implementation) depended on Cell (2; 4) which revealed the values of Cell (1; 4) and Cell (1; 3) - these, combined with the clue in Cell (2; 3), was enough to mark the value of Cell (1; 2) as clear.
In this way, for every cell we can determine the value of, we can construct a chain of in uence that led to the marking - starting at revealed cells, cells those allowed to be marked, and working through the chain of inference to the conclusion for the value of a certain cell. Every new cell will inherit the chain of in uence of any cell that in uenced it - you can imagine this as a very large directed acyclic graph branching out from the initial cell you reveal (or any unjusti ed / guessed cell you reveal).
Based on your model and implementation, how can you characterize and build this chain of in uence? Hint: What are some `intermediate' facts along the chain of in uence?
What in uences or controls the length of the longest chain of in uence when solving a certain board? How does the length of the chain of in uence in uence the e ciency of your solver?
Experiment. Can you nd a board that yields particularly long chains of in uence? How does this vary with the total number of mines?
Experiment. Spatially, how far can the in uence of a given cell travel?
Can you use this notion of minimizing the length of chains of in uence to inform the decisions you make, to try to solve the board more e ciently?
Is solving minesweeper hard?
Bonus: Dealing with Uncertainty
In real world situations, you are frequently confronted with incomplete or partial information. Three possible models of that here might be:
When a cell is selected to be uncovered, if the cell is `clear' you only reveal a clue about the surrounding cells with some probability. In this case, the information you receive is accurate, but it is uncertain when you will receive the information.
When a cell is selected to be uncovered, the revealed clue is less than or equal to the true number of surrounding mines (chosen uniformly at random). In this case, the clue has some probability of underestimating the number of surrounding mines. Clues are always optimistic.
When a cell is selected to be uncovered, the revealed clue is greater than or equal to the true number of surrounding mines (chosen uniformly at random). In this case, the clue has some probability of overestimating the number of surrounding mines. Clues are always cautious.
How could you adapt your solver to each of these three cases? Try to implement at least one, and experiment with what kind of performance hit results, in terms of ability to clear the board.
4