Excel Su Doku
Excel Su Doku
/w EPDw UKMTM5
Sign in
Search In:
All Time
Date range: Search Search this blog Common Tasks RSS for Posts RSS for Comments Atom recent posts Search all blogs
by
Diego Oppenheimer
by
Diego Oppenheimer
by
Diego Oppenheimer
by
FredK
by
FredK
Tags
Analysis Services Backwards Compatibility Charting Conditional Formatting Excel 2010 Excel Server File Format Formatting & Printing Formulas and functions Gemini How To News / Announcments Other Improvements Overview PDF PDF, Undo, Other Improvements Performance PivotTables Power Tips Programmability Rows, Columns, Performance Sparklines Tables UDFs Undo
The BI Blog
All things Business Intelligence (BI) related at Microsoft
Office Global Experience Office Math Microsoft OneNote InfoPath team blog Office Interoperability Office Natural Language team blog Office Extensibility Microsoft SharePoint Designer Microsoft SharePoint Workspace Office Sustained Engineering Performance Point team blog
Archives Archives June 2010 (9)
May 2010 (7) April 2010 (6) March 2010 (5) February 2010 (8) January 2010 (8) December 2009 (2) November 2009 (17) October 2009 (16) September 2009 (15) August 2009 (13) July 2009 (16) June 2009 (3) May 2009 (1) April 2009 (3) March 2009 (3) February 2009 (4) January 2009 (8) December 2008 (3) November 2008 (2) October 2008 (6) September 2008 (4) August 2008 (5) July 2008 (1) June 2008 (5) May 2008 (4) April 2008 (8) March 2008 (6) February 2008 (5) January 2008 (6) December 2007 (4) November 2007 (4) October 2007 (4) September 2007 (4) August 2007 (7) May 2007 (1) April 2007 (2) March 2007 (1) February 2007 (2) December 2006 (2) October 2006 (3) September 2006 (18) August 2006 (15) July 2006 (12) June 2006 (14) May 2006 (15) April 2006 (11) March 2006 (12) February 2006 (11) January 2006 (6) December 2005 (11) November 2005 (17) October 2005 (17) September 2005 (4)
Building a Basic, Understandable Sudoku Solver Using Excel Iterative Calculation - Part 1/2
Joseph Chirilov 30 Sep 2008 1:56 AM Comments 15 Today's author, Charlie Ellis, a Program Manager on the Excel team, shares a spreadsheet he built in Excel for solving Sudoku puzzles. The spreadsheet can be found in the attachments at the bottom of this post. For those of you who don't already know, Sudoku is a type of logic puzzle (that I was completely addicted to about three years ago) that requires you to place the numbers 1-9 into a grid obeying certain rules (lots more information on Sudoku is available on the web). A while back, a fellow PM on the Excel team, Dan Cory, wrote a spreadsheet for solving Sudoku puzzles using Excel formulas and made it available on Office Online (here). Dan's spreadsheet was great in that, unlike many of the Sudoku solving spreadsheets out there, it didn't use any VBA or other scripting to do the work of solving the puzzles, and relied instead on the iterative calculation feature of Excel. It's quite cool and has been a popular download, but one thing about the spreadsheet that I wanted to see if I couldn't improve upon was just how complicated it is. In fact, Dan made every single cell its own different formula, and he ended up having to use VBA to create the formulas because maintaining and debugging it without VBA to write all those different formulas in an automated way was impossible. As soon as I saw Dan's spreadsheet, I wanted to make my own version of a Sudoku solver that not only used only formulas, but also one where the formulas were relatively understandable and there were a small number of distinct formulas. It turned out to not be that tough to build, but I think I learned a fair amount trying different approaches to the problems of making an iterative model like this one perform well and at the same time be reasonably maintainable and understandable. I think it might even have turned up a reasonably useful way at looking at abstraction within formulas given the Excel formula language. I've always wanted to blog about the process of creating this spreadsheet and about how iterative formulas work to show the power of Excel's formula language, because it illustrates the usefulness of circular references and iterative calculation, and because I just think it's an incredible amount of fun so here goes. Lots of people have created more powerful solvers, many as spreadsheets, some using just formulas, but I wanted to try to explain how you can go about creating a solver and hopefully share some formula tricks that people find useful.
Pre-reqs
Creating a spreadsheet for solving a Sudoku isn't entry-level spreadsheeting. In addition to being pretty good with formulas, you'll need to understand the concept of iteration. Chris Rae did a great job of explaining the topic in his earlier post on Iteration & Conway's Game of
Life, so I'm not going to repeat that, and I'll simply assume you already understand iteration. Second, we're going to make extremely heavy use of named ranges, and for the stuff I'm doing, the new name manager is very helpful (see Formula building improvements Part 4: Defined Names for some information about this) and I'm going to assume working knowledge of it and of named ranges generally (though I'm going to show some tricks which may be new to even experienced formula users). Finally, you'll need to at least be familiar with array notation in Excel.
The input and solution boards are reasonably straightforward (the input board is the one in the top left where you'll type in a puzzle to be solved, the solution board is where the correct answer hopefully shows up). The board with possible values, which I'll call the valid values board, is a bit trickier. It is 27x27 because each box in the input and solution boards is represented by a 3x3 set of cells in the valid values board. Each of these nine cells represents whether one of the numbers 1-9 is still in the running to be the actual value for the corresponding box of the solution board and the set of possible values for a given cell in the input/solution cell is the set of all the numbers in a single 3x3 "big cell" that are not blank. If it
isn't already, the purpose/use of this board should become clear later. For now, let's fill in all the possible values from one to nine in each of these big cells.
Note that you'll want to do the filling in with either Paste Special | Formulas or CTRL-Enter because otherwise you'll mess up all the pretty formatting. Breaking this formula down, ROW and COLUMN return (duh) the row or column of the reference passed to them as a number. Passing these functions A1, as in this formula, means they'll give us a number that starts at one and goes up. The first part of the formula uses the modulus function to transform the column numbers given by COLUMN into the numbers 0-2, and then adds one to get 1-3. To this we add a 0, 3, or 6, depending on the row number by using the modulo function on the result of the ROW function. Next, because that's a bit of a gnarly formula to have sitting around, and we're going to have to use it all over the place, we're going to take this formula and move it out of the cell and into a named range. This allows us to abstract away all of the logic for this formula into a single, understandable name. For lack of a better name, I'm going to call it "onetonine" and it will have the same exact formula we just created. Because the context for the relative references (i.e. what they take as being the current cell) is determined by what cell you're in when you create
the named range, it's critical that you start off by selecting cell A1, then create the new named range, so that your formula works everywhere within the sheet.
This is also why we allow gutters of three rows and three columns around all the boards. Now we can take our new name and test it out in the board, like so:
Here CTRL+Enter is by far the easiest way to set the formula for all the cells in the valid values board. First select the whole board, then type in the formula, and instead of pressing Enter, just hit CTRL+Enter to fill the formula you just typed into all the cells (without messing up their formatting).
Again, use CTRL+Enter to fill this into the appropriate cells. Now that we have the base thing working, let's make it more re-usable and meaningful by using named ranges. As we did with the name onetonine, let's abstract the concept of referring to the correct input cell from any cell in the solution board and turn that into a name. We'll need to do something similar for all the boards at some point, so we'll start by making named ranges for each of the boards (I chose in_board, sol_board, and val_board) and then a name to go from the solution board to the input board (in_cell_from_sol) which is simply =Main!D4, then use this to change the formula to be =IF(in_cell_from_sol, in_cell_from_sol, ""). Note that this needs to be input from D16. OK, so far we just made our formula longer, but trust me, this concept becomes a life saver. Doing the same for valid value cells from solution board cells is only a bit trickier. The name sol_cell_from_val is: =INDEX(sol_board, INT((ROW(Main!A1)-1)/3)+1, INT((COLUMN(Main!A1)1)/3)+1) This must be created from cell P4. This formula uses ROW and COLUMN together with the division operator and INT to convert from the coordinates of the current cell in the 27x27 board to their coordinates in a 9x9 board, then uses INDEX to get the cell out of the sol_board corresponding to those coordinates. A neat way of testing this formula is to click into the "Refers to" box of the name manager from different cells in the valid values board. Depending on what cell you're in you'll see "dancing ants" (a moving highlight) for a different cell - hopefully the corresponding cell in the solution board.
Now that we have some basics, let's put in an actual puzzle and see about getting the inputs to propagate to the solution board and the valid value board. Here's the puzzle we'll use:
After entering it, the solution board should look like the input board. To make the valid value board work, we use this formula for all valid board cells: =IF(sol_cell_from_val<>"",IF(sol_cell_from_val=onetonine, onetonine,""), onetonine) This means that the current cell is blanked out if a value exists in the solution cell and that value isn't the current onetonine value. This should give you:
Now we're ready to do the stuff that will actually help come up with solutions based on the rules and strategies of Sudoku.
Which, while not simple, is at least understandable and gives you a valid values board that looks like this:
there exist a solution cell with my number in any of the row, column, or bigbox?" This name only ever has to return true or false (doing the test part of the condition #2 does above) and despite not being short, is actually really simple to write: =OR(COUNTIF(sol_row_from_val, onetonine)>0, COUNTIF(sol_col_from_val, onetonine)>0, COUNTIF(sol_bigbox_from_val, onetonine)>0) Taking advantage of this new name makes our new formula for valid value cells: =IF(sol_cell_from_val<>"",IF(sol_cell_from_val=onetonine, onetonine,""), IF(solution_in_rcb, "",onetonine)) Which is not only shorter than this formula had been and much more understandable, it also results in some clear places where there's only one possible solution:
So we can eyeball some solutions, but the trick now is to feed those into the solution board. This is where iteration comes in. Next time we'll use iteration and a few more formula tricks to solve some Sudokus. Edit: Updated the sol_bigbox_from_val formula to reflect what it looks like when entered from the starting cell in P4. Also clarified in a couple other places that the starting cell should be P4.
Attachment: 15 Comments
true:undefined
Comments
Building a Basic, Understandable Sudoku Solver Using Excel Iterative Calculation - Part 1/2 : EasyCoded 30 Sep 2008 2:22 AM
Andrew Wagner 30 Sep 2008 8:55 AM Can you please give the correct formula for sol_col_from_val (used in solution_in_rcb)? Thanks!
Charlie Ellis 30 Sep 2008 12:39 PM Sure thing, Andrew. The correct formula for sol_col_from_val is: =INDEX(sol_board, 0, INT((COLUMN(Main!A1)-1)/3)+1) ...when entered from the first cell in the valid values board. Basically, it's the same thing as sol_row_from_val (which was =INDEX(sol_board, INT((ROW(Main!A1)-1)/3)+1, 0)) but with a zero for the row_num argument as opposed to a zero for the col_num argument. Note that putting a zero for the row_num or col_num means that INDEX should return all rows or all columns of the reference instead of a specific one. Also, for future reference, you can see every formula used in this post (and the upcoming part two) by downloading a copy of the spreadsheet from the link Joe provided.
Ron 30 Sep 2008 1:30 PM This doesn't work as posted. The sol_bigbox_from_val formula should be =INDEX(sol_board, INT((ROW(Main!A1)-1)/9)*3+1, INT((COLUMN(Main!A1)1)/9)*3+1):INDEX(sol_board, INT((ROW(Main!A1)-1)/9)*3+3, INT((COLUMN(Main!A1)-1)/9)*3+3) with P4 selected.
Charlie Ellis 1 Oct 2008 3:02 AM Ron - Actually =INDEX(sol_board, INT((ROW(Main!S19)-1)/9)*3+1, INT((COLUMN(Main!S19)1)/9)*3+1):INDEX(sol_board, INT((ROW(Main!S19)-1)/9)*3+3, INT((COLUMN(Main!S19)1)/9)*3+3) is correct, so long as you select cell AH22 when you enter it. ;)
Yes, your fomulation is correct if you're entering that name from the first cell of the valid values board, as all the other names' formulas work. I'll work with Joe to update the post.
db 1 Oct 2008 9:01 AM I plugged in a sample problem, set iteration to 1000, and it gave an invalid answer.
Charlie Ellis 1 Oct 2008 4:27 PM Can you share the sample? I'd love to try to debug it and see what is going wrong. BTW, I can't say I'm too surprised if there's a bug in the formulas; while I tested previous versions of the spreadsheet extensively, I've only tested this version a little bit, since I (re-)created it from scratch as a part of making this blog post.
Joseph Chirilov 2 Oct 2008 2:26 AM Hi, I updated the post based on Ron's feedback. Thanks, Joe
Microsoft Excel 2 Oct 2008 2:32 AM Today's author, Charlie Ellis, continues discussing the spreadsheet he built to solve Sudoku puzzles.
TechieBird 2 Oct 2008 5:07 PM Great post - I've been using Excel since what seems like the dawn of time and I never knew you could name formulas like that. Now I'm just waiting for the walk-through that teaches me what to do with array formulas... :o)
Cooking 4 Oct 2008 8:52 AM Very less explored subject and you have provided valuable information. Thanks for the great stuff.
Paan 20 Oct 2008 2:46 PM It is really nice, I would be glad if you could attach one file, with sudoku puzzle and the formula
Barry Lynch 20 Oct 2008 7:40 PM Your Sudoku solver looks great, but having the possibility matrix and the entered candidates on different matrices slows down the solving process. I did my own Sudoku Solver, using basic formulae, which places both the possibility space and the entered candidates on the same matrix. As in your model,once a candidate is entered in a cell, all other candidates disappear from that cell and the entered candidate disappears from all cells in the same row, column and box. I find it more user friendly than yours but it is a larger size than your solution matrix as the 9 possible candidates for each cell display in the row above the candidate's cell. This candidate's cell is larger as it contains 9 merged cells.This allows for a larger font for the entered candidate. Could you adjust your programme to display solution and possibility space on the same matrix?, or is there a way I could adjust my programme so that the one cell initially displays the possibility space in 9 small sub-cells, and these 9 small sub-cells then merge into the one cell to display the entered candidate? Barry Lynch
Joseph Chirilov 21 Oct 2008 1:41 PM Paan, the file is attached at the bottom of the post.
Shahed Khan (MVP C#) 28 Oct 2008 4:48 AM 196 Microsoft Team blogs searched, 85 blogs have new articles in the past 7 days. 194 new articles found... Page 1 of 1 (15 items)
2010 Microsoft Corporation. All rights reserved. Terms of Use Trademarks Privacy Statement Report Abuse
Bottom of Form