
Tic-Tac-Toe the Hard Way Podcast
1) Lessons learned
What have we learned about machine learning and the human decisions that shape it? And is machine learning perhaps changing our minds about how the world outside of machine learning — also known as th...Show More
2) Head to Head: The Even Bigger ML Smackdown!
Yannick and David’s systems play against each other in 500 games. Who’s going to win? And what can we learn about how the ML may be working by thinking about the results?
3) Enter tic-tac-two
David’s variant of tic-tac-toe that we’re calling tic-tac-two is only slightly different but turns out to be far more complex. This requires rethinking what the ML system will need in order to learn h...Show More
4) Head to Head: the Big ML Smackdown!
David and Yannick’s tic-tac-toe ML agents face-off against each other in tic-tac-toe!
5) Give that model a treat! : Reinforcement learning explained
Switching gears, we focus on how Yannick’s been training his model using reinforcement learning. He explains the differences from David’s supervised learning approach. We find out how his system perf...Show More
6) Beating random: What it means to have trained a model
David did it! He trained a machine learning model to play tic-tac-toe! How did his model do against a player that makes random tic-tac-toe moves?
7) From tic-tac-toe moves to ML model
Once we have the data we need—thousands of sample games—how do we turn it into something the ML can train itself on? That means understanding how training works, and what a model is.
8) What does a tic-tac-toe board look like to machine learning?
David delves into questions around data and training for his model including: What does a tic-tac-toe board “look” like to ML? Plus, an intro to reinforcement learning, the approach Yannick will be ta...Show More
9) Howdy, and the myth of “pouring in data”
David and Yannick get started on their project to build competing machine learning systems that play tic-tac-toe. They discuss the human choices that will shape their systems along the way.
10) Introducing Tic-Tac-Toe the Hard Way
Introducing the podcast where a writer and a software engineer explore the human choices that shape machine learning systems by building competing tic-tac-toe agents. Brought to you by Google's People...Show More