I was scrolling and found the Big Data Cup for 2021. In short, they’ll provide two data sets about Women’s Hockey, and I’ll perform some analysis on that data. The winner gets tickets to the conference in Canada.
The first dataset deals with scouting hockey players, while the second deals with Olympics level performance. I’m going to spend most of my time on the Scouting dataset, as I think it’s a more interesting problem to solve, and the lessons are more broadly applicable (what is the best player, what need am I trying to fulfill).
I’m sure I’ll note more about how I want to approach the problem later, but I think this was a good base.