There will be times when you are tempted to loop through rows or columns in Pandas to achieve your results - and the lesson I keep learning is Don't do it! Every time I'm tempted to write a for loop with Pandas data I find myself clock watching and cursing... 9 times out of 10 there... Continue Reading →
In September & October I was fortunate enough to attend the Data Science Intensive Program (DSI) in Cape Town. In a word: WOW! The program brought together 16 students from 7 African countries for 8 very intensive weeks with an ambitious goal: To ensure that anyone who completes the DSI is able to contribute significant... Continue Reading →
I picked Center for Policing Equity challenge on Kaggle for three reasons: I love maps and I love the idea that data scientists can significantly improve our world, in addition to improving the bottom lines of big corporates. And this is exactly the type of messy data one would get in the real world so... Continue Reading →
I always like to visualize data and see the detail if possible so it was with great joy that I stumbled across DataFrame.style this morning. Here is an example of how it helps us to visualize some Titanic survival rates by sex and passenger class: The Pandas documentation itself is pretty comprehensive, but if you're looking... Continue Reading →
I've just discovered the awesome Brandon Rohrer and his blog while trying to find an intelligible article on Bayesian inference. What a goldmine - this guy is a born educator! Thank you for sharing your knowledge - it is well-appreciated!
I've just worked through Imtiaz Ahmad's Master SQL for Data Science on Udemy and it was a thoroughly enjoyable, morale-boosting experience! He build on each concept so you never feel left behind or perplexed at how he arrived at a solution, and as promised there are a gazillion exercises so by the time you're done you feel like... Continue Reading →
Related yet different, here's how... A quick note on the "preliminary terrors" of notation: e is Euler's number - you'll find the e on your calculator or the EXP() function in Excel The parameter is conventionally written as λ (pronounced lambda). Poisson Exponential Number of events that occur in an interval of time Time taken between 2... Continue Reading →
I've just finished listening to The Art of Doing Twice the Work in Half the Time on Audible and I feel like a real fan already - I can't wait to test-drive it in a team situation! As a stalwart of Corporate IT, I've only ever worked according to the "waterfall" methodology and I'm unpleasantly... Continue Reading →