Avoiding for loops in Pandas

There will be times when you are tempted to loop through rows or columns in Pandas to achieve your results - and the lesson I keep learning is Don't do it! Every time I'm tempted to write a for loop with Pandas data I find myself clock watching and cursing... 9 times out of 10 there... Continue Reading →

Emerging from Data Science Intensive

In September & October I was fortunate enough to attend the Data Science Intensive Program (DSI) in Cape Town. In a word: WOW! The program brought together 16 students from 7 African countries for 8 very intensive weeks with an ambitious goal: To ensure that anyone who completes the DSI is able to contribute significant... Continue Reading →

Pandas dataframe styling – cool!

I always like to visualize data and see the detail if possible so it was with great joy that I stumbled across DataFrame.style this morning. Here is an example of how it helps us to visualize some Titanic survival rates by sex and passenger class: The Pandas documentation itself is pretty comprehensive, but if you're looking... Continue Reading →

I've just discovered the awesome Brandon Rohrer and his blog while trying to find an intelligible article on Bayesian inference. What a goldmine - this guy is a born educator! Thank you for sharing your knowledge - it is well-appreciated!

SQL CheatSheet

I've just worked through Imtiaz Ahmad's Master SQL for Data Science on Udemy and it was a thoroughly enjoyable, morale-boosting experience! He build on each concept so you never feel left behind or perplexed at how he arrived at a solution, and as promised there are a gazillion exercises so by the time you're done you feel like... Continue Reading →

Poisson vs Exponential distributions

Related yet different, here's how... A quick note on the "preliminary terrors" of notation: e is Euler's number - you'll find the e on your calculator or the EXP() function in Excel The parameter is conventionally written as λ (pronounced lambda). Poisson Exponential Number of events that occur in an interval of time Time taken between 2... Continue Reading →

A Scrum fan is born… cheatsheet

I've just finished listening to The Art of Doing Twice the Work in Half the Time on Audible and I feel like a real fan already - I can't wait to test-drive it in a team situation! As a stalwart of Corporate IT, I've only ever worked according to the "waterfall" methodology and I'm unpleasantly... Continue Reading →

Create a website or blog at WordPress.com

Up ↑