A piece I wrote on the man behind one of the most popular statistical hypothesis tests of all time.
Tag Archives: statistics
Scipy Lecture Notes
Tutorials on the scientific Python ecosystem: a quick introduction to central tools and techniques. The different chapters each correspond to a 1 to 2 hours course with increasing level of expertise, from beginner to expert.
A great intro to Scipy and the things your can do with it.
Three and a half degrees of separation
How connected is the world? Playwrights [1], poets [2], and scientists [3] have proposed that everyone on the planet is connected to everyone else by six other people. In honor of Friends Day, we’ve crunched the Facebook friend graph and determined that the number is 3.57. Each person in the world (at least among the 1.59 billion people active on Facebook) is connected to every other person by an average of three and a half other people. The average distance we observe is 4.57, corresponding to 3.57 intermediaries or “degrees of separation.” Within the US, people are connected to each other by an average of 3.46 degrees.
Well, there we are. Not six but something closer to four. Or maybe even closer to three, given that the figure has shrunk each time Facebook—which continues to grow—has done one of these studies.
Markov and You
A classic Coding Horror piece on Markov chains (text chains in particular).
Language Trends on GitHub
The rank represents languages used in public & private repositories, excluding forks, as detected by Linguist.
No surprise at JavaScript taking the top spot. What is surprising is Java having a steady upward trend.
Big data: are we making a big mistake?
“Big data” has arrived, but big insights have not. The challenge now is to solve new problems and gain new answers – without making the same old statistical mistakes on a grander scale than ever.
A great article on the pitfalls.