Blog  |   Puzzles  |   Books  |   About

Facebook and gender-bias in names

Monday, September 13th, 2010

Some weeks back, I used Facebook’s open graph API to read about 1.2 million names from Facebook users. I used it to construct this file, which shows, for each name, the number of times it occurs, the number of times it’s used by males, and the number of times it occurs in females. To keep […]

Mining juicy words

Monday, March 22nd, 2010

This weekend, I counted all the words on Project Gutenberg. This has been done before, notably, here. My script crawled most of the English language books on Project Gutenberg (about 20,000 titles), and counted how often each word appears, and how many books each word appears in. The script ran for about 20 hours. You […]