Community Data Science Course (Spring 2015)/Day 2 Coding Challenges

From CommunityData
< Community Data Science Course (Spring 2015)
Revision as of 19:00, 13 April 2015 by Benjamin Mako Hill (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Each of the challenges this week will ask you to modify and work with code in the baby names which you should have installed and begun working with in class.

As always, it's not essential that you solve or get through all of these — I'm not grading your answers on these. That said, being able to work through at least many of them is a good sign that you have mastered the concepts for the week. It is always fine to collaborate or work together on these problem sets. The only thing I ask is that you do not broadcast answers before Sunday at midnight on Canvas.


  1. Search for your own name. Are there both boys and girls that have your name? Is it more popular for one group than for the other?
  2. Are there more boys names or girls names? What about for particular letters? What about for every letter?
  3. What is the longest name in the dataset?
  4. How many boys and girls are described in the dataset (i.e., how many boys and girls born in 2013 have names given to at least four others)?
  5. How many boys names are also girls names? How many girls names are also boys names?
  6. How many names are subsets of other names?
  7. Write a program that will take a name as input and return the number of girls and boys with that name.
  8. What is the most popular girls name that is also a boys name?
  9. Take a prefix as input and print the number of boys and girls with that prefix. (i.e., "m" would list babies whose names start with "m" and "ma" would list babies whose names start with "ma", etc).