DS4UX (Spring 2016)/Day 3 coding challenge: Difference between revisions

From CommunityData
No edit summary
Line 5: Line 5:
== Challenges ==
== Challenges ==


# Search for your own name. Are there both boys and girls that have your name? Is it more popular for one group than for the other?
# Are there more boy names or girl names? What about for particular first letters? What about for ''every'' first letter?
# Are there more boys names or girls names? What about for particular letters? What about for every letter?
# How babies are described in the dataset (assuming nobody is counted more than once)?
# What is the longest name in the dataset?
# What is the longest name in the dataset?
# How many boys and girls are described in the dataset (i.e., how many boys and girls born in 2013 have names given to at least four others)?
# How many boy names are also girl names? How many girls' names are also boys' names?
# How many boys names are also girls names? How many girls names are also boys names?
# How many names are subsets of other names?
# How many names are subsets of other names?
# Write a program that will take a name as input and return the number of girls and boys with that name.
# What is the most popular girl name that is also a boy name?
# What is the most popular girls name that is also a boys name?
# Write a program that will take a name as input and return the number of babies with that name in the girl and boy datasets.
# Take a prefix as input and print the number of boys and girls with that prefix. (i.e., "m" would list babies whose names start with "m" and "ma" would list babies whose names start with "ma", etc).
# Take a prefix as input and print the number of babies with that prefix in each dataset (i.e., "m" would list babies whose names start with "m" and "ma" would list babies whose names start with "ma", etc).
# Which boy and girl names are the most popular across all four years in our dataset? (hint: to solve this challenge, you will need to edit <tt>ssadata.py</tt>)
# Which boy and girl names have increased most in popularity between 2010 and 2013? Which ones have declined most in popularity?


[[Category:DS4UX (Spring 2016)]]
[[Category:DS4UX (Spring 2016)]]

Revision as of 00:08, 11 April 2016

Each of the challenges this week will ask you to modify and work with code in the Baby Names dataset which you should have installed and begun working with in class.

As always, it's not essential that you solve or get through all of these — I'm not grading your answers on these. That said, being able to work through at least many of them is a good sign that you have mastered the concepts for the week. It is always fine to collaborate or work together on these problem sets.

Challenges

  1. Are there more boy names or girl names? What about for particular first letters? What about for every first letter?
  2. How babies are described in the dataset (assuming nobody is counted more than once)?
  3. What is the longest name in the dataset?
  4. How many boy names are also girl names? How many girls' names are also boys' names?
  5. How many names are subsets of other names?
  6. What is the most popular girl name that is also a boy name?
  7. Write a program that will take a name as input and return the number of babies with that name in the girl and boy datasets.
  8. Take a prefix as input and print the number of babies with that prefix in each dataset (i.e., "m" would list babies whose names start with "m" and "ma" would list babies whose names start with "ma", etc).
  9. Which boy and girl names are the most popular across all four years in our dataset? (hint: to solve this challenge, you will need to edit ssadata.py)
  10. Which boy and girl names have increased most in popularity between 2010 and 2013? Which ones have declined most in popularity?