Intro to Programming and Data Science (Spring 2020)/Day 3 Coding Challenges: Difference between revisions

From CommunityData
No edit summary
No edit summary
Line 8: Line 8:


== Baby Name Challenges ==
== Baby Name Challenges ==
Download the [https://github.com/CommunityDataScienceCollective/CDSW_Babynames/archive/master.zip baby names data and code].
We will go through the babynames1.py, babynames2.py, and babynames3.py files in class. Once we understand what they are doing, you can use them as a springboard to write programs that do each of the following:


# Search for your own name. Are there both boys and girls that have your name? Is it more popular for one group than for the other?
# Search for your own name. Are there both boys and girls that have your name? Is it more popular for one group than for the other?
Line 18: Line 22:
#* (Challenge!) For every letter, tell if there are more boys names or girls names.
#* (Challenge!) For every letter, tell if there are more boys names or girls names.
# What is the longest name in the dataset?
# What is the longest name in the dataset?
# How many boys and girls are described in the dataset (i.e., how many boys and girls born in 2013 have names given to at least four others)?
# How many boys and girls are described in the dataset (i.e., how many boys and girls born in 2018 have names given to at least four others)?
# How many boys names are also girls names? How many girls names are also boys names?
# How many boys names are also girls names? How many girls names are also boys names?
# What is the most popular girls name that is also a boys name?
# What is the most popular girls name that is also a boys name?
# Discover at least one fact about the names that is not listed above.
# Discover at least one fact about the names that is not listed above.
# ''Challenge'' plot (in Excel) the number of people who share a name with n other people in the data set, where n is 4 to 19.
# ''Challenge'' plot (in Excel) the number of people who share a name with n other people in the data set, where n is 4 to 19.

Revision as of 17:18, 22 January 2020

Questions from Python For Everybody

  • Chapter 5: Exercises 1 and 2
  • Chapter 6: Exercise 5
  • Chapter 8: Exercise 4*, 5*, 6

Baby Name Challenges

Download the baby names data and code.

We will go through the babynames1.py, babynames2.py, and babynames3.py files in class. Once we understand what they are doing, you can use them as a springboard to write programs that do each of the following:

  1. Search for your own name. Are there both boys and girls that have your name? Is it more popular for one group than for the other?
    • Hint: don't use a for loop for this one.
  2. What is the most common name for each gender?
  3. What is the least common name?
  4. How often does the least common name occur? (Does that bother you?)
  5. Are there more boys names or girls names?
    • What about that start with "a"?
    • (Challenge!) For every letter, tell if there are more boys names or girls names.
  6. What is the longest name in the dataset?
  7. How many boys and girls are described in the dataset (i.e., how many boys and girls born in 2018 have names given to at least four others)?
  8. How many boys names are also girls names? How many girls names are also boys names?
  9. What is the most popular girls name that is also a boys name?
  10. Discover at least one fact about the names that is not listed above.
  11. Challenge plot (in Excel) the number of people who share a name with n other people in the data set, where n is 4 to 19.