Intro to Programming and Data Science (Summer 2021)/Day 5 Coding Challenges: Difference between revisions

From CommunityData
Line 9: Line 9:
== Baby Name Challenges ==
== Baby Name Challenges ==


Download the [https://github.com/CommunityDataScienceCollective/babynames-cdsw/archive/master.zip baby names data and code].
Download the [https://github.com/CommunityDataScienceCollective/babynames-cdsw/archive/master.zip baby names data and code] and complete each of the exercises.
 
We will go through the Jupyter Notebook file in class. Once we understand what is is doing, you can use it as a springboard to write programs that do each of the following:
 
# Search for your own name. Are there both boys and girls that have your name? Is it more popular for one group than for the other?
# What is the most common name for each gender?
# What is the least common name?
# How often does the least common name occur? (Does that concern you?)
# Are there more boys names or girls names?
# What is the longest name in the dataset?
# How many boys and girls names are described in the dataset (i.e., how unique boy names and unique girl names are there)?
# How many boys names are also girls names? How many girls names are also boys names?
# What is the most popular girl name that is also a boy name?
 


=== Above and beyond ===
=== Above and beyond ===

Revision as of 16:47, 20 May 2021

Python for Everybody

Chapter 9:

  • Exercises 2, 3, 4, and 5*

Chapter 10:

  • Exercises 1, 2, 3*


Baby Name Challenges

Download the baby names data and code and complete each of the exercises.

Above and beyond

  1. Write a function that takes a prefix as input and prints the number of boys and girls with that prefix (e.g., get_names("m") would list all names that start with "m" and get_names("ma") would only list those that start with "ma").
  2. Plot (in Excel) the number of people who share a name with n other people in the data set, where n is 4 to 19.