Intro to Programming and Data Science (Summer 2020)/Day 4 Coding Challenges: Difference between revisions
From CommunityData
(Copying content from previous class) |
|||
Line 5: | Line 5: | ||
== Baby Names == | == Baby Names == | ||
Using the baby names data from [[ | Using the baby names data from [[Intro_to_Programming_and_Data_Science_(Summer_2020)/Day_3_Coding_Challenges|The day 3 challenges]]: | ||
# Get the ratio of names that start with each letter. | # Get the ratio of names that start with each letter. | ||
#* Do this for boys and girls. | #* Do this for boys and girls. |
Revision as of 17:13, 15 May 2020
Python for Everybody
Chapter 7: Exercises 1, 2, 3
Baby Names
Using the baby names data from The day 3 challenges:
- Get the ratio of names that start with each letter.
- Do this for boys and girls.
- Hint First line of output should be something like:
a: boys: 0.1002914920750592 girls: 0.17587602795796703
- Are girls or boys more likely to have a name that is used by both genders?
- Figure out how to change the ssadata.py file so that it loads births from 2017 instead of 2018.
Above and beyond
- Figure out how to load two years of birth data simultaneously and compare them (e.g., identify the top 20 names from 2017 and figure out how many more/fewer people were named those names in 2018).
- Visualize some of the differences (probably in Excel)