Community Data Science Workshops (Core)/Workshop/Baby Names/Setup

From CommunityData
Revision as of 03:24, 15 January 2020 by Jtmorgan (talk | contribs) (→‎Test the Baby Names code: some more specifics)

In this section, we'll download everything we need for tomorrow's projects.

Baby Names

Being a twin means you always have a pillow or blanket handy.jpg

Download the Baby Names project

You'll be playing with data from the list of all baby names in the US (used more than five times in a year) from the last several years:

Windows
  1. Right click the following file, click "Save Target as..." or "Save link as...", and save it to your Desktop directory: https://github.com/CommunityDataScienceCollective/CDSW_Babynames/archive/master.zip
  2. The ".zip" extension on the above file indicates that it is a compressed Zip archive. We need to "extract" its contents. To do this, click on "Start", then "Computer", and navigate to your Desktop directory. Find CDSW_Babynames-master.zip on your Desktop and double-click on it to "unzip" it. That will create a folder called babynames containing several files.
MacOS
  1. Click the following link, select "Save file" and click "Okay" to start the download: https://github.com/CommunityDataScienceCollective/CDSW_Babynames/archive/master.zip
  2. The file CDSW_Babynames-master.zip will be saved to your default download location (usually "Downloads"). We suggest that you move it to somewhere like your Desktop or Documents folder so you can find it easier.
  3. The ".zip" extension on the above file indicates that it is a compressed Zip archive. We need to "extract" its contents. Double click CDSW_Babynames-master.zip to do this. It will create a folder called "CDSW_Babynames-master" that contains several files.

Test the Baby Names code

Launch Jupyter Notebook through the Anaconda Navigator and navigate to the "CDSW_Babynames-master" directory where the Baby Names code lives. Open BabyNames.ipynb.

Running cells

To test that everything is working, we will run the first two cells in this notebook. "Run" means executing the code in the cell. You can run a cell two ways: 1) by clicking the "run" button, or 2) by clicking the grey part of the cell and then typing SHIFT + ENTER (or "RETURN") at the same time.

Step 1: Run the first cells that reads:

 import ssadata

If you are running this cell for the first time, it will look like nothing has happened! That's expected. But if you are curious about what just you just did, ask a mentor!

Step 2: Run the second cell that reads:

 for name in ssadata.boys.keys():
     if name == "mako":
         print("There were " + str(ssadata.boys[name]) + " boys named " + name)

This time, you should get the output

There were 10 boys named mako

If this doesn't work, let a mentor know!

Success!

You've completed setup for the Baby Names project.

State Capitals

We'll look at an example Python script that quizzes you on state capitals during the lecture on Saturday.

  1. Right click the following file, click "Save Target as..." or "Save link as...", and save it to your Desktop directory: https://communitydata.science/~mako/State_Capitals.ipynb

Success!

You are done downloading the Saturday projects.

Champagne.pngParty.png