Editing Community Data Science Workshops (Fall 2015)/Day 3 Projects/Civic data

From CommunityData

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 1: Line 1:
[[File:Seattle_building_cost_heatmap.png|thumb|right|250px|What neighborhoods have changed the most since the beginning of the Seattle [http://www.seattletimes.com/pacific-nw-magazine/seattles-building-boom-is-good-news-for-a-new-generation-of-workers/ building boom]?]]
[[File:Seattle_building_cost_heatmap.png|right|250px]]
[[File:SeattleGovLogoHome.png|right|150px]]


__NOTOC__
__NOTOC__
Line 17: Line 18:
* Graphing and mapping data
* Graphing and mapping data


<!-- We will also explore an important non-technical part of data science: thinking critically about data. Thinking critically about the data you have, and what conclusions you can draw from it, is especially important when you are visualizing data, because its easy to mislead people with visualizations. We'll discuss how visualizations based on incorrect data can lead people to make false conclusions, using an example from a recent visualization of building demolitions published in the Northwest design magazine ''Arcade''. -->
We will also explore an important non-technical part of data science: thinking critically about data. Thinking critically about the data you have, and what conclusions you can draw from it, is especially important when you are visualizing data, because its easy to mislead people with visualizations. We'll discuss how visualizations based on incorrect data can lead people to make false conclusions, using an example from a recent visualization of building demolitions published in the Northwest design magazine ''Arcade''.


== Preparation ==


== Part 1: Downloads ==
;Step 1: Download today's scripts and sample datasets: [http://jtmorgan.net/cdsw/may9cdsw2.zip FIXME]
''If you are confused by anything today, go back and refresh your memory with the [[Community Data Science Workshops (Fall 2015)/Day 0 setup and tutorial|Day 0 setup and tutorial]] and [[Community Data Science Workshops (Fall 2015)/Day 0 tutorial|Day 0 tutorial]]''
;Today's dataset: [https://data.seattle.gov/Permitting/Building-Permits-Current/mags-97de Click here to view the Seattle Building Permits database]
 
;Step 1: Download today's scripts: [http://jtmorgan.net/cdsw/nov7cdsw.zip Python scripts]
 
;Step 2: Get the dataset: Unzip the folder, navigate to it in your terminal, and run this script <pre>download_building_permit_data.py</pre>
 
Now open up the CSV output file to verify that you got all the data you asked for.


You can view the full set of permit applications at: https://data.seattle.gov/Permitting/Building-Permits-Current/mags-97de
== Visualizing new home construction in Seattle over time ==
;Question: Has the rate of apartment construction increased since 2010?


;Challenge question: how could we change our API query to download the applicant names for all COMMERCIAL building permits issued within the past year?


== Part 2: charting new construction by month ==
;Question: Has the rate of residential construction increased since 2010?


Now we want to learn whether there is, in fact, a housing boom in Seattle. We'll do this by counting how many new permits are issued each month, and then plot these on a graph.
== Visualizing new home construction in Seattle by neighborhood ==
 
Run the script <pre>residential_permits_by_month.py</pre>
 
Now open up the CSV output file and check your data. Does this look right to you?
 
;Challenge question: how could we separate out single family homes from apartments, and count/plot them separately?
 
== Part 3: charting new construction by neighborhood ==
;Question: which Seattle neighborhoods have had the most multifamily residential construction (apartments and townhomes) since 2010?
;Question: which Seattle neighborhoods have had the most multifamily residential construction (apartments and townhomes) since 2010?


Now we want to learn where all this new construction is happening. We'll do this by sending the address for each MULTIFAMILY permit to the Google Geolocation API, which will return the neighborhood where that address is located.
Run the script <pre>multifamily_permits_by_neighborhood.py</pre>
Now open up the CSV output file and check your data. Does this look right to you?
;Challenge question: In which Seattle neighborhood is the cost of new apartment construction projects highest, on average?


== Part 4: Mapping new home construction in Seattle ==
== Mapping new home construction in Seattle ==
;Question: what locations have experienced the highest density of new construction since 2010?
;Question: what locations have experienced the highest density of new construction since 2010?


Now we will try to get an even more detailed picture of where this construction is occuring, using Google Fusion tables, a powerful visualization application that makes it easy to plot data on a map.
== Challenge questions ==
 
Go [https://support.google.com/fusiontables/answer/2571232?hl=en here] and follow the steps to upload your '''new_all_2010-2015.csv''' file.
 
*Create a point map and heatmap.
*Experiment with filtering and weighting the points on your map.
 
;Example map: https://www.google.com/fusiontables/data?docid=1gm0wVqnK7zQ7hgp5bsKoBIetMZ75jo-VMs5noDoJ#map:id=3
 
== Going further ==
''Try to answer these additional questions that draw on the data and methods we're learning today. Ask a mentor if you get stuck!''
''Try to answer these additional questions that draw on the data and methods we're learning today. Ask a mentor if you get stuck!''
#Which developer has spent the most on new construction in Seattle since 2010?
#In which Seattle neighborhood is the cost of new construction projects highest, on average?
#How many townhouses have been constructed in Seattle since 2010?
#Where in Seattle are the most commercial buildings being constructed in 2015?
#Where in Seattle are the most ''commercial'' buildings being constructed in 2015?
#How does the rate of residential construction in Seattle from 2010-2015 compare to the previous 5 years?
#How does the rate of residential construction in Seattle from 2010-2015 compare to [https://data.seattle.gov/Permitting/Building-Permits-Older-than-5-years/47eb-r92t the previous 5 years]?
#How many townhouses have been constructed in Seattle since 2011?






== Resources ==


;Press about the construction boom
=== Resources ===
*http://www.theurbanist.org/2015/10/20/fact-check-no-explosion-in-demolitions/
''If you are confused by anything today, go back and refresh your memory with the [[Community Data Science Workshops (Fall 2015)/Day 0 setup and tutorial|Day 0 setup and tutorial]] and [[Community Data Science Workshops (Fall 2015)/Day 0 tutorial|Day 0 tutorial]]''
*http://arcadenw.org/article/changing-seattle
*http://www.seattlemag.com/article/demolitions-seattle-no-neighborhood-unaffected
*http://www.seattletimes.com/pacific-nw-magazine/seattles-building-boom-is-good-news-for-a-new-generation-of-workers/
 
=== Datasets ===
* Permits, last 5 years (today's dataset): https://data.seattle.gov/Permitting/Building-Permits-Current/mags-97de
* Permits older than 5 years: https://data.seattle.gov/Permitting/Building-Permits-Older-than-5-years/47eb-r92t
 
;Custom Socrata datasets
multifamily 2010-2015: https://data.seattle.gov/Permitting/Building-permits-new-multifamily-residential-const/ma3y-m69a
single and multifamily 2010-2015: https://data.seattle.gov/Permitting/Building-permits-new-residential-construction/kdfe-reh3
 
=== Sample visualizations ===
*Building permit charts: https://docs.google.com/spreadsheets/d/15DBcWnCroga4B1_ss66YjW9hlJxEk3g1UK7khFLpbQM/edit#gid=0
*Building permit Fusion table: https://www.google.com/fusiontables/DataSource?docid=1gm0wVqnK7zQ7hgp5bsKoBIetMZ75jo-VMs5noDoJ#rows:id=1
 
=== APIs ===
* Hurl.it API testing tool: https://www.hurl.it


;Sample API queries
;Sample API queries
Line 104: Line 55:
* Google maps location (using lat/long): http://maps.googleapis.com/maps/api/geocode/json?latlng=47.66979666%2C-122.38570052
* Google maps location (using lat/long): http://maps.googleapis.com/maps/api/geocode/json?latlng=47.66979666%2C-122.38570052


;Sample charts/maps
* Google Fusion table map of permits 2010-2015 [https://www.google.com/fusiontables/DataSource?docid=1aOSWYuXXqh7U2bnUsBH7mM2w3JfsYHzjk29ihbi8]


 
;Help resources and inspiration
=== Google tools ===
* About Google maps API: https://developers.google.com/maps/documentation/geocoding/#ReverseGeocoding
*Fusion tables: https://support.google.com/fusiontables/answer/2571232
* About Google Fusion tables: https://support.google.com/fusiontables/answer/2571232
*About Fusion table heatmaps: https://support.google.com/fusiontables/answer/1152262
* Google Fusion Tables mapmaking tutorial: https://support.google.com/fusiontables/answer/2527132?hl=en&topic=2573107&ctx=topic
*About Google maps geocoding API: https://developers.google.com/maps/documentation/geocoding/
* API sandbox tool: https://www.hurl.it
 
* API-powered app: http://web6.seattle.gov/mnm/
=== Socrata open data portal===  
*More about the Socrata open data portal and API: http://www.socrata.com/products/open-data-portal/
* Socrata API help resources: http://dev.socrata.com/consumers/getting-started.html
* Socrata API help resources: http://dev.socrata.com/consumers/getting-started.html
:* filtering results: http://dev.socrata.com/docs/filtering.html
:* filtering results: http://dev.socrata.com/docs/filtering.html
Line 118: Line 69:
:* writing API queries: http://dev.socrata.com/docs/queries.html
:* writing API queries: http://dev.socrata.com/docs/queries.html


;Data portal-powered apps
;Other data.seattle.gov datasets with neighborhood, timeseries, and/or location data
*https://www.seattleinprogress.com/
*[https://data.seattle.gov/Community/Seattle-Cultural-Space-Inventory/vsxr-aydq Seattle Cultural Space Inventory]
*http://web6.seattle.gov/mnm/
*[https://data.seattle.gov/Transportation/MTS-Trail-west-of-I-90-Bridge/u38e-ybnc MTS trail bike/ped traffic]
*[https://data.seattle.gov/Transportation/Burke-Gilman-Trail-north-of-NE-70th-St-Bike-and-Pe/2z5v-ecg8 Burke Gilman trail bike/ped traffic]
*[https://data.seattle.gov/Transportation/Road-Weather-Information-Stations/egc4-d24i Road temps in Seattle]
*[https://data.seattle.gov/Public-Safety/Seattle-Police-Department-911-Incident-Response/3k2p-39jp SPD 911 incident respose]


;Socrata API instructional videos
;Instructional videos
* https://data.seattle.gov/videos
* https://data.seattle.gov/videos
* https://www.youtube.com/watch?v=YlKzXTrTLOQ
* https://www.youtube.com/watch?v=YlKzXTrTLOQ
Line 128: Line 82:
* https://www.youtube.com/watch?v=Vd6bwz3ivVA
* https://www.youtube.com/watch?v=Vd6bwz3ivVA


;Some other government website that use the Socrata API
;Other Socrata sites that use this API
* https://data.austintexas.gov/
* https://data.austintexas.gov/
* https://data.cityofchicago.org/
* https://data.cityofchicago.org/
Please note that all contributions to CommunityData are considered to be released under the Attribution-Share Alike 3.0 Unported (see CommunityData:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)