Community Data Science Course (Spring 2023)/Week 2 lecture notes
From CommunityData
Resources[edit]
Lecture outline[edit]
Review Week 1 python material[edit]
- math: using python as a calculator
- addition, subtraction, multiplication, division
- division shows something different:
8/2
versus2*2
- type()
- there are different types of things in python (called objects)
- variables that "know about the decimal place" (int) and variables that don't (floats)
- variables that are strings.
- variables
- assignment of variables
- e.g., math with variables: scale up a recipe, into an assignment
- you can assign to a variable and it will replace the old value
- strings
- things within quotation marks
- adding strings with "concatenation" (smushing things together)
- e.g.,
print("Hello" + name)
- concatenating strings and integers doesn't work (e.g.,
print(1 + "tommy")
) - 1 is different than "1"; name is different than "name"
- convert "1" to 1 with the
int()
function - single quotes versus double quotes (python doesn't care)
- you can also multiply strings! (although it's not clear why you want to)
- The type of a variable tells you what you can do with it
- Booleans
- comparisons (e.g.,
1 == 1
or1 == 0
)- you can compare strings (case sensitive!)
- also >, <, and !=
- type() shows that the output of True or False is
bool
- e.g.,
"i" in "team"
- e.g.,
"i" not in "team"
- comparisons (e.g.,
- if/elif/else (move to external file)
- if, something that evaluates to a Boolean, and then colon
- e.g.,
if "tom" in "tommyguy"
- e.g., adding else example:
if brother_age > sister_age
- e.g., temperature range
- e.g., adding elif: fix the bug in the previous program if they were the same age
- indent with spaces (we use 4 spaces!)
Lists[edit]
- purpose
- Stores things in order
- initialization
- making a list called my list:
my_list = ["a", "b", "c"]
- comma separated elements. in python they can be a mix of any kind of types
type(my_list)
- making a list called my list:
- len() review
- accessing elements
- indexing like my_list[0]
- indexing starts from the front and we start counting at 0
- we go from the end with negative numbers
- what happens if we try to move outside of the range? ('error!)
- adding elements
- using the the
my_list.append()
function - the
.append()
function is a special kind of function that lists know about - compare to
print
which is a global function.
- using the the
- changing elements
- replacing elements like
my_list[0] = "foo"
- replacing elements like
- finding elements in list
- e.g.,
"z" in my_list
- e.g.,
- slicing lists
- the colon inside the [] is the slicing syntax
- e.g.,
my_list[0:2]
is 0th up to, but not including, the 2nd - e.g.,
my_list[2:]
- e.g.,
my_list[:2]
- e.g.,
my_list[:]
- strings are like lists
- we can slice lists
- len()
len("")
length of the empty string
- many other interesting functions for lists
- e.g.,
min()
andmax()
- e.g., create a list of names and sort it
names.sort()
- e.g., reverse a list with
names.reverse()
- search "python lists" in any search engine to see many more functions that list knows about.
- e.g.,
Optional Exercise[edit]
Figure out what the list function pop
does. Confirm with your neighbor. Write an example to be sure.
loops and more flow control[edit]
- for loops
- e.g.,
for name in names: print name
- e.g.,
for name in names: print 'hello ' + name
- Super powerful because it can do something many many times. Data science is about doing tedious things very quickly.
for
is the workhorse that makes this possible. - Look and see what name is after we're done looping.
- Move to editor.
- e.g.,
- if statements inside for loops
- e.g.,
if name[0] in "AEIOU"
then print "starts with a vowel" - show we can test things outside the loop to show how the comparisons are working
- add an else statement to capture words that start with a consonant
- create a counter within a for loop (keep track)
- e.g.,