Editing Statistics and Statistical Programming (Winter 2021)/Problem set 12
From CommunityData
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 1: | Line 1: | ||
== OpenIntro Questions == | == OpenIntro Questions == | ||
Complete the following exercises from OpenIntro §7: 7.12, 7.24, 7.26, 7.42, 7.44, 7.46 | Complete the following exercises from OpenIntro §7: 7.12, 7.24, 7.26, 7.42, 7.44, 7.46 | ||
== Programming challenges (and statistical questions) == | == Programming challenges (and statistical questions) == | ||
Line 14: | Line 14: | ||
* Download the dataset by clicking through on the "Red Dye Number 40" link on [http://college.cengage.com/mathematics/brase/understandable_statistics/7e/students/datasets/owan/frames/frame.html this webpage]. You'll find that the it's not in an ideal setup. It's an Excel file (XLS) with a series of columns labeled X1.. X4. Yikes! If you look at the website with the data and/or Table 1 in the paper you should be able to figure out what each column stands for. | * Download the dataset by clicking through on the "Red Dye Number 40" link on [http://college.cengage.com/mathematics/brase/understandable_statistics/7e/students/datasets/owan/frames/frame.html this webpage]. You'll find that the it's not in an ideal setup. It's an Excel file (XLS) with a series of columns labeled X1.. X4. Yikes! If you look at the website with the data and/or Table 1 in the paper you should be able to figure out what each column stands for. | ||
* Import the data into R and get to work on reshaping the dataset. I think a good format would be a data frame with two columns: <code>group</code> | * Import the data into R and get to work on reshaping the dataset. I think a good format would be a data frame with two columns: <code>group</code> and <code>weeks_alive</code>. | ||
=== PC2. Summarize the data === | === PC2. Summarize the data === |