Graphical Analysis of Biological Data

Logo

Welcome!

View My GitHub Profile

Assignment Schedule

Here is the schedule of assignments, with links to appear after assignments are released. Your completed assignments are due by 23:55 of the scheduled due date.

Follow links in the Notes column for notes with background information. The Reading web pages will have a link to a PDF file with the same content.

Assignments are available in HTML and PDF formats. They have the same content so you only need to complete one. Resources has links to places necessary to complete an assignment or to supporting information. The readings listed in the Readings and Assignments pages are the officially assigned readings that you must complete.

Note: Click on blue links in PDF files to visit those web sites.

Warning: I am still changing this schedule! Order and content is highly susceptible to change.

Notes Assignment Due (23:55) Reading and Resources
01 Install software;
Markdown
HTML / PDF 2 Jul (R) R4ds Chapter 1
R4ds Chapter 2
Markdown tutorial
02 Learn R!
Assignment Intro (Pt. 0)
HTML / PDF 5 Jul (Su) Read this first!
02 Basics (Pt. 1) HTML / PDF 5 Jul (Su) Do first three parts!
02 Vectors (Pt. 2) HTML / PDF 5 Jul (Su) R4ds Chapter 4
R4ds Chapter 6
R4ds Chapter 8
02 Matrices (Pt. 3) HTML / PDF 5 Jul (Su) Extra day for holiday weekend.
02 Factors (Pt. 4) HTML / PDF 7 Jul (T) Homework 2 con’t
02 Data Frames (Pt. 5) HTML / PDF 7 Jul (T)  
02 Lists (Pt. 6) HTML / PDF 7 Jul (T)  
03 R Studio
R Markdown
Git and GitHub
HTML / PDF 9 Jul (R) R4ds Chapter 4 (review)
R4ds Chapter 26
R4ds Chapter 27
R4ds Chapter 29, sect 1-3
R Markdown tutorial
GitHub Tutorial
04 Data Visualization HTML / PDF 12 Jul (Su) R4ds Chapter 3
05 Data Visualization con’t HTML / PDF 14 Jul (T)  
06 Importing and Wrangling I HTML / PDF 16 July (R) R4ds Chapter 10, sections 1-3.
R4ds Chapter 11, sections 1-2, 5
R4ds Chapter 18, sections 1-3
R4ds Chapter 12 sections 1-3, 7
R4ds Chapter 5 sections 1-4
Data Import cheatsheet
Data Transformation cheatsheet
07 Importing and Wrangling I con’t HTML / PDF 19 Jul (Su)  
08 Wrangling II HTML / PDF 21 Jul (T) R4ds Chapter 5
Tidyverse Style Guide
09 Wrangling II con’t HTML / PDF 23 Jul (R)  
10 Data Visualization II HTML / PDF 26 Jul (Su) Zuur et al. (required!)
Look at Data
Visualizing distributions
Cleveland dot plots
Scatter plots
11 Data Visualization II (cont’t) HTML / PDF 28 Jul (T) Zuur et al.
12 Dates HTML / PDF 30 Jul (R) R4ds Chapter 16
Lubridate cheatsheet
13 Strings and things HTML / PDF 02 Aug (Su) See notes and assignment.
14 Maps HTML / PDF 04 Aug (T) See notes and assignment.
15 Final HTML / PDF 07 Aug (F) Due date is last day of session
This will be a project that requires you to apply select skills learned throughout the course. You will receive minimal guidance and support from me.

Logistics

Here is the general flow that will happen for the assignments.

  1. When an assignment is released, I will post a GitHub Issue in the Internal Discussion repo, tagging the students (you should get an email notification, too, unless you’ve somehow disabled this, are not Watching the Internal Discussion repo, or did not inform me of your GitHub username).
  2. Create a new homework folder called hw## inside the folder on your computer that you created for this class. Replace the ## with the assignment number, e.g., hw01, hw02, etc. Use a separate folder for each assignment. By the end of the course, you should have folders labeled hw01 through hw16 (or whatever the final homework number ends up being).
  3. Download any .Rmd or other files associated with the assignment and place them in your new homework folder.
  4. Complete the assignment. Push your edits to your GitHub repository often while you work.
  5. After you have pushed the final version you want graded, go to the GABD Students GitHub page and open an issue in your repository. Add the @instructor tag and the Git SHA number for the version you want graded.
  6. I will grade the version of your completed assignment that matches the SHA. I will add any comments about your assignment as an issue in your GitHub repository, and post the grade in Moodle.

Evaluation

Components of each assignment will be graded on a 3-point scale. Here is the general rubric (also consult any specific guidance given in the relevant assignment itself).

A few assignments (exact number still to be determined) will have extra work for graduate and honors contract students that must be completed. Other students may attempt the extra work but I will not include your attempt in your grade.

Topic Excellent: 3 Satisfactory: 2 Needs work: 1
Coding style Student has gone beyond what was expected and required, coding manual is followed, code is well commented Coding style lacks refinement and has some errors, but code is readable and has some comments Many errors in coding style, little attention paid to making the code human readable
Coding strategy Complicated problem broken down into sub-problems that are individually much simpler. Code is efficient, correct, and minimal. Code uses appropriate data structure (list, data frame, vector/matrix/array). Code checks for common errors Code is correct, but could be edited down to leaner code. Some “hacking” instead of using suitable data structure. Some checks for errors. Code tackles complicated problem in one big chunk. Code is repetitive and could easily be functionalized. No anticipation of errors.
Presentation: graphs Graph(s) carefully tuned for desired purpose. One graph illustrates one point Graph(s) well chosen, but with a few minor problems: inappropriate aspect ratios, poor labels. Graph(s) poorly chosen to support questions.
Presentation: tables Table(s) carefully constructed to make it easy to perform important comparisons. Careful styling highlights important features. Table(s) generally appropriate but possibly some minor formatting deficiencies. Table(s) with too many, or inconsistent, decimal places. Table(s) not appropriate for questions and findings. Major display problems.
Achievement, mastery, cleverness, creativity Student has gone beyond what was expected and required, e.g., extraordinary effort, additional tools not addressed by this course, unusually sophisticated application of tools from course. Tools and techniques from the course are applied very competently and, perhaps,somewhat creatively. Chosen task was acceptable, but fairly conservative in ambition. Student does not display the expected level of mastery of the tools and techniques in this course. Chosen task was too limited in scope.
Ease of access for instructor, compliance with course conventions for submitted work Access as easy as possible, code runs! Satisfactory Not an earnest effort to reduce friction and comply with conventions and/or code does not run

Note: Scores of zero are possible if one or more parts of an assignment are not completed.

Warning: I will run your code on the original data. You may be tempted to modify the data file in Excel or other editor to “ease” importing and wrangling. That defeats the purpose of this course and a strength of R. If your code does not work on the original data, then I will consider your code as non-working.

Grades

Grades are mapped as follows:

Grade Percentage
3.0 100%
2.5 90%
2.0 80%
1.5 70%
1.0 60%
0.0 0%

Intermediate points are allowed, and follow a linear scale in between each point, so that the full mapping is depicted in the following chart:

Undergraduates must have an average grade of 60% or better to pass this course. Graduate students must have an average grade of 70% or better to pass.