This page is not intended to make your life harder and more "tasky". Rather, the objective here is to help you remember key things that you need to know for your assignments. In the first one or two assignments, I will typically remind you of specific things you need to do. However, beyond that point, I will expect you to remember to adhere to this way of doing things. Some of these requirements are not simply good practice for this class, but rather, are good habits for doing data analysis in general.
You are not, of course, required to meet objectives until we discuss them in lecture. For example, if we have not yet discussed how to create a scatterplot, then I certainly don't expect you to worry about the specific requirements of scatterplots just yet!
I strongly recommend that you refer to this document before submitting any assignment. It's a great way to make sure you don't miss any obvious / easy points.
Submit as Google or Word Document: Your submissions should always be in DOCX (Word) or Google Doc format. Do not submit as PDF.
Always show your work: As mentioned in the past, I don't expect you to go into painful detail of every step, but the grader must have a decent idea of how you arrived at your answer.
Describing a distribution: When asked to describe a distribution, this should include at least one chart (typically a histogram or a boxplot -- and including both is okay), and descriptive statistics (center, spread, 5-number summary).
All graphs should be included in your document.
Include title and labels on graphs: When drawing a graph, you should always include a title, and at the label on at least one axis (histograms and boxplots), and for some graphs, such as scatterplots, you should have labels on both the x and y axes.
Create vectors in R: When you are provided with a series of values and expected to do things with them using R, you will almost certainly want to place those values inside an R vector.
Always include your R code in your assignments: In addition to providing numeric solutions from your R output, you should, of course, also include any R code you wrote to generate that output.
Be prepared for questions that can not be answered: I will be throwing these at you from time to time. I promise that my goal here is not to trick you! However, you must be aware that in the real world, people will often come up with statistical and mathematical conclusions where they are not appropriate. You will come to see that I am far more interested in pushing all of you to ensure that you grasp concepts, than I am in making you spend hours crunching numbers. Types of questions that can not be answered:
Real world application: As an educated citizen of the world, it is important to be able to interpret statistical scenarios without necessarily having expertise in the particular area. At some points in this and other assignments, you may be asked to consider some real-world ramifications of the calculations you are being asked to determine. Do not take these questions lightly as they may have right and wrong answers. The key is to spend at least a little bit of time thinking about them.