Category | Excellent | Good | Needs work |
---|---|---|---|
Organization & Formatting | 5 All formatting guidelines are followed; YAML is correct with all team members listed. |
4 Most formatting guidelines are followed; YAML is correct with all team members listed. |
3 Several or all formatting guidelines not followed; YAML contains elements that aren't updated from the template. |
Research Question | 10 / 9 Research question is clear, focused, concise, complex, and arguable. |
8 / 7 / 6 Research question is reasonably clear and focused, but may be too simple, too complex, or too verbose. |
5 / 4 / 3 Research question is unclear and lacks focus; question is far too simple or overly complex. |
Data Sources | 10 / 9 Data sources or plausible data sources are clearly described; validity of and concerns about data are discussed. |
8 / 7 / 6 Identified data sources or plausible data sources are not clearly described; validity of and concerns about data are minimally discussed. |
5 / 4 / 3 Data sources are poorly described or missing; description of validity of and concerns about data are poor or missing. |
Anticipated Results | 10 / 9 Detailed description of: 1) two variables and their expected distribution; 2) two expected relationships; 3) two charts that help visualize expected relationships; and 4) how chosen variables will inform research questions. |
8 / 7 / 6 Detailed description of only one expected variable, relationship, or chart; minimal description of two expected variables, relationships, or charts; minimal description of links between variables and research question. |
5 / 4 / 3 Poor or missing descriptions of variables, relationships, or charts; poor or missing description between variables and research question. |
Technical things | 5 All code runs without errors; all files included in the submitted .zip file. |
4 Code has only one or two error, otherwise runs; all files included in the submitted .zip file. |
3 Code has multiple errors; submitted .zip file is missing components necessary to reproduce analysis. |
Project Proposal
Due: Sep 22 by 11:59pm
Weight: This assignment is worth 6% of your final grade.
Purpose: The purpose of the proposal is to clarify the specific research topic of interest for your team and to ensure that you are headed in a promising direction before spending hours analyzing data for your project.
Assessment: Your submission will be assessed using the rubric at the bottom of this page.
Write a proposal of an exploratory data analysis that you plan to conduct for your final project. The instructors will review and grade your proposal and provide feedback upon returning from spring break. If your proposal is approved, you are done and can move on towards the next project task. In some cases, the instructors may ask you to submit a revised proposal, most likely by focusing / adjusting the proposal scope and / or research question. Below is a list of specific items your proposal should include.
1. Get organized
Download and unzip this template for your proposal report. Open the project.Rproj
file and write your proposal in the report.qmd
file. The template comes with some text and code explaining how to use it - should should delete this code as it is only for explanatory purposes. Be sure to adjust the content in the YAML:
- Write your project title in the
title
field (and provide a subtitle if you wish, or delete thesubtitle
field). - In the
author
field, list the names of all teammates, e.g.author: Luke Skywalker, Leia Organa, and Han Solo
.
2. Identify a topic and research question(s)
Once you have identified a topic for your project, write a clear research question that you plan to answer with your analysis. You may choose to have more than one question, or one main question with multiple sub-questions. Follow these guidelines. Your question(s) should be:
- Clear: it provides enough specifics that your audience can easily understand its purpose without needing additional explanation.
- Focused: it is narrow enough that it can be addressed thoroughly with the data available and within the limits of the final project report.
- Concise: it is expressed in the fewest possible words.
- Complex: it is not answerable with a simple “yes” or “no,” but rather requires synthesis and analysis of data.
- Arguable: its potential answers are open to debate rather than accepted facts.
Be sure to avoid making causal statments in your research question, such as “how does XX affect YY?” This is a causal question, and causal questions are not appropriate for this course. Instead, we are exploring a topic and looking for patterns in the data. We may find that some things are related, but we cannot say that one thing causes another.
Alternative project ideas: While most teams choose to conduct a analysis and write a report on their findings, there are two other options for your project:
- Replication study: While most teams choose a topic with questions that we don’t yet have answers to, it is also very valuable to conduct replication studies of existing research. If you choose to do this, discuss the original research that you aim to replicate, and note that you still have to look up your own data sources and write your own code.
- Dashboard project: A written analysis is the typical final project output, but in some contexts a shiny dashboard can be just as valuable. If you choose to do this, discuss the purpose of the dashboard you have in mind and the data that you plan to include in it. You will still be required to write a report, but the report will motivate the dashboard and key takeaways that can be seen in the dashboard.
3. Write an introduction
Introduce your topic and clearly state your research question(s). Motivate why this topic is interesting, important, etc., and motivate the significance of your research question(s). Feel free to use examples, charts, images, etc. to help motivate your points.
4. Discuss your data sources
Discuss the data source(s) you plan to use for your analysis. Follow these guidelines:
- If you have already identified the source(s), describe them and include urls and / or references to the sources.
- If you have not identified a source yet, describe the data you hope to use, and give at least one plausible source that may have the data (regardless of whether the source makes it available or not).
- Discuss the validity of the source(s). For each data source, is the data the original data, or has it been pre-processed by someone else? How was the original data collected and by whom? If you do not yet have a source, discuss what concerns you have about a plausible source that might have the data.
5. Describe anticipated results
- Choose two variables that you expect to find in your data that are relevant to your research question.
- Regardless of whether those variables actually exist, describe how you would expect each to be distributed (e.g. unimodal, multimodel, tightly-group, widely-spread, etc.). For example, you might expect the price of gasoline over a particular period to be rather tightly-grouped around a mean, whereas the stock price of a particular company might vary much more widely over the same period.
- Describe two relationships you expect to find among variables in your data (again, regardless of whether they actually exist in your data). For example, you might expect sales of hybrid vehicles to increase when gas prices increase; in this case, I am expecting that hybrid vehicle sales and gasoline prices are positively correlated.
- Describe at least two charts that you expect will help you visualize the relationships that you expect to find. For example, a scatterplot of gasoline prices and hybrid vehicle sales over a particular time period might be useful for visualizing the level of correlation between these two variables.
- Discuss how your expectations about the variables you chose will help inform you about your research question.
6. Attribution
Include a short description of the specific contributions of each team member in your team. Here is an example:
- Student A wrote the initial proposal draft.
- Student B identified data sources.
- Student C helped organize project files and edited the final draft of the proposal.
If all members contributed equally, you can just put the single sentence “All members contributed equally”.
Team members who do not make meaningful contributions to their projects will not receive the same grade as that of their team mates. If you are having any issues or disputes among team members, please contact Professor Helveston ASAP so we can find a resolution.
7. Render and submit
Click the “Render” button to compile your .qmd
file into a html web page. Then open the report.html
file in a web browser and proofread your report.
Does all of the formatting look correct? Make sure there are no errors in the rendered file before submitting it.
Once you’ve proofread your report, create a zip file of all the files in your R project folder for this assignment and submit it on the corresponding assignment submission on Blackboard.
Only one person from your team should submit.