Project Introduction In this project, you’ll have the opportunity to combine a lot of what you’ve learned throughout this entire program.
Project Introduction
In this project, you’ll have the opportunity to combine a lot of what you’ve learned throughout this entire program. This project may seem challenging, and there isn’t a lot of guidance provided, as we want you to have the freedom to be creative. That dataset that you are working with is real, and it is pretty big. When you are an analyst for a company, there often isn’t guidance on how to correctly answer a particular question. And there might not really be one right answer. Leverage your skills. Make sure you look through the directions carefully.
Project Description
Your goal in this project is to create visualizations to reveal insights from a data set. You will create data visualizations that tell a story or highlight patterns in the data set. Your work should be a reflection of the theory and practice of data visualization, such as visual encodings, design principles, and effective communication.
There are 3 different data sets you can choose from. (CHOOSE ONE ONLY)
- Flight Delays and Cancellations
- US Census Demographic Data
- Youtube Data from the US
1) Flight Delays and Cancellations
This data comes from a Kaggle dataset, it tracks the on-time performance of US domestic flights operated by large air carriers in 2015. You can find the dataset in supporting materials at the bottom of this page.
The file you must use in creating your data visualizations is the flights.csv
file. The other two provided files may be used in conjunction with the flights.csv file
, but should not be used alone.
You are required to create three visualizations. Some questions you may attempt to answer include those pertaining to the following areas:
- Which airlines or airports have the worst delays? Determine which destinations and arrival destinations have the most delays? Doing this using maps is actually pretty difficult, but you may choose an alternative visual to provide this information. Think about what kind of aggregates might work best to determine which airlines and airports are the best and worst in terms of delays.
- What causes delays? Think about if you work at an airline and you want to decrease delays. What part of the flight causes the most delays? Do these causes vary by airport or time of year?
- You can also come up with your own question! As you work with the data, come up with a question you’re curious about and can be answered from the data. Build a dashboard or story to answer your question and lead viewers to that answer.
2) US Census Demographic Data
This data comes from a Kaggle dataset, it includes the census data for all counties in 2015. You can find the dataset in supporting materials at the bottom of this page. Required dashboards
You are required to create three visualizations. Some questions you may attempt to answer include those pertaining to the following areas:
- Which states have the best transportation? This is a fairly subjective question, so your first job is to define what the best transportation is. Is it highest percentage of transit use? Is it lowest mean commute times. Then you need to determine how to aggregate the data from the county level to the state. Are there outlier counties affecting the data? How should you aggregate all the data from the counties to represent the state effectively? Please provide your reasoning in your report.
- How does income and poverty look across America? Think about how best to contrast this data to show an interesting finding. You can look across many of the different fields to show interesting findings. Do counties with more construction experience more or less poverty? Do counties near the coast experiene more or less income? Remember this is all correlation and not causation so we cannot say any one thing causes it but we can report descriptive statistics.
- You can also come up with your own question! As you work with the data, come up with a question you’re curious about and can be answered from the data. Build a dashboard or story to answer your question and lead viewers to that answer.
3) Youtube Data US
This data comes from a Kaggle dataset, it includes a bunch of information for videos that were trending for at least one day. This data set will require some cleaning in excel prior to Tableau. You can find the dataset in supporting materials at the bottom of this page.
You are required to create three visualizations. Some questions you may attempt to answer include those pertaining to the following areas:
- What tags have grow in popularity over time? (The tag field is very hard to work with, consider asking a different question. Reminder all of the questions here are suggestions, they are not required. There is an additional file with the tags transposed into a a single column that you can make use of as well. To join it in you will want to join it on channel title and title of the video) You will need to first figure out how to clean the data from having bunch of tag words all in one cell to a format where you can count the number of times the individual tag word is used each day, month, or year depending on how you are answering this question
- What categories are the most liked and disliked? To answer this question you will need to replace the numeric category IDs with the actual names which are in a separate file. Then you can looks at this question deeper. Perhaps there is a time of year where one category is preferred? Are there certain channels with an outlier amount of likes or dislikes?
- You can also come up with your own question! As you work with the data, come up with a question you’re curious about and can be answered from the data. Build a dashboard or story to answer your question and lead viewers to that answer.
REQUIREMENTS
Use the Project Rubric to review your project. If you are happy with your submission, then you are ready to submit! If you see room for improvement in any category in which you do not meet specifications, keep working!
Your project will be evaluated by a Udacity reviewer according to the same Project Rubric. Your project must “Meet Specifications” in each category in order for your submission to pass.
REMINDERS
Your visualization work should use Tableau: other visualization tools may not be evaluated by reviewers. Host your dashboards/stories on Tableau Public, you’ll be submitting them to reviewers and this makes them easily accessible.
Remember that your visualization should be explanatory in nature and communicate specific results that you want to show.
SUBMISSION
Ready to submit your project? Click on the “Submit Project” button or go back to your Udacity Home and click on the project, then follow the instructions to submit! Make sure you include the following files and information:
- A PDF or Markdown report that includes the following sections:
- Links to your dashboards or story
- You must submit url links for each of your visuals from Tableau Public. If you need a reminder on how to save to Tableau Public, please see the next concept.
- Summary: brief description of the visualization and the main story or findings conveyed
- Design: explain any design choices you made including changes to the visualization after collecting feedback
- Resources: list of Web sites, books, forums, blog posts, GitHub repositories etc that you referred to or used in this submission (Add N/A if you did not use such resources). Project rubric as follows :
Project: Telling Stories with Data
Visualization is Explanatory
Success Criteria Specifications Does the visualization have a clear finding? The visualization centers on a specific, clear finding in the data. Does the visualization focus on its finding? The selected finding is clearly communicated. Design choices foster communication between the reader and the visualization.Visualization does not add additional colors, shapes, or other design elements in an unnecessary way. Rather, each additional element should add to the insight being made. Design
Success Criteria Specifications Does the written summary reflect what a reader would interpret from the graphic? The written summary should include a brief description of the visualization and state at least one finding.A reader’s summary of the graphic would closely match the written summary in the writeup, and a reader is able to identify at least one main point or relationship that the graphic attempts to convey.To reiterate your report should include at least 3 sets of - Link(s) to your dashboards or story
- Summary: brief description of the visualization and the main story or findings conveyed (please include an insight you are able to make from the visualization)
- Design: explain any design choices you made including changes to the visualization after collecting feedback
- Resources: list of Web sites, books, forums, blog posts, GitHub repositories etc that you referred to or used in this submission (Add N/A if you did not use such resources).
Does the data visualization incorporate interaction or animation? The visualization includes interaction or animation. The inclusion of filters and additional variables shown in tool tip as appropriate within the visualization interaction are present.At minimum you are required to include a filter in one visualization and you are required to include a tool tip in at least one visualization. You should strive to include these anywhere where they would benefit your visualization. Are colors used appropriately? Color choices must accurately reflect the data and be chosen with accessibility in mind. For example, values that span from negative to positive numbers should be encoded with a diverging palette. Also, the color palettes should work for colorblindness. Are the plots appropriate for the data types? Line plots for sequences, bar charts for categorical variables, etc. Completeness
Success Criteria Specifications The three visualizations are provided. The three visualizations are included. These visualizations may be a single worksheet, but at least one must be a dashboard involving more than one worksheet. A dashboard counts as a single visualization. All visualizations must be clearly connected to a finding, and foster the interaction pieces (filters, colors, etc.) that allow for the finding to be found easily by a user.One Dashboard is required. A Dashboard is an option in Tableau that allows you to combine multiple charts into one page. This counts as 1 visualization.Two other unique visualizations are also required, These can be two single worksheets, two more dashboards, two more stories, or any combination of worksheet, dashboard, or story. Are links to each of the dashboards hosted on Tableau Public provided? The visuals need to be saved to Tableau Public and the links to those visuals must be provided in the report along with the finding for each visual.If you are unable to save to Tableau Public please include screenshots in your pdf report of each visualization. If you choose to use screenshots you should include at least one screenshot of your filters being used (a before and after picture of the visualization). For each visual, there is a written response as to the insight(s) gained from the visual. The insight(s) should be accurate and easily available from the filters and interactivity available in the visual. Across your three visuals, you should incorporate at least 3 different types of visuals. Each visual must be appropriate for the particular data type. However, you cannot submit three bar charts, or three line charts. You should have a minimum of at least three different types of visuals across all of your turned in items.3 Different types of charts required, here are some types you can choose - Bar Chart
- Line Chart
- Scatter Chart
- Histogram
- Bubble Chart
- Map
- Area Chart
- Pie Chart etc
Collepals.com Plagiarism Free Papers
Are you looking for custom essay writing service or even dissertation writing services? Just request for our write my paper service, and we'll match you with the best essay writer in your subject! With an exceptional team of professional academic experts in a wide range of subjects, we can guarantee you an unrivaled quality of custom-written papers.
Get ZERO PLAGIARISM, HUMAN WRITTEN ESSAYS
Why Hire Collepals.com writers to do your paper?
Quality- We are experienced and have access to ample research materials.
We write plagiarism Free Content
Confidential- We never share or sell your personal information to third parties.
Support-Chat with us today! We are always waiting to answer all your questions.