Bikeshare Dataset Analysis Project
You will be tasked to answer analytical questions relevant to the topics covered in class
(i.e. Descriptive Statistics, inferential statistics such as confidence intervals, 1- and 2-
sample hypothesis testing, etc.) In this project, you will demonstrate your ability to
understand such topics, use technology to manipulate data and calculate statistics, and
ultimately communicate your findings in a professional manner. This will be facilitated
through a set of professionally formatted sections. (all in 1 excel document) Project Background
Bikeshare, also known as bike rental or bike sharing, is a system in which bicycles are
made available for shared use to individuals on a short-term basis. Typically, bikeshare
systems are designed to provide users with a convenient, affordable, and healthy mode
of transportation in urban areas.
In a typical bikeshare system, users can locate and rent bikes from self-service stations
located throughout the service area. The bikes are equipped with locks and can be
unlocked using a mobile app or a membership card. Users can ride the bikes for a
specified amount of time and then return them to any available station within the
system.
Bikeshare systems have become increasingly popular in recent years, particularly in
cities around the world. They offer a sustainable and efficient way to get around
congested urban areas, reduce traffic congestion, and promote physical activity.
This data set contains data and information for specific rides in a prior month in 2024 for
the city of Washington D.C. (this was the latest version of the data available)
All personally identifying information was removed from the data. A unique data
identifier called ride_id is used to differentiate specific records.
The overall goal will be to better understand the bikeshare market in Washington, D.C.
and for the student to accurately manipulate data, calculate statistics, run hypothesis
testing using excel, and communicate findings in an understandable way.
The dataset Capital Bikeshare Data.xlsx is comprised of 4 tabs. The first is Data
Dictionary which lists out the variables and gives their definitions. The second is data
which is the raw data that will be used for your analysis. The other two tabs are used
specifically for Section D and E respectively.
Variable Description
ride_id Unique numerical identifier for specific ride/rental
rideable_type
The type of bike used in the ride (“docked_bike”
“classic_bike” or “electric_bike”)
started_at The date and time that the bike rental started
day_of_week_start
The day of the week that the bike rental was initiated (1 =
Sunday, 2 = Monday, etc.)
ended_at The date and time that the bike rental ended
time_spent
The amount of time that passed between the start and
end of the rental (minutes)
start_station_name The name of the docking station where the ride started
start_station_id
The id number of the docking station where the ride
started.
end_station_name The name of the docking station where the ride ended
end_station_id
The id number of the docking station where the ride
ended.
start_lat The latitude where the ride started.
start_lng The longitude where the ride started.
end_lat The latitude where the ride ended.
end_lng The longitude where the ride ended.
distance The distance between the starting point and ending point.
Member_casual
The type of customer that is using the bike. (“member” or
“casual”)
Guidelines and Requirements
The following section is to be done in accordance with the Brightspace quiz called
“Bikeshare Data Project.” Please go to our Brightspace class, then to Assessments –
Quizzes and Exams – Bikeshare Data Project and Click Start Quiz.
Section A – Data Dictionary & Variable Identification
Bikeshare Dataset Analysis Project Business Statistics
Provide a data dictionary with the correct variable types identified. You will categorize
the variables in the data as (1) Qualitative or Quantitative, and (2) Nominal, Ordinal,
Interval, or Ratio level of measurement.
Complete this is in the Excel file and in the BrightSpace Quiz.
Section B – Descriptive & Visual Statistics
Give Descriptive Statistics in table format for the raw data set (“data” tab). The 5
variables that you will evaluate are rideable_type, day_of_week_start, time_spent,
distance, and member_casual. Make sure to include appropriate statistics given the
correct variable type (Quantitative or Qualitative)
a. Quantitative – Mean, Median, Mode, Standard Deviation, Variance, Range,
Count (“n”), 99% confidence interval (either Z-based or t-based)
b. Qualitative – Frequencies, Relative Frequencies
For each variable, give a relevant visualization. Describe notable characteristics of the
distributions such as shape of the quantitative distribution (skew) and other things of
note.
a. Quantitative – Histogram
b. Qualitative – Bar Chart or Pie Chart
Section C – One-sample hypothesis testing
Perform/Document 7-step hypothesis test to test if there is sufficient evidence that time
spent on a ride is less than 15 minutes. Assume ? = .01.
Section D – Two-sample hypothesis testing
Perform/Document 7-step hypothesis test to test if there is statistically significant
difference in the amount of time spent on a ride between members and casual riders.
Assume ? = .05.
Section E – ANOVA
Perform/Document 7-step hypothesis test to test if there is significant statistical
evidence that the time spent on a ride is dependent on the day of the week. Assume ? =
.01.
Section F – Correlation
Generate a scatterplot of the variables time_spent and distance. Also, generate the
correlation of these variables. (either by using the data analysis toolpak or an excel
function). Qualitatively describe the correlation of these variables. What might this
mean?
(*** OPTIONAL***) Section G – Dashboard
Bikeshare Dataset Analysis Project Business Statistics
Come up with 3 data visualizations or KPIs that you think would be applicable for a
manager at Capital Bikeshare to have on either a daily or weekly dashboard. Note that
one or more of these may be visualizations or statistics that you already have
calculated. Name the visualizations, explain why they should be on a dashboard for a
manager, and show them in an excel table called “Dashboard.”
Submit your .xlsx file to the assignment box by the due date. Note that the file should be
complete with all of the answers and excel functions live.
I strongly suggest you watch the walkthrough video that I have provided, and I strongly
encourage you to come to me with any questions you have. Do not delay in reaching
out with questions so that you are on the right track and give yourself the best chance
for success.
Bikeshare Dataset Analysis Project – Project Checklist Points
Available
Section A – Data Dictionary & Variable Identification
? Correctly answered statistical questions in Brightspace
30
Section B – Descriptive & Visual Statistics
? Included table(s) of all descriptive statistics correctly calculated
? Included appropriate visualizations and interpretation of variables
distributive shape/skewness
? Correctly answered statistical questions in Brightspace
45
Section C – One-Sample Hypothesis Testing
? Professionally and correctly displayed statistical output in excel
? Correctly answered statistical questions in Brightspace
35
Section D – Two-Sample Hypothesis Testing
? Professionally and correctly displayed statistical output in excel
? Correctly answered statistical questions in Brightspace
35
Section E – ANOVA
? Professionally and correctly displayed statistical output in excel
? Correctly answered statistical questions in Brightspace
35
Section F – Correlation
? Professionally and correctly displayed statistical output in excel
? Correctly answered statistical questions in Brightspace
20
Section G – Dashboard
? Professionally and correctly displayed dashboard in excel
? Correctly answered statistical questions in Brightspace
Collepals.com Plagiarism Free Papers
Are you looking for custom essay writing service or even dissertation writing services? Just request for our write my paper service, and we'll match you with the best essay writer in your subject! With an exceptional team of professional academic experts in a wide range of subjects, we can guarantee you an unrivaled quality of custom-written papers.
Get ZERO PLAGIARISM, HUMAN WRITTEN ESSAYS
Why Hire Collepals.com writers to do your paper?
Quality- We are experienced and have access to ample research materials.
We write plagiarism Free Content
Confidential- We never share or sell your personal information to third parties.
Support-Chat with us today! We are always waiting to answer all your questions.