[Type text] [Type text] [Type text] Course Project Part One: Finding a

[Type text] [Type text] [Type text]
Course Project Part One: Finding a Data Set Relevant to Your Interests
Stat Resource & Subject: https://www.causeweb.org/tshs/surgery-timing/ ( Surgery Timing)
For this part of the course project, you are simply finding a data set. You want to find a data set you can use for the remainder of the course as you will be applying various statistical analyses to this data set.
It is strongly recommended that you find a data set that is of interest to you and that aligns with the type of data you would like to analyze.
Your data set must:
Have at least 30 observations/records.
Have at least 5 variables with at least 2 of the variables being quantitative and 2 being qualitative.
Be in Microsoft Excel format.
You can either:
Select a free, publicly available data set from the internet. There are many free, publicly available data sets. Possible resources include government data (such as demographic or energy data from various government agencies), globally recognized organizations (such as the World Bank), and publicly traded companies (most which post their financial data on their websites).
Select a data set from your own organization. For example, if you work in sales, finance, or supply chain and your company has a data set relevant to your job that you can work with, you can use that. Please note, if you select this option, you must get written permission from your manager to use the data set in this course, as potentially sensitive information could be revealed to other students and the instructor. It is your responsibility to scrub your data set of any sensitive information.
Your objective for this part of the project is to get the data set into your possession and save it on a computer you can easily access and that has Microsoft Excel installed.
Once you’ve found your data set, answer the following questions:
Your Response
Where did you find your data set?
What about this data set is of interest to you?
How many observations/records does your data set have?
Which variables in your data set are quantitative?
Which variables in your data set are qualitative?
Course Project Part Two: Writing Research Questions and Creating Graphs
Using your data set (see Course Project Part One, above):
Write three research questions, looking into variable comparisons that can be visually answered by using a pivot table with a side-by-side bar chart, a scatter diagram, and a side-by-side box plot.
Using Microsoft Excel, create the pivot table, side-by-side bar chart, scatter diagram, and side-by-side box plot.**
Use your graphs to communicate the results of your research questions.
You will now want to save this document and your Excel file in the same place on your computer. From this point forward, you’ll be working with both for this course project. In addition, you might find it useful to have both files open at the same time because you’ll often be working in them simultaneously.
Complete the following table:
Research Questions
Research Question 1:
Research Question 2:
Research Question 3:
Your Response
How do your pivot table, side-by-side bar chart, scatter diagram, and side-by-side box plot help you communicate the results of your research questions?
**Most versions of Microsoft Excel make it easy to create pivot tables, side-by-side bar charts, and scatter diagrams. However, only the most recent version of Excel has a built-in tool for creating box plots (also known as box and whisker plots). You can create box and whisker plots in earlier versions of Excel; it just takes a few extra steps. It is recommended you search YouTube for “Excel box and whisker plot” tutorials. Once you’ve created one or two by hand, you’ll find it really doesn’t take too long. In fact, YouTube is a great resource for virtually any Excel question you might have.
Course Project Part Three: Computing the Correlation between Two Quantitative Variables
Using your data set (see Course Project Part Three, above):
Choose two quantitative variables and compute the correlation between them.
Is it appropriate to find a line of best fit? If so, include that as well. If not, in the space below, explain why not.
Complete the following table if you did not include a line of best fit:
Your Response
Why was a line of best fit not appropriate for your data?
Course Project Part Four: Determining Confidence Intervals
Using your data set (see Course Project Part One, above):
Give appropriate confidence intervals for your variables and interpret the results.
Complete the following table:
Your Response
What confidence intervals did you select? Why?
How would you interpret the results?