Introduction
Your instructor will provide you with a data file that includes data on five variables:
SALES represents the number of sales made this week.
CALLS represents the number of sales calls made this week.
TIME represents the average time per call this week.
YEARS represents years of experience in the call center.
TYPE represents the type of training the employee received.
Part A:
Exploratory Data Analysis
Preparation
• Open the files for the course project and the data set.
• For each of the five variables, process, organize, present and summarize the data. Analyze each variable by itself using graphical and numerical techniques of summarization. Use Excel as much as possible, explaining what the results reveal. Some of the following graphs may be helpful:
stem-leaf diagram, frequency/relative frequency table, histogram, boxplot, dotplot, pie chart, bar graph. Caution: not all of these are appropriate for each of these variables, nor are they all necessary.
More is not necessarily better. In addition be sure to find the appropriate measures of central tendency, the measures of dispersion, and the shapes of the distributions (for the quantitative variables) for the above data. Where appropriate, use the five number summary (the Min, Q1, Median, Q3, Max). Once again, use Excel as appropriate, and explain what the results mean.
• Analyze the connections or relationships between the variables. There are ten possible pairings of two variables. Use graphical as well as numerical summary measures. Explain the results of the analysis. Be sure to consider all 10 pairings. Some variables show clear relationships, while others do not.
Report Requirements
• From the variable analysis above, provide the analysis and interpretation for three individual variables. This would include no more than 1 graph for each, one or two measures of central tendency and variability (as appropriate), the shapes of the distributions for quantitative variables, and two or three sentences of interpretation.
• For the 10 pairings, identify and report only on three of the pairings, again using graphical and numerical summary (as appropriate), with interpretations. note that at least one pairing must include a qualitative variable and at least one pairing must not include a qualitative variable.
• Prepare the report in Microsoft Word, integrating graphs and tables with text explanations and interpretations. Be sure to include graphical and numerical back up for the explanations and interpretations. Be selective in what is included in the report to meet the requirements of the report without extraneous information.
• All DeVry University policies are in effect, including the plagiarism policy.
• Project Part A report is due by the end of Week 2.
• Project Part A is worth 100 total points. See grading rubric below.
Submission:
The report, including all relevant graphs and numerical analysis along with interpretations
Format for report:
A. Brief Introduction
B. Discuss 1st individual variable, using graphical, numerical summary and interpretation
C. Discuss 2nd individual variable, using graphical, numerical summary and interpretation
D. Discuss 3rd individual variable, using graphical, numerical summary and interpretation
E. Discuss 1st pairing of variables, using graphical, numerical summary and interpretation
F. Discuss 2nd pairing of variables, using graphical, numerical summary and interpretation
G. Discuss 3rd pairing of variables, using graphical, numerical summary and interpretation
H. Conclusion
Part B: Hypothesis Testing and Confidence Intervals
Complete the following four hypotheses, using α = 0.05 for each. The week 5 spreadsheet can be used in these analyses.
1. Mean sales per week exceed 42.5 per salesperson
2. Proportion receiving online training is less than 55%
3 Mean calls made among those with no training is at least 145
4. Mean time per call is 14.7 minutes
• Using the same data set from part A, perform the hypothesis test for each speculation in order to see if there is evidence to support the manager’s belief. Use the Eight Steps of a Test of Hypothesis from Section 9.1 of your text book as a guide. You can use either the p-value or the critical values to draw conclusions. Be sure to explain your conclusion and interpret that to the claim in simple terms
• Compute 99% confidence intervals for the variables used in each hypothesis test, and interpret these intervals.
• Write a report about the results, distilling down the results in a way that would be understandable to someone who does not know statistics. Clear explanations and interpretations are critical.
• All DeVry University policies are in effect, including the plagiarism policy.
• Project Part B report is due by the end of Week 6.
• Project Part B is worth 100 total points. See grading rubric below.
Format for report:
A. Summary Report (about one paragraph on each of the four speculations)
B. Appendix with the calculations of the Eight Elements of a Test of Hypothesis, the p-values, and the confidence intervals. Include the Excel formulas or spreadsheet screen shots used in the calculations.
Final Project: Regression and Correlation Analysis
Use the dependent variable (labeled Y) and one of the independent variables (labeled X1, X2, and X3) in the data file. Select and use one independent variable throughout this analysis. Use Excel to perform the regression and correlation analysis to answer the following. The week
6 spreadsheet can be helpful in this work.
1. Generate a scatterplot for the specified dependent variable (Y) and the selected independent variable (X), including the graph of the “best fit” line. Interpret.
2. Determine the equation of the “best fit” line, which describes the relationship between the dependent variable and the selected independent variable.
3. Determine the correlation coefficient. Interpret.
4. Determine the coefficient of determination. Interpret.
5. Test the utility of this regression model by completing a hypothesis test of b=0 using α=0.10. Interpret results, including the p-value.
6. Based on the findings in steps 1-5, analyze the ability of the independent variable to predict the dependent variable.
7. Compute the confidence interval for b, using a 95% confidence level. Interpret this interval.
8. Compute the 99% confidence interval for the dependent variable, for a selected value of the independent variable. Each student can choose a value to use for the independent variable (use same value in the next step). Interpret this interval.
9. Using the same chosen value for part (8), estimate the 99% prediction interval for the dependent variable. Interpret this interval.
10. What can be said about the value of the dependent variable for values of the independent variable that are outside the range of the sample values? Explain.
11. Describe a business decision that could be made based on the results of this analysis. In other words, how might the business operations change based on these statistical results.
12. All DeVry University policies are in effect, including the plagiarism policy.
13. Final Project report is due by the end of Week 7.
14. Final Project is worth 130 total points. See grading rubric below.
Summarize your results from Steps 1-11 in a 3-page report. The report should explain and interpret the results in ways that are understandable to someone who does not know statistics.
Submission:
The Word document, summary report should be submitted for questions 1-11. The Excel output can be included as an appendix, if needed.
Format for report:
A. Summary Report
B. Steps 1-11 addressed with appropriate output, graphs and interpretations. Be sure to number each step 1-11.