Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA
Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA Analysis of variance (ANOVA) is a statistical procedure that compares data between two or more groups or conditions to investigate the presence of differences between those groups on some continuous dependent variable (see Exercise 18). In this exercise, we will focus on the one-way ANOVA, which involves testing one independent variable and one dependent variable (as opposed to other types of ANOVAs, such as factorial ANOVAs that incorporate multiple independent variables). Why ANOVA and not a t-test? Remember that a t-test is formulated to compare two sets of data or two groups at one time (see Exercise 23 for guidance on selecting appropriate statistics). Thus, data generated from a clinical trial that involves four experimental groups, Treatment 1, Treatment 2, Treatments 1 and 2 combined, and a Control, would require 6 t-tests. Consequently, the chance of making a Type I error (alpha error) increases substantially (or is inflated) because so many computations are being performed. Specifically, the chance of making a Type I error is the number of comparisons multiplied by the alpha level. Thus, ANOVA is the recommended statistical technique for examining differences between more than two groups (Zar, 2010). ANOVA is a procedure that culminates in a statistic called the F statistic. It is this value that is compared against an F distribution (see Appendix C) in order to determine whether the groups significantly differ from one another on the dependent variable. The formulas for ANOVA actually compute two estimates of variance: One estimate represents differences between the groups/conditions, and the other estimate represents differences among (within) the data. ORDER YOUR PROFESSIONAL PAPER HERE ? Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA? Research Designs Appropriate for the One-Way ANOVA Research designs that may utilize the one-way ANOVA include the randomized experimental, quasi-experimental, and comparative designs (Gliner, Morgan, & Leech, 2009). The independent variable (the ?grouping? variable for the ANOVA) may be active or attributional. An active independent variable refers to an intervention, treatment, or program. An attributional independent variable refers to a characteristic of the participant, such as gender, diagnosis, or ethnicity. The ANOVA can compare two groups or more. In the case of a two-group design, the researcher can either select an independent samples t-test or a one-way ANOVA to answer the research question. The results will always yield the same conclusion, regardless of which test is computed; however, when examining differences between more than two groups, the one-way ANOVA is the preferred statistical test. Example 1: A researcher conducts a randomized experimental study wherein she randomizes participants to receive a high-dosage weight loss pill, a low-dosage weight loss pill, or a placebo. She assesses the number of pounds lost from baseline to post-treatment 378for the three groups. Her research question is: ?Is there a difference between the three groups in weight lost?? The independent variables are the treatment conditions (high-dose weight loss pill, low-dose weight loss pill, and placebo) and the dependent variable is number of pounds lost over the treatment span. Null hypothesis: There is no difference in weight lost among the high-dose weight loss pill, low-dose weight loss pill, and placebo groups in a population of overweight adults. Example 2: A nurse researcher working in dermatology conducts a retrospective comparative study wherein she conducts a chart review of patients and divides them into three groups: psoriasis, psoriatric symptoms, or control. The dependent variable is health status and the independent variable is disease group (psoriasis, psoriatic symptoms, and control). Her research question is: ?Is there a difference between the three groups in levels of health status?? Null hypothesis: There is no difference between the three groups in health status. Statistical Formula and Assumptions Use of the ANOVA involves the following assumptions (Zar, 2010): Sample means from the population are normally distributed. The groups are mutually exclusive. The dependent variable is measured at the interval/ratio level. The groups should have equal variance, termed ?homogeneity of variance.? All observations within each sample are independent. The dependent variable in an ANOVA must be scaled as interval or ratio. If the dependent variable is measured with a Likert scale and the frequency distribution is approximately normally distributed, these data are usually considered interval-level measurements and are appropriate for an ANOVA (de Winter & Dodou, 2010; Rasmussen, 1989). The basic formula for the F without numerical symbols is: F=Mean Square Between GroupsMean Square Within Groups The term ?mean square? (MS) is used interchangeably with the word ?variance.? The formulas for ANOVA compute two estimates of variance: the between groups variance and the within groups variance. The between groups variance represents differences between the groups/conditions being compared, and the within groups variance represents differences among (within) each groups data. Therefore, the formula is F = MS between/MS within. Hand Calculations Using an example from a study of students enrolled in an RN to BSN program, a subset of graduates from the program were examined (Mancini, Ashwill, & Cipher, 2014). The data are presented in Table 33-1. A simulated subset was selected for this example so that 379the computations would be small and manageable. In actuality, studies involving one-way ANOVAs need to be adequately powered (Aberson, 2010; Cohen, 1988). See Exercises 24 and 25 for more information regarding statistical power. TABLE 33-1 MONTHS FOR COMPLETION OF RN TO BSN PROGRAM BY HIGHEST DEGREE STATUS Participant #AssociatesParticipant #BachelorsParticipant #MastersDegreeDegreeDegree117101619172191115202132412162120418131222215241416231262415122416716161625208161712261892018102712 The independent variable in this example is highest degree obtained prior to enrollment (Associates, Bachelors, or Masters degree), and the dependent variable was number of months it took for the student to complete the RN to BSN program. The null hypothesis is ?There is no difference between the groups (highest degree of Associates, Bachelors, or Masters) in the months these nursing students require to complete an RN to BSN program.? The computations for the ANOVA are as follows: Step 1: Compute correction term, C. Square the grand sum (G), and divide by total N: C=460 2 27 =7,837.04 Step 2: Compute Total Sum of Squares. Square every value in dataset, sum, and subtract C: (17 2 +19 2 +24 2 +18 2 +24 2 +16 2 +16 2 +?+12 2 )??7,837.04=8,234??7,837.04=396.96 Step 3: Compute Between Groups Sum of Squares. Square the sum of each column and divide by N. Add each, and then subtract C: 178 2 9 +125 2 9 +157 2 9 ??7,837.04(3,520.44+1,736.11+2,738.78)??7,837.04=158.29 Step 4: Compute Within Groups Sum of Squares. Subtract the Between Groups Sum of Squares (Step 3) from Total Sum of Squares (Step 2): 396.96??158.29=238.67 380 Step 5: Create ANOVA Summary Table (see Table 33-2). a. Insert the sum of squares values in the first column. b. The degrees of freedom are in the second column. Because the F is a ratio of two separate statistics (mean square between groups and mean square within groups) both have different df formulas?one for the ?numerator? and one for the denominator: Mean square between groupsdf=number of groups??1 Mean square within groups df=N-number of groups For this example, thedffor the numerator is 3??1=2. Thedffor the denominator is 27??3=24. c. The mean square between groups and mean square within groups are in the third column. These values are computed by dividing the SS by the df. Therefore, the MS between = 158.29 ÷ 2 = 79.15. The MS within = 238.67 ÷ 24 = 9.94. d. The F is the final column and is computed by dividing the MS between by the MS within. Therefore, F = 79.15 ÷ 9.94 = 7.96. TABLE 33-2 ANOVA SUMMARY TABLE Source of Variation SSdfMSF Between Groups158.29279.157.96Within Groups238.67249.94 Total396.9626 Step 6: Locate the critical F value on the F distribution table (see Appendix C) and compare it to our obtained F = 7.96 value. The critical F value for 2 and 24 df at α = 0.05 is 3.40, which indicates the F value in this example is statistically significant. Researchers report ANOVA results in a study report using the following format: F(2,24) = 7.96, p < 0.05. Researchers report the exact p value instead of ?p < 0.05,? but this usually requires the use of computer software due to the tedious nature of p value computations. Our obtained F = 7.96 exceeds the critical value in the table, which indicates that the F is statistically significant and that the population means are not equal. Therefore, we can reject our null hypothesis that the three groups spent the same amount of time completing the RN to BSN program. However, the F does not indicate which groups differ from one another, and this F value does not identify which groups are significantly different from one another. Further testing, termed multiple comparison tests or post hoc tests, is required to complete the ANOVA process and determine all the significant differences among the study groups. Post Hoc Tests Post hoc tests have been developed specifically to determine the location of group differences after ANOVA is performed on data from more than two groups. These tests were developed to reduce the incidence of a Type I error. Frequently used post hoc tests are the Newman-Keuls test, the Tukey Honestly Significant Difference (HSD) test, the Scheffé test, and the Dunnett test (Zar, 2010; see Exercise 18 for examples). When these tests are 381calculated, the alpha level is reduced in proportion to the number of additional tests required to locate statistically significant differences. For example, for several of the aforementioned post hoc tests, if many groups mean values are being compared, the magnitude of the difference is set higher than if only two groups are being compared. Thus, post hoc tests are tedious to perform by hand and are best handled with statistical computer software programs. Accordingly, the rest of this example will be presented with the assistance of SPSS. SPSS Computations The following screenshot is a replica of what your SPSS window will look like. The data for ID numbers 24 through 27 are viewable by scrolling down in the SPSS screen. 382 Step 1: From the ?Analyze? menu, choose ?Compare Means? and ?One-Way ANOVA.? Move the dependent variable, Number of Months to Complete Program, over to the right, as in the window below. Step 2: Move the independent variable, Highest Degree at Enrollment, to the right in the space labeled ?Factor.? Step 3: Click ?Options.? Check the boxes next to ?Descriptive? and ?Homogeneity of variance test.? Click ?Continue? and ?OK.? 383 ORDER YOUR PROFESSIONAL PAPER HERE Interpretation of SPSS Output The following tables are generated from SPSS. The first table contains descriptive statistics for months to completion, separated by the three groups. The second table contains the Levenes test of homogeneity of variances. The third table contains the ANOVA summary table, along with the F and p values. The first table displays descriptive statistics that allow us to observe the means for the three groups. This table is important because it indicates that the students with an Associates degree took an average of 19.78 months to complete the program, compared to 13.89 months for students with a Bachelors and 17.44 months for students with a Masters degree. One Way The second table contains the Levenes test for equality of variances. The Levenes test is a statistical test of the equal variances assumption. The p value is 0.488, indicating there was no significant difference among the three groups variances; thus, the data have met the equal variances assumption for ANOVA. The last table contains the contents of the ANOVA summary table, which looks much like Table 33-2. This table contains an additional value that we did not compute by hand?the exact p value, which is 0.002. Because the SPSS output indicates that we have a significant ANOVA, post hoc testing must be performed. 384 Return to the ANOVA window and click ?Post Hoc.? You will see a window similar to the one below. Select the ?LSD? and ?Tukey? options. Click ?Continue? and ?OK.? The following output is added to the original output. This table contains post hoc test results for two different tests: the LSD (Least Significant Difference) test and the Tukey HSD (Honestly Significant Difference) test. The LSD test, the original post hoc test, explores all possible pairwise comparisons of means using the equivalent of multiple t-tests. However, the LSD test, in performing a set of multiple t-tests, reports inaccurate p values that have not been adjusted for multiple computations (Zar, 2010). Consequently, researchers should exercise caution when choosing the LSD post hoc test following an ANOVA. The Tukey HSD comparison test, on the other hand, is a more ?conservative? test, meaning that it requires a larger difference between two groups to indicate a significant difference than some of the other post hoc tests available. By requiring a larger difference between the groups, the Tukey HSD procedure yields more accurate p values of 0.062 to reflect the multiple comparisons (Zar, 2010). 385 Post Hoc Tests Observe the ?Mean Difference? column. Any difference noted with an asterisk (*) is significant at p < 0.05. The p values of each comparison are listed in the ?Sig.? column, and values below 0.05 indicate a significant difference between the pair of groups. Observe the p values for the comparison of the Bachelors degree group versus the Masters degree group. The Tukey HSD test indicates no significant difference between the groups, with a p of 0.062; however, the LSD test indicates that the groups significantly differed, with a p of 0.025. This example enables you see the difference in results obtained when calculating a conservative versus a lenient post hoc test. However, it should be noted that because an a priori power analysis was not conducted, there is a possibility that these analyses are underpowered. See Exercises 24 and 25 for more information regarding the consequences of low statistical power. Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA Final Interpretation in American Psychological Association (Apa) Format The following interpretation is written as it might appear in a research article, formatted according to APA guidelines (APA, 2010). A one-way ANOVA performed on months to program completion revealed significant differences among the three groups, F(2,24) = 7.96, p = 0.002. Post hoc comparisons using the Tukey HSD comparison test indicated that the students in the Associates degree group took significantly longer to complete the program than the students in the Bachelors degree group (19.8 versus 13.9 months, respectively) (APA, 2010). However, there were no significant differences in program completion time between the Associates degree group and the Masters degree group or between the Bachelors degree group and the Masters degree group. Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA 386 Study Questions Is the dependent variable in the Mancini et al. (2014) example normally distributed? Provide a rationale for your answer. What are the two instances that must occur to warrant post hoc testing following an ANOVA? Do the data in this example meet criteria for homogeneity of variance? Provide a rationale for your answer. What is the null hypothesis in the example? What was the exact likelihood of obtaining an F value at least as extreme as or as close to the one that was actually observed, assuming that the null hypothesis is true? Do the data meet criteria for ?mutual exclusivity?? Provide a rationale for your answer. 387 What does the numerator of the F ratio represent? What does the denominator of the F ratio represent? How would our final interpretation of the results have changed if we had chosen to report the LSD post hoc test instead of the Tukey HSD test? Was the sample size adequate to detect differences among the three groups in this example? Provide a rationale for your answer. Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA 388 Answers to Study Questions Yes, the data are approximately normally distributed as noted by the frequency distribution generated from SPSS, below. The Shapiro-Wilk (covered in Exercise 26) p value for months to completion was 0.151, indicating that the frequency distribution did not significantly deviate from normality. The two instances that must occur to warrant post hoc testing following an ANOVA are (1) the ANOVA was performed on data comparing more than two groups, and (2) the F value is statistically significant. Yes, the data met criteria for homogeneity of variance because the Levenes test for equality of variances yielded a p of 0.488, indicating no significant differences in variance between the groups. The null hypothesis is: ?There is no difference between groups (Associates, Bachelors, and Masters degree groups) in months until completion of an RN to BSN program.? The exact likelihood of obtaining an F value at least as extreme as or as close to the one that was actually observed, assuming that the null hypothesis is true, was 0.2%. Yes, the data met criteria for mutual exclusivity because a student could only belong to one of the three groups of the highest degree obtained prior to enrollment (Associate, Bachelors, and Masters degree). The numerator represents the between groups variance or the differences between the groups/conditions being compared. The denominator represents within groups variance or the extent to which there is dispersion among the dependent variables. The final interpretation of the results would have changed if we had chosen to report the LSD post hoc test instead of the Tukey HSD test. The results of the LSD test indicated that the 389students in the Masters degree group took significantly longer to complete the program than the students in the Bachelors degree group (p = 0.025). The sample size was most likely adequate to detect differences among the three groups overall because a significant difference was found, p = 0.002. However, there was a discrepancy between the results of the LSD post hoc test and the Tukey HSD test. The difference between the Masters degree group and the Bachelors degree group was significant according to the results of the LSD test but not the Tukey HSD test. Therefore, it is possible that with only 27 total students in this example, the data were underpowered for the multiple comparisons following the ANOVA. Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA 390 Data for Additional Computational Practice for Questions to be Graded Using the example from Ottomanelli and colleagues (2012) study, participants were randomized to receive Supported Employment or treatment as usual. A third group, also a treatment as usual group, consisted of a nonrandomized observational group of participants. A simulated subset was selected for this example so that the computations would be small and manageable. The independent variable in this example is treatment group (Supported Employment, Treatment as Usual?Randomized, and Treatment as Usual?Observational/Not Randomized), and the dependent variable was the number of hours worked post-treatment. Supported employment refers to a type of specialized interdisciplinary vocational rehabilitation designed to help people with disabilities obtain and maintain community-based competitive employment in their chosen occupation (Bond, 2004). The null hypothesis is: ?There is no difference between the treatment groups in post-treatment number of hours worked among veterans with spinal cord injuries.? Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA Compute the ANOVA on the data in Table 33-3 below. ORDER Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA? TABLE 33-3 POST-TREATMENT HOURS WORKED BY TREATMENT GROUP Participant #Supported EmploymentParticipant #TAU ObservationalParticipant #TAU Randomized186151125297181228315891335417918143052410161515 ?TAU? = Treatment as Usual. 391 EXERCISE 33 Questions to Be Graded Name: ___________________________________________ Class: _________ Date: _______________________________________________________________________ Follow your instructors directions to submit your answers to the following questions for grading. Your instructor may ask you to write your answers below and submit them as a hard copy for grading. Alternatively, your instructor may ask you to use the space below for notes and submit your answers online at http://evolve.elsevier.com/Grove/statistics/ under ?Questions to Be Graded.? Do the data meet criteria for homogeneity of variance? Provide a rationale for your answer. If calculating by hand, draw the frequency distribution of the dependent variable, hours worked at a job. What is the shape of the distribution? If using SPSS, what is the result of the Shapiro-Wilk test of normality for the dependent variable? What are the means for three groups hours worked on a job? What are the F value and the group and error df for this set of data? Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA 392 Is the F significant at α = 0.05? Specify how you arrived at your answer. If using SPSS, what is the exact likelihood of obtaining an F value at least as extreme as or as close to the one that was actually observed, assuming that the null hypothesis is true? Which group worked the most weekly job hours post-treatment? Provide a rationale for your answer. Write your interpretation of the results as you would in an APA-formatted journal. Is there a difference in your final interpretation when comparing the results of the LSD post hoc test versus Tukey HSD test? Provide a rationale for your answer. If the researcher decided to combine the two Treatment as Usual groups to represent an overall ?Control? group, then there would be two groups to compare: Supported Employment versus Control. What would be the appropriate statistic to address the difference in hours worked between the two groups? Provide a rationale for your answer. I just need help in question 5th and 6th of questions to be graded, exercise 33. Assignment: Calculating Analysis of Variance and Post Hoc Analyses Following ANOVA Order Now
ADDITIONAL INFORMATION
Calculating Analysis of Variance and Post Hoc Analyses
Introduction
Let’s say we’ve got a bunch of data. We’re trying to find out whether there is any relationship between two variables (like age and salary). How do we go about doing this? Well, it helps if we have a way of assessing whether there really is an effect at all—that is, if people in different groups are statistically different from each other on some variable. The term “analysis of variance” (ANOVA) is a clumsy name for an approach that analyzes different types of data by comparing them with one another under various conditions. So let’s take a look at how ANOVAs work by first explaining what they are and then exploring the three main situations in which one might use them: across individuals; within individuals; or both simultaneously within individuals or across groups/individuals together
The term “analysis of variance” (ANOVA) is a clumsy name for an approach that analyzes different types of data
The term “analysis of variance” (ANOVA) is a clumsy name for an approach that analyzes different types of data. This can be confusing, especially if you’re new to statistics, but it’s important to understand how this technique works so that you can use it effectively in your own research projects.
The general idea behind ANOVA is simple: you want to see whether there are significant differences between your groups—that is, whether they’re different from each other in some way or another. You might ask yourself whether one group has higher scores than another or if they have similar scores across all tests (why would we care?). In short: why do we care?
There are several different types of ANOVAs depending on the type of variable being analyzed and how many variables need testing at once (one variable per group). Here’s a quick overview:
An ANOVA uses an F statistic.
An ANOVA uses an F statistic. This is a ratio of two variances, and it’s calculated by dividing the variance between groups by the variance within groups. If you’re familiar with statistical jargon, this might look like this:
-
An F-test is used to test whether one or more treatments differ from each other in their effect on response (i.e., if they differ from each other at least statistically).
A key requirement for ANOVAs is that the groups being compared are independent of each other.
The first thing to know about ANOVAs is that they are not the same as post hoc analyses. Post hoc means “after” or “afterwards,” and it refers to the way you analyze data after an experiment has been done. In contrast, ANOVA refers specifically to analysis of variance (ANOVA), which compares two or more groups by performing statistical tests on their results and then drawing conclusions about what happened in each group based on those results.
When conducting an experiment using multiple groups, there are three main requirements for conducting an effective ANOVA:
-
Groups must be independent of each other; if they’re not independent but related instead (i.e., all your subjects come from one school), then you can’t use this method because it won’t work properly with them being related instead of independent.*
There are three main situations in which one might do an ANOVA.
There are three main situations in which one might do an ANOVA. The first is to compare three or more groups, as in this example:
-
Treatment Group B: 20 people who receive experimental treatment A (a drug) and 30 people who receive placebo treatment C (a pill that looks like the drug but has no active ingredients). The control group is made up of 30 people who receive another pill with no active ingredients.
-
Treatment Group A: 10 people who are given experimental treatment C and 10 people who are given placebo treatment C. The control group consists of 10 members of each group.
The F statistic tells us whether to reject the null hypothesis.
The F statistic is the ratio of the variance of the groups to the variance within the groups. It’s calculated by dividing one group’s sum of squares by another group’s sum of squares, then squaring it. If you’re familiar with standard deviations, this means that your F statistic will have a value between 0 and 1 (or if we take our example above, between 0 and 4). In most cases, you’ll want to compare your result against a critical value in order to determine whether or not it exceeds 1; if so, we can reject our null hypothesis and conclude that there is evidence against it being true.
Once we’ve run our analysis, we may have set ourselves up for a post hoc analysis.
If you’ve done your analysis, and it’s time to see what’s different between the groups. Post hoc tests can help us do that.
Post hoc analysis is a set of tests that are done after the ANOVA (one-way or two-way). It allows us to find out which groups are significantly different from each other. The purpose of post hoc tests is to find out what’s different between the groups; they’re used when we have more than one sample size and want to compare them by looking at their mean values and standard deviations instead of using variances alone (which isn’t as informative).
Researchers use ANOVAs to assess whether groups are different from each other using an F statistic, but they don’t tell us exactly what’s different.
The F statistic tells us whether to reject the null hypothesis. If your F statistic is small, then you can say that there is not enough evidence to reject H0 (the null hypothesis). You may think that groups are different from each other and want to see if this difference is due to chance or real differences in groups.
If you don’t have enough power (a large sample size), however, then your post hoc analysis will not be accurate.
Conclusion
When it comes to ANOVAs, there are two main points of confusion. The first is that this approach can be used to investigate whether groups are different from each other using an F statistic, but it doesn’t tell us exactly what’s different. This is why we need post hoc analyses: they allow us to analyze our data more deeply and more accurately than simply looking at the F statistic alone.
The second point of confusion is that researchers use ANOVAs for many different types of data. In this blog post we looked at one particular type of experiment—a simple repeated measures design with two levels (e.g., three treatments) and three or four subjects per group—but there are many other types as well! For example, if we want to compare how two treatments affect people in a particular population (such as diabetes patients), then we might run an ANOVA across all individuals within that population instead of just those who take both pills together (i.e., mixed treatment arms) or only one pill (i..e., single-treatment arms).
Collepals.com Plagiarism Free Papers
Are you looking for custom essay writing service or even dissertation writing services? Just request for our write my paper service, and we'll match you with the best essay writer in your subject! With an exceptional team of professional academic experts in a wide range of subjects, we can guarantee you an unrivaled quality of custom-written papers.
Get ZERO PLAGIARISM, HUMAN WRITTEN ESSAYS
Why Hire Collepals.com writers to do your paper?
Quality- We are experienced and have access to ample research materials.
We write plagiarism Free Content
Confidential- We never share or sell your personal information to third parties.
Support-Chat with us today! We are always waiting to answer all your questions.