Help Support Pure and Simple! Sign up for the Pure and Simple News
There are plenty of reasons to wait to have sex. Read More >
Abstinence Programs are effective. More >
See the Programs we have for you. View >
Talk to your kids about your opinions and values. Read More >
 

Pure & Simple Professionals

 

A Minority Report: Fundamental Concerns about the CDC Meta-analysis of
Group-based Interventions to Prevent Adolescent Pregnancy, HIV, and Other STIs

by Irene Ericksen, M.S. & Danielle Ruedt, M.P.H.,
Members of the CDC Community Guide
Adolescent Sex Behavior Coordination Team External Partners consultant panel

November 7, 2009*


I. Summary Statement

We are grateful to have been involved in the Centers for Disease Control and Prevention’s research study, “Group-based Interventions to Prevent Adolescent Pregnancy, HIV, and Other Sexually Transmitted Infections: A Systematic Review and Meta-analysis.” This was an important effort to summarize the outcome research on sex education in America. We acknowledge the careful work by the Community Guide research staff and their willingness to consider all viewpoints when making decisions about the research. What follows represents our minority opinion as members of this study’s External Partners consultant panel and does not represent the views of the CDC or the Adolescent Sex Behavior Coordination Team.


There are serious limitations to this meta-analysis study that cause us to take exception to the Recommendation Statements (http://www.thecommunityguide.org/hiv/index.html) issued by the CDC’s Task Force on Community Preventive Services about the study results. These limitations lead us to conclude that the statements about the general effectiveness of the comprehensive risk
reduction (CRR) strategy are not warranted by the data. They overstate the likelihood that any single CRR program will be effective at protecting the sexual health of adolescents, especially the school-based programs, which are the focus of the public policy debate about sex education and impact the future health of millions of adolescents across the country. The CDC recommendations also fail to acknowledge the evidence for the effectiveness of abstinence education (AE) programs at reducing teen sexual activity, and invite conclusions that CRR is a superior approach to AE, which is not supported by the evidence.


The Task Force has made public its Recommendation Statements without also making available to the public the full set of study findings upon which the recommendations are based—both supporting and otherwise. The reason given for this decision is that the data from the study has not yet been scientifically cleared by the CDC for release to the public. However, this policy prevents the public from scrutinizing the body of evidence underlying the CDC Task Force Recommendations in the same time frame in which the CDC recommendations will influence the decisions of policymakers and public health professionals. (Having the opportunity to examine this evidence is particularly important in the current climate of controversy and politicization that surrounds the public policy debate about sex education in America.)


Additionally, this policy prevents the inclusion in this minority report of most of the scientific data that supports and illustrates the concerns described herein. While the meta-analysis data has been made available to us as members of the Adolescent Sex Behavior Coordination Team, at this point the Recommendation Statements can only be addressed in a qualitative manner here,
without reporting any quantitative data that has not been released to the public.

Given the public’s inability to fully examine the scientific evidence for the Task Force Recommendations, this minority report offers evidence that there are alternative interpretations of the meta-analysis results which have scientific merit. These observations and assertions are offered in good faith, with the intent of promoting an open dialogue about the best interpretation of the results of this meta-analysis study.


II. Fundamental Concerns and Supporting Rationale


1. The conclusion of general CRR effectiveness is not supported by the totality of data.
The Recommendation Statement asserts that the comprehensive risk reduction (CRR)1 strategy is generally effective across  setting, population, and program type on most of the stated outcomes, which include sexual activity, frequency of sex, number of partners, condom use, and STIs. Yet high measures of inconsistency of results across studies on key outcomes indicate that a sizable percentage of CRR programs did not work on these outcomes, especially for school-based programs, which are the focus of the national policy debate about sex education.


a. The study suffers from a problem that is fundamental in meta-analysis by combining “apples and oranges.”2 The CRR category was created by collecting together everything that was not a study of “abstinence-only” sex education, resulting in a very heterogeneous category that exceeds the limits of good meta-analysis design. This category combines data from school classroom-based programs (a common delivery system for sex education that is fairly homogeneous with respect to setting, population, and pedagogy) with programs representing a wide variety of settings, populations, and pedagogies (e.g., STD clinics and youth shelters, youth in detention centers and parents in housing projects, individual service learning and multi-component youth development programs), illustrated in Table 1 (see Appendix). This problem undermines the validity of the entire study.


b. When these studies of diverse programs were combined in the same analysis, the resulting statistical inconsistency in the data was not adequately resolved. The Recommendation Statement does not report the important I2 measure of heterogeneity for the meta-analysis results. (When a meta-analysis combines the effects from many studies into one measure to derive an average impact, the I2 indicates how much inconsistency or variation there was across all of the effects that produced the average.) Yet heterogeneity for the majority of the significant CRR outcomes was so high that it would be considered “severe” by some experts,3 indicating that results reported at that level of aggregation are not interpretable. With such a level of internal inconsistency, the study recommendations should have indicated which of the diverse types of CRR programs in the study was effective and which was not, or reported their inability to do so, instead of concluding that there was sufficient evidence of across-the-board effectiveness (“across a range of populations and settings”). This misleads policy makers seeking to choose evidence-based programs.


c. The small size of the reported CRR effects on key outcomes was not discussed in the Task Force Recommendations, and raises the question: What degree of behavioral improvement should a typical sex education program produce in order for it to be considered an “effective” program?
The relative risk estimates reported in the Recommendation Statement4 indicate that the CRR programs increased frequency of teen condom use by an average of 12% and reduced sexual activity by an average of 12%. They averaged a 16% improvement across the key outcomes of sexual activity, number of partners, condom use, pregnancy, and STIs (excluding conflated outcome measures, see below). This small magnitude of improvement, combined with the high inconsistency of results across studies, further diminishes the likelihood that any single CRR program will be effective at protecting adolescents. (It should be noted that the AE effects were in a similar range.)


d. The Recommendation Statements do not report the lack of effectiveness for school-based CRR programs on key outcomes (condom use, pregnancy, sexually transmitted infections/STIs).


1. This is important since the school classroom is where most youth receive sex education and many of the programs that produced the reported general effects were not the school classroom type programs that most people think of as “sex education.”


2. Nonetheless, the Recommendation Statement that “The evidence supports a conclusion that CRR interventions are applicable across a range of populations and settings…[including] school and community settings” gives policy-makers the impression that the study found CRR programs implemented in schools were effective at achieving these important outcomes, which was not the case.

2. Key measures of CRR program effectiveness are inadequate or confusing.


a. Instead of reporting on consistent condom use (CCU), the study reports on a less protective measure, frequency of teen condom use (which combines studies that measured frequency of use, or use at first or last intercourse, with a few studies that measured consistent use). According to the CDC’s Condom Fact Sheet, “consistent and correct use of the male latex condom reduces the risk of sexually transmitted disease (STD) and human immunodeficiency virus (HIV) transmission. However, condom use cannot provide absolute protection against any STD. The most reliable ways to avoid transmission of STDs are to abstain from sexual activity, or to be in a long-term mutually monogamous relationship with an uninfected partner”   (http://www.cdc.gov/condomeffectiveness/brief.html). STD (synonymous with STI) transmission can occur in one sexual contact and some studies have found that nonconsistent use provided inadequate STD protection or resulted in higher rates of STDs.5 Rates of CCU by teens in the U.S. are low (28% of sexually active teen girls and 47% of boys)6 and this meta-analysis does not provide evidence that CRR programs have increased CCU and thereby increased actual protection.7

b. The study uses a conflated outcome called Use of Protection against Pregnancy and STIs, which is a combined measure of condom use and oral contraceptive use. It then reports a significant improvement on this outcome for CRR programs, implying that the programs were successful at affecting use of both condoms and oral contraceptives and that this was protective against pregnancy and STIs. This is misleading because:


1. It implies an effect on both condom use and oral contraceptive use, yet the study did not find significant effects for oral contraceptive use (when tested separately).4


2. Because oral contraceptives provide no protection at all against STIs, they should not be included in a measure of “protection against pregnancy and STIs”.


3. It is not known whether the condom/contraceptive use measured in this outcome actually provided “protection against pregnancy or STIs,” so to call it that is not accurate. (The study did not find that CRR programs generally reduced teen pregnancy.4)


For these reasons, Use of Protection against Pregnancy and STIs, is a misleading outcome; at the very least it should be renamed “Use of Condoms or Contraception;” to be most accurate the behaviors should not be combined in this measure.

c. The study reports effects on another conflated outcome called Unprotected Sex, defined as having sex without using a condom. A reduction in Unprotected Sex occurs when teens either abstain from sex or use a condom, thus this is a measure of whether sexual abstinence or condom use is occurring, without identifying the actual behavior. This can be misleading and confusing to policy-makers: combining both behaviors “pads” the numbers such that statistical significance can be obtained where it might not be obtained for both behaviors separately (in fact, this appears to have occurred for school-based CRR programs); it also gives CRR programs credit for effectiveness without identifying how that effect is being achieved. Finally, because studies of abstinence education have not typically measured Unprotected Sex as such, the CDC study does not give AE credit for reducing “unprotected sex” by increasing abstinence, although this is the effect of abstaining from sex.


d. The study allowed minimum follow-up times for condom/contraceptive/STI outcomes (one to three months) that were too short to demonstrate a lasting program effect, but required longer follow-up times (six months minimum) for abstinence outcomes. While not done with this intent, it had the effect of requiring most abstinence programs to meet a higher standard of effectiveness.

3. The meta-analysis evidence for the reported CRR effect on STIs is not of adequate quality to inform national policy about sex education. The effect appears to be dependent upon only two CRR programs (DiClemente, 2004 and Jemmott, 2005).8 These programs were both clinic-based, i.e., not occurring in school classrooms with school-based populations. In addition, evidence from several of the other studies in the analysis was of low quality.


4. The reduction in sexual activity showed by AE programs has been discounted because of a misplaced deference to randomized control trial (RCT) studies, some of which had important design problems.


The meta-analysis found a significant reduction in sexual activity by the AE studies.4 However, the Task Force chose to designate it as “insufficient evidence” due to“inconsistent results across studies,” in the RCT (randomized control trials—the preferred research design) vs. non-RCT (or quasi-experimental, an accepted research design) results for sexual activity. This decision is questionable for the following reasons:


a. Analyzing the results separately for the RCT and non-RCT studies assumes there were no other differences in research quality between these two sets of studies that might influence their outcomes. Yet there were other differences, which resulted in some of the RCT studies having weaker research designs (see b. below).


b. The RCT results were weighted heavily toward two studies by the same authors that have important design problems9 which likely caused them to inaccurately estimate a lack of abstinence effect. This raises questions about the deference given to their results by the Task Force.


c. Of the 6 RCT studies in the AE analysis, 3 had sizable positive effects.


d. The measure of inconsistent results across studies was higher for CRR programs on sexual activity than for AE programs.


e. The quasi-experimental results are based on seven well-designed studies:
they met the criteria for inclusion in the CDC meta-analysis, the majority of them were peer-reviewed and published, and most used statistical methods to refine the comparison group match and control for pretest differences.

Thus, the meta-analysis provides credible evidence that AE programs have delayed sexual activity, evidence that is stronger than the evidence for CCR effects on STIs (see #3, above) or for the unreported evidence (i.e., lack of evidence) for school-based CRR programs on condom use (see #1d, above). This suggests an inconsistent standard has been applied. A case can be made that the difference in the RCT vs. non-RCT results for AE is due to problems in several RCT studies that prevented the accurate measure of positive results, not the opposite. This would mean that their influence on the AE results should be discounted rather than allowed to negate the body of research.


5. The CDC meta-analysis evidence does not support an assumption that the CRR strategy is superior to the AE strategy.

a. The study invites an inappropriate comparison between CRR and AE. Given that school-based programs represented approximately 80% of the AE studies and only 40% of the CRR studies (see Table 1 in Appendix, including footnote), the appropriate comparison in the meta-analysis for relative effectiveness would be between school-based CRR and AE programs. As it is, end users of the meta-analysis Recommendation Statements will inevitably make comparisons between the published results for the two strategies. When they do, the comparison is between AE, which was mostly school-based, and CRR, which was mostly community-based.


b. A common rationale for preferring CRR to AE is the claim that CRR is effective at both preventing sexual activity and reducing the risks of sexual activity through promoting the use of condoms, while AE would only be expected to reduce sexual activity. However, the meta-analysis methodology was not able to test empirically whether the CRR strategy has been effective at achieving both of those outcomes within individual CRR programs. And if there is not evidence that the typical CRR program is effective at both outcomes then there is not evidence that it offers any real advantage over effective AE programs. While it should be acknowledged that this kind of evidence is hard to come by, it should also be recognized that lack of it constitutes a lack of evidence, and assertions that CRR achieves these dual outcomes should not be characterized as “evidence-based” where such evidence is lacking.


c. When the meta-analysis did a direct test of differences in effectiveness between the AE and CRR strategies, it failed to demonstrate a difference for any of the outcomes except one.


In conclusion, we offer the above concerns and supporting rationale to explain our reasons for not supporting the Task Force Recommendations. Another version of this report, containing detailed statistical data from the meta-analysis as evidence for the above concerns, will be issued as soon as the CDC releases the meta-analysis data to the public. We would respectfully suggest that it is in the best interest of science and public policy for the CDC to release the meta-analysis data at the same time the Recommendation Statements are issued, so that the public can examine the full body of evidence upon which they are based—both supporting and otherwise. Lacking this, the above analysis has been presented to offer an alternative point of view on the interpretation of the study results.

* This is an amended version of the November 6, 2009 Minority Report. Some wording has been altered slightly to correlate better with the final wording of the CDC Task Force Recommendation Statements posted on November 6, 2009 at http://www.thecommunityguide.org/hiv/index.html.


About the authors:


Irene Ericksen, M.S., is a research analyst with The Institute for Research & Evaluation in Salt Lake City, a nonprofit research organization that has gained national recognition for its work evaluating sex education programs, particularly abstinence education. IRE has provided expert testimony to state legislative bodies, the U.S. Senate, the U.S. House of Representatives (April, 2008), and the White House (June, 2009). Ms. Ericksen is an author of peer-reviewed published research articles reporting the results of abstinence education evaluation studies.


Danielle Ruedt, MPH, is the Public Health Programs Coordinator for the Governor’s Office of Children and Families, State of Georgia.


Appendix


Table 1.

Type of Setting/Population(a)
CRR studies
AE studies
A. School/Classroom setting, general school population
24 total (40%)
15 total (80%)
     
B. Community-based setting, self-selected population
36 total (60%)
4 total (20%)
           Health or STD clinic setting & population
7
           Community center & population
6
3
           Youth shelter & population
5
           Multi-component youth development program
4
           Parent training program (community-based)
4
1
           Low-income housing project setting & population
3
           Youth in juvenile detention/jail
3
           Youth in residential drug treatment program
2
           Service learning by individuals at community sites
2

(a). Note: This table was developed by the authors through examination of the list of studies included in the meta-analysis; it was not produced by the Community Guide research staff.


End Notes


1. The term “comprehensive,” when applied to sex education, is typically meant to indicate that interventions promote both sexual abstinence and sexual risk reduction (including the use of condoms and oral contraceptives). However, in this review of sex education studies, the comprehensive risk reduction (CRR) category includes studies of sex education programs that “solely promote sexual risk reduction,” i.e., that do not promote abstinence. This may be a source of confusion to policy-makers, who will miss this deviation from the accepted definition and assume that all of the programs in this category promoted teen abstinence to some degree.


2. See: Sharpe D. (1997). Of apples and oranges, file drawers and garbage: Why validity issues in metaanalysis will not go away. Clinical Psychology Review, 17, (8): 881–901, and Higgins JPT & Thompson SG (2002). Quantifying heterogeneity in a meta-analysis. Statistics in Medicine, 21: 1539–1558.


3. Higgins and Thompson, 2002, give a magnititude for I2 that indicates “severe heterogeneity” and conclude that I2 values in this range should “induce considerable caution” regarding the results of a meta-analysis, p.1553. Thompson, 1994, and Moayyedi, 2004, argue that this level of heterogeneity in the effect renders the result meaningless at that level of aggregation; that the source of heterogeneity should be identified by examining subgroups and the results should only be interpreted at the subgroup level. (See Moayyedi P. (2004). Meta-analysis: Can we mix apples and oranges? American Journal of Gastroenterology, 2004: 2297–2301; Thompson SG. (1994). Why sources of heterogeneity in meta-analysis should be investigated. British Medical Journal, 309: 1351–1355.)


4. These facts are presented in the CDC Task Force Recommendation Statement for Comprehensive Risk Reduction, available at: http://www.thecommunityguide.org/hiv/index.html.


5. See Crosby RA, DiClemente RJ, Wingood GM, Lang D, Harrington KF. (2003). Value of consistent condom use: A study of sexually transmitted disease prevention among African American adolescent females. American Journal of Public Health; 93: 901–2.; Shlay JC, McCung MW, Patnaik JL et al. (2004). Comparison of sexually transmitted disease prevalence by reported level of condom use among patients attending an urban sexually transmitted disease clinic. Sex Transm Dis; 31(3):154–60.; Ahmed S, Lutalo T, Wawer M et al. (2001). HIV incidence and sexually transmitted disease prevalence associated with condom use: a population study in Rakai, Uganda. AIDS; 15(16):2171–9.; Grinsztejn B, Veloso V, Levi J, Velasque L, Luz P et al. (2009). Factors associated with increased prevalence of human papillomavirus infection in a cohort of HIV-infected Brazilian women. International Journal of Infectious Diseases, 13, 72–80.; Martin ET, Krantz E, Gottlieb SL, Magaret AS, Langenberg A, et al. (2009). A Pooled Analysis of the Effect of Condoms in Preventing HSV-2 Acquisition, ARCH INTERN MED/Vol 169 (13):1233–1240; Centers for
Disease Control and Prevention. (2003). Fact Sheet for Public Health Personnel—Male Latex Condoms and Sexually Transmitted Diseases. National Center for HIV, STD, and TB Prevention. Atlanta, GA: U.S. Department of Health and Human Services (paragraph 4). Retrieved October 31, 2003 from www.cdc.gov/nchstp/od/latex.htm. According to the CDC, “inconsistent use, e.g., failure to use condoms with every act of intercourse, can lead to STD transmission because transmission can occur with a single
act of intercourse” (CDC, 2003). A study in the journal AIDS (Ahmed et al., 2001) found, “Irregular condom use was not protective against HIV or STD and was associated with increased gonorrhea/chlamydia risk.” A Denver study (Shlay et al., 2004) reported that “when all condom users were compared with non-users (N=126,220), there was limited evidence of protection against specific STD.” But when consistent vs. inconsistent users were compared, the consistent users had significantly lower infection
rates.


6. Franzetta, Kerry. et al. 2006. “Trends and Recent Estimates: Contraceptive Use Among U.S. Teens.” Child Trends Research Brief #2006-04.


7. Some will argue that these lesser measures may be indicators of behavior that might reduce the risk of STDs. But given that some studies have found non-consistent condom use was not protective (see note #5 above), we assert that to be recommended by such a major force in public health information dissemination as the CDC, programs should not be called effective when they have not produced evidence of success at achieving this yet partially protective outcome of consistent condom use.


8. DiClemente RJ, Wingood GM, Harrington KF et al. Efficacy of an HIV prevention intervention for African American adolescent girls: a randomized controlled trial. JAMA 2004;292(2):171–9; Jemmott JB, III, Jemmott LS, Braverman PK, Fong GT. HIV/STD risk reduction interventions for African American and Latino adolescent girls at an adolescent medicine clinic: a randomized controlled trial. Arch Pediatr Adolesc Med 2005;159(5):440–9.


9. Design Problems of RCT Abstinence Studies


a. The first of these studies was a long-term evaluation of 4 different abstinence programs, in which no positive effects were found. However, when reviewed critically, important design weaknesses become evident. Each of the 4 sub-studies randomized the assignment to treatment and control groups at the individual level within schools. The programs were then implemented within these schools for substantial periods of time, ranging from 1 to 4 years. This design sets up a classic scenario in which contamination or cross-fertilization of the treatment effect between the treatment and control groups can occur within the school (the “lunch-room-to-locker-room effect”). The result is a muting of the treatment effect over time, in that the two groups influence each others’ attitudes and behaviors in the close quarters of the school environment. This is especially likely when the treatment occurs over an extended time period. This, combined with the unusually young age of the program participants (8 to 13 years old) and the unusually long follow-up period used in the study (4 to 6 years after baseline) constitutes good reason to question this study as a valid test of abstinence education. (See Trenholm C, Devaney B, Fortson K, Quay L, Wheeler J, Clark M. (2007). Impacts of Four Title V, Section 510 Abstinence Education Programs. Princeton, NJ: Mathematica Policy Research, Inc. April 2007.)


b. The problem with the second study, Clark, Trenholm, 2007, has to do with its inclusion in the CDC meta-analysis at all. The study was included ostensibly because it was considered an evaluation of an abstinence curriculum. However, the purpose of this study was to detect whether the Heritage Keepers“Life Skills” curriculum had any incremental impact on teen sexual behavior over and above the Heritage Keepers Abstinence Curriculum. The Life Skills curriculum, which contained minimal abstinence content, was a voluntary after-school program intended as a supplement to the main Abstinence Curriculum, which was a mandatory class taught during the school day. In this study, the treatment group received the Abstinence Curriculum plus the Life Skills Component, and the control group received only the Abstinence Curriculum. Thus, this study did not test the effect of abstinence
education versus the absence of abstinence education, but rather it tested the effect of a voluntary life skills program (i.e., a light abstinence dose) over and above the effect of a mandatory abstinence program (i.e., a heavy dose of the abstinence message). In short, it was not an evaluation of the effectiveness of an abstinence education program, and as such, should not have been included in the group of abstinence studies in the meta-analysis. It should be noted that the Heritage Keepers Abstinence Curriculum was evaluated against a true counterfactual (the absence of abstinence education) and demonstrated a significant and sizable reduction in teen sexual initiation (OR=.54, p<.001) one year after the program. (See Clark MA, Trenholm, C,Devaney, B, Wheeler, J, Quay, L. (2007). Impacts of the Heritage Keepers Life Skills Education Component. Final Report. Princeton, NJ: Mathematica Policy Research, Inc. and Weed SE, Ericksen IH, Birch PJ. (2005). An evaluation of the Heritage Keepers Abstinence Education program. In Golden A (Ed.) Evaluating Abstinence Education Programs: Improving Implementation and Assessing Impact. Washington DC: Office of Population Affairs and the Administration for Children and Families, Department of Health & Human Services 2005:88–103.).


c. The above concerns constitute good reason to question these studies as a valid test of abstinence education and, given their heavy weight in the analysis, are good reason to question the RCT versus non-RCT results for the abstinence effects in the meta-analysis.

 

® 2010 Pure & Simple Health Education, Inc. - Abstinence Education - 3301 W. 13th Street - Wichita, Kansas 67203 - Toll free: (866) 688-0840 - (316) 688-0840 - Contact Us - Site by: The Multimedia Room

This site is supported through Discretionary Grant No. 90AE0305, Community Based Abstinence Education Program, CAT No. 93010, Department of Health and Human Services, Administration for Children & Families, Administration on Children, Youth & Families

Pure and Simple Home Page Pure an Simple Events Abstienence Education News Contact Pure and Simple Send this page to a Friend!Follow us on Twitter!Join our Facebook Fan PageSign Up for our Newsletter!