Résumés
Résumé
Cet article vise à expliciter la logique des tests d’hypothèses en recherche. Plusieurs études ont démontré une confusion quant à la signification des résultats de tests d’hypothèses. Cette confusion proviendrait en partie des points divergents entre les approches de Fisher, de Neyman et Pearson ainsi que de Bayes. L’article précise l’information fournie par le coefficient de signification, la taille de l’effet et la puissance et propose différents logiciels permettant l’analyse de la puissance et de la taille de l’effet.
Mots clés:
- tests d’hypothèses,
- signification statistique,
- taille de l’effet,
- puissance,
- inférence
Summary
This paper aims to explain the rationale of hypothesis testing in research. Many studies have shown some confusion as to the significance of hypothesis testing results. This confusion could partially come from discrepancies between the approaches of Fisher, Neyman and Pearson as well as Bayes. The article clarifies the information provided by the significance coefficient, effect size and power, and proposes various software to perform power and effect size analysis.
Key words:
- hypothesis testing,
- statistical significance,
- effect size,
- power,
- inference
Resumen
Este artículo tiene por objetivo explicitar la lógica de las pruebas de hipótesis en investigación. Varios estudios han demostrado una confusión en cuanto al significado de los resultados de las pruebas de hipótesis. Esta confusión tendría en parte por origen los puntos divergentes entre los enfoques de Fisher, de Neyman et Pearson así como de Bayes. El artículo precisa la información proporcionada por el coeficiente de significado, el tamaño del efecto y la potencia y propone distintos programas que permiten el análisis de la potencia y del tamaño del efecto.
Palabras claves:
- pruebas de hipótesis,
- significado estadístico,
- tamaño del efecto,
- potencia,
- inferencia
Parties annexes
Références
- American Psychological Association (2001). Publication manual of the American Psychological Association (5e édition). Washington, District of Columbia : American Psychological Association.
- Baguley, T. (2004). Understanding statistical power in the context of applied research. Applied ergonomics, 35(2), 73-80.
- Behavioral and Brain Sciences, 21(2), 1998.
- Bezeau, S. et Graves, R. (2001). Statistical power and effect sizes of clinical neuropsychology research. Journal of clinical and experimental neuropsychology, 23(3), 399-406.
- Biskin, B. H. (1998). Comment on significance testing. Measurement and evaluation in counselling and development, 31(1), 58-62.
- Blais, J.-G. (1991). Statistique, méthodes quantitatives et analyse des données. Repères, essais en éducation, (13), 63-90.
- Carver, R. P. (1978). The case against statistical significance testing. Harvard educational review, 48(3), 378-399.
- Cashen, L. H. et Geiger, S. W. (2004). Statistical power and the Testing of null hypotheses : a review of contemporary management research and recommendations for future studies. Organizational research methods, 7(2), 151-167.
- Chow, S. L. (1996). Statistical significance : rationale, validity and utility. London, United Kingdom : Sage.
- Clark-Carter, D. (1997). The account taken of statistical power in research. British journal of psychology, 88(1), 71-83.
- Cohen, J. (1994). The Earth is round (p < .05). American psychologist, 49(12), 997-1003.
- Cohen, J. (1988). Statistical power analysis for the Behavioral sciences (2e édition). Hillsdale, New Jersey : Lawrence Erlbaum Associates.
- Cohen, J. (1962). The statistical power of abnormal-social psychological research : a review. Journal of abnormal and social psychology, 65(3), 145-153.
- Conseil des ministres de l’Éducation du Canada (2007). Le Programme pancanadien d’évaluation (PPCE) et le Programme d’indicateurs du rendement scolaire (PIRS) [En ligne]. Disponible le 25 juin 2007 : http://www.cmec.ca/pcap/indexf.stm
- Faul, F., Erdfelder, E., Lang, A.-G. et Buchner, A. (2007). G*Power 3 : a flexible statistical power analysis program for the social, behavioural, and biomedical sciences. Behavior research methods, 39(2), 175-191.
- Fern, E. F. et Monroe, K. B. (1996). Effect-size estimates : issues and problems in interpretation. Journal of consumer research, 23(2), 89-105.
- Foucart, T. (2001). L’interprétation des résultats statistiques. Mathématiques et sciences humaines, 39(153), 21-28.
- Gigerenzer, G. (1993). The Superego, the Ego, and the Id in statistical reasoning. Dans G. Keren et C. Lewis (Dir.) : A handbook for data analysis in the Behavioral sciences : methodological issues. Hillsdale, New Jersey : Erlbaum.
- Giguère, G., Hélie, S. et Cousineau, D. (2004). Manifeste pour le retour des sciences en psychologie. Revue québécoise de psychologie, 25(3), 117-130.
- Gorman, B. S., Primavera, L. H. et Allison, D. B. (1995). POWPAL : a program for estimating effect sizes, statistical power, and sample sizes. Educational and psychological measurement, 55(5), 773-776.
- Hallahan, M. et Rosenthal, R. (1996). Statistical power : concepts, procedures, and applications. Behaviour research and therapy, 34(5/6), 489-499.
- Jennions, M. D. et Møller, A. P. (2003). A survey of the statistical power of research in behavioral ecology and animal behavior. Behavioral ecology, 14(3), 438-445.
- Jones, L. V. et Tukey, J. W. (2000). A sensible formulation of the significance test. Psychological Methods, 5(4), 411-414.
- Kline, R. B. (2004). Beyond Significance Testing : reforming data analysis methods in behavioral research. Washington, District of Columbia : American Psychological Association.
- Kosciulek, J. F. et Szymanski, E. M. (1993). Statistical power analysis of rehabilitation counselling research. Rehabilitation counselling bulletin, 36(4), 212-219.
- Lecoutre, B., Poitevineau, J. et Lecoutre, M.-P. (2005). Une raison pour ne pas abandonner les tests de signification de l’hypothèse nulle. Modulab, 33, 243-248.
- Lecoutre, M.-P. (1982). Comportements des chercheurs dans des situations conflictuelles d’analyse de données expérimentales. Psychologie française, 27(1), 1-8.
- Lecoutre, M.-P. et Poitevineau, J. (2000). Aller au-delà des tests de signification usuels : vers de nouvelles normes de publication. L’année psychologique, 100(4), 683-713.
- Loftus, G. R. (1996). Psychology will be a much better science when we change the way we analyse data. Current Directions in Psychological Science, 5(6), 161-171.
- Maddock, J. E. et Rossi, J. S. (2001). Statistical power of articles published in three health psychology-related journals. Health Psychology, 20(1), 76-78.
- Maxwell, S. E. (2004). The persistence of underpowered studies in psychological research : causes, consequences, and remedies. Psychological methods, 9(2), 147-163.
- Meehl, P. E. (1978). Theoretical risk and tabular asterisks : Sir Karl, Sir Ronald and the slow process of soft psychology. Journal of consulting and clinical psychology, 46(4), 806-834.
- Michel, R., Ollivier-Gay, L., Spiegel, A. et Boutin, J.-P. (2002). Les tests statistiques : Intérêt, principe et interprétations. Médecine tropicale, 62(5), 561-563.
- Millis, S. R. (2003). Statistical practices : the seven deadly sins. Child Neuropsychology, 9(3), 221-233.
- Mone, M. A., Mueller, G. C. et Mauland, W. (1996). The perceptions and usage of Statistical Power in applied psychology and management research. Personnel psychology, 49(1), 103-120.
- Morrison, D. E. et Henkel, R. E. (1970). The significance test controversy. Chicago, Illinois : Aldine.
- Nakagawa, S. (2004). A Farewell to Bonferroni : the problems of Low Statistical Power and Publication Bias. Behavioral ecology, 15(6), 1044-1045.
- Paul, K. M. et Plucker, J. A. (2004). Two steps forward, one step back : effect size reporting in gifted education research from 1995-2000. Roeper review, 26(2), 68-72.
- Pittenger, D. J. (2001). Power calculator : a collection of interactive programs. Educational and psychological measurement, 61(2), 889-894.
- Poitevineau, J. (2004). L’usage des tests statistiques par les chercheurs en psychologie : aspects normatif, descriptif et prescriptif. Mathématiques et sciences humaines, 42(3), 5-25.
- Rosenthal, R., Rosnow, R. L. et Rubin, D. B. (2000). Contrasts and effect sizes in behavioral research. A correlational approach. Cambridge, United Kingdom : Cambridge University Press.
- Rouanet, H. (1991). Les pratiques statisticiennes en question. Dans H. Rouanet, M.-P. Lecoutre, M.-C. Bert, B. Lecoutre, J.-M. Bernard et B. Leroux (Dir.) : L’inférence statistique dans la démarche du chercheur. Berne, Suisse : Peter Lang.
- Sawilowsky, S. S. (2003). Deconstructing arguments from the case against Hypothesis Testing. Journal of modern applied statistical methods, 2(2), 467-474.
- Shrout, P. E. (1997). Should significance tests be banned ? Introduction to a special section exploring the Pros and Cons. Psychological science, 8(1), 1-2.
- Thomas, L. et Juanes, F. (1996). The importance of statistical power analysis : an example from animal behaviour. Animal behaviour, 52(4), 856-859.
- Thomas, L. et Krebs, C. J. (1997). A review of statistical power analysis software. Bulletin of the ecological society of America, 78(2), 126-139.
- Thompson, B. (1989). Statistical significance, result importance, and result generalizability : three noteworthy but somewhat different issues. Measurement and evaluation in counselling and development, 22(1), 2-6.
- Vacha-Haase, T. et Nilsson, J. E. (1998). Statistical significance reporting : current trends and uses in MECD. Measurement and evaluation in counselling and development, 31(1), 46-57.
- Vacha-Haase, T. et Thompson, B. (1998). Further comments on statistical significance tests. Measurement and evaluation in counselling and development, 31(1), 63-67.
- Wilkinson, L. and the Task Force on Statistical Inference (1999). Statistical methods in psychology journals : guidelines and explanations. American psychologist, 54(8), 594-604.
- Williams, R. H., Zimmerman, D. W. et Zumbo, B. D. (1995). Impact of measurement error on statistical power : review of an old paradox. Journal of experimental education, 63(4), 363-370.