EPSY 8224: Performance Assessment Design & Analysis


AERA, APA, NCME (2014). Standards for educational and psychological testing. Washington DC: American Educational Research Association.

Bandalos, D.L. (2018). Measurement theory and applications in the social sciences. New York, NY:Guilford.

January 28

Shavelson, R.J., Baxter, G.P., & Pine, J. (1992). Performance assessments: Political rhetoric and measurement reality. Educational Researcher, 21(4), 22-27.

Kane, M. B., & Mitchell, R. (1996). Implementing performance assessment: Promises, problems, and challenges. Mahwah, NJ: Lawrence Erlbaum Associates. [Chapters 1-2]

February 4

Darling-Hammond, L., & Adamson, F. (2010). Beyond basic skills: The role of performance assessment in achieving 21st century standards of learning. Stanford, CA: Stanford University, Stanford Center for Opportunity Policy in Education.

Arter, J. (1999). Teaching about performance assessment. Educational Measurement: Issues and Practice, 18(2), 30-44.

Herman, J.L., Aschbacher, P.R., & Winters, L. (1992). A practical guide to alternative assessment. Alexandria, VA: ASCD. [Chapter 3]

February 11

Kane & Mitchell (1996). [Chapter 4]

Stiggens, R.J. (1987). Designs and development of performance assessments. Educational Measurement: Issues and Practice, 6(3), 33-42.

Herman, Aschbacher, & Winters (1992). [Chapter 4]

February 18

Lane, S. (2010). Performance assessment: The state of the art. Stanford, CA: Stanford University, Stanford Center for Opportunity Policy in Education. [pp. 3-24]

Phillips, G.W. (1996). Technical issues in large-scale performance assessment. Washington, DC: National Center for Education Statistics. [Chapter 1]

Brookhart, S. (1993). Assessing student achievement with term papers and written reports. Educational Measurement: Issues & Practice, 12(1), 40-47.

February 25

Lane (2010). [pp. 25-41]

Herman, Aschbacher, & Winters (1992). [Chapters 5-6]

March 4

Wills, K.V., & Rice, R. (Eds.). (2013). ePortfolio performance upport systems: Constructing, presenting, and constructing portfolios. Anderson, SC: Parlor Press.

Arter, J.A., & Spandel, V (1992). Using portfolios of student work in instruction and assessment. Educational Measurement: Issues & Practice, 11(1), 36-44.

Explore EdTPA and other portfolio readings online.

March 11

Kane & Mitchell (1996). [Chapters 6-8]

Phillips (1996). [Chapter 5]

March 25

American College Testing Program (1997). Reliability issues with performance assessment: A collection of papers. Iowa City, IA: Author.

Kane, M.B., Crooks, T., & Cohen, A. (1999). Validating measures of performance. Educational Measurement: Issues and Practice, 18(2), 5-17.

Lane (2010). [pp. 42-58]

Herman, Aschbacher, & Winters (1992). [Chapter 6]

Phillips (1996). [Chapter 2]

April 15

Kane & Mitchell (1996). [Chapter 10]

Recommended Readings

Standards for teacher competence in educational assessment Resource and state perspectives on PA
Implementing PA in the classroom Early childhood PA - Work Sampling
PA in Chicago Schools Validity of PA for English Language Learners
PA models and tools for complex tasks Fairness in performance appraisals
Career development portfolios Portfolios as Opportunities for Teacher Learning
Promises of PA PA and the issue of bias
edTPA Meaningful PA
Development of PA in Science (Solano-Flores, Shavelson) Toward a Science of PA technology (Shavelson et al.)
  Performance management cycle
Brennan, R.L. (2011). Generalizability theory and classical test theory. Applied Measurement in Education, 24, 1-21. Chen, E. et al. (2007). Examining the generalizability of direct writing assessment tasks. CSE Technical Report 718.
Briesch, A.M., Chafouleas, S.M., & Johnson, A.J. (2016). Use of generalizability theory within K-12 school-based assessment: A critical review and analysis of the empirical literature. Applied Measurement in Education, 29(2), 83-107. Webb, N.M., & Shavelson, R.J. (2005). Generalizability theory: Overview. In B.S. Everitt & D.C. Howell (Eds.), Encyclopedia of statistics in behavioral science (pp. 717-719). Chichester, United Kingdom: Wiley & Sons.
Rater Considerations  
Harik, P. et al. (2009). An examination of rater drift within a generalizability theory framework. Journal of Educational Measurement, 46(1), 43-58. Zhang, M. (2013). Contrasting automated and human scoring of essays. R & D Connections, 23.
NBPTS Bias Reduction Training