Here are two studies on Criterion though caution should be warranted as these studies were supported by the publisher.  Nevertheless, they found good correlations between human holistic scoring and the Criterion scoring, but not for individual mechanical errors

Attali, Y. (2004, April). Exploring the feedback and revision features of Criterion. Paper presented at the annual meeting of the National Council on Measurement in Education, San Diego, CA.  Retrieved July 7, 2007, from

Attali, Y., & Burnstein, J. (2006). Automated essay scoring with e-rater® v.2. Journal of Technology, Learning, and Assessment, 4(3). Retrieved July 7, 2007, from

Others studies found equally high correlations for Intelligent Essay Assessor(tm):

Foltz, P., Laham, D., & Landauer, T. K. (1999). The Intelligent Essay Assessor: Applications to educational technology. Interactive Multimedia Electronic Journal of Computer-Enhanced Learning, 1(2). Retrieved July 11, 2007, from

Rudner, L. M., Garcia, V., & Welch, C. (2006). An evaluation of Intellimetric(tm) essay scoring system. Journal of Technology, Learning, and Assessment, 4(4). Retrieved July 47, 2007, from

You can read more about this in my "Technology Integration" chapter in the second edition of the Handbook of College Reading and Study Strategy Research:

Flippo, R. F., & Caverly, D. C. (2009). Handbook of college reading and study strategy research (2nd Ed.). New York: Routledge.  Retrieved October 20, 2008, from

Dave Caverly

