Tutoring Corpus

Download tutoring corpus database


The database contains 54 transcribed one-on-one tutoring sessions on computer science introductory data structures (transcripts table). The data structures in question are linked lists, stacks and binary search trees. The transcripts are annotated for the following tutoring moves / strategies (annotations table):

The first 6 tags (+FB, -FB, DPI, DDI, SI, PT) are turn level annotations; the last two (WOE and Analogy) are multiturn annotations, that are marked in the transcripts with Begin and End markers on the turns that begin and end the strategy, respectively.

Because of human protection requirements, we are unable to release audio and video (the video was limited to the hands of the student and the tutor, and the papers they had in front of them)

The interested reader may refer to the following papers for more details:

1. Di Eugenio, B., Fossati, D., Ohlsson, S., & Cosejo, D. (2009). Towards explaining effective tutorial dialogues. In Annual Meeting of the Cognitive Science Society (pp. 1430-1435).

2. Di Eugenio, B., Chen, L., Green, N., Fossati, D., & AlZoubi, O. (2013, July). Worked out examples in computer science tutoring. In International Conference on Artificial Intelligence in Education (pp. 852-855). Springer, Berlin, Heidelberg.

3. Alizadeh, M., Di Eugenio, B., Harsley, R., Green, N., Fossati, D., & AlZoubi, O. (2015, May). A Study of Analogy in Computer Science Tutorial Dialogues. In Proceedings of the 7th International Conference on Computer Supported Education-Volume 2 (pp. 232-237). SCITEPRESS-Science and Technology Publications, Lda.