A Process-oriented Dataset of Revisions during Writing

Rianne Conijn, Emily Dux Speltz, Menno Van Zaanen, Luuk Van Waes, Evgeny Chukharev-Hudilainen

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

2 Citaten (Scopus)


Revision plays a major role in writing and the analysis of writing processes. Revisions can be analyzed using a product-oriented approach (focusing on a finished product, the text that has been produced) or a process-oriented approach (focusing on the process that the writer followed to generate this product). Although several language resources exist for the product-oriented approach to revisions, there are hardly any resources available yet for an in-depth analysis of the process of revisions. Therefore, we provide an extensive dataset on revisions made during writing (accessible via https://hdl.handle.net/10411/VBDYGX). This dataset is based on keystroke data and eye tracking data of 65 students from a variety of backgrounds (undergraduate and graduate English as a first language and English as a second language students) and a variety of tasks (argumentative text and academic abstract). In total, 7,120 revisions were identified in the dataset. For each revision, 18 features have been manually annotated and 31 features have been automatically extracted. As a case study, we show two potential use cases of the dataset. In addition, future uses of the dataset are described.
Originele taal-2Engels
TitelLREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings
RedacteurenNicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Plaats van productieMarseille, France
UitgeverijEuropean Language Resources Association (ELRA)
Aantal pagina's6
ISBN van elektronische versie9791095546344
StatusGepubliceerd - 2020
Extern gepubliceerdJa

Vingerafdruk Duik in de onderzoeksthema's van 'A Process-oriented Dataset of Revisions during Writing'. Samen vormen ze een unieke vingerafdruk.

Citeer dit