Abstract
Understanding the temporal organization of writing is key to studying writing processes. Existing methods to segment writing into phases often rely on arbitrary rules, extensive manual annotation, or focus on numerous transitions. This study aimed to develop an automated segmentation method to detect distinctive transition in the dominant writing process, particularly the transition from first draft to revision. For this, keystroke data (source-based L1 writing (N = 80) and text simplification in L2 (N = 88)) were manually annotated. The BEAST algorithm was applied for Bayesian change point detection, based on five characteristics derived from the annotation criteria: (1) percentage of the final text written so far, (2) distance between typed and remaining characters, (3) relative cursor position, (4) source use, and (5) pause timings. The first three features proved most effective in identifying change points. A rule-based approach was further applied to select one final change point, which resulted in mediocre accuracy ranging from 31% exact agreement to 49% agreement within 60 seconds. To conclude, the BEAST algorithm is useful in detecting a variety of change points in writing processes, yet connecting them to meaningful phases is still quite complex.
| Original language | English |
|---|---|
| Pages (from-to) | 339-369 |
| Number of pages | 31 |
| Journal | Journal of Writing Research |
| Volume | 17 |
| Issue number | 2 |
| DOIs | |
| Publication status | Published - 7 Oct 2025 |
Bibliographical note
Publisher Copyright:© 2025 University of Antwerp. All rights reserved.
Funding
The data collection for this study has received funding from the European Union’s Horizon 2020 research and innovation programme (Marie Sklodowska-Curie grant agreement No 888918), and from a NWO National Grant (Dutch Research Council, Grant 405–14-301). 2020 research and innovation programme (Marie Sklodowska-Curie grant agreement No 888918), and from a NWO National Grant (Dutch Research Council, Grant 405–14-301).
Keywords
- change point detection
- drafting
- Keystroke logging
- revision phase
- writing process