A Data-driven Approach for the Identification of Features for Automated Feedback on Academic Essays

Mohsin Abbas; Peter van Rosmalen; Marco Kalz

doi:10.1109/TLT.2023.3320877

A Data-driven Approach for the Identification of Features for Automated Feedback on Academic Essays

Mohsin Abbas^*, Peter van Rosmalen^*, Marco Kalz^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

2 Downloads (Pure)

Abstract

For predicting and improving the quality of essays, text analytic metrics (surface, syntactic, morphological and semantic features) can be used to provide formative feedback to the students in higher education. In this study, the goal was to identify a sufficient number of features that exhibit a fair proxy of the scores given by the human raters via a data-driven approach. Using an existing corpus and a text analysis tool for the Dutch language, a large number of features were extracted. Artificial neural networks, Levenberg Marquardt algorithm and backward elimination were used to reduce the number of features automatically. Irrelevant features were eliminated based on the inter-rater agreement between predicted and human scores calculated using Cohen's Kappa (<inline-formula><tex-math notation="LaTeX">$\kappa$</tex-math></inline-formula>). The number of features in this study was reduced from 457 to 28 and grouped into different categories. The results reported in this paper are an improvement over a similar previous study. Firstly, the inter-rater reliability between the predicted scores and human raters was increased by tweaking the corpus for overfitting for average scores. The resulting maximum value of <inline-formula><tex-math notation="LaTeX">$\kappa$</tex-math></inline-formula> showed substantial agreement compared to moderate inter-rater reliability in the prior study. Secondly, instead of using a dedicated training and test set, the training and testing phases in the new experiments were performed using k-fold cross validation on the corpus of texts. The approach presented in this research paper is the first step towards our ultimate goal of providing meaningful formative feedback to the students for enhancing their writing skills and capabilities.

Original language	English
Pages (from-to)	914-925
Number of pages	12
Journal	IEEE Transactions on Learning Technologies
Volume	16
Issue number	6
DOIs	https://doi.org/10.1109/TLT.2023.3320877
Publication status	Published - 29 Sept 2023

Keywords

Artificial Neural Networks
Backward Elimination
Dimensionality reduction
Feature extraction
Feature reduction
Feature selection
k-fold Cross Validation
Levenberg Marquardt
Measurement
Natural Language Processing
Semantics
Surface morphology
Syntactics
Training
Writing

Access to Document

10.1109/TLT.2023.3320877

Full TextFinal published version, 418 KBLicence: Taverne

Cite this

@article{26e2362349424e2888cbde9d86c1594e,

title = "A Data-driven Approach for the Identification of Features for Automated Feedback on Academic Essays",

abstract = "For predicting and improving the quality of essays, text analytic metrics (surface, syntactic, morphological and semantic features) can be used to provide formative feedback to the students in higher education. In this study, the goal was to identify a sufficient number of features that exhibit a fair proxy of the scores given by the human raters via a data-driven approach. Using an existing corpus and a text analysis tool for the Dutch language, a large number of features were extracted. Artificial neural networks, Levenberg Marquardt algorithm and backward elimination were used to reduce the number of features automatically. Irrelevant features were eliminated based on the inter-rater agreement between predicted and human scores calculated using Cohen's Kappa ($\kappa$). The number of features in this study was reduced from 457 to 28 and grouped into different categories. The results reported in this paper are an improvement over a similar previous study. Firstly, the inter-rater reliability between the predicted scores and human raters was increased by tweaking the corpus for overfitting for average scores. The resulting maximum value of $\kappa$ showed substantial agreement compared to moderate inter-rater reliability in the prior study. Secondly, instead of using a dedicated training and test set, the training and testing phases in the new experiments were performed using k-fold cross validation on the corpus of texts. The approach presented in this research paper is the first step towards our ultimate goal of providing meaningful formative feedback to the students for enhancing their writing skills and capabilities.",

keywords = "Artificial Neural Networks, Backward Elimination, Dimensionality reduction, Feature extraction, Feature reduction, Feature selection, k-fold Cross Validation, Levenberg Marquardt, Measurement, Natural Language Processing, Semantics, Surface morphology, Syntactics, Training, Writing",

author = "Mohsin Abbas and Rosmalen, {Peter van} and Marco Kalz",

note = "Publisher Copyright: IEEE",

year = "2023",

month = sep,

day = "29",

doi = "10.1109/TLT.2023.3320877",

language = "English",

volume = "16",

pages = "914--925",

journal = "IEEE Transactions on Learning Technologies",

issn = "1939-1382",

publisher = "IEEE",

number = "6",

}

TY - JOUR

T1 - A Data-driven Approach for the Identification of Features for Automated Feedback on Academic Essays

AU - Abbas, Mohsin

AU - Rosmalen, Peter van

AU - Kalz, Marco

N1 - Publisher Copyright: IEEE

PY - 2023/9/29

Y1 - 2023/9/29

N2 - For predicting and improving the quality of essays, text analytic metrics (surface, syntactic, morphological and semantic features) can be used to provide formative feedback to the students in higher education. In this study, the goal was to identify a sufficient number of features that exhibit a fair proxy of the scores given by the human raters via a data-driven approach. Using an existing corpus and a text analysis tool for the Dutch language, a large number of features were extracted. Artificial neural networks, Levenberg Marquardt algorithm and backward elimination were used to reduce the number of features automatically. Irrelevant features were eliminated based on the inter-rater agreement between predicted and human scores calculated using Cohen's Kappa ($\kappa$). The number of features in this study was reduced from 457 to 28 and grouped into different categories. The results reported in this paper are an improvement over a similar previous study. Firstly, the inter-rater reliability between the predicted scores and human raters was increased by tweaking the corpus for overfitting for average scores. The resulting maximum value of $\kappa$ showed substantial agreement compared to moderate inter-rater reliability in the prior study. Secondly, instead of using a dedicated training and test set, the training and testing phases in the new experiments were performed using k-fold cross validation on the corpus of texts. The approach presented in this research paper is the first step towards our ultimate goal of providing meaningful formative feedback to the students for enhancing their writing skills and capabilities.

AB - For predicting and improving the quality of essays, text analytic metrics (surface, syntactic, morphological and semantic features) can be used to provide formative feedback to the students in higher education. In this study, the goal was to identify a sufficient number of features that exhibit a fair proxy of the scores given by the human raters via a data-driven approach. Using an existing corpus and a text analysis tool for the Dutch language, a large number of features were extracted. Artificial neural networks, Levenberg Marquardt algorithm and backward elimination were used to reduce the number of features automatically. Irrelevant features were eliminated based on the inter-rater agreement between predicted and human scores calculated using Cohen's Kappa ($\kappa$). The number of features in this study was reduced from 457 to 28 and grouped into different categories. The results reported in this paper are an improvement over a similar previous study. Firstly, the inter-rater reliability between the predicted scores and human raters was increased by tweaking the corpus for overfitting for average scores. The resulting maximum value of $\kappa$ showed substantial agreement compared to moderate inter-rater reliability in the prior study. Secondly, instead of using a dedicated training and test set, the training and testing phases in the new experiments were performed using k-fold cross validation on the corpus of texts. The approach presented in this research paper is the first step towards our ultimate goal of providing meaningful formative feedback to the students for enhancing their writing skills and capabilities.

KW - Artificial Neural Networks

KW - Backward Elimination

KW - Dimensionality reduction

KW - Feature extraction

KW - Feature reduction

KW - Feature selection

KW - k-fold Cross Validation

KW - Levenberg Marquardt

KW - Measurement

KW - Natural Language Processing

KW - Semantics

KW - Surface morphology

KW - Syntactics

KW - Training

KW - Writing

U2 - 10.1109/TLT.2023.3320877

DO - 10.1109/TLT.2023.3320877

M3 - Article

SN - 1939-1382

VL - 16

SP - 914

EP - 925

JO - IEEE Transactions on Learning Technologies

JF - IEEE Transactions on Learning Technologies

IS - 6

ER -