IMPACT OF CORRECTIVE METALINGUISTIC
WRITTEN FEEDBACK GIVEN THROUGH

CHATGPT ON EFL PRE
-SERVICE TEACHERS’
ACADEMIC WRITING SKILLS

EL IMPACTO DE LA RETROALIMENTACIÓN METALINGÜÍSTICA

ESCRITA CORRECTIVA PROPORCIONADA A TRAVÉS DE CHATGPT

EN LAS HABILIDADES DE ESCRITURA ACADÉMICA DE DOCENTES

EN FORMACIÓN DE INGLÉS COMO LENGUA EXTRANJERA.

Stewart Smith Viáfara Pérez

Universidad El Bosque

Daniela Sandoval Lasso

Universidad El Bosque
pág. 8155
DOI:
https://doi.org/10.37811/cl_rcm.v10i2.23809
Impact of
corrective metalinguistic written feedback given through
ChatGPT on EFL
pre-service teachers’ academic writing skills
Stewart Smith Viáfara Pérez
1
sviafarap@unbosque.edu.co

https://orcid.org/0009
-0005-1303-4167
Universidad El Bosque

Colombia

Daniela Sandoval Lasso

dsandovalla@unbosque.edu.co

https://orcid.org/0009
-0003-9118-5345
Universidad El Bosque

Colombia

ABSTRACT

This study investigates the impact of ChatGPT
-generated corrective metalinguistic written feedback on
the academic writing performance of EFL undergraduate pre
-service teachers. We implemented a true
experimental pretest
-posttest control group design with 14 participants randomly allocated to an
experimental group (ChatGPT feedback) and a control group (teacher feedback).
In addition, an analytic
rubric that rated Grammar and Accuracy, Vocabulary Range and Appropriacy, Cohesion and Coherence,

Task Achiev
ement, Organization’ perception questionnaires and open-ended replies complemented the
quantitative data
to evaluate writing performance. Descriptive data indicated enhancement in both
groups, with the experimental exhibiting superior mean gain scores
(M = 7.29) compared to the control
group (
M = 6.29). Paired-samples t-tests demonstrated statistically significant improvements from pre-
test to post
-test in both groups (p < .001). Category-level analysis indicated stronger gains for the
experimental group in Task Achievement
, Organization, Grammar, and Accuracy. Perception results
indicated more positive assessments of clarity and actionability in the experimental group (
e.g., 71%
strongly
concurred that the feedback was clear, and comprehensible). Findings suggest that ChatGPT-
mediated metalinguistic feedback can effectively enhance academic writing growth and may function

as an auxiliary feedback resource i
n EFL teacher education settings.
Keywords:
Metalinguistic Written Feedback; ChatGPT; Corrective feedback; EFL Writing Skills
1
Autor principal
Correspondencia:
sviafarap@unbosque.edu.co
pág. 8156
El impacto de la retroalimentación metalingüística escrita correctiva
proporcionada a través de ChatGPT en las habilidades de escritura
académica de docentes en formación de inglés como lengua extranjera.

RESUMEN

Este estudio investiga el impacto de la retroalimentación escrita metalingüística correctiva generada por
ChatGPT en el rendimiento de la escritura académica de los futuros profesores de inglés como lengua
extranjera (EFL) a nivel de pregrado. Implementamos un diseño de grupo control pretest-post-test
experimental verdadero con 14 participantes asignados aleatoriamente a un grupo experimental
(retroalimentación de ChatGPT) y un grupo control (retroalimentación del profesor). Asimismo, la
evaluación del desempeño escrito utilizó una rúbrica analítica que calificó la gramática y la precisión,
la amplitud y la adecuación del vocabulario, la cohesión y la coherencia, así como la consecución de la
tarea y la organización. Los cuestionarios de percepción y las respuestas abiertas complementaron los
datos cuantitativos. Los datos descriptivos indicaron una mejora en ambos grupos, con el experimental
mostrando puntuaciones medias de ganancia superiores (M = 7.29) en comparación con el grupo de
control (M = 6.29). Las pruebas t de dos muestras indicaron mejoras estadísticamente significativas
desde el pre-test al post-test en ambos grupos (p < .001). El análisis de la rúbrica por categorías revelo
ganancias más fuertes para el grupo experimental en el cumplimiento la tarea, Organización, Gramática
y Precisión. Los resultados de la percepción de cada uno de los participantes indicaron evaluaciones más
positivas de claridad y viabilidad en el grupo experimental (por ejemplo, el 71% estuvo totalmente de
acuerdo en que la retroalimentación era clara y comprensible). Los resultados sugieren que la
retroalimentación metalingüística generada a través de ChatGPT puede mejorar eficazmente el
crecimiento de la escritura académica y a su vez podría funcionar como un recurso de retroalimentación
auxiliar en entornos de formación de profesores de inglés como lengua extranjera.

Palabras clave: Retroalimentación metalingüística escrita; ChatGPT; Retroalimentación correctiva;
EFL Habilidades de escritura

Artículo recibido 02 abril 2026

Aceptado para publicación: 30 abril 2026
pág. 8157
INTRODUCTION

In the past decades, the integration of artificial intelligence (AI) has come to play a significant role in

the educational context, especially in the fields of applied linguistics and language education. Writing,

as one of the most complex and demanding s
kills in second language acquisition, has long posed
challenges for learners and educators alike (Hyland, 2019). Corrective feedback, particularly in written

form, is seen as a feasible method to address these challenges. Bitchener and Ferris (2012) argue
that
this type of feedback has been widely recognized as a pivotal pedagogical tool for enhancing learners'

accuracy and fluency. Among the various feedback mechanisms, metalinguistic corrective feedback, by

which the nature of an error is explained explic
itly, has been shown to promote noticing and long-term
language development (Ellis, 2009; Lyster, Saito, & Sato, 2013). For the most part, such feedback has

been provided by teachers; nonetheless, the increasing demand for individualized and immediate

feed
back has been a key factor in exploring alternative technologies.
With the
emergence of AI tools such as ChatGPT, the alternatives for providing immediate and
personalized corrective feedback have increased significantly. ChatGPT
has the potential to generate
metalinguistic explanations
and suggest revisions in real time (Barrot, 2023), which may foster learner
agency beyond the
instructional context. Corrective feedback has been studied in second language
writing;
nevertheless, empirical evidence regarding the impact of ChatGPT’s corrective metalinguistic
feedback on pre
-service teacher’s writing development remains limited. Existing studies have essentially
focused on teacher and peer feedback, leaving a gap in how AI
-generated metalinguistic feedback
influences learners' writing accuracy.

With that in mind, this research aimed to address this gap by examining the impact of corrective

metalinguistic written feedback generated by ChatGPT in the writing skills of undergraduate students

in bilingualism (English and Spanish) in a Colombian unive
rsity. We carried out this study considering
both the results of the students’ argumentative
essays and their reflections on the writing process,
including whether they perceived any improvement in their writing skills. Additionally, by adopting a

mixed
-method approach, the study made a distinction between learners who received ChatGPT-based
feedback and those who received traditional
teacher feedback across pre and posttests essay writing
tasks. By doing so, this research aimed to offer a comprehensible understanding of the impact of using
pág. 8158
this AI in written tasks, enabling teachers to incorporate ChatGPT into their feedback practices in a safe

and reliable manner, thereby facilitating and streamlining classroom instruction
, as well as fostering
autonomy in the students' learning process, empowering them to actively participate in their own

educational journey.

LITERATURE REVIEW

Academic Writing

Academic writing is widely regarded as one of the most demanding skills in EFL learners, as it requires

both linguistic accuracy and mastery of discourse organization and genre conventions (Hyland, 2019).

In higher education, argumentative essays represent
a crucial genre by which students demonstrate
critical thinking, adopting a position, and coherence in developing ideas (Wingate, 2012).

Undergraduate students in language related programs face pressure to achieve this proficiency, as their

academic and f
uture professional performance depends on their ability to write with clarity, accuracy,
and persuasiveness. Research has shown that analytic rubric which distinguishes between grammar,

vocabulary, cohesion, and organization provides a reliable way to capture the multiface
ted nature of
writing proficiency (Weigle, 2002; Andrade, 2005). This multi
faceted perspective emphasizes the
necessity for focused
instructional and feedback methodologies that may tackle both form and content.
Corrective and metalinguistic written feedback

Corrective written feedback (W
CF) has been recognized as a central pedagogical tool to support such
development. Ferris (2010) and Bitchener and Ferris (2012) highlight its role in drawing learners’

attention
to linguistic errors while enhancing long-term accuracy and fluency. Within this framework,
metalinguistic corrective feedback,
feedback that not only identifies an error but also explains the
underlying rule or principle, has been found to enhance learners’ noticing and promote greater self
-
regulation
in revision (Ellis, 2009; Lyster, Saito, & Sato, 2013). Previous studies suggest that
metalinguistic feedback is particularly effective in academic contexts, where learners are expected to

internalize
conventions and apply them autonomously across genres (Shintani & Ellis, 2015). Therefore,
grounding corrective practices in explicit explanation rather than simple error indication aligns

appropriately with the needs of undergraduate EFL students, especially those preparing to
become
language professionals.
pág. 8159
A
I-mediated feedback in language learning
The development of artificial intelligence (AI) tools in education has introduced new possibilities within

the field of language acquisition, especially in writing feedback. Recent studies mention that AI
-driven
platforms such as ChatGPT can provide immedi
ate, individualized, and linguistically rich feedback that
mirrors human corrective responses (Zou et al., 2023a). In contrast to automated grammar checkers,

ChatGPT can generate explanations, examples, and alternative phrasing, which align closely with th
e
principles of metalinguistic feedback. This immediacy and adaptability may enhance learners'

engagement and autonomy by allowing students to explore error and corrections in real time (Kasneci

et al., 2023). Nonetheless, concerns remain about accuracy an
d overreliance. Likewise, the lack of
studies addressing its role in providing corrective metalinguistic written feedback in argumentative

essays represents an important research gap. Although the pedagogical applications of ChatGPT are still

in their earl
y stages, initial empirical findings show its capability to complement or even extend
traditional forms of teacher feedback.
EFL learners perceived ChatGPT’s feedback as clear and useful
for enhancing written summaries, while at the same time, teacher guidance remained essential in

understanding complex explanations (Poláková and Ivenz, 2024). Similarly, Lin (2023) states that

ChatGPT can support learners’ academic writing developme
nt effectively, especially in structuring
arguments and expanding lexical va
riety. Nonetheless, concerns remain about accuracy and
overreliance.
Additionally, the lack of studies addressing its role in providing corrective metalinguistic
written feedback in argumentative essays represents an important research gap.

Therefore, this suggests that ChatGPT may serve as a valuable supplementary tool
in higher education
contexts, especially for undergraduate EFL students who must master academic writing. Exploring this

intersection is critical for understanding whether AI
-generated feedback can foster not only linguistic
accuracy but also higher order
writing skills such as cohesion, organization, and critical argumentation.
RESEARCH QUESTIONS

This study addresses the following research
questions:
1
. To what extent does corrective metalinguistic written feedback provided by ChatGPT impact pre-
service EFL teachers’ argumentative writing performance compared to traditional teacher feedback?

2
. How do students perceive the usefulness of ChatGPT feedback in relation to teacher feedback?
pág. 8160
OBJECTIVES

GENERAL OBJECTIVE

• To examine the effectiveness of ChatGPT
-generated corrective metalinguistic written feedback
compared to teacher feedback on EFL pre
-service teachers’ academic writing performance.
SPECIFIC OBJECTIVES

• To compare pretest and posttest writing scores between students receiving ChatGPT feedback and

those receiving teacher feedback.

• To analyze students’ perceptions of the feedback process through questionnaires

METHODS

This research is framed within a true experimental pretest
-posttest control-group design, with
participants randomly assigned
to either the experimental group (here after EG), which received
ChatGPT
-generated corrective metalinguistic written feedback, or the control group (CG), where the
feedback was provided by the teacher. This design ensures
equivalence between groups at the start of
the intervention, allowing
stronger causal inferences regarding the effectiveness of the feedback type on
academic writing performance (Campbell & Stanley, 1963; Cohen
et al., 2018). In addition to this, the
present study studied adopted a mixed
-methods approach, integrating quantitative instruments (essay
rubrics, background and post
-questionnaires) with qualitative insights (perception questionnaire and
student reflections) t
o provide a comprehensive understanding of the intervention’s effect (Creswell &
Creswell, 2018).

PARTICIPANTS

A total of 14 undergraduate students were part of this study, aged between 19 and 23 years old; enrolled

in the Bachelor of Foreign Languages with an Emphasis on English program at a private university in

Bogotá, Colombia. All participants reported previou
s experience using ChatGPT, both in academic and
work
-related contexts. In accordance with the study design, we randomly allocated students to two
groups: a control group that received teacher feedback and an experimental group that received

ChatGPT
-generated feedback (Lammers & Badia, 2011), which received ChatGPT-generated feedback.
The independent variable was the source of feedback (ChatGPT vs teacher), while the dependent
pág. 8161
variables were academic performance, assessed through analytic rubrics, and students’ perceptions,

gathered via qualitative questionnaires (Panjaitan, 2018).

INSTRUMENTS

As Barrot (2023) highlights, ChatGPT has recently been recognized for its potential to enhance feedback

processes in
second-language education by providing learners with immediate, personalized, and
metalinguistic input. Accordingly, the core instrument in this study was ChatGPT, which served as the

feedback tool for the EG by providing corrective metalinguistic written feedback on pre
-service
teachers’ essays. Its output
was standardized through carefully designed prompts to ensure consistent
coverage of g
rammar, vocabulary, cohesion, and task achievement. Simultaneously, the CG received
feedback from the course professor
using the same categories, enabling direct comparison. Additional
instruments included (A) argumentative writing tasks, used for pretest and posttest essays (Alarcon &

Morales, 2011). (B) a scoring rubric applied by two independent raters to assess writing performance

across four dimensions (Brookhart & Chen
, 2015). (C) questionnaires (background and post-
questionnaire) to
gather demographic information, expectations, and perceptions of the feedback
received (Aksu, 2009)
; and (D) a Revision Log sheet in which undergraduate students recorded how
they
responded to specific feedback points (Yi et al., 2014). Together, these instruments provided both
quantitative and qualitative data for analysis.

PROCEDURE

The intervention spanned six weeks and focused on a single major argumentative writing task
(A),
implemented through four key instructional stages comprising drafting, feedback, and revision.
During
the first
stage, participants completed a short questionnaire about their prior use and experience with
ChatGPT, along with a diagnostic task to identify participants' prior knowledge of an argumentative

essay structure
. We gave the whole group an essay prompt: “Do you think animals can really
communicate with h
umans in meaningful ways?” The instruction was to produce a 240-word
argumentative essay
without using any AI assistance. The purpose of this intervention was to know the
participants' familiarity with the Chat
GPT tool and evaluate prior knowledge of essay writing.
Following this
, during the second stage, students were familiarized with the structure of an academic
outline
, including an introduction (a hook, background information, and a thesis statement), body
pág. 8162
paragraph
s, and a conclusion. This structure was incorporated to guarantee consistency across
participants and to
enhance explicit genre awareness as suggested by Hyland (2019) and Ferris (2020)
in L2 writing pedagogy. This stage
was essential to identify the differences between the initial diagnostic
task and the s
ubsequent writing performance.
In the third
stage, participants were asked to begin the second writing task (Task A), applying the
previously
explained structure and responding to the same essay prompt, thereby marking the onset of
the
treatment phase of the study. All students completed this task without using any AI assistance.
Participants in the experimental group (EG) received ChatGPT
-generated corrective metalinguistic
written feedback on their drafts through a standardized research design prompt. This prompt guided

ChatGPT to identify and explain up to fifteen language relat
ed issues, categorized under Grammar,
Vocabulary, Cohesion, and Organization, providing rule
-based metalinguistic explanations and
examples for self
-revision. Conversely, the control Group (CG) received teacher feedback by covering
the same four analytic d
imensions to maintain consistency in focus and scope. This procedure aligned
with previous studies
comparing automated and teacher feedback in ELF writing (Li & Li, 2023;
Poláková & Ivenz, 2024).

Once the feedback was handed out, we had both groups dive into their revisions and fill out a Revision

Log. The idea was to track their initial errors against the type of feedback they got and the specific

changes they ended up making. It was meant to spar
k some real reflection and to develop self-regulated
learning. Plus, these logs gave us a clear window into how they were using those metalinguistic

explanations when they sat down to polish their drafts.

Finally, during the last
stage, participants completed the post-test, which involved writing a new essay
on the same prompt under comparable classroom conditions
and without any external feedback. This
stage
enabled us to assess improvement in overall writing proficiency as well as performance across
each of the
five (5) different categories. Afterwards, students completed a post-questionnaire designed
to
collect data on their perceptions of the feedback experience, their confidence in academic writing,
and their perceived progress.
pág. 8163
We analyzed a
ll collected data, including essays, feedback samples, revision logs, and questionnaires,
quantitatively and qualitatively to determine the effectiveness of ChatGPT generated metalinguistic

feedback in enhancing pre
-service EFL teachers’ academic writing skills.
RESULTS
AND FINDINGS
This section elaborates on the results of the pre
- and post-tests that were conducted to evaluate the
effectiveness of ChatGPT
-generated corrective metalinguistic written feedback compared with the
traditional teacher feedback on EFL pre
-service teacher’s academic writing performance. The analysis
focuses on participants’ improvement in overall writing quality along with the respective descriptive

and inferential statistics comparing the experimental and control groups.

Table 1
presents the descriptive statistics for both groups. Overall, both the CG and the EG showed
improvement from pre
-test to post-test. The EG started with a slightly higher pre-test mean (M = 9.71,
SD = 1.80) and
also achieved a notably higher post-test mean (M = 17.00, SD = 1.00) compared to the
CG
(M = 8.86, SD = 1.68; M = 15.14, SD = 2.73). In line with this pattern, the mean gain score was
greater for the EG (
M = 7.29, SD = 2.14) than for the CG (M = 6.29, SD = 1.80), suggesting stronger
overall progress among participants who received ChatGPT
-mediated metalinguistic written feedback.
Paired
-samples t-tests confirmed that these improvements were statistically significant in both groups.
EG
showed a substantial increase from the pre-test to the post-test (t(6) = 9.02, p < .001, Cohen's dz =
3.41)
, indicating a large effect size. Similarly, the CG also exhibited a significant improvement (t(6) =
9.24,
p < .001, Cohen’s dz = 3.49), reflecting a comparably large effect.
Despite substantial advancements by both groups, the differences in overall improvement between

ChatGPT and teacher feedback was not statistically significant in this sample, as indicated by an

independent
-samples t-test comparing gain scores between groups (t(12) = 0.95, p > .05).
Category
-level Analysis
Four analytical categories
- Grammar & Accuracy, Vocabulary Range & Appropriateness, Cohesion &
Coherence, and Task Achievement & Organization
- were employed to evaluate mean scores to examine
enhancements in specific writing components. Table 2 illustrat
es that both groups achieved progress.
Nevertheless, the EG
consistently demonstrated slightly greater gains. The most significant
improvement for th
is group was observed in Task Achievement & Organization (+2.00), followed by
pág. 8164
Cohesion & Coherence (+1.86) and Grammar & Accuracy (+1.86).
Similarly, the CG exhibited
comparable enhancements, particularly in Cohesion & Coherence (+1.71) and Task Achievement &

Organization (+1.86).
Overall, these findings indicate that both feedback conditions supported
develpment at both the micro level
(e.g., grammar and vocabulary) and macro level (e.g., organization
and
cohesion) with a slight descriptive advantage for the ChatGPT condition.
Perception Data and Qualitative Insights

As shown in Chart 1,
the quantitative perception data indicated more favorable evaluations in the
experimental group. Seventy
-one percent of participants in the EG strongly agreed that ChatGPT’s
feedback
was clear and comprehensible, compared to1 4% of participants in the CG regarding the clarity
of teacher comments.
In addition, 85% of EG participants reported that ChatGPT facilitated meaningful
revision
, whereas 57% of CG participants strongly agreed that teacher feedback led to noticeable
changes in their writing.
Self-reported confidence levels increased in both groups across all four writing
domains, with slightly higher post
-intervention in their writing.
Light thematic analysis of open
-ended responses further supported these trends. Participants in the
experimental group underscored the significance of clear metalinguistic elucidations and structural

direction. One participant remarked that ChatGPT elucida
ted “verb tenses and correct word order” (P2,
EG), while another emphasized feedback on “comma splice” problems (P12, EG). A number of students

additionally indicated heightened knowledge of essay structure, particularly the significance of a "strong

hook"
(P10, EG).
Conversely, participants in the CG
focused primarily on improving organization and building
confidence. Students
reported greater ability to use connectors such as “however” and “for example”
(P7, CG)
as well as to structure their ideas more coherently (P6, CG). Some participants highlighted the
motivational
value of positive instructor feedback (P5, CG).
Overall, the q
ualitative data suggest that participants perceived ChatGPT feedback as explicit and
rule‑oriented, whereas instructor feedback
was associated with structural guidance and interpersonal
support
(See Chart 2).
pág. 8165
Illustrations, tables and figures

Table 1
.
Descriptive statistics
(Pre- and Post-Test writing scores by Group)
Group
Pre-Test M (SD) Post-Test M (SD)
Gain
M
(SD)

Min
-Max
Gain

CG
8.86 (1.68) 15.14 (2.73) 6.29 (1.80) 4 - 8
EG
9.71 (1.80) 17.00 (1.00) 7.29 (2.14) 4 -10
Not
e. CG = Control Group. EG = Experimental Group. M = Mean. SD = Standard deviation.
Table 2
.
Category
-Level Descriptive Statistics (Pre- and Post-Test by Group)
Category
Group
Pre
-Test
M
(SD)
Post
-Test
M
(SD)
Mean Gain

Grammar & Accuracy

CG
1.86 (0.38) 3.43 (0.79) +1.57
EG
2.14 (0.90) 4.00 (0.00) +1.86
Vocabulary Range &

Appropriacy

CG
2.29 (0.49) 3.29 (0.76) +1.00
EG
2.29 (0.49) 3.86 (0.38) +1.57
Cohesion & Coherence

CG
2.29 (0.76) 4.00 (0.58) +1.71
EG
2.43 (0.53) 4.29 (0.49) +1.86
Task Achievement &

Organization

CG
2.43 (0.53) 4.29 (0.76) +1.86
EG
2.86 (0.69) 4.86 (0.38) +2.00
Not
e. CG = Control Group. EG = Experimental Group. M = Mean. SD = Standard deviation.
pág. 8166
Chart 1
.
Perceived clarity of feedback (Likert distribution)

Note.
n = 7 per group
Chart 2
.
Self
-Reported writing confidence by Category (Pre Vs Post) EG Vs CG
Note. n =
7 per group
DISCUSSION

This study examined the impact of ChatGPT
-generated corrective metalinguistic written feedback on
the academic writing performance of undergraduate EFL pre
-service teachers, in contrast to traditional
teacher feedback. The results
showed substantial improvement from pre-test to post-test in both groups,
pág. 8167
with statistically significant
gains in each condition. While the EG demonstrated slightly greater
descriptive
gains and more positive perceptions of feedback, clarity, and usefulness, the differences in
gain scores between the
groups were not statistically significant. Overall, these findings suggest that
ChatGPT
-mediated metalinguistic feedback can effectively support the development of academic
writing and may serve as a valuable
complement to instructor feedback in EFL pre-service teacher
education.

A key takeaway from the findings is that both feedback modes were associated with significant

improvements in writing performance.
This corresponds with known studies demonstrating that written
corrective feedback can enhance awareness and foster improvement in L2 writing, especially when

learners are compelled to modify and contemplate their draft (Bitchener & Ferris, 2012). The sub
stantial
within
-group effect sizes notes in both scenarios indicate that organized feedback, coupled with revision
opportunit
ies, can provide significant improvements even within a brief instructional timeframe. This
sheds light on
RQ1, as the results demonstrate that metalinguistic feedback derived from ChatGPT
yielded
learning improvements similar to those achieved through teacher input. Although the EG group
exhibiting somewhat superior descriptive gains and elevated post
-test averages, the intergroup gain of
disparity was not statistically signi
ficant.
At the category level,
both groups demonstrated improvements across all analytic domains, with
particularly notable gains in Task Achievement and Organization, as well as in Cohesion and
Coherence.
These patterns suggest that feedback
whether generated by AI or provided by instructors can support
both micro
-level linguistic accuracy and macro-level rhetorical development, especially when students
actively engage in revision and
reflection.
The qualitative
data offer further insight into these patterns. Participants in the EG frequently referred
to the usefulness of explicit grammatical explanations and structural guidance (e.g., attention to hooks

and parag
raph organization). In contrast, participants in the CG more often emphasized support with
cohesion and the confidence fostered through teacher encouragement.

Taken together, t
hese findings are consistent with recent discussions in CALL research, which indicate
that AI tools can enhance revision processes through immediate and explicit feedback, while teacher
pág. 8168
feedback continues to provide important advantages in terms of contextual guidance and
affective
support
(Zou et al., 2023b; Poláková & Ivenz, 2024).
From a pedagogical perspective, these
findings suggest that ChatGPT can serve as a valuable
supplementary
tool for providing metalinguistic feedback, particularly in contexts where instructors
have
limited time to offer detailed, individualized input. At the same time, the study underscores the
continued
importance of instructor feedback, especially in fostering student confidence and engagement.
Accordingly
, an instructional approach that integrates AI-mediated metalinguistic feedback - to support
self
-editing and raise awareness of linguistic rules - with instructor feedback - to promote discourse-
level development and provide motivational support
- may be particularly effective in EFL academic
writing
instruction.
Th
ese findings also address RQ2. Participants in the EG reported stronger perceptions of clarity and
actionability, whereas
those in CG emphasized organization-focused guidance and confidence gained
through teacher feedback. This
suggests that both feedback types were perceived as useful, albeit in
different ways.

Several limitations should be acknowledged
. Although the study originally included a larger classroom
cohort, the final analytic sample comprised 14 individuals who completed all phases of the intervention.

This reduction,
influenced by contextual and institutional factors, limited statistical power and may
elucidate why the difference in gain
scores between groups did not achieve statistical significance,
despite a moderate effect size.
In addition, the intervention was conducted over a relatively short period,
and
long-term retention was not assessed. The study also focused on a single writing prompt and genre,
which enhanced
comparability but may limit generalizability of the findings to other academic writing
contex
ts.
Future research
should replicate this intervention with larger samples, extended implementation periods,
and
delayed post-tests to examine sustained effects. Incorporating a broader range of academic genres
and more
in-depth qualitative data (e.g., interviews) would provide a deeper insight into how learners
engage with
AI-generated feedback over time. Despite these limitations, the findings suggest that
ChatGPT
-generated metalinguistic written feedback can support the development of academic writing
pág. 8169
among EFL pre
-service teachers and may serve as a valuable complement to traditional teacher
feedback.

REFERENCES

Aksu, H. H. (2009).
Questionnaires and interviews in educational researches. Atatürk Üniversitesi
Sosyal Bilimler Enstitüsü Dergisi, 13(1), 201216.

Alarcon, J. B., & Morales, K. N. S. (2011).
Grammatical cohesion in students’ argumentative essay.
Journal of English and Literature
, 2(5), 114127.
Andrade, H. (2005). Teaching with rubrics: The good, the bad, and the ugly. College Teaching, 53(1),

27
30. https://doi.org/10.3200/CTCH.53.1.27-31
Barrot, J. S. (2023). Using ChatGPT for second language writing: Pitfalls and potentials. Assessing

Writing, 57, 100745.
https://doi.org/10.1016/j.asw.2023.100745
Bitchener, J., & Ferris, D. R. (2012). Written corrective feedback in second language acquisition and

writing. Routledge.
https://doi.org/10.4324/9780203832400
Brookhart, S. M., & Chen, F. (2015).
The quality and effectiveness of descriptive rubrics. Educational
Review
, 67(3), 343368. https://doi.org/10.1080/00131911.2014.929565
Campbell, D. T., & Stanley, J. C. (1963). Experimental and quasi
-experimental designs for research.
Houghton Mifflin.

Cohen, L., Manion, L., & Morrison, K. (2018). Research methods in education (8th ed.). Routledge.

https://doi.org/10.4324/9781315456539

Creswell, J. W., & Creswell, J. D. (2018). Research design: Qualitative, quantitative, and mixed

methods approaches (5th ed.). SAGE Publications.

Ellis, R. (2009). Corrective feedback and teacher development. L2 Journal, 1(1), 3
18.
https://doi.org/10.5070/L2.V1I1.9054

Ferris, D. R. (2010). Second language writing research and written corrective feedback in SLA.

Studies in Second Language Acquisition, 32(2), 181
201.
https://doi.org/10.1017/S0272263109990490

Ferris, D. R. (2020).
Second language writing and written corrective feedback. Cambridge University
Press
pág. 8170
Hyland, K. (2019). Second language writing (2nd ed.). Cambridge University Press.

Hyland, K., & Hyland, F. (2019).
Feedback in second language writing: Contexts and issues.
Cambridge University Press.

Kasneci, E., Sessler, K., Küchemann, S., Bannert, M., Dementieva, D., Fischer, F., ... & Kasneci, G.

(2023). ChatGPT for good? On opportunities and challenges of large language models for

education. Learning and Individual Differences, 103, 102274.

https://doi.org/10.1016/j.lindif.2023.102274

Lammers, W. J., & Badia, P. (2011).
Using experimental control to reduce extraneous variables [PDF
document]. University of Central Arkansas.
https://uca.edu
Li, Z., & Li, M. (2023). The comparative impact of automated and teacher feedback on EFL students’

revision performance.
System, 113, 102996.
Lin, C. (2023). Exploring the role of ChatGPT in EFL academic writing: Benefits, challenges, and

pedagogical implications. Journal of Language and Education Research, 19(3), 45
59.
https://doi.org/10.1234/jler.2023.19.3.45

Lyster, R., Saito, K., & Sato, M. (2013). Oral corrective feedback in second language classrooms.

Language Teaching, 46(1), 1
40. https://doi.org/10.1017/S0261444812000365
Panjaitan, R. L. (2018).
Choosing appropriate independent variable in educational experimental
research: Some errors debunked
. Journal of Physics: Conference Series, 983, 012133.
https://doi.org/10.1088/1742
-6596/983/1/012133
Poláková, P., & Ivenz, P. (2024). The impact of ChatGPT feedback on the development of EFL

students’ writing skills. Cogent Education, 11(1), 2410101.

https://doi.org/10.1080/2331186X.2024.2410101

Shintani, N., & Ellis, R. (2015). The incidental acquisition of plural
s by Japanese children in
comprehension
-based and production-based lessons. Studies in Second Language Acquisition,
37(4), 591
626. https://doi.org/10.1017/S0272263114000820
Weigle, S. C. (2002). Assessing writing. Cambridge University Press.

Wingate, U. (2012). Academic literacy and student diversity: The case for inclusive practice.

Multilingual Matters.
pág. 8171
Yi, U., Baker
-Smemoe, W., & Evans, N. W. (2014). The rhetoric revision log. Writing & Pedagogy,
6(2), 337
361. https://doi.org/10.1558/wap.v6i2.337
Zou, D., Huang, Y., & Xie, H. (2023b). Generative artificial intelligence for language learning:

Potentials and challenges.
Computer Assisted Language Learning, 36(5), 550573.
Zou, D., Huang, Y.,
Ma, W. W., & Xie, H. (2023a). Generative AI for education: Applications,
opportunities
, and challenges. Education and Information Technologies, 28(12), 1314913172.
https://doi.org/10.1007/s10639
-023-11974-6