0% found this document useful (0 votes)
6 views13 pages

Development of Contextually-Based Animated Video Media For Improving Speaking Abilities in Children With Speech Delay

This study developed contextually-based animated video media aimed at improving speaking abilities in children with speech delay, utilizing the ADDIE development model. The intervention demonstrated high feasibility and effectiveness, with a 16% improvement in speaking abilities compared to traditional media, validated by expert evaluations and teacher feedback. The findings suggest that integrating multimedia learning with contextual approaches can enhance speech interventions, although further research is needed for broader applicability.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views13 pages

Development of Contextually-Based Animated Video Media For Improving Speaking Abilities in Children With Speech Delay

This study developed contextually-based animated video media aimed at improving speaking abilities in children with speech delay, utilizing the ADDIE development model. The intervention demonstrated high feasibility and effectiveness, with a 16% improvement in speaking abilities compared to traditional media, validated by expert evaluations and teacher feedback. The findings suggest that integrating multimedia learning with contextual approaches can enhance speech interventions, although further research is needed for broader applicability.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

JOURNAL OF INNOVATION AND RESEARCH IN PRIMARY EDUCATION | ISSN 2829-775X

Vol. 4, No. 4, 2025, 2689-2701


[Link]

Development of Contextually-Based Animated Video Media for Improving


Speaking Abilities in Children with Speech Delay

Trimartini*
Universitas Negeri Yogyakarta, Indonesia

Sukinah
Universitas Negeri Yogyakarta, Indonesia

*Corresponding Author: trimartini.2024@[Link]

Abstract
Keywords Speech delay affects 3-15% of school-age children, requiring innovative interventions that
animated video combine engaging multimedia elements with evidence-based pedagogical approaches.
speech delay Traditional therapeutic methods often lack the multimodal stimulation necessary to
contextual learning maintain attention and motivation in young children with speech delay. This study aimed
multimedia intervention to develop and evaluate the effectiveness of contextually-based animated video media for
early childhood improving speaking abilities in children with speech delay using the ADDIE development
model. A research and development methodology employed the ADDIE framework with
Article History single-subject experimental design. One 4-year-old child with diagnosed speech delay
Received 2025-07-20 participated as the primary subject. Data collection utilized expert validation scales for
Accepted 2025-09-27 media and content assessment, systematic observation protocols for speaking ability
measurement, and teacher response questionnaires. Expert validation involved content
Copyright © 2025 by Author(s). and media specialists evaluating technical quality and educational appropriateness.
This is an open access article Speaking ability was assessed across five indicators using 5-point rating scales, comparing
under the CC BY-SA license. performance between traditional image media and animated video media conditions.
Expert validation confirmed high feasibility with media quality scoring 4.3/5.0 and content
appropriateness 4.2/5.0. Teacher evaluation demonstrated exceptional reception at
95.8%. Comparative assessment revealed 16% improvement in overall speaking ability
when using animated video media versus traditional image media, with enhancements
observed in articulation clarity, word production, and verbal responsiveness across four of
five assessed indicators. Contextually-based animated video media proved feasible and
effective for improving speaking abilities in children with speech delay. The integration of
multimedia learning theory with contextual learning principles offers promising directions
for technology-enhanced speech interventions, though larger-scale randomized controlled
trials are needed to establish robust evidence for clinical practice.

INTRODUCTION
Language development represents a fundamental aspect of early childhood growth, serving as
the primary vehicle for social interaction, cognitive development, and academic success. During the
critical early years, children typically progress through predictable stages of language acquisition,
developing receptive and expressive language skills that enable effective communication with their
environment. However, a significant proportion of children experience speech delays, with prevalence
rates ranging from 3% to 15% among school-age children, presenting substantial challenges for their
social interaction, learning outcomes, and overall developmental trajectory. Recent evidence suggests
that pediatric speech delays have more than doubled for children aged 12 and younger since the
COVID-19 pandemic, highlighting the urgent need for innovative intervention approaches.
Speech delay, characterized by significantly below-average speech development compared to
age-matched peers, manifests through difficulties in phonological production, limited vocabulary
acquisition, and challenges in constructing coherent verbal expressions. The development of spoken
and written language relies on an intact phonological system, with phonological deficits manifesting as
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

difficulties in accurately producing speech sounds and mapping phonemes onto letters during reading
processes (Anthony et al., 2011; Rvachew, 2007). These deficits often result in compromised social
skills, reduced academic performance, and potential long-term impacts on educational and social
functioning. Early language skills serve as some of the best predictors of academic, social, and
vocational outcomes, making timely intervention critical for optimizing developmental trajectories.
Traditional interventional approaches for speech delay have predominantly relied on
conventional therapeutic methods, often lacking the engaging, multimodal elements necessary to
maintain children's attention and motivation. Early intervention research demonstrates that caregiver-
implemented communication interventions can effectively improve caregiver use of language
facilitation strategies and subsequently enhance children's language skills, though effects vary across
different outcome measures (Roberts & Kaiser, 2011). Contemporary research emphasizes the critical
importance of utilizing innovative educational technologies that combine auditory and visual
stimulation to enhance language learning outcomes. Alfarida and Dafit (2024) demonstrated the
effectiveness of animated video media in developing speaking abilities among elementary students,
while Alya et al. (2025) confirmed that educational animated videos significantly stimulated language
abilities in early childhood, providing empirical support for technology-enhanced language
interventions.
Animation-based educational media offers unique advantages for language development
through its capacity to present dynamic visual representations combined with synchronized auditory
input. Animation serves as a presentation of ideas in visual form, utilizing dynamic expression that
displays movements from different sources to assist the generation of mental models and provide
external models for mental representation (Castro-Alonso et al., 2014; Höffler & Leutner, 2007). Meta-
analytic evidence indicates that animation is beneficial for learning compared to static graphics, with
its feature of changes in time having an overall more positive effect than static pictures (Berney &
Bétrancourt, 2016). The multimedia learning theoretical framework supports the integration of visual
and auditory elements, advocating for collaborative multimedia creation that provides clear,
comprehensible explanations through simple language accessible to diverse learners (Adetunji &
Levine, 2016).
The theoretical foundation for multimedia learning interventions is grounded in Cognitive Load
Theory (CLT), which emphasizes optimizing instructional design to manage the cognitive burden
placed on learners' working memory systems. Sweller (1988) established that working memory has
limited capacity for processing information, and instructional methods should avoid overloading it to
maximize learning outcomes. The cognitive theory of multimedia learning specifies that the human
information processing system includes dual channels for visual/pictorial and auditory/verbal
processing (Mayer, 2001). The multimedia principle indicates that working memory capacity increases
when material is presented using both visual and auditory methods (Chandler & Sweller, 1992).
Recent empirical investigations have consistently demonstrated the effectiveness of animated
video interventions for speech development. Amah et al. (2025) reported significant improvements in
speaking abilities among 4-5 year-old children following animated video interventions, while Aminah
(2023) confirmed enhanced language capabilities through animated video media implementation.
Astini et al. (2020) further validated the utility of animated films for improving speaking abilities in 5-6
year-old children, while Habibah and Nafiqoh (2022) demonstrated that animated videos effectively
enhanced both listening and speaking skills in early childhood populations.
Contextual learning approaches provide complementary pedagogical frameworks by connecting
academic content with real-life experiences. Contextual learning operates through connecting brain
actions to meaningful pattern creation via academic content linkage with real-life contexts (Davtyan,
2014; Hull, 1995). Constructivist philosophy underlying contextual approaches emphasizes that
learning extends beyond memorization to meaningful knowledge construction (Suryawati & Osman,
2017; Vygotsky, 1978). Hernawati et al. (2024) demonstrated the effectiveness of story-telling

2690
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

methods using animated video media for enhancing speaking skills, while Maulina (2025) confirmed
the efficacy of digital storytelling through animated videos for improving children's speaking abilities.
Despite growing evidence supporting animated video interventions for language development,
significant gaps remain in understanding optimal design characteristics and implementation strategies
for children with speech delay. Existing research has primarily focused on typically developing children
or has employed limited sample sizes and methodological approaches. Meta-analyses of parent-
implemented language interventions demonstrate modest improvements in expressive vocabulary and
receptive language, but effects are not consistently significant across all linguistic domains (Roberts &
Kaiser, 2011). Furthermore, few studies have systematically examined the integration of contextual
learning principles with animated video technology specifically for speech delay populations.
The present study addresses these knowledge gaps by developing and evaluating a
contextually-based animated video media intervention designed specifically for children with speech
delay. The research aims to create an innovative educational tool that combines the engaging
characteristics of animation with meaningful real-life contexts, thereby facilitating more effective
speech development outcomes. This research contributes to the expanding knowledge base on
technology-enhanced interventions for speech delay by providing empirical evidence regarding the
effectiveness of contextually-grounded animated video media.

METHODS
This study employed a research and development methodology using the ADDIE model
(Analysis, Design, Development, Implementation, Evaluation) to create and evaluate contextually-
based animated video media for improving speaking abilities in children with speech delay. The
research design incorporated both developmental and experimental phases, utilizing a single-subject
experimental design with pre-post intervention comparisons to assess media effectiveness.
The study population consisted of children aged 4-5 years attending TKIT Bina Insan who
exhibited speech delay characteristics. Using purposive sampling, one 4-year-old child with diagnosed
speech delay served as the primary subject for product testing. The limited sample size reflected the
specialized nature of the intervention and the availability of children meeting specific inclusion criteria:
confirmed speech delay diagnosis, age between 4-5 years, absence of other developmental disorders,
and parental consent for participation.
Data collection employed multiple instruments designed to capture different aspects of media
development and effectiveness. Expert validation utilized structured assessment scales for both media
and content specialists, evaluating technical quality, age-appropriateness, and educational value using
5-point Likert scales. Child speaking ability was assessed through systematic observation protocols
measuring five key indicators: consonant and vowel articulation clarity, simple word production,
phrase construction, verbal response to questions, and overall articulation quality. Each indicator was
rated on a 5-point scale ranging from "not emerged" to "excellent." Child response to media was
evaluated using observation scales measuring attention focus, imitation behavior, interaction with
content, emotional expression, and overall engagement. Teacher response questionnaires assessed
media usability, educational value, contextual relevance, and age-appropriateness using 4-point rating
scales.
Expert validation involved content and media specialists who evaluated the animated video
using predetermined criteria. Content validity was established through expert review of educational
objectives, developmental appropriateness, and contextual alignment. Media quality assessment
examined technical aspects including animation quality, audio clarity, duration suitability, and
interactivity features. Reliability was ensured through consistent application of assessment protocols
and inter-rater agreement procedures.
Data collection procedures followed the ADDIE framework phases. The analysis phase involved
needs assessment through stakeholder interviews and baseline child assessment. Design phase
included storyboard development and content planning. Development phase encompassed media

2691
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

production and expert validation. Implementation involved structured intervention sessions with
systematic observation and documentation. Evaluation phase included post-intervention assessment
and comparative analysis.
Data analysis utilized descriptive statistics and percentage-based comparisons to evaluate
media effectiveness. Expert validation scores were calculated using mean ratings and converted to
qualitative categories: very feasible (>4.2), feasible (3.4-4.2), less feasible (2.6-3.4), not feasible (1.8-
2.6), and very unfeasible (≤1.8). Child performance improvements were analyzed through percentage
change calculations comparing pre-intervention, image media, and animated video media conditions.
Teacher response data were analyzed using percentage calculations, with scores above 70%
indicating positive reception. Effectiveness was determined through multi-source triangulation
incorporating expert validation, child performance improvements, and teacher feedback ratings.

RESULTS AND DISCUSSION


Results
Analysis Phase Results
The initial analysis phase of the ADDIE model focused on identifying the specific needs and
characteristics of children with speech delay to inform media development. Comprehensive needs
assessment was conducted through structured interviews with teachers and parents, revealing critical
requirements for effective intervention media. The analysis identified that children with speech delay
require clear articulation models, repetitive practice opportunities, and engaging visual content that
maintains attention spans. Baseline assessment of the target child revealed speaking abilities scoring
40% across measured indicators, confirming the need for specialized intervention approaches.
Stakeholder interviews highlighted three primary media requirements: clear pronunciation of
vocabulary and narration, simplified visual presentations without excessive movement that could
distract focus, and interactive elements enabling adult facilitation during learning sessions. These
findings informed the subsequent design specifications and established clear parameters for media
development.
Design Phase Results
The design phase translated analysis findings into concrete media specifications and educational
frameworks. The development team established specific objectives targeting improvement in
consonant and vowel articulation, simple word production, phrase construction, verbal responsiveness,
and overall articulation clarity. Content selection focused on contextual learning principles,
incorporating daily life activities familiar to young children.
Detailed storyboard development encompassed 17 scenes featuring everyday activities
including eating, drinking, tooth brushing, playing, learning, and sleeping. Each scene was designed to
present vocabulary within meaningful contexts while providing multiple repetition opportunities and
syllable segmentation to facilitate imitation and practice. The design phase established a target
duration of 5-7 minutes to accommodate typical attention spans of children with speech delay,
ensuring sustained engagement without cognitive overload.
Collaborative planning sessions involving content experts, media specialists, and educational
practitioners resulted in comprehensive design specifications addressing both technical and
pedagogical requirements. The design framework integrated multimedia learning principles with
contextual learning approaches, establishing clear guidelines for animation style, narration pace, and
interactive elements.

Development Phase Results


The development phase transformed design specifications into functional animated video media
through systematic production processes. Professional animation specialists created 2-dimensional
animated sequences depicting contextual scenarios with vibrant, child-appropriate visual elements.

2692
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

Audio production involved clear narration with deliberate pacing, syllable segmentation, and multiple
repetitions designed to facilitate speech practice and imitation.
Technical development incorporated 17 distinct scenes with coordinated visual and auditory
elements supporting language learning objectives. The media featured consistent character
presentation, clear facial expressions for articulation modeling, and synchronized text display following
narration timing. Production quality met professional standards for educational media while
maintaining accessibility for diverse learning needs.
The final product characteristics demonstrate integration of contextual learning principles with
multimedia design elements. Figure 1 illustrates the syllable segmentation scene where complex
words are broken down into manageable components to facilitate pronunciation practice.

Figure 1. Syllable Segmentation Scene Example

This scene exemplifies the intervention's approach to supporting phonological development


through visual and auditory segmentation. The animated character provides clear articulation
modeling while text segmentation offers visual support for syllable identification. The deliberate
pacing allows children with speech delay adequate processing time for imitation attempts.
Figure 2 demonstrates the complete sentence practice scene, where children are encouraged to
produce full utterances after syllable-by-syllable practice.

Figure 2. Complete Sentence Practice Scene

The progression from segmented to complete sentence presentation supports gradual skill
building while maintaining contextual relevance. The visual consistency between segmentation and
complete sentence scenes facilitates learning transfer and promotes confidence in verbal production
attempts.
The contextual learning integration is evident throughout the media, with each scene featuring
familiar daily activities that children encounter in their regular routines. This scene demonstrates how
the intervention connects language learning with familiar experiences, reducing cognitive load while
enhancing relevance and motivation. The visual richness provides multiple opportunities for
vocabulary exposure while maintaining focus on primary speech targets.
Expert validation during development confirmed media quality and educational appropriateness.
Media expert evaluation assessed six key criteria including animation quality, color appropriateness,

2693
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

audio clarity, interactivity potential, usability, and duration suitability. Table 1 presents the
comprehensive media expert validation scores.
Table 1. Media Expert Validation Results
No. Indicator Score
1 Animation quality attractive and appropriate for children 4
2 Colors used appropriate for child age (not too contrasting/dark) 5
3 Audio (narration, sound effects) clear and easily understood by children 4
4 Media enables child interaction, such as sound imitation 4
5 Media easy to use by children and facilitators (teachers/parents) 5
6 Video duration appropriate for child attention span 4
Total Score 26
Mean Score 4.3

The media expert validation yielded a mean score of 4.3, indicating that the animated video
media falls within the "very feasible" category according to established criteria (>4.2). This high rating
demonstrates that the technical and design aspects of the media meet professional standards for early
childhood educational materials.
Content expert evaluation focused on nine educational and developmental criteria to assess the
appropriateness of learning materials for children with speech delay. Table 2 summarizes the content
validation outcomes.

Table 2. Content Expert Validation Results


No. Indicator Score
1 Material relevant to improving speaking ability of speech delay children 4
2 Material appropriate for children's needs to improve speaking ability 5
3 Material appropriate for child developmental level 4
4 Material concepts in animated video theoretically accurate 4
5 Material in animated video contains no misinformation causing misconceptions 5
6 Material related to children's daily life 5
7 Material helps children practice speaking ability 4
8 Material presented attractively to motivate children 4
9 Material beneficial for children with other disabilities 3
Total Score 38
Mean Score 4.2

Content expert validation achieved a mean score of 4.2, placing the educational content within
the "very feasible" category. The evaluation confirmed that the material design appropriately
addresses the specific needs of children with speech delay while maintaining developmental
appropriateness and theoretical accuracy.
Product revision was conducted based on expert feedback, addressing recommendations for
syllable segmentation consistency, vocabulary standardization, visual-text coordination, color
enhancement, and synchronized text presentation. These modifications ensured optimal educational
effectiveness and technical quality before implementation phase initiation.
Implementation Phase Results
The implementation phase involved systematic deployment of the completed animated video
media with the target child through structured intervention sessions. Pre-implementation baseline
assessment established initial speaking ability levels using standardized observation protocols.
Implementation proceeded through carefully monitored sessions allowing systematic observation of
child responses and engagement patterns.
Teacher response evaluation provided crucial insights into the practical application and
perceived effectiveness of the animated video media. Two experienced early childhood educators
evaluated the media across four dimensions: visual presentation, educational benefits, contextual

2694
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

learning alignment, and age appropriateness. Table 3 presents the teacher response evaluation
results.
Table 3. Teacher Response Evaluation Results
Subject Total Score Percentage
Teacher A 22 91.6%
Teacher H 24 100%
Average 95.8%

The teacher response evaluation demonstrated exceptionally positive reception, with an


average score of 95.8%. This result significantly exceeds the 70% threshold for positive
categorization, indicating strong professional endorsement of the media's educational value and
practical applicability in classroom settings.
Child engagement monitoring during implementation revealed sustained attention throughout
media presentation, active attempts at verbal imitation, and positive emotional responses to animated
content. The contextual elements featuring daily life activities facilitated recognition and engagement,
supporting the theoretical foundation underlying the intervention design.

Evaluation Phase Results


The evaluation phase assessed intervention effectiveness through comparative analysis of
speaking ability across different media conditions. The primary outcome measure examined changes
in speaking ability across three conditions: baseline assessment, traditional image media intervention,
and contextually-based animated video media intervention. Five specific indicators were systematically
observed and rated on a 5-point scale. Table 4 presents the comparative speaking ability assessment
results.
Table 4. Comparative Speaking Ability Assessment Results
Indicator Image Animated
Media Score Video Score
Consonant and vowel articulation clarity 2 3
Simple word production (1-2 syllables) 2 3
Word arrangement into phrases 2 2
Verbal response to questions 2 3
Clear word articulation 2 3
Total Score 10 14
Percentage 40% 56%
Improvement 16%

The comparative assessment revealed a 16% improvement in overall speaking ability when
using animated video media compared to traditional image media. While the improvement was
modest, it represents meaningful enhancement across four of the five assessed indicators, with
phrase construction remaining unchanged between conditions.
Comprehensive evaluation incorporated multiple data sources including expert validation scores,
teacher response ratings, and direct child outcome measurements. The triangulated assessment
confirmed media effectiveness while identifying areas for potential enhancement in future iterations.
The evaluation phase validated the ADDIE model approach for developing specialized educational
media for children with speech delay, demonstrating systematic methodology effectiveness for
addressing specific learning needs.

Discussion
The findings of this study provide evidence supporting the effectiveness of contextually-based
animated video media for improving speaking abilities in children with speech delay. The
comprehensive evaluation approach, incorporating expert validation, teacher assessment, and direct

2695
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

child outcome measurement, offers multiple perspectives on the intervention's value and potential
applications.
The expert validation results align with established principles of multimedia learning theory,
confirming that the animated video media meets professional standards for educational technology
design. The high ratings for technical quality (4.3) and content appropriateness (4.2) support Mayer's
(2001) cognitive theory of multimedia learning, which emphasizes the importance of coordinated
visual and auditory processing channels. The positive evaluation of interactivity features and usability
reflects adherence to Sweller's (1988) cognitive load theory principles, ensuring that the media design
minimizes extraneous cognitive burden while maximizing learning potential.
The animation characteristics observed in this intervention align with theoretical predictions about
effective multimedia design. Animation serves as a dynamic presentation of ideas in visual form,
utilizing symbols, characters, graphics, and sounds to provide audiovisual experiences that differ from
conventional text-based approaches (Castro-Alonso et al., 2014). The fast-changing presentation
characteristic of animation likely assisted in generating mental models for the speech delay child, as
Höffler and Leutner (2007) demonstrated that animation offers external models for mental
representation. The intervention's success in maintaining child attention supports Dalacosta et al.'s
(2009) findings that animation functions effectively as an attention-gaining tool, drawing learners'
focus to educational content.
The 16% improvement in speaking ability observed in this study, while modest, is consistent with
previous research on technology-enhanced language interventions. Habibah and Nafiqoh (2022)
reported similar positive outcomes when implementing animated videos for listening and speaking skill
development in early childhood populations. The improvement pattern across multiple speaking
indicators suggests that animated video media may provide more comprehensive stimulation than
traditional static image approaches. This finding supports Berney and Bétrancourt's (2016) meta-
analysis demonstrating that animation is beneficial for learning compared to static graphics, with
changes over time having more positive effects than static pictures. The ability of animation to help
students understand dynamic changes over time (Hari Narayanan & Hegarty, 2002; Hegarty et al.,
2003; Schnotz & Lowe, 2008) may be particularly relevant for speech development, where temporal
sequencing and phonemic transitions are crucial.
The intervention design reflects principles from the multimedia learning theoretical framework
developed by Adetunji and Levine (2016), which advocates for co-creation of multimedia considering
both technical and educational expertise. The collaborative approach used in this study, involving
content specialists, media experts, and educational practitioners, mirrors the MLTF emphasis on
integrating diverse professional perspectives to create comprehensible educational content. The
production process followed typical animation development steps including storyboard creation, voice-
over development, and motion graphics (Glebas, 2008), ensuring technical quality while maintaining
educational focus.
The contextual learning component of the intervention appears to have contributed to its
effectiveness, as evidenced by the child's positive engagement with daily life scenarios presented in
the animated content. This finding supports Davtyan's (2014) assertion that contextual learning
facilitates meaningful pattern creation through real-life content connections. The integration of familiar
activities and vocabulary likely reduced cognitive load while enhancing relevance and motivation,
consistent with Hull's (1995) emphasis on learning within experiential reference frames. The use of
everyday scenarios in the animation content may have facilitated the visualization of concrete
concepts, supporting Ayres et al.'s (2009) and Schwan and Riempp's (2004) research on animation's
effectiveness in concept visualization.
The underlying speech processing mechanisms targeted by this intervention reflect current
understanding of phonological development in children with speech delay. The development of spoken
language relies on intact phonological systems, with phonological deficits manifesting as difficulties in
accurately producing speech sounds (Anthony et al., 2011; Rvachew, 2007). The animated video's

2696
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

focus on clear articulation models and syllable segmentation may have addressed poorly specified
phonological representations that characterize children with speech delay (Edwards et al., 1999;
Rvachew et al., 2003; Sutherland & Gillon, 2005). The intervention's audiovisual design potentially
provided multiple redundant cues for phoneme identification, addressing the acoustic redundancy
principles that play important roles in successful speech perception (Ziegler et al., 2009).
The speech perception challenges addressed by this intervention are particularly relevant given
research indicating that children with speech delay exhibit difficulties perceiving speech sounds they
produce incorrectly (Broen et al., 1983; Hoffman et al., 1985; Rvachew & Jamieson, 1989). The
animated video's emphasis on clear acoustic-phonetic features may have supported the establishment
of well-specified phonological representations in the child's mental lexicon, addressing Fowler's (1991)
concerns about deficits in encoding and processing acoustic-phonetic features. The intervention's
success suggests that providing enhanced acoustic clarity through animation may help children with
underspecified phonological representations "fill in" gaps for successful speech perception (Lewis &
Carrell, 2007).
The exceptionally positive teacher response (95.8%) suggests strong professional acceptance
and perceived practical value of the intervention. This finding is particularly significant given Roberts
and Kaiser's (2011) meta-analysis indicating that intervention effectiveness often depends on
implementation quality and stakeholder buy-in. The high teacher endorsement suggests potential for
successful scaling and adoption in educational settings, addressing Law et al.'s (2003) concerns about
intervention sustainability and real-world applicability. The intervention addresses a critical educational
need, as Wilson (1997) and EDK (2010) noted that speaking skills promotion is often neglected
compared to reading and writing skills development.
The intervention's focus on speaking competence development aligns with growing recognition
that public speaking represents a core educational requirement (van Ginkel et al., 2015). While this
study targeted younger children with speech delay, the foundational skills addressed parallel those
needed for later academic success, where speaking competence serves as a tool for learning and
knowledge assessment across curriculum areas (Lee et al., 2013). The early intervention approach
responds to Hunt et al.'s (2014) calls for developing speaking interventions for elementary-age
children, though the current study's single-subject design addresses De Grez and Valcke's (2010)
concerns about methodological limitations in younger student intervention research.
However, several limitations must be acknowledged when interpreting these results. The single-
subject design limits generalizability and prevents statistical significance testing. The modest
improvement magnitude (16%) raises questions about clinical significance and long-term impact.
Cirrin et al. (2008) emphasized that language intervention research requires larger sample sizes and
longer follow-up periods to establish robust evidence for practice recommendations. Additionally, the
absence of a control group prevents definitive attribution of improvements to the specific intervention
components rather than general exposure or developmental maturation.
The study's findings contribute to the growing evidence base supporting technology-enhanced
interventions for speech and language disorders. The integration of animation principles with
contextual learning represents an innovative approach that warrants further investigation with larger,
more diverse populations. The intervention's success in stimulating active student involvement
supports Perales-Palacios and Vílchez-González's (2002, 2005) findings about animation's capacity to
engage learners actively, while the multimedia approach addresses So et al.'s (2019) observations
that students enjoy learning with graphics, animations, and interactive elements.
Future research should address the current study's limitations through randomized controlled
trials with adequate sample sizes, longer intervention periods, and extended follow-up assessments.
Investigation of optimal dosage, individual response predictors, and cost-effectiveness would enhance
the evidence base for clinical and educational decision-making. The development of standardized
outcome measures specifically designed for technology-enhanced speech interventions would facilitate
cross-study comparisons and meta-analytic synthesis. Research should also examine whether the

2697
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

intervention's benefits extend to children with different types of speech delay and investigate the
optimal balance between animation complexity and cognitive load management for various
developmental levels.

CONCLUSION
This study successfully developed and validated a contextually-based animated video media
intervention for children with speech delay using the ADDIE development model. The research
contributes to the field by demonstrating the integration of multimedia learning theory and contextual
learning principles in creating specialized educational technology for speech-language intervention.
Expert validation confirmed high technical quality (4.3/5.0) and content appropriateness (4.2/5.0),
while teacher evaluation yielded exceptionally positive reception (95.8%), indicating strong
professional endorsement for practical implementation.
The intervention achieved a modest but meaningful 16% improvement in speaking ability
compared to traditional image-based approaches, with enhancements observed across four of five
assessed indicators including articulation clarity, word production, and verbal responsiveness. These
findings support the theoretical foundation that animation's dynamic visual presentation, combined
with contextual daily-life scenarios, can effectively engage children with speech delay while providing
necessary acoustic and visual cues for phonological development.
The study's primary contribution lies in establishing a systematic methodology for developing
theory-driven multimedia interventions specifically targeting speech delay populations. The integration
of cognitive load theory principles with animation design demonstrates practical application of
educational technology research to address specialized learning needs. Additionally, the positive
stakeholder reception suggests strong potential for real-world implementation and scaling in
educational and therapeutic settings.
However, significant limitations constrain the generalizability of these findings. The single-subject
design prevents statistical analysis and limits external validity. The modest improvement magnitude
raises questions about clinical significance, while the absence of control groups complicates causal
attribution. The short intervention duration and lack of follow-up assessment preclude conclusions
about long-term effectiveness and retention of learned skills.
Future research should prioritize randomized controlled trials with larger sample sizes, extended
intervention periods, and longitudinal follow-up assessments to establish robust evidence for clinical
practice. Investigation of optimal dosage parameters, individual response predictors, and cost-
effectiveness analysis would inform implementation decisions. Development of standardized outcome
measures specifically designed for technology-enhanced speech interventions would facilitate cross-
study comparisons and meta-analytic synthesis. Research should also examine intervention
effectiveness across different types of speech delay and investigate optimal balance between
animation complexity and cognitive load management for various developmental levels, ultimately
advancing evidence-based practice in speech-language pathology and early childhood education.

REFERENCES
Adetunji, O., & Levine, R. (2016). Toward a new model of science learning, teaching, and
communication. MRS Advances, 1(56), 3709–3714. [Link]
Alfarida, D., & Dafit, F. (2024). Pengembangan media pembelajaran video animasi untuk kemampuan
berbicara siswa pada materi dongeng kelas I SDN 141 Pekanbaru. Didaktik: Jurnal Ilmiah
PGSD STKIP Subang, 10(1), 2315-2322. [Link]
Alya, N., Azia, A. N., Hidayah, U. N., Alisia, N., Kartika, N. A., & Latief, F. (2025). Pemanfaatan video
animasi edukatif dalam stimulasi kemampuan bahasa anak usia dini. Jambura Early Childhood
Education Journal, 7(1), 39-52. [Link]

2698
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

Amah, N. R., Kartika, W. I., & Heriansyah, M. (2025). Peningkatan kemampuan berbicara melalui video
animasi pada anak usia 4-5 tahun di TK Tunas Rimba 1 Samarinda. Early Childhood Journal,
6(1), 48-53. [Link]
Aminah, M. (2023). Pengembangan media video animasi pembelajaran untuk meningkatkan
kemampuan berbahasa anak usia dini. Indonesian Journal of Elementary School, 3(2), 236-
242. [Link]
Anthony, J. L., Aghara, R. G., Dunkelberger, M. J., Anthony, T. I., Williams, J. M., & Zhang, Z. (2011).
What factors place children with speech sound disorders at risk for reading problems?
American Journal of Speech-Language Pathology, 20, 146-160. [Link]
0360(2011/10-0053)
Astini, B. N., Fahruddin, F., & Nurhasanah, N. (2020). Pemanfaatan film animasi dalam meningkatkan
kemampuan berbicara anak usia 5-6 tahun. Indonesian Journal of Elementary and Childhood
Education, 1(4), 146-154. . [Link]
[Link]/[Link]/ijece/article/view/270
Ayres, P., Marcus, N., Chan, C., & Qian, N. (2009). Learning hand manipulative tasks: When
instructional animations are superior to equivalent static representations. Computers &
Education, 52(2), 348-356. [Link]
Berney, S., & Bétrancourt, M. (2016). Does animation enhance learning? A meta-analysis. Computers &
Education, 101, 150–167. [Link]
Broen, P. A., Strange, W., Doyle, S. S., & Heller, J. H. (1983). Perception and production of
approximant consonants by normal and articulation-delayed preschool children. Journal of
Speech and Hearing Research, 26, 601-608. [Link]
Castro-Alonso, J. C., Ayres, P., & Paas, F. (2014). Dynamic visualisations and motor skills. In W. Huang
(Ed.), Handbook of human centric visualization (pp. 551–580). Springer.
[Link]
Chandler, P., & Sweller, J. (1992). The split-attention effect as a factor in the design of instruction.
British Journal of Educational Psychology, 62(2), 233-246. [Link]
8279.1992.tb01017.x
Cirrin, F. M., Schooling, T. L., Nelson, N. W., Diehl, S. F., Flynn, P. F., Staskowski, M., ... & Adamczyk,
D. F. (2008). Evidence-based systematic review: Effects of different service delivery models
on communication outcomes for elementary school–age children. Language, Speech, and
Hearing Services in Schools, 38(4), 329-341. [Link]
0128)
Dalacosta, K., Kamariotaki-Paparrigopoulou, M., Palyvos, J., & Spyrellis, N. (2009). Multimedia
application with animated cartoons for teaching science in elementary education. Computers
& Education, 52(4), 741–748. [Link]
Davtyan, R. (2014). Contextual learning. In 2014 ASEE Zone 1 Conference.
De Grez, L., & Valcke, M. (2010). Learning and instruction of oral presentation skills. In L. E. Kattington
(Ed.), Education in a competitive and globalizing world. Handbook of curriculum development
(pp. 177–208). Nova Science.
Edwards, J., Fourakis, M., Beckman, M. E., & Fox, R. A. (1999). Characterizing knowledge deficits in
phonological disorders. Journal of Speech, Language, and Hearing Research, 42, 169-186.
[Link]
EDK (2010). SCHULSPRACHE: Wissenschaftlicher Kurzbericht und Kompetenzmodell: Provisorische
Fassung (vor Verabschiedung der Standards). [School language: Scientific short report and
competence model].
Fowler, A. (1991). How early phonological development might set the stage for phoneme awareness.
Haskins Laboratories Status Report on Speech Research, 105/106, 53-64.
[Link]
development-might-set-stage-phoneme-awareness-anne-fowler

2699
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

Glebas, F. (2008). Directing the story: Professional storytelling and storyboarding techniques for live
action and animation. Focal Press.
Habibah, I., & Nafiqoh, H. (2022). Pemanfaatan video animasi dalam meningkatkan kemampuan
menyimak dan berbicara pada anak usia dini. CERIA (Cerdas Energik Responsif Inovatif
Adaptif), 5(2), 159. [Link]
Hari Narayanan, N., & Hegarty, M. (2002). Multimedia design for communication of dynamic
information. International Journal of Human-Computer Studies, 57(4), 279–315.
[Link]
Hegarty, M., Kriz, S., & Cate, C. (2003). The roles of mental animations and external animations in
understanding mechanical systems. Cognition and Instruction, 21(4), 209–249.
[Link]
Hernawati, E., Prihatin, Y., & Sudibyo, H. (2024). Efektivitas metode story telling bermedia video
dongeng animasi dalam meningkatkan keterampilan berbicara siswa sekolah dasar. Journal of
Education Research, 5(4), 6519-6525. [Link]
Hoffman, P. R., Stager, S., & Daniloff, R. G. (1983). Perception and production of misarticulated /r/.
Journal of Speech and Hearing Disorders, 48, 210-215.
[Link]
Hoffman, P. R., Daniloff, R. G., Bengoa, D., & Schuckers, G. (1985). Misarticulating and normally
articulating children's identification and discrimination of synthetic [r] and [w]. Journal of
Speech and Hearing Disorders, 50, 46-53. [Link]
Höffler, T. N., & Leutner, D. (2007). Instructional animation versus static pictures: A meta-analysis.
Learning and Instruction, 17(6), 722–738. [Link]
Hull, D. (1995). Who are you calling stupid? The revolution that's changing education. CORD
Communications.
Hunt, S., Wright, A., & Simonds, C. (2014). Securing the future of communication education: Advancing
an advocacy and research agenda for the 21st century. Communication Education, 63(4),
449–461. [Link]
Law, J., Garrett, Z., & Nye, C. (2003). Speech and language therapy interventions for children with
primary speech and language delay or disorder. Cochrane Database of Systematic Reviews,
3. [Link]
Lee, O., Quinn, H., & Valdes, G. (2013). Science and language for English language learners in relation
to next generation science standards and with implications for Common Core State Standards
for English language arts and mathematics. Educational Researcher, 42(4), 223–233.
[Link]
Lewis, D. E., & Carrell, T. D. (2007). The effect of amplitude modulation on intelligibility of time-varying
sinusoidal speech in children and adults. Perception and Psychophysics, 69(7), 1140-1151.
[Link]
Maulina, I. (2025). Efektivitas digital story telling melalui video animasi untuk meningkatkan
kemampuan berbicara anak. Jurnal Dunia Pendidikan, 5(5), 1752-1761.
[Link]
Mayer, R. E. (2001). Multimedia learning. Cambridge University Press.
Perales-Palacios, F. J., & Vílchez-González, J. M. (2002). Teaching physics by means of cartoons: A
qualitative study in secondary education. Physics Education, 37(5), 400–406.
[Link]
Perales-Palacios, F. J., & Vílchez-González, J. M. (2005). The teaching of physics and cartoons: Can
they be interrelated in secondary education? International Journal of Science Education,
27(14), 1647–1670. [Link]

2700
Journal of Innovation and Research in Primary Education | 4(4), 2025 | 2689-2701

Roberts, M. Y., & Kaiser, A. P. (2011). The effectiveness of parent-implemented language interventions:
A meta-analysis. American Journal of Speech-Language Pathology, 20(3), 180-199.
[Link]
Rvachew, S. (2007). Phonological processing and reading in children with speech sound disorders.
American Journal of Speech-Language Pathology, 16, 260-270. [Link]
0360(2007/030)
Rvachew, S., & Jamieson, D. G. (1989). Perception of voiceless fricatives by children with a functional
articulation disorder. Journal of Speech and Hearing Disorders, 54, 193-208.
[Link]
Rvachew, S., Ohberg, A., Grawburg, M., & Heyding, J. (2003). Phonological awareness and phonemic
perception in 4-year-old children with delayed expressive phonology skills. American Journal
of Speech-Language Pathology, 12(4), 463. [Link]
Schnotz, W., & Lowe, R. K. (2008). A unified view of learning from animated and static graphics. In R.
Lowe & W. Schnotz (Eds.), Learning with animation: Research implications for design (pp.
304–356). Cambridge University Press.
Schwan, S., & Riempp, R. (2004). The cognitive benefits of interactive videos: Learning to tie nautical
knots. Learning and Instruction, 14(3), 293–305.
[Link]
So, W. W. M., Chen, Y., & Wan, Z. H. (2019). Multimedia e-learning and self-regulated science
learning: A study of primary school learners' experiences and perceptions. Journal of Science
Education and Technology, 28(5), 508–522. [Link]
Sutherland, D., & Gillon, G. T. (2005). Assessment of phonological representations in children with
speech impairment. Language, Speech, and Hearing Services in the Schools, 36, 294-307.
[Link]
Suryawati, E., & Osman, K. (2017). Contextual learning: Innovative approach towards the development
of students' scientific attitude and natural science performance. Eurasia Journal of
Mathematics, Science and Technology Education, 14(1), 61-76.
[Link]
Sweller, J. (1988). Cognitive load during problem solving: Effects on learning. Cognitive Science, 12(2),
257-285. [Link]
van Ginkel, S., Gulikers, J., Biemans, H., & Mulder, M. (2015). Towards a set of design principles for
developing oral presentation competence: A synthesis of research in higher education.
Educational Research Review, 14, 62–80. [Link]
Vygotsky, L. S. (1978). Mind in society: The development of higher psychological processes. Harvard
University Press.
Wilson, J. A. (1997). A program to develop the listening and speaking skills of children in a first grade
classroom [Unpublished doctoral dissertation]. University of Virginia.
Ziegler, J. C., Pech-Georgel, C., George, F., & Lorenzi, C. (2009). Speech-perception-in-noise deficits in
dyslexia. Developmental Science, 12(5), 732-745. [Link]
7687.2009.00817.x

2701

You might also like