This study aims to analyze the quality of machine-translated subtitles and to propose post-editing guidelines for improving automatically generated subtitles. The research focuses on the Korean machine-translated subtitles of the Chinese drama <!--...
This study aims to analyze the quality of machine-translated subtitles and to propose post-editing guidelines for improving automatically generated subtitles. The research focuses on the Korean machine-translated subtitles of the Chinese drama <!-- Not Allowed Tag Filtered --><Go Ahead> provided on YouTube, which were created not by human translators but through the platform’s neural machine translation system. A total of 462 Chinese subtitles and their corresponding Korean machine-translated subtitles were collected and compiled as the dataset.
Based on Pedersen’s FAR model, the subtitles were evaluated across three dimensions, which are equivalence, acceptability, and readability. To ensure the reliability of the evaluation, two evaluators independently conducted assessments according to the same criteria, and any discrepancies were resolved through discussion before determining the final error categories and penalty scores.
The results show that stylistic errors and semantic errors accounted for the highest proportion among all error types. Stylistic errors frequently involved inappropriate honorific usage, unsuitable terms of address, and excessive use of written or formal expressions in contexts requiring colloquial forms. Semantic errors commonly included missing contextual logic, distorted pragmatic meaning, or incomplete representation of speaker intention. In contrast, formal errors such as grammatical errors, punctuation errors, and segmentation or spotting issues appeared less frequently, indicating that contemporary neural machine translation systems have achieved a certain level of linguistic accuracy. Nevertheless, significant limitations remain in capturing contextual nuances and reproducing natural spoken language features.
Drawing on these findings, the study proposes post-editing guidelines across five dimensions: semantic accuracy, stylistic appropriateness, lexical and idiomatic usage, subtitle formatting, and syntactic or grammatical correctness. These guidelines are expected to enhance the quality of automatically generated subtitles and to support the development of hybrid human–machine translation workflows.
By examining subtitles produced in real viewing environments, this study offers practical insights into the strengths and limitations of machine-translated subtitles. The results may serve as foundational data for future research on improving Chinese–Korean machine subtitle translation and for advancing post-editing practices in audiovisual translation.