Introducing Transvisio: A customizable AI-powered subtitling tool
Main Article Content
Abstract
The growing demand for multilingual audiovisual (AV) content boosts expectations about efficient and diverse subtitling solutions accordingly. Traditional subtitling methods lack much-needed flexibility for catering to different audiences and various types of content. Therefore, this paper introduces TransVisio, an AI-driven tool that provides customized subtitling options with the use of Large Language Models (LLMs). It differs from other subtitle generators in that TransVisio allows users to customize subtitles to the content genre, target audience, language register, and even cultural nuances, to mention a few. Targeting professional translators, educators, and students, the tool combines artificial intelligence (AI) functions of the highest level with a user-centered interface in support of the subtitling process. Our paper describes the development and features of TransVisio and outlines some practical implications, ethical considerations, and future prospects to offer an overview of its contributions to audiovisual translation (AVT), AI language processing, and translation education.
Downloads
Article Details
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This open-access article is distributed under a Creative Commons Attribution (CC-BY) 4.0 license.
You are free to: Share — copy and redistribute the material in any medium or format. Adapt — remix, transform, and build upon the material for any purpose, even commercially. The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms: Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
No additional restrictions You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
How to Cite
Share
References
Abu-Rayyash, H., & Al-Ramadan, A. (2024). SubGPT: An Automated Tool for Subtitle-Template Translation. International Journal of Linguistics, Literature and Translation, 7(6), 126–134. https://doi.org/10.32996/ijllt.2024.7.6.14
Akasheh, W. M., Haider, A. S., Al-Saideen, B., & Sahari, Y. (2024). Artificial intelligence-generated Arabic subtitles: insights from Veed. io’s automatic speech recognition system of Jordanian Arabic. Texto Livre, 17, e46952-e46952. doi:10.1590/1983-3652.2024.46952.
Akinrinola, O., Okoye, C. C., Ofodile, O. C., & Ugochukwu, C. E. (2024). Navigating and reviewing ethical dilemmas in AI development: Strategies for transparency, fairness, and accountability. GSC Advanced Research and Reviews, 18(3), 050–058.
Al-Abbas, L. S., & Haider, A. S. (2021). Using Modern Standard Arabic in subtitling Egyptian Comedy Movies for the Deaf/Hard of Hearing. Cogent Arts & Humanities, 8(1), 1-16. doi:10.1080/23311983.2021.1993597
Al-Salman, S., & Haider, A. S. (2024). Assessing the accuracy of MT and AI tools in translating humanities or social sciences Arabic research titles into English: Evidence from Google Translate, Gemini, and ChatGPT International Journal of Data and Network Science, 8(4), 2483-2498. doi:https://doi.org/10.5267/j.ijdns.2024.5.009
AlBkowr, N., & Haider, A. S. (2023). Subtitling for people with hearing impairments in the Arab world context: The case of the Blue Elephant 2 movie. Online Journal of Communication and Media Technologies, 13(4), e202347. doi:https://doi.org/10.30935/ojcmt/13602
Ali, D. H. A. (2024). Artificial Intelligence In Translation Studies: Benefits, Challenges, And Future Directions. مجلة القراءة والمعرفة, 24(275). https://mrk.journals.ekb.eg/article_381236.html
Álvarez, A., Arzelus, H., & Etchegoyhen, T. (2014). Towards customized automatic segmentation of subtitles. Paper presented at the Advances in Speech and Language Technologies for Iberian Languages: Second International Conference, IberSPEECH 2014, Las Palmas de Gran Canaria, Spain, November 19-21, 2014. Proceedings.
Amin, A. A., Glasser, A., Kushalnagar, R., Vogler, C., & Huenerfauth, M. (2021). Preferences of Deaf or Hard of Hearing Users for Live-TV Caption Appearance. In M. Antona & C. Stephanidis (Eds.), Universal Access in Human-Computer Interaction. Access to Media, Learning and Assistive Environments (Vol. 12769, pp. 189–201). Springer International Publishing. https://doi.org/10.1007/978-3-030-78095-1_15
Amini, M., Ravindran, L., & Lee, K.-F. (2024). Implications of using AI in Translation Studies: Trends, Challenges, and Future Direction. Asian Journal of Research in Education and Social Sciences, 6(1), 740–754.
Arroyo Chavez, M., Thompson, B., Feanny, M., Alabi, K., Kim, M., Ming, L., Glasser, A., Kushalnagar, R., & Vogler, C. (2024). Customization of Closed Captions via Large Language Models. In K. Miesenberger, P. Peňáz, & M. Kobayashi (Eds.), Computers Helping People with Special Needs (Vol. 14751, pp. 50–58). Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-62849-8_7
Brey, P. A. (2012). Anticipatory ethics for emerging technologies. NanoEthics, 6(1), 1–13.
Briva-Iglesias, V., O’Brien, S., & Cowan, B. R. (2023). The impact of traditional and interactive post-editing on machine translation user experience, quality, and productivity. Translation, Cognition & Behavior, 6(1), 60-86.
Çetiner, C., & İşisağ, K. (2019). Undergraduate level translation students’ attitudes towards machine translation post-editing training. International Journal of Languages’ Education and Teaching, 7(1), 110-120.
Dankers, V., Lucas, C. G., & Titov, I. (2022). Can Transformer be Too Compositional? Analysing Idiom Processing in Neural Machine Translation (arXiv:2205.15301). arXiv. http://arxiv.org/abs/2205.15301
Du, J., & Lu, J. (2024). A Comparative Study on the Translation Quality between Human and Machine-Generated Subtitles. Paper presented at the 2024 6th International Conference on Natural Language Processing (ICNLP).
Farghal, M., & Haider, A. S. (2024). Translating classical Arabic verse: human translation vs. AI large language models (Gemini and ChatGPT). Cogent Social Sciences, 10(1), 2410998. doi:https://doi.org/10.1080/23311886.2024.2410998
Floridi, L. (2024). The Ethics of Artificial Intelligence: Exacerbated problems, renewed problems, unprecedented problems-Introduction to the Special Issue of the American Philosophical Quarterly dedicated to The Ethics of AI. Available at SSRN. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4801799
Gorman, B. M., Crabb, M., & Armstrong, M. (2021). Adaptive Subtitles: Preferences and Trade-Offs in Real-Time Media Adaption. Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, 1–11. https://doi.org/10.1145/3411764.3445509
Huriye, A. Z. (2023). The ethics of artificial intelligence: Examining the ethical considerations surrounding the development and use of AI. American Journal of Technology, 2(1), 37–44.
Jia, Y., & Sun, S. (2023). Man or machine? Comparing the difficulty of human translation versus neural machine translation post-editing. Perspectives, 31(5), 950-968.
Malakul, S., & Park, I. (2023). The effects of using an auto-subtitle system in educational videos to facilitate learning for secondary school students: Learning comprehension, cognitive load, and satisfaction. Smart Learning Environments, 10(1), 4. https://doi.org/10.1186/s40561-023-00224-2
Mensah, G. B. (2023). Artificial intelligence and ethics: A comprehensive review of bias mitigation, transparency, and accountability in AI Systems. Preprint, November, 10. https://www.researchgate.net/profile/George-Benneh-Mensah/publication/375744287_Artificial_Intelligence_and_Ethics_A_Comprehensive_Review_of_Bias_Mitigation_Transparency_and_Accountability_in_AI_Systems/links/656c8e46b86a1d521b2e2a16/Artificial-Intelligence-and-Ethics-A-Comprehensive-Review-of-Bias-Mitigation-Transparency-and-Accountability-in-AI-Systems.pdf
Mihalache, I. (2024). Ethical Analysis of AI-Enhanced Audiovisual Translation Tools. 2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC), 1558–1559. https://ieeexplore.ieee.org/abstract/document/10633381/
Perone, S. (2023). Matesub: The Translated Subtitling Tool at the IWSLT2023 Subtitling task. Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), 461–464. https://aclanthology.org/2023.iwslt-1.45/
Saed, H., Haider, A. S., & Tair, S. A. (2023). Gender, sexual, ethnic, color and Disability-related Epithets and labels across languages: Evidence from Arabic subtitling of English Movies and series. Eurasian Journal of Applied Linguistics, 9(1), 213-225. doi:http://dx.doi.org/10.32601/ejal.901018
Samha, F., Haider, A. S., & Hussein, R. F. (2023). Address forms in Egyptian vernacular and their English equivalence: A translation-oriented study. Ampersand, 10, 1-12. doi:https://doi.org/10.1016/j.amper.2023.100117
Seyidov, R. (2024). Importance and Role of Artificial Intelligence for Arabic Translators. International Journal for Arabic Linguistics & Literature Studies (JALLS), 6(2). https://www.refaad.com/Files/JALLS/JALS-6-2-4.pdf
Shuhaiber, R., & Haider, A. S. (2023). Strategies of Subtitling Egyptian Idiomatic Expressions and Proverbs into English. Language Value, 16(2), 60-99. doi:https://doi.org/10.58256/sptxps94
Soe, T. H., Guribye, F., & Slavkovik, M. (2021). Evaluating AI assisted subtitling. ACM International Conference on Interactive Media Experiences, 96–107. https://doi.org/10.1145/3452918.3458792
Soudi, A., Hannani, M., Van Laerhoven, K., & Avramidis, E. (2024). Exploring the Potential of Large Language Models in Adaptive Machine Translation for Generic Text and Subtitles. Proceedings of the 17th Workshop on Building and Using Comparable Corpora (BUCC) @ LREC-COLING 2024, 51–58.
Tardel, A. (2023). A proposed workflow model for researching production processes in subtitling. Trans-Kom, 16(1), 140–173.
Yamada, M. (2023). Optimizing Machine Translation through Prompt Engineering: An Investigation into ChatGPT’s Customizability. arXiv preprint arXiv:2308.01391.