THE EMPEROR, HITCHCOCK AND PATRIARCHY: AI-ASSISTED VS. HUMAN GENERATED KEY WORDS EXTRACTED FROM UNPROCESSED ETHNOLOGICAL MATERIAL
Main Article Content
Abstract
Archives of audio, video, and written ethnographic materials gathered through individual researchers' field studies often involve data obscured by improper keyword classification that does not accurately reflect their content. This article investigates how a computer information system may assist a professional in this task. We apply a selection of computational intensive methods for the search and clustering of textual materials from the Digital Archive of Ethnological and Anthropological Resources (DAEAR) at the Institute of Ethnology and Anthropology at the University St. Cyril and Methodius in Skopje and comment on their practical usefulness. The results of the machine extraction are compared with the ones obtained from 10 experts (ethnologists and anthropologists) and ten non-experts
Downloads
Download data is not yet available.
Article Details
How to Cite
Jakimovski, Dragan, and Ilina Jakimovska. 2024. “THE EMPEROR, HITCHCOCK AND PATRIARCHY: AI-ASSISTED VS. HUMAN GENERATED KEY WORDS EXTRACTED FROM UNPROCESSED ETHNOLOGICAL MATERIAL”. EthnoAnthropoZoom/ЕтноАнтропоЗум 24 (24), 179-219. https://doi.org/10.37620/EAZ242424179j.
Issue
Section
Articles / Статии
Copyright (c) 2024 EthnoAnthropoZoom/ЕтноАнтропоЗум
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
---------------------------------------------------------------------------------------------------
Известување за авторски права
Авторите ги задржуваат авторските права и го даваат правото на првото објавување на списанието кое истовремено е лиценцирано Creative Commons Attribution License што на другите им овозможува да го споделат делото со потврда за авторството на делото и првичното објавување во ова списание.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
References
Bengfort, B., Bilbro, R., and Ojeda, T. (2018). Applied Text Analysis with Python, Enabling Language-Aware Data Products with Machine Learning.
Ehrmanntraut, A., Hagen T., Jannidis F., Konle L., Kroncke M., Winko S. (2022). Modeling and Measuring Short Text Similarities: On the Multi-Dimensional Differences between German Poetry of Realism and Modernism. In: Journal of Computational Literary Studies 1 (1).
O'Reilly, Boston. Hovy, D. (2020). Text Analysis in Python for Social Scientists, Discovery and Exploration. Cambridge University Press, Cambridge.
Ramyadharshni, S. S. and Pabitha, P. (2018). Topic categorization on social network using latent dirichlet allocation. Bonfring International Journal of Software Engineering and Soft Computing, 8(2):16–20.
Srinivasa-Desikan, B. (2018). Natural Language Processing and Computational Linguistics. Packt Publishing, Birmingham. Xu, D. and Tian, Y. (2015). A comprehensive survey of clustering algorithms. Ann. Data. Sci., 2(2):165–193
Ehrmanntraut, A., Hagen T., Jannidis F., Konle L., Kroncke M., Winko S. (2022). Modeling and Measuring Short Text Similarities: On the Multi-Dimensional Differences between German Poetry of Realism and Modernism. In: Journal of Computational Literary Studies 1 (1).
O'Reilly, Boston. Hovy, D. (2020). Text Analysis in Python for Social Scientists, Discovery and Exploration. Cambridge University Press, Cambridge.
Ramyadharshni, S. S. and Pabitha, P. (2018). Topic categorization on social network using latent dirichlet allocation. Bonfring International Journal of Software Engineering and Soft Computing, 8(2):16–20.
Srinivasa-Desikan, B. (2018). Natural Language Processing and Computational Linguistics. Packt Publishing, Birmingham. Xu, D. and Tian, Y. (2015). A comprehensive survey of clustering algorithms. Ann. Data. Sci., 2(2):165–193