THE EMPEROR, HITCHCOCK AND PATRIARCHY: AI-ASSISTED VS. HUMAN GENERATED KEY WORDS EXTRACTED FROM UNPROCESSED ETHNOLOGICAL MATERIAL

Main Article Content

Dragan Jakimovski
https://orcid.org/0000-0003-0786-2941
Ilina Jakimovska
https://orcid.org/0000-0003-1209-1555

Abstract

Archives of audio, video, and written ethnographic materials gathered through individual researchers' field studies often involve data obscured by improper keyword classification that does not accurately reflect their content. This article investigates how a computer information system may assist a professional in this task. We apply a selection of computational intensive methods for the search and clustering of textual materials from the Digital Archive of Ethnological and Anthropological Resources (DAEAR) at the Institute of Ethnology and Anthropology at the University St. Cyril and Methodius in Skopje and comment on their practical usefulness. The results of the machine extraction are compared with the ones obtained from 10 experts (ethnologists and anthropologists) and ten non-experts

Downloads

Download data is not yet available.

Article Details

How to Cite
Jakimovski, Dragan, and Ilina Jakimovska. 2024. “THE EMPEROR, HITCHCOCK AND PATRIARCHY: AI-ASSISTED VS. HUMAN GENERATED KEY WORDS EXTRACTED FROM UNPROCESSED ETHNOLOGICAL MATERIAL”. EthnoAnthropoZoom/ЕтноАнтропоЗум 24 (24), 179-219. https://doi.org/10.37620/EAZ242424179j.
Section
Articles / Статии

References

Bengfort, B., Bilbro, R., and Ojeda, T. (2018). Applied Text Analysis with Python, Enabling Language-Aware Data Products with Machine Learning.

Ehrmanntraut, A., Hagen T., Jannidis F., Konle L., Kroncke M., Winko S. (2022). Modeling and Measuring Short Text Similarities: On the Multi-Dimensional Differences between German Poetry of Realism and Modernism. In: Journal of Computational Literary Studies 1 (1).

O'Reilly, Boston. Hovy, D. (2020). Text Analysis in Python for Social Scientists, Discovery and Exploration. Cambridge University Press, Cambridge.

Ramyadharshni, S. S. and Pabitha, P. (2018). Topic categorization on social network using latent dirichlet allocation. Bonfring International Journal of Software Engineering and Soft Computing, 8(2):16–20.

Srinivasa-Desikan, B. (2018). Natural Language Processing and Computational Linguistics. Packt Publishing, Birmingham. Xu, D. and Tian, Y. (2015). A comprehensive survey of clustering algorithms. Ann. Data. Sci., 2(2):165–193