Projekt SWOW-SL

Gradnja zbirke prostih asociacij za slovenščino

Avtorji

  • Špela Vintar Univerza v Ljubljani, Filozofska fakulteta
  • Mojca Brglez Univerza v Ljubljani, Filozofska fakulteta
  • Ela Novak
  • Simon De Deyne Univerza v Melbournu, Avstralija

DOI:

https://doi.org/10.4312/slo2.0.2025.1.104-119

Ključne besede:

proste asociacije, SWOW, Mali svet besed, kampanja množičenja

Povzetek

V prispevku opisujemo projekt gradnje prve obsežnejše zbirke prostih asociacij za slovenščino. Podatkovna zbirka SWOW-SL je nastala v sklopu krovnega projekta Small World of Words, pri katerem se asociacije zbirajo za številne jezike in ki predstavlja metodološki okvir za našo raziskavo. Najprej opišemo postopek izbire iztočnic in prilagajanja spletnega eksperimenta za slovenščino, nato pa podrobneje predstavimo kampanjo zbiranja podatkov, pri kateri smo udeležence nagovarjali prek družabnih omrežij in s tiskanimi oglasi v obliki plakatov in nalepk. Uspešnost posameznih oglasnih strategij smo skrbno spremljali, zato lahko izkušnje iz projekta služijo tudi kot dragoceno izhodišče za morebitne sorodne množičenjske kampanje. V projektu smo uspeli zbrati asociacije za 1.000 slovenskih iztočnic v skupnem obsegu 20.000 odzivov. Zbirka podatkov je objavljena na repozitoriju Clarin.si in ponuja številne možnosti za nadaljnje raziskave s področij kognitivnega jezikoslovja, računalniške obdelave naravnega jezika in sorodnih disciplin.

Prenosi

Podatki o prenosih še niso na voljo.

Literatura

Brglez, M., Vintar, Š., & Žagar, A. (2024). How Human-Like are Word Associations in Generative Models? An Experiment in Slovene. In M. Zock, E. Chersoni, Y.-Y. Hsu & S. de Deyne (Eds.), Proceedings of the Workshop on Cognitive Aspects of the Lexicon, LREC-COLING 2024 (pp. 42–48). Retrieved from https://aclanthology.org/2024.cogalex-1.5.pdf

Cabana, Á., Zugarramurdi, C., Valle-Lisboa, J. C., & De Deyne, S. (2024). The „small world of words“ free association norms for Rioplatense Spanish. Behavior Research Methods, 56(2), 968–985.

Clark, H. H. (1970). Word associations and linguistic theory. New horizons in linguistics, 1, 271–286.

De Deyne, S., Navarro, D. J., Perfors, A., Brysbaert, M., & Storms, G. (2019). The “Small World of Words” English word association norms for over 12,000 cue words. Behavior research methods, 51, 987–1006. doi: 10.3758/s13428-018-1115-7

De Deyne, S., Navarro, D. J., & Storms, G. (2013). Better explanations of lexical and semantic cognition using networks derived from continued rather than single-word associations. Behavior Research, 45, 480–498. doi: 10.3758/s13428-012-0260-7

Deese, J. (1959). Influence of inter-item associative strength upon immediate free recall. Psychological Reports, 5(3), 305–312.

Freud, S. (1913). On the beginning of treatment. Standard Edition of Complete Works, Vol. XII. London: Hogarth Press.

Galton, F. (1879). Psychometric experiments. Brain, 2(2), 149–162.

Gordon, J., & Van Durme, B. (2013). Reporting bias and knowledge acquisition. In F. M. Suchanek, S. Riedel, S. Singh, & P. Pratim Talukdar (Eds.), Proceedings of the 2013 workshop on Automated knowledge base construction (pp. 25–30). doi: 10.1145/2509558

Günther, F., Rinaldi, L., & Marelli, M. (2019). Vector-space models of semantic representation from a cognitive perspective: A discussion of common misconceptions. Perspectives on Psychological Science, 14, 1006–1033.

Kiss, G., Armstrong, C., Milroy, R., & Piper, J. (1973). An associative thesaurus of English and its computer Analysis. In A. Aitken, R. Bailey & N. Hamilton-Smith (Eds.), The Computer and Literacy Studies (pp. 153–165). Edinburgh: University Press.

Kosem, I., & Dobrovoljc, K. (2020). Gigafida 2.0: The Reference Corpus of Written Standard Slovene. In N. Calzolari et al. (Eds.), Proceedings of the Twelfth Language Resources and Evaluation Conference, Marseille, France (pp. 3340–3345). European Language Resources Association.

Krek, S., Arhar Holdt, Š, Erjavec, T., Čibej, J., Repar, A., Gantar, P., Ljubešić, N.,

Li, B., Ding, Z., De Deyne, S., & Cai, Q. (2024). A large-scale database of Mandarin Chinese word associations from the Small World of Words project. Under review.

Ljubešić, N., Terčon, L., & Dobrovoljc, K. (2024). CLASSLA-Stanza: The Next Step for Linguistic Processing of South Slavic Languages. Proceedings of the Conference on Language Technologies & Digital Humanities (JTDH 2024), 251–274. doi: 10.5281/zenodo.13936406

Mandera, P., Keuleers, B., & Brysbaert, M. (2017). Explaining human performance in psycholinguistic tasks with models of semantic similarity based on prediction and counting: A review and empirical validation. Journal of Memory and Language, 92, 57–78.

Nelson, D. L., McEvoy, C. L., & Schreiber, T. A. (2004). The University of South Florida free association, rhyme, and word fragment norms. Behavior Research Methods, Instruments & Computers, 36, 402–407. doi: 10.3758/BF03195588

Nelson, D., McEvoy, C. L., & Dennis, S. (2012). What is free association and what does it measure? Memory and Cognition, 28, 887–899. doi: 10.3758/bf03209337

Nematzadeh, A., Meylan, S. C., & Griffiths, T. L. (2017). Evaluating vector-space models of word representation, or the unreasonable effectiveness of counting words near other words. In CogSci. Retrieved from https://cocosci.princeton.edu/papers/nematzadeh_etal_17_cogsci_reps.pdf

Objavljeno

30. 05. 2025

Številka

Rubrika

Razprave

Kako citirati

Vintar, Špela, Brglez, M., Novak, E., & De Deyne, S. (2025). Projekt SWOW-SL: Gradnja zbirke prostih asociacij za slovenščino. Slovenščina 2.0: Empirične, Aplikativne in Interdisciplinarne Raziskave, 13(1), 104-119. https://doi.org/10.4312/slo2.0.2025.1.104-119