Publikationen Korpus- und Computerlinguistik
2021
- Jansen, Silke, et al. Demystifying Bilingualism. How Metaphor Guides Research towards Mythification. London: Palgrave Macmillan, Cham, 2021.
2019
- Proisl, Thomas. The cooccurrence of linguistic structures. Erlangen: FAU University Press, 2019.
URL: https://nbn-resolving.org/urn:nbn:de:bvb:29-opus4-111251
2015
- Kabashi, Besim. Automatische Verarbeitung der Morphologie des Albanischen. Erlangen: FAU University Press, 2015.
URL: https://opus4.kobv.de/opus4-fau/files/6859/Dissertation_Besim_Kabashi_OPUS.pdf
2020
- Griebel, Tim, Stephanie Evert, and Philipp Heinrich, eds. Multimodal Approaches to Media Discourses: Reconstructing the Age of Austerity in the United Kingdom. London: Routledge, 2020.
2024
- Adrian, Axel, et al. "AUSLEGUNG DES KI-VO-E ZUR EVALUATION VON VERFAHREN DER KÜNSTLICHEN INTELLIGENZ AM BEISPIEL DER AUTOMATISCHEN ANONYMISIERUNG VON GERICHTSENTSCHEIDUNGEN." Jusletter IT (2024): 215-226.
- Wilkens, Rodrigo, Leonardo Zilio, and Aline Villavicencio. "Assessing linguistic generalisation in language models: a dataset for Brazilian Portuguese." Language Resources and Evaluation 58.1 (2024): 175-201.
- Adrian, Axel, et al. "Auslegung des KI-VO-E zur Evaluation von Symbolischen Deduktionsverfahren der Künstlichen Intelligenz für juristische Anwendungen." Jusletter IT (2024): 85-94.
2023
- Adrian, Axel, et al. "AUTOMATISCHE ANONYMISIERUNG VON GERICHTSURTEILEN – EINE VISION SCHEINT REALISIERBAR." Jusletter IT March (2023): 211-220.
- Patel, Malin, et al. "A reference constructicon as a database." Yearbook of the German Cognitive Linguistics Association 11 (2023): 175-202.
- Malapally, Annette, et al. "Unequal Tweets: Black Disadvantage is (Re)tweeted More but Discussed Less Than White Privilege." Political Communication (2023).
2022
- Adrian, Axel, et al. "Entwicklung und Evaluation automatischer Verfahren zur Anonymisierung von Gerichtsentscheidungen." LegalTech 4 (2022): 233-238.
URL: https://beck-online.beck.de/Bcid/Y-300-Z-LTZ-B-2022-S-233-N-1 - Nesset, Tore, Aleksandr Piperski, and Svetlana Sokolova. "Russian feminitives: what can corpus data tell us?" Russian Linguistics 46.2 (2022): 95-113.
- Peters, Joachim, et al. "Präsentation von Palliativstationen und SAPV-Teams im Internet - eine korpusbasierte Metaanalyse von Webseiten." Zeitschrift für Palliativmedizin 23 (2022): 46-53.
2021
- Adrian, Axel, et al. "Anonymization of court decisions - An essential prerequisite of e-Justice Anonymisierung von gerichtsurteilen – eine wesentliche voraussetzung für E-justice." Jusletter IT May (2021): 137-147.
- Dykes, Nathan, et al. "Argument parsing via corpus queries." it - Information Technology 63 (2021): 31-44.
2020
- Dykes, Nathan, et al. "Reconstructing Arguments from Noisy Text." Datenbank-Spektrum 20 (2020): 123-129.
URL: https://link.springer.com/article/10.1007/s13222-020-00342-y - Peters, Joachim, et al. "Kompetenzdarstellung, Patientennähe und Argumentationsstrategien von Internetangeboten deutscher Hospize, Palliativstationen und SAPV-Teams-eine korpusbasierte Meta-Analyse." Zeitschrift für Palliativmedizin 21.5 (2020): e34.
- Dykes, Nathan, and Joachim Peters. "Reconstructing argumentation patterns in German newspaper articles on multidrug-resistant pathogens: a multi-measure keyword approach." Journal of Corpora and Discourse Studies 3 (2020): 51-74.
URL: https://jcads.cardiffuniversitypress.org/articles/abstract/35/
2019
- Peters, Joachim, et al. "A Linguistic Model of Communication Types in Palliative Medicine: Effects of Multidrug-Resistant Organisms (MDRO) Colonization or Infection and Isolation Measures in End of Life on Family Caregivers’ Knowledge, Attitude and Practices." Journal of Palliative Medicine 22.8 (2019).
URL: https://www.liebertpub.com/doi/pdf/10.1089/jpm.2019.0027 - Evert, Stephanie, et al. "Combining Machine Learning and Semantic Features in the Classification of Corporate Disclosures." Journal of Logic, Language and Information (2019): 309-330.
- Peters, Joachim, et al. "Metaphors for multidrug-resistant bacteria in German newspaper articles, 1995-2015. A computer-assisted qualitative study." Metaphor and the Social World 9.2 (2019): 221-241.
2017
- Büttner, Andreas, et al. "»Delta« in der stilometrischen Autorschaftsattribution." Zeitschrift für digitale Geisteswissenschaften (2017).
URL: http://www.zfdg.de/2017_006 - Evert, Stephanie, et al. "Understanding and explaining Delta measures for authorship attribution." Digital Scholarship in the Humanities 32.suppl_2 (2017): ii4–ii16.
- Schäfer, Fabian, Stephanie Evert, and Philipp Heinrich. "Japan's 2014 General Election: Political Bots, Right-Wing Internet Activism and PM Abe Shinzō’s Hidden Nationalist Agenda." Big Data 5.4 (2017): 1 - 16.
2016
- Evert, Stephanie, et al. "A Distributional Approach to Open Questions in Market Research." Computers in Industry 78 (2016): 16-28.
2024
- Blombach, Andreas, and Bettina Lindner-Bornemann. "Der possessive Dativ in Raum und Zeit." Regionale Sprachgeschichte(n). Hrg. Dagobert Höllein, Günter Koch, Alexander Werth, Berlin/Boston: De Gruyter, 2024. 29-46.
- Chiarcos, Christian, et al. "Multiword expressions, collocations and the OntoLex vocabulary." Multiword expressions in lexical resources: Linguistic, lexicographic, and computational perspectives. Ed. Voula Giouli (ed), Verginica Barbu Mititelu (ed), Berlin: Language Science Press, 2024. 187–227.
URL: https://langsci-press.org/catalog/book/440 - Adrian, Axel, et al. "Auslegung des KI-VO-E zur Evaluation von Verfahren der Künstlichen Intelligenz am Beispiel der automatischen Anonymisierung von Gerichtsentscheidungen." Sprachmodelle: Juristische Papageien oder mehr? – Tagungsband des 27. Internationalen Rechtsinformatik Symposions IRIS 2024. Hrg. Erich Schweighofer / Stefan Eder / Federico Costantini / Felix Schmautzer / Jonas Pfister, 2024. 205 - 215.
2023
- Lindner-Bornemann, Bettina, and Andreas Blombach. "„Ach [...] was wars so dunkel in dem Wolf seinem Leib!“ Zur diachronen Entwicklung des possessiven Dativs." Historische (Morpho-)Syntax des Deutschen. Hrg. Alexander Lasch, Kerstin Roth, Dominik Hetjens, Berlin/Boston: De Gruyter, 2023. 298-316.
URL: https://www.degruyter.com/document/doi/10.1515/jbgsg-2023-0019/html - Adrian, Axel, et al. "Automatische Anonymisierung von Gerichtsurteilen – Eine Vision scheint realisierbar." Rechtsinformatik als Methodenwissenschaft des Rechts – Tagungsband des 26. Internationalen Rechtsinformatik Symposions IRIS 2023. Hrg. Erich Schweighofer / Jakob Zanol / Stefan Eder, Editions Weblaw, 2023. 211 - 220.
2022
- Peters, Joachim, and Nathan Dykes. "Die Palliativmedizinische Fachkultur in Geschichte und Gegenwart – sprachwissenschaftliche Perspektiven." Linguistik und Medizin. Ed. Ilg, Yvonne, Schnedermann, Theresa, Iakushevich, Marina, Berlin, New York: De Gruyter, 2022. 194-214.
- Adrian, Axel, et al. "Manuelle und automatische Anonymisierung von Urteilen." Digitalisierung von Zivilprozess und Rechtsdurchsetzung. Hrg. Adrian, Axel/Kohlhase, Michael/Evert, Stephanie/Zwickel, Martin, 2022. 173-197.
- Dykes, Nathan, Philipp Heinrich, and Stephanie Evert. "Retrieving Twitter argumentation with corpus queries and discourse analysis." Broadening the Spectrum of Corpus Linguistics: New approaches to variability and change. Ed. Susanne Flach, Martin Hilpert, John Benjamins Publishing Company, 2022. 229-256.
2021
- Pfaffenberger, Fabian, and Philipp Heinrich. "Die überschätzte Gefahr? Twitter-Bots im Europawahlkampf 2019." Europawahlkampf 2019: Zur Rolle der Medien. Ed. Holtz-Bacha C, Wiesbaden: Springer, 2021. 115 - 148.
- Keuchen, Michael, et al. "Anonymisierung von Gerichtsurteilen – Eine wesentliche Voraussetzung für E-Justice –." Cybergovernance - Tagungsband des 24. Internationalen Rechtsinformatik Symposions IRIS 2021. Hrg. Schweighofer E, Eder S, Hanke P, Kummer F, Saarenpää A, Editions Weblaw, 2021. 137 - 149.
2020
- Griebel, Tim, Stephanie Evert, and Philipp Heinrich. "Possibilities and Challenges of Corpus-Assisted Discourse Analyses of Austerity in the United Kingdom." Multimodal Approaches to Media Discourses: Reconstructing the Age of Austerity in the United Kingdom. Ed. Griebel T, Evert S, Heinrich P, London: Routledge, 2020. 1 - 10.
- Griebel, Tim, and Philipp Heinrich. "The Cultural Political Economy of Brexit in the Age of Austerity." Multimodal Approaches to Media Discourses: Reconstructing the Age of Austerity in the United Kingdom. Ed. Griebel T, Evert S, Heinrich P, London: Routledge, 2020. 163 - 188.
- Adrian, Christoph, et al. "Will the real populism (please) stand out? Eine interdisziplinäre Aufarbeitung populistischer Tendenzen in Brexit-Tweets im Kontext der Europawahl 2019." Europawahlkampf 2019. Ed. Christina Holtz-Bacha, Wiesbaden: Springer VS, 2020. 245-274.
2019
- Pfaffenberger, Fabian, Christoph Adrian, and Philipp Heinrich. "Was bin ich – und wenn ja, wie viele? Identifikation und Analyse von Political Bots während des Bundestagswahlkampfs 2017 auf Twitter." Die (Massen-)Medien im Wahlkampf: Die Bundestagswahl 2017. Ed. Holtz-Bacha, Christina, Wiesbaden: Springer, 2019. 97 - 124.
- Dimpel, Friedrich Michael, and Thomas Proisl. "Gute Wörter für Delta: Verbesserung der Autorschaftsattribution durch autorspezifische distinktive Wörter." DHd 2019. Digital Humanities: multimedial & multimodal. Konferenzabstracts. Ed. Patrick Sahle, 2019. 296–299.
URL: https://zenodo.org/record/2596095
2018
- Uhrig, Peter, Stephanie Evert, and Thomas Proisl. "Collocation Candidate Extraction from Dependency-Annotated Corpora: Exploring Differences across Parsers and Dependency Annotation Schemes." Lexical Collocation Analysis: Advances and Applications. Ed. Cantos-Gómez P, Almela-Sánchez M, Cham: Springer International Publishing, 2018. 111–140.
2017
- Evert, Stephanie, and Stella Neumann. "The impact of translation direction on characteristics of translated texts. A multivariate analysis for English and German." Empirical Translation Studies. New Theoretical and Methodological Traditions. Ed. De Sutter G, Lefer M, Delaere I, Berlin: Mouton de Gruyter, 2017. 47-80.
URL: http://www.stefan-evert.de/PUB/EvertNeumann2017/
2024
- Adrian, Axel, et al. "DIREGA – Building Decision Support for German Register Law." Presented at JURIX 2024, Brno Ed. Jaromir Savelka, Jakub Harasta, Tereza Novotna, Jakub Misek, IOS Press, 2024.
URL: https://ebooks.iospress.nl/volumearticle/71034 - Evert, Stephanie, Christine Ganslmayer, and Christian Rink. "Multi-level analysis as a systematic Approach to evaluating the quality of AI-generated dictionary entries." Proceedings of the EURALEX 2024, Cavtat/Dubrovnik Ed. Kristina Š. Despot, Ana Ostroški Anić, Ivana Brač, 2024. 298–315.
URL: https://euralex.jezik.hr/wp-content/uploads/2021/09/Euralex-XXI-proceedings_1st.pdf - Rink, Christian, Christine Ganslmayer, and Stephanie Evert. "Towards a comprehensive method for evaluating and utilizing AI-generated bilingual lexicographic data in language learning using the example of Chinese as a foreign language." Proceedings of the AsiaLex 2024, Toyo University, Tokyo Ed. Ai Inoue, Naho Kawamoto, Makoto Sumiyoshi, Tokyo: 東洋大学 (Toyo University), 2024. 133–142.
URL: https://drive.google.com/file/d/11eiezuzidbVhlr-0Cr_k2NxgTg5CEUcP/view - Heinrich, Philipp, et al. "Automatic Identification of COVID-19-Related Conspiracy Narratives in German Telegram Channels and Chats." Proceedings of the The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Turin Ed. Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue, 2024. 1932-1943.
URL: https://aclanthology.org/2024.lrec-main.173 - Dykes, Nathan, et al. "Leveraging High-Precision Corpus Queries for Text Classification via Large Language Models." Proceedings of the First Workshop on Language-driven Deliberation Technology (DELITE) @ LREC-COLING 2024, Torino, Italy Ed. Hautli-Janisz A, Lapesa G, Anastasiou L, Gold V, Liddo AD, Reed C, Torino, Italy: ELRA and ICCL, 2024. 52--57.
URL: https://aclanthology.org/2024.delite-1.7 - Khan, Anas Fahad, et al. "On Modelling Corpus Citations in Computational Lexical Resources." Proceedings of the Joint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024, Torino, ITA Ed. Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue, Paris: European Language Resources Association (ELRA), 2024. 12385-12394.
URL: https://aclanthology.org/2024.lrec-main.1084/ - Adrian, Axel, et al. "Auslegung des KI-VO-E zur Evaluation von Verfahren der Künstlichen Intelligenz am Beispiel der automatischen Anonymisierung von Gerichtsentscheidungen." Proceedings of the 27. Internationalen Rechtsinformatik Symposions IRIS 2024, Salzburg, Österreich Ed. Erich Schweighofer, Stefan Eder, Federico Costantini, Felix Schmautzer, Jonas Pfister, Salzburg, Austria, 2024. 205 -- 215.
- Heinrich, Philipp, et al. "Automatic Identification of COVID-19-related Narratives in German Telegram Channels and Chats." Proceedings of the Joint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024, Torino Ed. Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue, European Language Resources Association (ELRA), 2024. 1932-1943.
- Dykes, Nathan, et al. "Finding Argument Fragments on Social Media with Corpus Queries and LLMs." Proceedings of the 1st International Conference on Robust Argumentation Machines, RATIO 2024, Bielefeld, DEU Ed. Philipp Cimiano, Anette Frank, Michael Kohlhase, Benno Stein, Springer Science and Business Media Deutschland GmbH, 2024. 163-181.
- Zilio, Leonardo, et al. "Using character-level models for efficient abbreviation and long-form detection." Proceedings of the Joint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024, Torino, Hybrid Ed. Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue, European Language Resources Association (ELRA), 2024. 3028-3037.
URL: https://aclanthology.org/2024.lrec-main.270
2023
- Dykes, Nathan, Anna Wilson, and Peter Uhrig. "A Pipeline for the Creation of Multimodal Corpora from YouTube Videos." Proceedings of the LIMO 2023. The 1st Workshop on Linguistic Insights from and for Multimodal Language Processing, Ingolstadt Ed. Piush Aggarwal, Özge Alaçam, Carina Silberer, Sina Zarrieß, Torsten Zesch, Ingolstadt: Association for Computational Linguistics, 2023. 1-5.
URL: https://aclanthology.org/2023.limo-1.1 - Uhrig, Peter, et al. "Studying Time Conceptualisation via Speech, Prosody, and Hand Gesture: Interweaving Manual and Computational Methods of Analysis." Proceedings of the Gesture and Speech in Interaction (GeSpIn) Conference, Nijmegen Ed. Wim Pouw, James Trujillo, Hans Rutger Bosker, Linda Drijvers, Marieke Hoetjes, Judith Holler, Sarka Kadava, Lieke Van Maastricht, Ezgi Mamus, Asli Ozyurek, 2023.
URL: https://hdl.handle.net/21.11116/0000-000D-A250-1
2022
- Blombach, Andreas, et al. "Exploring Lexical Diversities." Proceedings of the Digital Humanities 2022, Tokyo 2022. 130-134.
URL: https://dh2022.dhii.asia/dh2022bookofabsts.pdf - Diwersy, Sascha, et al. "Eine korpuslinguistische Analyse der Corona-Berichterstattung in der deutschen und französischen Presse." Tagungsband Mots et Discours de la Pandémie, Heidelberg 2022.
- Chiarcos, Christian, et al. "Modelling Collocations in OntoLex-FrAC." Proceedings of the Proceedings of Globalex Workshop on Linked Lexicography within the 13th Language Resources and Evaluation Conference Marseille, France: European Language Resources Association, 2022. 10--18.
URL: https://aclanthology.org/2022.gwll-1.3 - Gracia, Jorge, Besim Kabashi, and Ilan Kernerman. "TIAD 2022: The Fifth Translation Inference Across Dictionaries Shared Task." Proceedings of the Proceedings of Globalex Workshop on Linked Lexicography within the 13th Language Resources and Evaluation Conference Marseille, France: European Language Resources Association, 2022. 19--25.
URL: https://aclanthology.org/2022.gwll-1.4
2021
- Tayebi Arasteh, Soroosh, et al. "How Will Your Tweet Be Received? Predicting the Sentiment Polarity of Tweet Replies." Proceedings of the IEEE 15th International Conference on Semantic Computing (ICSC), Laguna Hills, CA Ed. IEEE, 2021. 370-373.
URL: https://ieeexplore.ieee.org/document/9364527 - Gracia, Jorge, Besim Kabashi, and Ilan Kernerman. "Results of the Translation Inference Across Dictionaries 2021 Shared Task." Proceedings of the The Translation Inference Across Dictionaries 2021 Shared Task Ed. Carvalho S, Souza RR, Zaragoza, Spain: CEUR-WS.org,, 2021. 208--220.
URL: http://ceur-ws.org/Vol-3064/tiad4.pdf
2020
- Proisl, Thomas, and Gabriella Lapesa. "KLUMSy@KIPoS: Experiments on Part-of-Speech Tagging of Spoken Italian." Proceedings of the 7th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian (EVALITA 2020), Online Ed. Basile V, Croce D, Di Maro M, Passaro L, CEUR-WS.org, 2020.
URL: http://ceur-ws.org/Vol-2765/paper140.pdf - Blombach, Andreas, et al. "A Corpus of German Reddit Exchanges (GeRedE)." Proceedings of the 12th International Conference on Language Resources and Evaluation, LREC 2020, Marseille Ed. Nicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, European Language Resources Association (ELRA), 2020. 6310-6316.
URL: https://www.aclweb.org/anthology/2020.lrec-1.774 - Dykes, Nathan, Philipp Heinrich, and Andreas Blombach. "Independent argumentation schemes? Transferring argument queries from Brexit to environment tweets." Presented at ICAME41, Heidelberg 2020.
- Blombach, Andreas, et al. "A new German Reddit corpus." Proceedings of the 15th Conference on Natural Language Processing, KONVENS 2019, Erlangen-Nurnberg German Society for Computational Linguistics and Language Technology, 2020. 278-279.
- Evert, Stephanie, et al. "Corpus query lingua franca part II: Ontology." Proceedings of the 12th International Conference on Language Resources and Evaluation, LREC 2020, Marseille Ed. Nicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, European Language Resources Association (ELRA), 2020. 3346-3352.
- Proisl, Thomas, et al. "EmpiriST Corpus 2.0: Adding Manual Normalization, Lemmatization and Semantic Tagging to a German Web and CMC Corpus." Proceedings of the 12th International Conference on Language Resources and Evaluation, LREC 2020, Marseille Ed. Nicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, European Language Resources Association (ELRA), 2020. 6142-6148.
URL: https://www.aclweb.org/anthology/2020.lrec-1.754
2019
- Kabashi, Besim. "Collecting collocations for the Albanian language." Proceedings of the 6th Biennial Conference on Electronic Lexicography in the 21st Century: Smart Lexicography, eLex 2019, Sintra Ed. Iztok Kosem, Tanara Zingano Kuhn, Margarita Correia, Jose Pedro Ferreira, Maarten Jansen, Isabel Pereira, Jelena Kallas, Milos Jakubicek, Simon Krek, Carole Tiberius, Lexical Computing CZ s.r.o., 2019. 478-489.
- Dykes, Nathan, Philipp Heinrich, and Stephanie Evert. "Arguing Brexit on Twitter. A corpus linguistic study." Presented at European Conference on Argumentation 2019, Groningen 2019.
- Dykes, Nathan, Philipp Heinrich, and Stephanie Evert. "Reconstructing Twitter arguments with corpus linguistics." Presented at ICAME40: Language in Time, Time in Language, Neuchâtel 2019.
- Gracia, Jorge, et al. "Results of the translation inference across dictionaries 2019 shared task." Proceedings of the 2nd TIAD Shared Task - Translation Inference Across Dictionaries, TIAD 2019, Leipzig Ed. Jorge Gracia, Besim Kabashi, Besim Kabashi, Ilan Kernerman, CEUR-WS, 2019. 1-12.
- Proisl, Thomas, et al. "The_Illiterati: Part-of-Speech Tagging for Magahi and Bhojpuri Without Even Knowing the Alphabet." Proceedings of the First International Workshop on NLP Solutions for Under Resourced Languages (NSURL 2019), Trento Association for Computational Linguistics, 2019. 73-79.
URL: https://www.aclweb.org/anthology/2019.nsurl-1.11
2018
- Proisl, Thomas, et al. "EmotiKLUE at IEST 2018: Topic-Informed Classification of Implicit Emotions." Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Brüssel Ed. Balahur A, Mohammad SM, Hoste V, Klinger R, Brussels: Association for Computational Linguistics, 2018. 235–242.
URL: http://aclweb.org/anthology/W18-6234 - Heinrich, Philipp, and Fabian Schäfer. "Extending Corpus-Based Discourse Analysis for Exploring Japanese Social Media." Proceedings of the 4th Asia Pacific Corpus Linguistics Conference (APCLC2018), Takamatsu Ed. Yukio Tono & Hitoshi Isahara, 2018. 135 - 140.
- Heinrich, Philipp. "Stylistic Features in Corporate Disclosures and their Predictive Power." Proceedings of the 4th Asia Pacific Corpus Linguistics Conference (APCLC2018), Takamatsu Ed. Yukio Tono & Hitoshi Isahara, 2018. 129 - 134.
- Kabashi, Besim, and Thomas Proisl. "Albanian Part-of-Speech Tagging: Gold Standard and Evaluation." Proceedings of the 11th Language Resources and Evaluation Conference, Miyazaki Ed. Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T, Miyazaki: European Language Resources Association, 2018. 2593–2599.
URL: http://www.lrec-conf.org/proceedings/lrec2018/pdf/89.pdf - Heinrich, Philipp, et al. "A Transnational Analysis of News and Tweets about Nuclear Phase-Out in the Aftermath of the Fukushima Incident." Proceedings of the Workshop on Computational Impact Detection from Text Data, Miyazaki Ed. Andreas Witt, Jana Diesner, Georg Rehm, Paris: ELRA, 2018. 8 - 16.
- Proisl, Thomas, et al. "Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods." Proceedings of the 11th Language Resources and Evaluation Conference, Miyazaki Ed. Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T, Miyazaki: European Language Resources Association, 2018. 3309–3314.
URL: http://www.lrec-conf.org/proceedings/lrec2018/pdf/835.pdf - Proisl, Thomas. "SoMeWeTa: A Part-of-Speech Tagger for German Social Media and Web Texts." Proceedings of the 11th Language Resources and Evaluation Conference, Miyazaki Ed. Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T, Miyazaki: European Language Resources Association, 2018. 665–670.
URL: http://www.lrec-conf.org/proceedings/lrec2018/pdf/49.pdf - Evert, Stephanie, Nathan Dykes, and Joachim Peters. "A quantitative evaluation of keyword measures for corpus-based discourse analysis." 2018.
URL: http://www.stefan-evert.de/PUB/EvertEtc2018_CAD_slides.pdf - Peters, Joachim, and Nathan Dykes. "From keywords to discourse - towards a keyword operationalisation model in discourse linguistics." Proceedings of the Corpora and Discourse International Conference Lancaster, 2018.
- Pfaffenberger, Fabian, Christoph Adrian, and Philipp Heinrich. "Political bots during the German federal election campaign 2017 on Twitter." Proceedings of the 7. European Communication Conference (ECC) der European Communication Research and Education Association (ECREA), Lugano 2018.
2017
- Evert, Stephanie, et al. "Combining Machine Learning and Semantic Features in the Classification of Corporate Disclosures." Proceedings of the Logic and Algorithms in Computational Linguistics 2017 (LACompLing2017), Stockholm Ed. Loukanova R, Liefke K, Stockholm: Stockholm University, 2017. 47 - 62.
URL: http://su.diva-portal.org/smash/get/diva2:1140018/FULLTEXT03.pdf - Proisl, Thomas, et al. "Translation Inference across Dictionaries via a Combination of Graph-based Methods and Co-occurrence Statistics." Proceedings of the Shared Task on Translation Inference Across Dictionaries, Galway Ed. McCrae J, Bond F, Buitelaar P, Cimiano P, Declerck T, Gracia J, Kernerman I, Ponsoda E, Ordan N, Piasecki M, CEUR, 2017. 94–102.
URL: http://ceur-ws.org/Vol-1899/TIAD17_paper_1.pdf - Evert, Stephanie, et al. "E-VIEW-Alation – a Large-Scale Evaluation Study of Association Measures for Collocation Identification." Proceedings of the eLex 2017, Leiden Ed. Iztok K, Carole T, Miloš J, Jelena K, Simon K, and Vít B, Brno: Lexical Computing, 2017. 531–549.
URL: https://elex.link/elex2017/wp-content/uploads/2017/09/paper32.pdf - Lapesa, Gabriella, and Stephanie Evert. "Large-scale evaluation of dependency-based DSMs: Are they worth the effort?" Proceedings of the Proceedings of the 15th Annual Meeting of the European Association for Computational Linguistics (EACL 2017): Volume 2, Short Papers Valencia, Spain, 2017. 394-400.
URL: http://www.linguistik.fau.de/dsmeval/ - Evert, Stephanie, Sebastian Wankerl, and Elmar Nöth. "Reliable measures of syntactic and lexical complexity: The case of Iris Murdoch." Presented at Proceedings of the Corpus Linguistics 2017 Conference, Birmingham Birmingham, UK, 2017.
URL: http://purl.org/stefan.evert/PUB/EvertWankerlNoeth2017.pdf
2016
- Wankerl, Sebastian, Elmar Nöth, and Stephanie Evert. "An Analysis of Perplexity to Reveal the Effects of Alzheimer's Disease on Language." Proceedings of the ITG-Fachbericht 267: Speech Communication Paderborn, Germany, 2016. 254-259.
- Kabashi, Besim, and Thomas Proisl. "A Proposal for a Part-of-Speech Tagset for the Albanian Language." Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož Ed. Calzolari Nicoletta, Choukri Khalid, Declerck Thierry, Grobelnik Marko, Maegaard Bente, Mariani Joseph, Moreno Asuncion, Odijk Jan, Piperidis Stelios, Paris: European Language Resources Association (ELRA), 2016. 4305–4310.
URL: http://www.lrec-conf.org/proceedings/lrec2016/pdf/1066_Paper.pdf - Evert, Stephanie. "CogALex-V Shared Task: Mach5 – A traditional DSM approach to semantic relatedness." Proceedings of the Proceedings of the 5th Workshop on Cognitive Aspects of the Lexicon (CogALex-V) Osaka, Japan, 2016. 92-97.
URL: http://www.collocations.de/data/#mach5 - Evert, Stephanie, et al. "„Delta“ in der stilometrischen Autorschaftsattribution." Präsentiert bei DHd 2016, Leipzig Leipzig: Nisaba, 2016.
URL: http://www.dhd2016.de/abstracts/sektionen-002.html - Evert, Stephanie, et al. "EmpiriST 2015: A Shared Task on the Automatic Linguistic Annotation of Computer-Mediated Communication and Web Corpora." Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), Berlin Berlin, Germany, 2016. 44-56.
URL: https://sites.google.com/site/empirist2015/ - Piperski, Aleksandr, and Anton Kukhto. "Intra-speaker stress variation in Russian: A corpus-driven study of Russian poetry." Proceedings of the 2016 International Conference on Computational Linguistics and Intellectual Technologies, Dialogue 2016 Rossiiskii Gosudarstvennyi Gumanitarnyi Universitet, 2016. 540-550.
URL: https://www.scopus.com/record/display.uri?eid=2-s2.0-85020440068&origin=inward - Proisl, Thomas, and Peter Uhrig. "SoMaJo: State-of-the-art tokenization for German web and social media texts." Proceedings of the 10th Web as Corpus Workshop (WAC-X), Berlin Ed. Cook P, Evert S, Schäfer R, Stemle E, Berlin: Association for Computational Linguistics (ACL), 2016. 57-62.
URL: http://aclweb.org/anthology/W16-26 - Santus, Enrico, et al. "The CogALex-V Shared Task on the Corpus-Based Identification of Semantic Relations." Proceedings of the Proceedings of the 5th Workshop on Cognitive Aspects of the Lexicon (CogALex-V) Osaka, Japan, 2016. 69-79.
URL: https://sites.google.com/site/cogalex2016/home/shared-task
2015
- Plotnikova, Nataliia, et al. "KLUEless: Polarity Classification and Association." Proceedings of the Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015) Denver, Colorado, 2015. 619--625.
URL: http://www.aclweb.org/anthology/S15-2103 - Plotnikova, Nataliia, et al. "SemantiKLUE: Semantic Textual Similarity with Maximum Weight Matching." Proceedings of the Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015) Denver, Colorado, 2015. 111--116.
URL: http://www.aclweb.org/anthology/S15-2020 - Evert, Stephanie, and Antti Arppe. "Some theoretical and experimental observations on naïve discriminative learning." Proceedings of the Proceedings of the 6th Conference on Quantitative Investigations in Theoretical Linguistics (QITL-6) Tübingen, Germany, 2015.
- Evert, Stephanie, et al. "Towards a better understanding of Burrows's Delta in literary authorship attribution." Proceedings of the Proceedings of the Fourth Workshop on Computational Linguistics for Literature Denver, CO, 2015. 79--88.
URL: http://www.aclweb.org/anthology/W15-0709 - Evert, Stephanie, and Andrew Hardie. "Ziggurat: A new data model and indexing format for large annotated text corpora." Proceedings of the Proceedings of the 3rd Workshop on the Challenges in the Management of Large Corpora (CMLC-3) Lancaster, UK, 2015. 21--27.
weitere siehe CRIS