Eurach Research

LCI

Learner Corpus Infrastructure

    Learner corpora build a fundamental basis for a noticeable part of the research activities of the Institute for Applied Linguistics. The Learner Corpus Infrastructure project (LCI)  aims at enhancing the research potential of the Institute by creating an always more efficient infrastructure for the collection, processing and maintenance of learner corpora.

    Publications
    Categorising speakers’ language background: Theoretical assumptions and methodological challenges for learner corpus research
    Lopopolo O, Bienati A, Frey JC, Glaznieks A, Spina S (2025)
    Elsevier BV
    Journal article
    Research Methods in Applied Linguistics

    More information: http://dx.doi.org/10.1016/j.rmal.2024.100170

    https://doi.org/10.1016/j.rmal.2024.100170

    Intensification in written L2 Italian: Insights from the multilingual region of South Tyrol

    Spina S, Glaznieks A, Abel A (2025)
    Journal article
    International Journal of Learner Corpus Research

    https://doi.org/10.1075/ijlcr.23041.spi

    Categorizing Speakers’ Language Background: Theoretical Assumptions and Methodological Challenges for Learner Corpus Research
    Lopopolo O, Bienati A, Frey J-C, Glaznieks A, Spina S (2024)
    Presentation/Speech

    Conference: Exploring the Use of Corpus Linguistics Research Methods, BAAL Corpus Linguistics SIG Symposium | University of Edinburgh | 27.3.2024 - 27.3.2024

    Corpora di apprendenti di italiano L2 (anche ma non solo) in Alto Adige
    Bienati A (2024)
    Presentation/Speech

    Conference: Tecnologie per la didattica delle lingue | Bologna | 16.4.2024 - 17.4.2024

    La scuola e le altre lingue? Una riflessione epistemologica sugli usi linguistici in contesti scolastici
    Platzgummer V, Bienati A, Lopopolo O, Leone-Pizzighella AR (2024)
    Presentation/Speech

    Conference: AItLA 2024| XXIV Congresso Internazionale dell'Associazione Italiana di Linguistica Applicata | Pavia : 21.2.2024 - 23.2.2024

    Discourse markers in the curricularization of ‘academic language’: a mixed methods analysis of tipo and praticamente in Italian secondary schools
    Leone-Pizzighella AR, Bienati A, Frey JC (2024)
    Contribution in book
    Contesti, pratiche e risorse della comunicazione multimodale

    More information: http://www.aitla.it/images/pdf/StudiAItLA18/009_Leone-Pizzig ...

    KoKo German L1 Learner Corpus 4
    Abel A, Glaznieks A, Culy C, Nicolas L, Stemle E (2024)
    Database

    More information: http://hdl.handle.net/20.500.12124/77

    MultiGED-2023 shared task at NLP4CALL: Multilingual Grammatical Error Detection
    Volodina E, Bryant C, Caines A, De Clerq O, Frey JC, Ershova E, Rosen A, Vinogradova O (2023)
    Conference proceedings article

    Conference: 12th Workshop on Natural Language Processing for Computer Assisted Language Learning (NLP4CALL 2023) | Tórshavn, Faroe Islands | 22.5.2023 - 22.5.2023

    More information: https://aclanthology.org/2023.nlp4call-1.1.pdf

    What kind of speakers are these? Discussing the ‘native speaker’ categorization in corpus design
    Lopopolo O, Glaznieks A (2023)
    Presentation/Speech

    Conference: Morella Young Researchers Symposium on Multilingualism (MYRSM) | Morella | 12.6.2023 - 13.6.2023

    MultiGED-2023 shared task at NLP4CALL: Multilingual Grammatical Error Detection
    Volodina E, Bryant C, Caines A, De Clercq O, Frey JC, Ershova E, Rosen A, Vinogradova O (2023)
    Presentation/Speech

    Conference: 12th Workshop on Natural Language Processing for Computer Assisted Language Learning (NLP4CALL 2023) | Tórshavn, Faroe Islands | 22.5.2023 - 22.5.2023

    More information: https://docs.google.com/presentation/d/1HxJOihIIpShf--Dbui4F ...

    Discussing the ‘native speaker’ categorization in corpus design: theories, procedures and methodological decisions
    Lopopolo O (2023)
    Presentation/Speech

    Conference: Forschungskolloquium - Institut für deutsche Sprache und Linguistik - Humboldt Universität Berlin | Berlin | 28.6.2023 - 28.6.2023

    The Kolipsi Corpus Family: Resources for Learner Corpus Research in Italian and German
    Glaznieks A, Frey JC, Abel A, Nicolas L, Vettori C (2023)
    Journal article
    Italian Journal of Computational Linguistics

    More information: https://journals.openedition.org/ijcol/1210

    https://doi.org/10.4000/ijcol.1210

    A core metadata schema for L2 data
    Paquot M, König A, Frey JC, Stemle EW (2023)
    Presentation/Speech

    Conference: EuroSLA 32, Conference of the European Second Language Association | Brimingham | 30.8.2023 - 2.9.2023

    Leonide: A longitudinal trilingual corpus of young learners of Italian, German and English
    Glaznieks A, Frey JC, Stopfner M, Zanasi L, Nicolas L (2022)
    John Benjamins Publishing Company
    Journal article
    International Journal of Learner Corpus Research

    More information: https://doi.org/10.1075/ijlcr.21004.gla

    https://doi.org/10.1075/ijlcr.21004.gla

    https://hdl.handle.net/10863/22888

    Syntactic variation in German “weil”-clauses: Comparison between immersed and non-immersed learners of German
    Glaznieks A, Frey JC (2022)
    Presentation/Speech

    Conference: 6th International Conference for Learner Corpus Research (LCR 2022) | Padova | 22.9.2022 - 24.9.2022

    Introducing a Gold Standard Corpus from Young Multilinguals for the Evaluation of Automatic UD-PoS Taggers for Italian
    Schmalz VJ, Frey JC, Stemle EW (2022)
    Conference proceedings article

    Conference: Eighth Italian Conference on Computational Linguistics (CliC-it 2021) | Milan | 29.6.2022 - 1.7.2022

    More information: http://ceur-ws.org/Vol-3033/paper13.pdf

    Towards standardizing LCR metadata
    König A, Frey JC, Stemle EW, Glaznieks A, Paquot M (2022)
    Presentation/Speech

    Conference: 6th International Conference for Learner Corpus Research (LCR 2022) | Padova | 22.9.2022 - 24.9.2022

    Italian middle school students' task motivation in narrative essay writing
    Lopopolo O, Frey JC, Okinina N (2022)
    Presentation/Speech

    Conference: VII Congresso DILLE “Didattica delle lingue e valutazione| società, scuola, università” | Pisa : 12.5.2022 - 13.5.2022

    LEONIDE: A longitudinal trilingual corpus of young learners of Italian, German and English
    Glaznieks A, Frey JC (2022)
    Presentation/Speech

    Conference: 6th International Conference for Learner Corpus Research (LCR 2022) | Padova | 22.9.2022 - 24.9.2022

    Kolipsi-1 Corpus v1.0
    Glaznieks A, Frey JC, Abel A, Vettori C, Nicolas L (2021)
    Database

    More information: http://hdl.handle.net/20.500.12124/26

    https://hdl.handle.net/10863/20577

    Kolipsi-2 Corpus v1.0
    Glaznieks A, Frey JC, Abel A, Vettori C, Nicolas L (2021)
    Database

    More information: http://hdl.handle.net/20.500.12124/30

    https://hdl.handle.net/10863/20578

    Exploring Reusability and Reproducibility for a Research Infrastructure for L1 and L2 Learner Corpora
    König A, Frey JC, Stemle EW (2021)
    MDPI AG
    Journal article
    Information

    More information: http://dx.doi.org/10.3390/info12050199

    https://doi.org/10.3390/info12050199

    https://hdl.handle.net/10863/17415

    Creating a learner corpus infrastructure: Experiences from making language learner data available
    Frey J, König A, Fišer D (2020)
    Conference proceedings article

    Conference: International Conference on ICT Enhanced Social Sciences and Humanities (ICTeSSH 2020) | Virtual | 29.6.2020 - 1.7.2020

    More information: https://www.itm-conferences.org/articles/itmconf/pdf/2020/03 ...

    https://doi.org/10.1051/itmconf/20203303006

    https://hdl.handle.net/10863/19257

    PORTA: The Learner Corpus Portal of Eurac Research
    Glaznieks A (2020)
    Website

    More information: https://www.porta.eurac.edu/

    https://hdl.handle.net/10863/14747

    Creating a learner corpus infrastructure: Experiences from making language learner data available
    König A, Frey JC (2020)
    Presentation/Speech

    Conference: International Conference on ICT Enhanced Social Sciences and Humanities (ICTeSSH 2020) | Virtual | 29.6.2020 - 1.7.2020

    https://hdl.handle.net/10863/14866

    Lexikalische Komplexität im Kontext holistischer Textbewertungen
    Frey JC (2020)
    Presentation/Speech

    Conference: Mehrsprachigkeit und Lernerkorpora | Bolzano | 13.2.2020 - 13.2.2020

    https://hdl.handle.net/10863/14953

    CTAP for Italian: Integrating Components for the Analysis of Italian into a Multilingual Linguistic Complexity Analysis Tool
    Okinina N, Frey JC, Weiß Z (2020)
    Conference proceedings article

    Conference: 12th International Conference on Language Resources and Evaluation (LREC 2020) | Marseille | 11.5.2020 - 16.5.2020

    https://hdl.handle.net/10863/16771

    LEONIDE – Longitudinal Learner Corpus in Italiano, Deutsch and English 1.0
    Glaznieks A, Frey JC, Stopfner M, Zanasi L, Nicolas L (2020)
    Database

    More information: http://hdl.handle.net/20.500.12124/25

    LEONIDE: A longitudinal trilingual corpus of young learners of Italian, German and English
    Glaznieks A, Frey JC, Stopfner M, Zanasi L (2020)
    Presentation/Speech

    Conference: 14th Teaching and Language Corpora conference (TaLC 2020) | Perpignan | 15.7.2020 - 17.7.2020

    https://hdl.handle.net/10863/19248

    Transc&Anno: A Graphical Tool for the Transcription and On-the-Fly Annotation of Handwritten Documents
    Okinina N, Nicolas L, Lyding V (2018)
    Presentation/Speech

    Conference: Games4NLP workshop at Eleventh International Conference on Language Resources and Evaluation (LREC 2018) | Miyazaki | 7.5.2018 - 12.5.2018

    https://hdl.handle.net/10863/19291

    Transc&Anno: A Graphical Tool for the Transcription and On-the-Fly Annotation of Handwritten Documents
    Okinina N, Nicolas L, Lyding V (2018)
    Conference proceedings article

    Conference: Games4NLP workshop at Eleventh International Conference on Language Resources and Evaluation (LREC 2018) | Miyazaki | 7.5.2018 - 12.5.2018

    More information: www.lrec-conf.org/proceedings/lrec2018/pdf/107.pdf

    https://hdl.handle.net/10863/7940

    KoKo German L1 Learner Corpus v3
    Abel A, Glaznieks A, Culy C (2014)
    Database

    More information: http://hdl.handle.net/20.500.12124/12

    Related News
    1 - 2

    Science Shots Eurac Research Newsletter

    Get your monthly dose of our best science stories and upcoming events.

    Choose language
    Eurac Research logo

    Eurac Research is a private research center based in Bolzano (South Tyrol) with researchers from a wide variety of scientific fields who come from all over the globe. Together, through scientific knowledge and research, they share the goal of shaping the future.

    No Woman No Panel

    What we do

    Our research addresses the greatest challenges facing us in the future: people need health, energy, well-functioning political and social systems and an intact environment. These are complex questions, and we are seeking the answers in the interaction between many different disciplines. [About us](/en/about-us-eurac-research)

    WORK WITH US

    Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution 4.0 International license.