Note: this lsf-course 4379 takes place together with lsf-course 3108 as one course. FB4 students (e.g. Master DA) register for lsf-course 3108, FB3 students (e.g. Master IIM/SWIKK) register for lsf-course 4379. Both use the learnweb-course 3108.
The course will take place on campus. While we will provide some material online, participating entirely remotely will not be possible.
Students will learn the most important phenomena in natural languages on different levels of granularity, starting with the combination of sounds to the meaning of words, sentences and texts.You will get an introduction to main symbolic and statistical approaches to model these phenomena. All theoretical topics will be accompanied by exercises dealing with these phenomena and demonstrating their use in practical applications, like spelling correction, auto completion, keyword extraction, topic detection, named entity recognition, relation extraction, synonym detection, etc.
The objective of this course is to introduce research questions from computational linguistics that can (and need to) be solved with large amounts of language data. Alongside, we introduce relevant basic linguistic phenomena and standard ways of describing them. We then discuss a number of tools and approaches that are based on large amounts of language data. Participants are familiarized with the notions, issues and approaches in the presentation part of the course and then can themselves experiment with data and tools, in order to contribute to a common descriptive and analytic task. Examples and data are taken from English (or paraphrased in EN).
This year we will focus on a specific topic: methods for dealing with words. You may participate in this course if you already attended the NLP course of summer 2022 to extend your knowledge on NLP. However, we do not require the other course as a prerequesite.