Workshop: Introduction to topic modelling and Natural language Processing (NLP)

  • Date: –15:00
  • Location: Engelska parken 4-2007
  • Lecturer: Jonas Frankemölle, Research Engineer, CDHU
  • Organiser: Centre for Digital Humanities and Social Sciences
  • Last day of registration: 2/27/2024 at. 11:59 PM.
  • Contact person: Victoria Yantseva
  • Number of seats: 22 (Of which 20 are booked)
  • Registration is closed for this event
  • Workshop

In this workshop, we will learn the basics of topic modelling and Natural Language Processing (NLP) that allow uncovering patterns and structures across large collections of text known as corpora. Using popular Python libraries such as NLTK and Gensim, we will go through the main steps in a standard NLP pipeline, such as text cleaning and pre-processing, text tokenization and vectorization, subsequent use of topic modelling algorithms, and, finally, evaluation of topic modelling results. Basic familiarity with Python is recommended but not required. 

Last modified: 2021-12-01