¹û¶³Ó°Ôº

XClose

¹û¶³Ó°Ôº Module Catalogue

Home
Menu

Natural Language Processing and Text Analysis (INST0073)

Key information

Faculty
Faculty of Arts and Humanities
Teaching department
Information Studies
Credit value
15
Restrictions
This module is restricted to students from the Department of Information Studies.
Timetable

Alternative credit options

There are no alternative credit options available for this module.

Description

Description

This module is intended to provide a fundamental understanding of the contemporary concepts and techniques used for processing linguistic data in textual form (Natural Language Processing). It emphasizes the use of existing frameworks and NLP algorithms, provides students with the skills to analyse textual data, and familiarises them with tools and applications. The first part of the course introduces core topics in computational modelling (morphology) of natural language including; Regular Expressions, Finite State Transducers, Part of Speech Tagging, Stemming, and Lemmatisation. The second part of the course introduces contemporary text processing and analysis tasks such as Information Extraction, Named Entity Recognition and Sentiment Analysis, and their application to automated processing of different text sources (academic publications, social media, news articles, and humanities corpora). The module follows a combination of teaching and learning methods, including lectures and computing laboratory work by putting more emphasis on learning through practical work (exercises and mini-projects) to ground the more theoretical aspects of the module syllabus.

Learning outcomes

Upon completion of the module, students will be able to:

  • Demonstrate knowledge and understanding of the fundamental concepts of Natural Language Processing and text analysis.
  • Apply methods and tools in the area of computational modelling (morphology) of natural language for automatic processing of textual input.
  • Explore solutions to text analysis problems in relation to automated processing tasks such as Information Extraction, Named Entity Recognition, Sentiment Analysis and other.
  • Develop small scale NLP processing pipelines using existing tools and frameworks.
  • Evaluate the outcome of the automated text processing pipelines according to standard metrics.

Optional for: MSc Knowledge, Information and Data Science, MA/MSc Digital Humanities, MA Archives and Records Management, MA Library and Information Studies, MA Publishing, 3rd year BSc Information Management or Business

Prerequisites: INST0019 Introduction to Programming and Scripting or equivalent experience with programming at a foundational level.

Please note: this module will be capped at 30 students.

Module deliveries for 2024/25 academic year

Intended teaching term: Term 2 ÌýÌýÌý Postgraduate (FHEQ Level 7)

Teaching and assessment

Mode of study
In person
Methods of assessment
100% Coursework
Mark scheme
Numeric Marks

Other information

Number of students on module in previous year
41
Module leader
Dr Andreas Vlachidis
Who to contact for more information
s.davenport@ucl.ac.uk

Last updated

This module description was last updated on 8th April 2024.

Ìý