Course Description
This course will introduce the latest development of information retrieval and web mining technologies. In the first part of the course, we will overview the fundamental concepts of information retrieval, such as crawling, parsing, indexing, searching, scoring, and compression. These techniques enable students to handle web scale datasets. In the second part, we will discuss how to extract knowledge from web scale datasets by link analysis, clustering, and recommendation techniques. Moreover, some latest implementation techniques (such as Apache Hadoop, Pig, and Lucene) will be studied thoroughly by the course project. The course is aimed at helping students to explore the latest techniques in information retrieve and web mining. Some research oriented projects will be given according to students’ background knowledge. The contents of the course will mix with lectures, tutorials, and group discussions.
Intended Learning Outcomes
CILO-1: An ability to apply knowledge of computing and mathematics appropriate to the programme outcomes and to the discipline.
CILO-2: An ability to analyse a problem, and identify and define the computing requirements appropriate to its solution.
CILO-3: An ability to analyse the local and global impact of computing on individuals, organisations, and society.