Course Description
Traditional relational databases systems and data warehouse platforms are no more enough to manage big data. This course discusses concepts, management, analytics and impact of big data on nowadays environment. It introduces the core technologies for big data management, and architectures and ecosystems, and helps students gain knowledge in designing, managing and evaluating big data solutions. By working on lab exercises and group project, students will develop skills in solving big data management problems in real-world business contexts.
Intended Learning Outcomes
CILO-1: Explain the core concepts of big data and storage techniques.
CILO-2: Explain the core concepts of Hadoop, a well-known big data storage & processing technique.
CILO-3: Explain the core concepts of Spark, the most popular big data storage & processing technique.
CILO-4: Demonstrate how to manage Hadoop Distributed File System (HDFS) in Linux.
CILO-5: Demonstrate how to operate Linux to support big data storage & processing tools.
CILO-6: Create Spark jobs to analyze big data.