Semester 2: Big Data Technologies-
- Hadoop Ecosystem
- Introduction to Hadoop
- Hadoop Distributed File System (HDFS)
- MapReduce programming model
- Apache Spark
- Introduction to Spark
- Spark architecture and components
- Spark programming in Scala or Python
- NoSQL Databases
- Types of NoSQL databases (e.g., MongoDB, Cassandra)
- Data modeling and querying in NoSQL databases
Reference Books:
- “Hadoop: The Definitive Guide” by Tom White
- “Spark: The Definitive Guide” by Bill Chambers and Matei Zaharia
- “Big Data: Principles and best practices of scalable realtime data systems” by Nathan Marz and James Warren
- “Kafka: The Definitive Guide” by Neha Narkhede, Gwen Shapira, and Todd Palino
- “HBase: The Definitive Guide” by Lars George
- “Big Data Analytics with R and Hadoop” by Vignesh Prajapati
- “Big Data Technologies and Applications” edited by Borko Furht and Flavio Villanustre