COM SCI X 450.3
Big Data Management
This course introduces tools for distributed storage and data processing in an open-source framework.
It covers NoSQL, the core components of Hadoop, and an overview of Hive.
The extent of data being produced and stored by organizations is increasing.
In fact, IDC has projected to reach 165 zetta bytes by 2025.
Organizations understand that being able to extract and leverage value and gain actionable insights from this big data can give them a tremendous competitive advantage.
In this course, students learn tools for distributed storage and data processing to an open-source framework.
This course addresses distributed storage and large data set processing focusing on architectures and technologies.