At the end of this course the students will be able to:
- Define big data and its features, and identify what are and are not big data problems.
- Describe the data storage and retrieval requirements for big data and utilize common big data platforms (Apache Hadoop and Spark) for storage of and retrieval on large-scale databases.
- Identify and apply appropriate machine learning algorithms for big data analytics tasks.
- Apply real-time analytics on big data streams.
- Model a problem into a graph database and perform analytics in a scalable manner.