BIG DATA COURSE
This Big Data course provides students with the solution for Big Data analytics under different software platforms such as Cloudera and Hortonwork. Specifically, the course will introduce the Hadoop and Map Reduce and some important infrastructures built on the top of Hadoop system including Hive, Pig, HBase and Scoop. This big data course also covers Apache Storm for processing streaming data in real time and Spark SQL, Spark Streaming, GraphX and Spark framing data analysis using Python.
TOPICS OF THE BIG DATA COURSE
- Introducing Apache Hadoop and Spark
- Map Reduce
- Apache Hive
- Hive User-Defined Functions (UDFs)
- Introducing Apache Pig
- Introducing Data Piping
- Hbase Non-SQL Programming
- Processing Streaming Data in Hadoop with Apache Storm
- Spark Programming
WHO SHOULD TAKE THIS COURSE?
Data analysts, data scientists, statisticians, mathematicians, computer programmers
WHO IS THE LEAD INSTRUCTOR OF THIS COURSE?
Ms Gitimoni Saikia is also the instructor of this course. She has a master in computer science and 7+ years of successful work experience in the education field. She has taught many programming languages, data structures and machine learning and has gained in-depth knowledge of machine learning algorithms including deep-learning and applied them to many projects using tools in Python, C++ and Java. Her areas of research interests include computer vision, natural language processing and financial analytics. Currently, Ms. Saikia is working as a data science consultant to many corporations in Toronto and also teaching data science courses at Metro College of Technology, Toronto, Ontario, Canada.
Credential: College certificate.
Hours: 50
Location of course delivery: Toronto, Ontario, Canada