Data Science and Big Data Analytics using Open Source (18 Nov 2019 to 29 Nov 2019)

Data Science and Big Data Analytics using Open Source (Oplan CSE-20)
(18 Nov 2019 to 29 Nov 2019)  at NITTTR Chandigarh

Contents

Introduction to Data Science and Big Data

Setup and Installation of Jupyter Notebook for Python and Libraries for Data Science and Big Data

Essentials of Python

Data Structure, Loops and Control Statements in Python

Functions, Arrays

Introduction to NumPy and Practice Session

Data and File Handling using Pandas

Introduction to WEKA

Data Preprocessing and ETL Tools for Data Science and Big Data Analytics

Statistical Analysis (Binomial, Poisson, Chi-square Test, Anova)

Implementation of Scikit-Learn Algorithms For Data Science

Essentials of Mathematics for Data Science and Big Data Analytics

Apache Hadoop Cluster Implementation for Big Data

OpenCV for Big Data in Computer Vision Applications

Apache Pig Implementation to Improve Hadoop Performance

Setup and Installation of Apache Spark

Big data Analytics using Apache Spark

Data Mining Algorithms for Data Analytics using WEKA

HDFS and HIVE for Data Analytics

Handling of Big Datasets using Dask

Data Analytics using PySpark

Social Networks Data Analysis using NetworkX

NoSQL Databases for Big Data

Real Time Weather Reporting and Analytics

Speech Recognition and Transformation

Geospatial Data Analysis and Real Time Streaming Apps

Advance Data Visualization and Plotting