All the below modules have been written into python only, it does not contains any other technology like Spark PySpark etc....
-
pandas: This folder contains all the operation of pandas like creating dataframes and various operations on pandas dataFrame.
-
NumPy for Data Analysis: This folder contains all the operations related to NumPy Array and various operations on NumPy array.
-
Basics_Python: This folder contains basics operation on python.
-
Data Capstone Project: This folder contains basics operation on python -- a samall project.
-
Pandas BuiltIn DataVisualization: This folder contains data visualization with pandas dataFrame.
-
Data Visualization with Seaborn: This folder contains data visualization using Seaborn.
-
Linear Regression: This folder contains machine learning model linear regression
-
Logistic Regression: This folder contains code for simple logistic regression project.
-
K-Means Clustering: This folder contains simple K-means clustering project (with exploratory data analysis)
-
K Nearest Neighbors: This folder contains a simple KNN (k nearest neighbors) project.
-
PCA with python: This folder contains PCA (principal component analysis) project usigsklearnwith cancer data set.
-
Recommender System: This folder contains recommender system project with NumPy, pandas dataframe and seaborn for exploratory data analysis.