Data Science Projects
This is an item in your portfolio. It can be have images or nice text. If you name the file .md, it will be parsed as markdown. If you name the file .html, it will be parsed as HTML.
# Data Science and Machine Learning Notebooks
- Visualization for Exploratory Data Analysis
- Matplotlib notebook (scatter plot, histogram/kde, box plot)
- Lightning - http://lightning-viz.org/
- Bokeh - https://bokeh.pydata.org/
- Data cleaning
- Notebook
- Dimension reduction
- Notebook for PCA, LogisticRegression, DecisionTree, RandomForest (from Scikit-learn )
- Data Set: Wine Quality
- Problem: Predict quality of wine.
- Approach: Used classification methods (Logistic Regression, Decision Tree, Random Forest) after applying PCA.
- Notebook for PCA, LDA, Autoencoder (from Scikit-learn )
- Data Set: Human Activity Recognition
- Problem: Predict activity from observations.
- Approach:
- Notebook for PCA, LogisticRegression, DecisionTree, RandomForest (from Scikit-learn )
Clustering
- Notebook for K-Means (from Scikit-learn )
- Data Set: Wine Quality
- Problem: Cluster data for two quality categories.
- Approach: Used K means algorithm to cluster.
- Notebook for K-Means (from Scikit-learn )
Regression
- Notebook for LinearRegression, Ridge, Lasso, ElasticNet, ElasticNet CrossValidation, HuberRegressor (from Scikit-learn )
- Data Set: House Sales in King County, USA
- Problem: Predict price of a house.
- Approach: Used regression methods (Linear, Ridge, Lasso, ElasticNet, Huber).
- TensorFlow.LinearRegressor
- TensorFlow.DNNRegressor
- Notebook for LinearRegression, Ridge, Lasso, ElasticNet, ElasticNet CrossValidation, HuberRegressor (from Scikit-learn )
Classification
- Notebook for DNNClassifier, LinearClassifier (from TensorFlow)
- Data Set: Wine Quality
- Problem: Predict quality of wine.
- Approach: Used classification methods (DNNClassifier, LinearClassifier).
- SVM
- Notebook for DNNClassifier, LinearClassifier (from TensorFlow)
- Text Analysis and NLP
- Word2Vec
- Reinforcement Learning