Introduction to Unsupervised Learning¶
Things to Learn
Definitions
Unsupervised Learning
Kmeans. What is k in K-means?
Hierarchial Clustering
Aggolomeration Clustering
DBScan
Dimensionality Reduction( or Algorithms ) + Benefits
Problems
What is Sentiment Analysis?
Can Decision trees be used for clustering?
Specifics about algorithm
Data Cleaning and Missing Value Treatment
Most appropriate strategy for Data Cleaning before applying K-means
Number of Features (to use)
Initialization before Kmeans
Termination Condition
Optimizing Number of Clusters
Elbow Method
Manhattand Distance
GAP Analysis
Silhouette Analysis
Convergence
Local vs Global Minima (Kmeans)
Sensitivity to outliers
When will the algorithm fail?
Metrics to be used
Why feature scaling is necessary for K-means
Visualization
Dendogram
Aggolomeration
Soft Assignment
Fuzzy Kmeans
Gausian Mixture Models
Multinomial Mixture Models + Expectation Maximization
Distances
Statistical Distances
Problem Domains and Examples
Google News
Gene : How much a gene is present?
Organize Computer Cluster
Social Network Analysis
Market Segmentation
Astronomical Data Analysis
Cocktail Party problem
Useful Links¶
Unsupervised Learning
YouTubeVideo("AA2g9ItoAc", width=800, height=600)
Kmeans Clustering
YouTubeVideo("4b5d3muPQmA", width=800, height=600)
Hierarchical Clustering
YouTubeVideo("7xHsRkOdVwo", width=800, height=600)
Isolation Forest for Anomaly Detection
Customer Segmentation
YouTubeVideo("4NDORb4HBkw", width=800, height=600)
Cohort Analysis and RFM
https://www.kaggle.com/tklimonova/online-retail-cohort-and-rfm-analysis
https://clevertap.com/blog/cohort-analysis/
https://www.kaggle.com/mahmoudelfahl/cohort-analysis-customer-segmentation-with-rfm#Cohort-Analysis-(Retention-over-User-&-Product-Lifetime)
https://towardsdatascience.com/who-is-your-golden-goose-cohort-analysis-50c9de5dbd31
A/B Testing
- https://medium.com/@robbiegeoghegan/implementing-a-b-tests-in-python-514e9eb5b3a1#:~:text=The%20minimum%20sample%20size%20of,to%20conduct%20the%20hypothesis%20testing.
File "<ipython-input-3-67e8396cc2ed>", line 1
- https://medium.com/@robbiegeoghegan/implementing-a-b-tests-in-python-514e9eb5b3a1#:~:text=The%20minimum%20sample%20size%20of,to%20conduct%20the%20hypothesis%20testing.
^
SyntaxError: invalid syntax