Clustering Algorithms In ML

Agglomerative Clustering

Oct 8, 2024

—

by

Agglomerative clustering is a hierarchical clustering algorithm that starts with each data point as its own cluster and iteratively merges the closest clusters until a stopping criterion is reached. It is a bottom-up approach that produces a dendrogram, which is a tree-like diagram that shows the hierarchical relationship between the clusters. The algorithm can be…

Distribution-Based Clustering

Oct 8, 2024

—

by

Tayyaba Syed

in Clustering Algorithms In ML

Distribution-based clustering algorithms, also known as probabilistic clustering algorithms, are a class of machine learning algorithms that assume that the data points are generated from a mixture of probability distributions. These algorithms aim to identify the underlying probability distributions that generate the data, and use this information to cluster the data into groups with similar…

Affinity Propagation

Oct 8, 2024

—

by

Tayyaba Syed

in Clustering Algorithms In ML

Affinity Propagation is a clustering algorithm that identifies “exemplars” in a dataset and assigns each data point to one of these exemplars. It is a type of clustering algorithm that does not require a pre-specified number of clusters, making it a useful tool for exploratory data analysis. Affinity Propagation was introduced by Frey and Dueck…

BIRCH Clustering

Oct 8, 2024

—

by

Tayyaba Syed

in Clustering Algorithms In ML

BIRCH (Balanced Iterative Reducing and Clustering hierarchies) is a hierarchical clustering algorithm that is designed to handle large datasets efficiently. The algorithm builds a treelike structure of clusters by recursively partitioning the data into subclusters until a stopping criterion is met. BIRCH uses two main data structures to represent the clusters: Clustering Feature (CF) and…

HDBSCAN Clustering

Oct 8, 2024

—

by

Tayyaba Syed

in Clustering Algorithms In ML

Working of HDBSCAN Clustering HDBSCAN builds a hierarchy of clusters using a mutual-reachability graph, which is a graph where each data point is a node and the edges between them are weighted by a measure of similarity or distance. The graph is built by connecting two points with an edge if their mutual reachability distance…

OPTICS Clustering

Oct 8, 2024

—

by

Tayyaba Syed

in Clustering Algorithms In ML

OPTICS is like DBSCAN (Density-Based Spatial Clustering of Applications with Noise), another popular density-based clustering algorithm. However, OPTICS has several advantages over DBSCAN, including the ability to identify clusters of varying densities, the ability to handle noise, and the ability to produce a hierarchical clustering structure. Implementation of OPTICS in Python To implement OPTICS clustering…

DBSCAN Clustering

Oct 8, 2024

—

by

Tayyaba Syed

in Clustering Algorithms In ML

The DBSCAN Clustering algorithm works as follows − Implementation in Python We can implement the DBSCAN algorithm in Python using the scikit-learn library. Here are the steps to do so − Load the dataset The first step is to load the dataset. We will use the make_moons function from the scikitlearn library to generate a toy dataset…

Density-Based Clustering

Oct 8, 2024

—

by

Tayyaba Syed

in Clustering Algorithms In ML

Density-based clustering is based on the idea that clusters are regions of high density separated by regions of low density. Popular Density-based Clustering Algorithms Here are the most common density-based clustering algorithms − DBSCAN Clustering The DBSCAN (Density-Based Spatial Clustering of Applications with Noise) algorithm is one of the most common density-based clustering algorithms. The DBSCAN algorithm…

Hierarchical Clustering

Oct 8, 2024

—

by

Tayyaba Syed

in Clustering Algorithms In ML

Hierarchical clustering is another unsupervised learning algorithm that is used to group together the unlabeled data points having similar characteristics. Hierarchical clustering algorithms falls into following two categories − Steps to Perform Agglomerative Hierarchical Clustering We are going to explain the most used and important Hierarchical clustering i.e. agglomerative. The steps to perform the same…

Mean-Shift Clustering

Oct 8, 2024

—

by

Tayyaba Syed

in Clustering Algorithms In ML

The Mean-Shift clustering algorithm is a non-parametric clustering algorithm that works by iteratively shifting the mean of a data point towards the densest area of the data. The densest area of the data is determined by the kernel function, which is a function that assigns weights to the data points based on their distance from…

Category: Clustering Algorithms In ML