clustering
Functions
|
Calculates the variance explained for a PCA of the permuted data. |
|
Calculates the optimal number of principal components to keep in a dimension reduction situation. |
|
Uses HDBSCAN to calculate clusters from the reduced data. |
|
This function calculates clusters based on the reduced vectors. |
|
Reduces the number of dimensions using PCA. |
|
Uses UMAP to reduce the dimensionality of the embeddings. |
|
Shuffles the data by each column or row for a pandas dataframe. |
|
Run a simple t test on a sample to see if it is significantly different from the population mean. |