Dimensionality reduction techniques are used to extract informative features that could be used for later learning, etc. Tested PCA, MDS and t-SNE for the breast cancer dataset.
Read MoreAWS Aurora global databases support both high availability and scalability in a cross region fashion. Here’s the rough guide how to deploy clusters in CloudFromation.
Read MoreWhat are differences between linear regression and polynomial regression? We must know these techniques well but it is still vague somewhat.
Read MoreStandarization and normalization are essential techniques to apply your data set before using classifiers in some cases. I tested both with iris data set.
Read MoreThis is the rough guide how to do test of normality for dataset by leveraging Scipy library. I used probplot, the Shapiro-Wilk test and skewness, kurtosis.
Read MoreThis guide shows how to obtain crypto data via CoinGecko API and calculate the coefficient of variation that is a statistical measure of the dispersion of data points.
Read Morecloud-init is an industry standard tool to initialize instance with cross-platform and multi-distribution characteristics. Explains how it works in AWS.
Read More