Month: November 2020

Artificial Intelligence Image
DataScience
Python

3 ways to do dimensionality reduction techniques in Scikit-learn

Dimensionality reduction techniques are used to extract informative features that could be used for later learning, etc. Tested PCA, MDS and t-SNE for the breast cancer dataset.

Read More
Cloud Service Image
AWS

AWS Aurora Global database deploy with CloudFormation

AWS Aurora global databases support both high availability and scalability in a cross region fashion. Here’s the rough guide how to deploy clusters in CloudFromation.

Read More
Chart Analysis Image
DataScience
Python
Statistics

Scikit-learn LinearRegression vs Numpy Polyfit

What are differences between linear regression and polynomial regression? We must know these techniques well but it is still vague somewhat.

Read More
Social Media Image
DataScience
Python
Statistics

What are standarization and normalization? Test with iris data set in Scikit-learn

Standarization and normalization are essential techniques to apply your data set before using classifiers in some cases. I tested both with iris data set.

Read More
Analytics Image
Python
Statistics

3 ways to do test of normality with Scipy library in Python

This is the rough guide how to do test of normality for dataset by leveraging Scipy library. I used probplot, the Shapiro-Wilk test and skewness, kurtosis.

Read More
Computer Analysis Image
Crypto
DataScience
Python

Coefficient of variation to compare your crypto assets

This guide shows how to obtain crypto data via CoinGecko API and calculate the coefficient of variation that is a statistical measure of the dispersion of data points.

Read More
Cloud Service Image
AWS

Beginner Tutorial: cloud-init in AWS

cloud-init is an industry standard tool to initialize instance with cross-platform and multi-distribution characteristics. Explains how it works in AWS.

Read More