Category: Python
How to Use Pandas Melt() Function
This short tutorial shows you how you can use melt() funtion in Pandas. It is often used when we need to change the format of dataframe to fit into a certain statistical functions. Example 1 of Using melt() City1 City2...
Read Full Article →
What is One-way ANOVA? Formula and Example
One-Way ANOVA is to compare the means of different groups, to see whether the mean difference is statistically significant. For instance, you would like to compare the average household size of three cities. You can collect 3 samples from these...
Read Full Article →
LaTex Formula Cheatsheet
This page includes statistics formulas in raw LaTex code. It is painful sometimes to write a complex formula and thus I hope this page is useful for those need to write them. In case you need to find symbols in...
Read Full Article →
When to Use t-test versus Correlation in Data Analysis
Since both correlation and t-test are about relationships between X and Y, what is the difference between them and when do you use t-test (or correlation)? This tutorial aims to answer these two questions. The following figure presents the difference...
Read Full Article →
How to Do Scatter Plots in Python
This tutorial shows how to use Pandas, Matplotlib, and Seaborn for scatter plots in Python with examples, codes, and charts. There are two methods of doing scatter plots in Python. The following shows the core syntax. Pandas: df.plot (kind=”scatter”, x=”column_x”,...
Read Full Article →
Plot Histogram in Python
Introduction We can use hist() in Matplotlib, pandas, and seaborn to plot histograms in Python. The following is the basic syntax of plotting histogram using these 3 different modules. Method 1: Using matplotlib plt.hist(data,bins=’auto’) Method 2: Using pandas pd.hist() Method...
Read Full Article →
How to Calculate Standard Deviation in Python (NumPy)
This short tutorial shows how you can calculate standard deviation in Python using NumPy.
Read Full Article →
How to Calculate Mean in Python (NumPy)
This short tutorial shows how you can calculate mean in Python using NumPy. First, we generate the random data with mean of 5 and standard deviation (SD) of 1. Then, you can use the numpy is mean() function. As you...
Read Full Article →
Generate Sample of Normal Distribution in Python NumPy
This tutorial shows how to generate a sample of normal distrubution using NumPy in Python. The following shows syntax of two methods. Method 1: It can change the default values (Default: mu=0 and sd=1). np.random.normal(mu=0, sigma=1, size) Method 2: It...
Read Full Article →
When to Use Bar Charts versus Line Charts in Data Visualization (Python Examples)
This tutorial explains when to use bar charts versus line charts in data visualization. I will use examples, including data, Python code, and actual charts to illustrate the difference. When Bar Charts are Better than Line Charts You can use...
Read Full Article →