wilsonzhang746

Working with normal distributions in Python

Normal distribution is describing random variables with bell-shaped probability density functions. Normal distribution is widely used in data science because…

2 years ago

Calculate point-biserial and biserial correlations using R

When a correlation, usually Person type correlation, is calculated, two variables have to be continuous. But this requirement does not…

2 years ago

Use spread() of dplyr in R to convert long-form dataset into wide-form

dplyr is a package that belongs to tidyverse framework. dplyr allows usage of pipeline structure (%>%), which can chain multiple…

2 years ago

Creating data frames using read.table() and read.csv() in R

R language provides several useful functions for importing delimited files and creating data frames. These delimited files are often stored…

2 years ago

Creating and indexing lists in Python

List is the simplest type of data structure in Python programming. A list is used to store a collection of…

2 years ago

How to install Anaconda and start programming with Python?

Python is among the most popular programming language for data science nowadays, and getting started with Python is quite easy.…

2 years ago

How to create factor variables in R programming

Categorical variables, including nominal and ordinal variables in R programming language are called factor variables. For example, gender(male/female) is nominal,…

2 years ago

Getting started working with R programming language

When R software and RStudio program have been installed on your computer, R working session can be simply started by…

2 years ago

Creating and indexing a list in R

list is a type of data structure in R programming language. Unlike other data structures, especially matrix and vector, in…

2 years ago

Violin plots with ggplot2 in R

Violin plot is quite similar as boxplot, in the sense that it shows the range of the data. And at…

2 years ago