wilsonzhang746, Author at We provide R, Python, Statistics Online-Learning Course

Estimating simple linear regression model using lm() in R

Linear regression is widely used to model the relationship between response or dependent variable and explanatory or independent variables. The parameter in the model has linear form. When there is only one explanatory assumed in the model, it is called simple linear regression.

By wilsonzhang746, 2 yearsMarch 25, 2024 ago

Python

Using exponential distributions in Python

The exponential distribution is modeling the probability distribution of the random time until next event occur in a Poisson event process. A Poisson event process has a constant occurring rate during an interval. Exponential distribution is a particular case of the gamma distribution when the shape parameter (alpha) equals 1.

By wilsonzhang746, 2 yearsMarch 18, 2024 ago

Python

Append, insert and remove elements of lists in Python

The elements of a list can be added or removed after it has been created.
To add new elements at the end of a list, Py

By wilsonzhang746, 2 yearsMarch 11, 2024 ago

Python

Working with Python classes and instances

Python uses class for object-oriented programming. A class represents the general behavior or information that the programmer or data analyst focuses on. When a class is created, particular objects belonging to this class can be created. This process is called instantiation. Class contains attributes, methods, or functions for general purpose. Attributes for instances can be modified by directly assigning new values, or by using methods defined in a class.

By wilsonzhang746, 2 yearsMarch 4, 2024 ago

R Programming

Calculating partial correlations with R

When we do data analysis, random variables in the dataset are usually mutually correlated. Sometimes, we may want to measure the pure relationship between two variables, and the influence from other variables being controlled. A partial correlation calculation could fulfill this purpose.

By wilsonzhang746, 2 yearsFebruary 29, 2024 ago

Python

Working with normal distributions in Python

Normal distribution is describing random variables with bell-shaped probability density functions. Normal distribution is widely used in data science because large sample random variates have a mean value which follows approximate normal distribution if variates are independently drawn from any distributions. The probability density function for normal distribution is determined by two parameters: mean(miu) and standard deviation(sigma).

By wilsonzhang746, 2 yearsFebruary 25, 2024 ago

R Programming

Calculate point-biserial and biserial correlations using R

When a correlation, usually Person type correlation, is calculated, two variables have to be continuous. But this requirement does not excludes the situation when one of the two variables is a dichotomous (binary) distributed. Say if we want to measure the correlations between height and gender for a group of people, the variable gender has clear dichotomous values. This kind of Pearson correlation is called point-biserial correlation, because the value for gender variable is strictly 0 or 1.

By wilsonzhang746, 2 yearsFebruary 21, 2024 ago

Dplyr

Use spread() of dplyr in R to convert long-form dataset into wide-form

dplyr is a package that belongs to tidyverse framework. dplyr allows usage of pipeline structure (%>%), which can chain multiple functions together into one statement to make data management more effective. spread() of dplyr is a function that spreads the values of columns from the current data frame, and make them as column labels in the resulting data frame.

By wilsonzhang746, 2 yearsFebruary 15, 2024 ago

R Data Structure Dataset

Creating data frames using read.table() and read.csv() in R

R language provides several useful functions for importing delimited files and creating data frames. These delimited files are often stored in local computer directory, with extensions ‘txt’, ‘csv’, ‘dat’. The mostly widely used functions for importing these files in R are read.table() and read.csv().

By wilsonzhang746, 2 yearsJanuary 26, 2024 ago

Python

Creating and indexing lists in Python

List is the simplest type of data structure in Python programming. A list is used to store a collection of elements of same type (numeric, string, etc.). In Python, a pair of brackets [] indicates the data object is a list type. For example, the following two statements create two lists, in which one is numeric and the other is of string type.

By wilsonzhang746, 2 yearsJanuary 23, 2024 ago