wilsonzhang746, Author at We provide R, Python, Statistics Online-Learning Course

Python Basic Course

For online Python training registration, click here ! Python has been regarded as one the most used programming languages in the last decade. With the development of AI, Python’s popularity remains even stronger. Python’s practicability lies not only in general data analysis, but also in web building, machine learning, deep Read more…

By wilsonzhang746, 2 yearsApril 30, 2024 ago

R Basic Course

Click here for online course registration ! R is a popular programming language, It is free and open source. R has been widely used in the field of statistical data analysis, econometrics, machine learning , just name a few in the last almost three decades. R is regarded as the Read more…

By wilsonzhang746, 2 yearsApril 30, 2024 ago

R Advanced Data Management

Aggregating data using aggregate() in R

In R programming, function aggregate() provides an easy way to calculate summary statistics of variables by specific groups in a data frame.

By wilsonzhang746, 2 yearsApril 19, 2024 ago

Linear Regression with R

Assessing Normality Assumption for Linear Regression in R

The normality assumption in linear regression is necessary to ensure the estimates of parameters are unbiased and the hypothesis testing is correct. It states that for the fixed or given values of explanatory variables, the dependent variables are normally distributed around the mean 0. It is equivalent to say that the residuals after model estimation follow a normal distribution with the mean 0.

By wilsonzhang746, 2 yearsApril 13, 2024 ago

Python

Returning a dictionary with functions in Python

Dictionary is a data structure type in Python. One reason for why Python is so popular among programmers is that dictionary provides an useful and effective way to store key-value pairs information. When functions in Python carry out some tasks, it can return required information to a dictionary.

By wilsonzhang746, 2 yearsApril 9, 2024 ago

R Basic Data Management

Dealing with missing values in R

In R programming, the value ‘NA’ is used to represent a missing value. Say we try to read a csv file from working directory and generate a data frame. There several places in the csv file have value ‘999’, which means missing value due to various circumstances during data survey and collection.

By wilsonzhang746, 2 yearsApril 6, 2024 ago

R Basic Data Management

Recode variables in R

In data analysis it is often needed to set new values to a variable based on one or several conditions, and these kinds of operations are called recode variables. The most frequently applied recoding variables in R may be the setting some values to missing values (NA), and recoding a continuous to the values of a categorical variable.

By wilsonzhang746, 2 yearsApril 3, 2024 ago

R Basic Data Management

Working with date values in R

R has rich resources of functions dealing with date values in data analysis. In this post, we introduce and show how to use as.Date() function for working with data values in R. as.Date() accepts a string input with specific format and transform the value to a date object in R.

By wilsonzhang746, 2 yearsMarch 31, 2024 ago

R Data Structure Dataset

How to generate random numbers from various statistical distributions using R

n this post, we show how to generate random numbers into vector and matrix in R programming, from various statistical distributions. Specifically, we focus several basic and widely used statistical distributions here, namely, normal distribution, continuous uniform distribution, binomial distribution and Poisson distribution.

By wilsonzhang746, 2 yearsMarch 29, 2024 ago

R Data Structure Dataset

Creating a random sequence using sample() in R

It is not uncommon to generate random sequences in R programming. sample() function provides the feasibility of generating such random objects from given vectors, either with or without replacement. The following code shows an example that 32 numbers with replacement drawn from 50 integers from 1 to 50.

By wilsonzhang746, 2 yearsMarch 27, 2024 ago