How to create factor variables in R programming

Categorical variables, including nominal and ordinal variables in R programming language are called factor variables. For example, gender(male/female) is nominal, and survey results (excellent, good, normal, bad) have ordinal values. Categorical variables are useful because many data analysis operations are related to values in different categories, such as contingency tables between two categorical variables for independence analysis, hypothesis testing of homogeneity of variances, just name a few.

Kernel density plots with ggplot2 in R

Kernel density function is a nonparametric method to find the drawing density curve of random samples, and it is often used to draw a smoothed curve in data visualization. In R programming with ggplot2 package, a chaining of functions ggplot() and geom_density() is often used to draw different smoothed curves showing the distribution of continuous variables.

Using a function with a while loop in Python

A function in Python is a group of code statements wrapped together to perform specific tasks. After a function is defined, then it can be called by passing real values to its arguments and get the returning results. A while loop in Python is a group of conditional statements bundled in a statement beginning with keyword ‘while’, and the codes will run forever until the condition returns false. By including a user-defined function inside a while loop in Python, many iterative tasks can be fulfilled.

Calculating Type I Error and Type II Error in Hypothesis Testing using Python

In hypothesis testing, the possibility of the other side than the conclusion usually exists, and the analysis commits so-called Type I and Type II errors, with respect to the truth and the decision made upon the random sample and hypotheses. In particular, a Type I error measures the probability that a true Null hypothesis (H0) is incorrectly rejected, and a Type II error says the probability that a false H0 not being rejected, respectively.