Handling NaN (not a number) values in Pandas Series using Python

We provide affordable online training course(via ZOOM meeting) for Python and R programming at fundamental level, click here for more details. In Python programming, NaN (not a number) values denotes those missing values, and values that not available among various calculations, such as divided by zero or logarithm of a negative number. Pandas allows to assign NaN values to Series and Data Frames. Two useful functions isnull() and notnull() will return boolean object such that Read more…

Working with duplicate values in Pandas Series with Python

We provide affordable online training course(via ZOOM meeting) for Python and R programming at fundamental level, click here for more details. When a Pandas Series data object is created in Python, is values can be evaluated with respect to duplicate values. Pandas provides several handy functions dealing with duplicate values in Series. unique() returns unique values of the object, value_counts() will list frequency of each unique value, and isin() will return a boolean Series in Read more…

How to filter a Pandas Series in Python

We provide affordable online training course(via ZOOM meeting) for Python and R programming at fundamental level, click here for more details. Series is a data structure type that stores one dimensional labeled data in Pandas module in Python. When a Series is created, its value can be filtered using loc() function, and the result can be stored into a new Series. Filtering conditional can be any of combination of conditional statements. isin() function can also Read more…

How to filter values of Numpy array in Python

We provide affordable online training course(via ZOOM meeting) for Python and R programming at fundamental level, click here for more details. When a Numpy array is created, its values can be filtered, mainly via conditional tests statement. We provides several examples of filtering array in the following examples. Numpy provides also function in1d() which meets values in a list as filtering condition. You can also watch videos on our YouTube channel for more understanding of Read more…

How to create Pandas Series from Numpy array in Python

We provide affordable online training course(via ZOOM meeting) for Python and R programming at fundamental level, click here for more details. Pandas module in Python provides two data structures, namely Series and Data Frame. While a Data Frame is a tabular dataset has similar mechanism like a spreadsheet, a Series is just a one-dimensional data object with labels. Creating a Series can come from manually input values, with pd.Series() functio, or by inputting a Numpy Read more…

How to create Pandas data structure Series in Python

We provide affordable online training course(via ZOOM meeting) for Python and R programming at fundamental level, click here for more details. Pandas is a library module designed in Python programming. It is specific for data structure with labeled data. Pandas provides two data structure types, Series and Data Frame. Series is a one-dimensional labeled data, and Data Frame is used for tabular data storage. Series stores data values and their corresponding labels in two arrays: Read more…

Evaluate relative importance of variables in regression using standardized regression coefficients approach in R

We provide effective and economically affordable online training courses for R and Python, click here for more details and course registration ! In linear regression analysis, one may be very interested in relative importance of independent variables, that is, which variable contributes most in explaining the variation of response variable. There are several approaches for this purpose. One of them is standardized regression coefficients, which measure how much change of response variable in standard deviation Read more…

How to calculate probabilities for Exponential distributions in R

We provide effective and economically affordable online training courses for R and Python, click here for more details and course registration ! Exponential distribution is a continuous distribution that models random time between events (until next event occurs) during a Poisson process, which is a process with constant occurrence rate and independence among each individual event. Exponential distribution has the following probability density function: Where positive parameter β denotes the mean time between events, and Read more…

How to calculate normal distributions in R

We provide effective and economically affordable online training courses for R and Python, click here for more details and course registration ! Normal distribution is a continuous random variable distribution with bell-shaped probability density curve. It is widely used in statistical data analysis, and the basis for many other distributions as well. The probability density function can be expressed as In which the two parameters μ and σ, are its mean and standard deviation, respectively. Read more…

How to calculate Continuous Uniform distribution probabilities in R

We provide effective and economically affordable online training courses for R and Python, click here for more details and course registration ! Continuous Uniform distribution is used to model a flat probability existence with a range (A,B). Its probability density can be depicted as The follow figure shows probability density curve for a Continuous Uniform distribution defined with range (1,3). In R programming, we can use following functions in computation of Continuous Uniform distribution. dunif() Read more…