Found 2038 Articles for R Programming

How to create a sample or samples using probability distribution in R?

Nizamuddin Siddiqui
Updated on 13-Aug-2021 11:38:15

3K+ Views

A probability distribution is the type of distribution that gives a specific probability to each value in the data set. For example, if we have a variable say X that contains three values say 1, 2, and 3 and each of them occurs with the probability defined as 0.25, 0.50, and 0.25 respectively then the function that gives the probability of occurrence of each value in X is called the probability distribution. In R, we can create the sample or samples using probability distribution if we have a predefined probabilities for each value or by using known distributions such as ... Read More

How to subset rows based on criterion of multiple numerical columns in R data frame?

Nizamuddin Siddiqui
Updated on 13-Aug-2021 11:33:31

244 Views

If we want to create a subset of a data frame based on multiple numerical columns then we can follow the below steps −Creating a data frame.Subsetting the data frame with the help of filter function of dplyr package.Create the data frameLet's create a data frame as shown below − Live Demox1

How to subset an R data frame if numerical column is greater than a certain value for a particular category in grouping column?

Nizamuddin Siddiqui
Updated on 13-Aug-2021 11:32:22

260 Views

Subsetting is one of the commonly used technique which serves many different purposes depending on the objective of analysis. To subset a data frame if numerical column is greater than a certain value for a particular category in grouping column then we need to follow the below steps −Creating a data frame.Subsetting the data frame with the help of filter function of dplyr package.Create the data frameLet's create a data frame as shown below − Live Demox

How to subset an R data frame if one of the supplied grouping values is found and numerical column value is greater than a certain value?

Nizamuddin Siddiqui
Updated on 13-Aug-2021 11:30:47

102 Views

Subsetting is one of the commonly used technique which serves many different purposes depending on the objective of analysis. To subset a data frame if one of the supplied grouping values is found means that we want to subset if any of the categorical variable values is present in the categorical column and if we want to include a numerical column for a greater than value then we need to follow the below steps −Creating a data frame.Subsetting the data frame if any of the supplied value of categorical variable exist and a numerical column value is greater than a ... Read More

How to subset an R data frame if one of the supplied grouping values is found?

Nizamuddin Siddiqui
Updated on 13-Aug-2021 11:29:11

30 Views

Subsetting is one of the commonly used technique which serves many different purposes depending on the objective of analysis. To subset a data frame if one of the supplied grouping values is found means that we want to subset if any of the categorical variable values is present in the categorical column, for this purpose we can follow the below steps −Creating a data frame.Subsetting the data frame if any of the supplied value of categorical variable exist.Create the data frameLet's create a data frame as shown below − Live Demox

How to create an ordinal variable in R?

Nizamuddin Siddiqui
Updated on 13-Aug-2021 11:28:02

3K+ Views

An ordinal variable is a type of categorical variable which has natural ordering. For example, an ordinal variable could be level of salary something defined with Low, Medium, and High categories here we have three categories but there exists a natural order in these categories as low salary is always less than the medium, medium is always less than high. To create an ordinal variable in R, we can use the order argument along with factor function while creating the variable. Follow the below steps to create an ordinal variable in R −Create a categorical column with factor function where ... Read More

How to subset an R data frame based on numerical and categorical column?

Nizamuddin Siddiqui
Updated on 13-Aug-2021 11:27:01

2K+ Views

Subsetting is one of the commonly used technique which serves many different purposes depending on the objective of analysis. To subset a data frame by excluding a column with the help of dplyr package, we can follow the below steps −Creating a data frame.Subsetting the data frame based on numerical as well as categorical column at the same time with the help of filter function of dplyr package.Create the data frameLet's create a data frame as shown below − Live DemoLevel

How to create a scatterplot with two legends using ggplot2 in R?

Nizamuddin Siddiqui
Updated on 13-Aug-2021 11:25:37

1K+ Views

If we want to create a scatterplot with two legends then we must be having two categorical or factor columns. This can be done by using the below steps −Creating a data frame with two numerical and two categorical columnsCreating the scatterplot with shape and color argument of geom_point function of ggplot2 packageCreate the data frameLet's create a data frame as shown below − Live Demox

How to subset a data frame by excluding a column using dplyr in R?

Nizamuddin Siddiqui
Updated on 13-Aug-2021 11:24:27

443 Views

Subsetting is one of the commonly used technique which serves many different purposes depending on the objective of analysis. To subset a data frame by excluding a column with the help of dplyr package, we can follow the below steps −Creating a data frame.Subsetting the data frame by excluding a column with select function of dplyr package.Create the data frameLet's create a data frame as shown below − Live Demox1

How to subset rows of an R data frame if any columns have values greater than a certain value?

Nizamuddin Siddiqui
Updated on 13-Aug-2021 11:23:13

720 Views

To subset rows of an R data frame if any columns have values greater than a certain value, we can follow the below steps −First of all, create a data frame.Then, use filter_all function of dplyr package with any_vars function to subset the rows of the data frame for any columns having values greater than a certain value.Create the data frameLet's create a data frame as shown below − Live Demox1

Advertisements