Found 2038 Articles for R Programming

Combine two columns by ignoring missing values if exists in one column in R data frame.

Nizamuddin Siddiqui
Updated on 08-Nov-2021 10:17:34

3K+ Views

To combine two columns by ignoring missing values if exists in one column in R data frame, we can use paste function and is.na function.For Example, if we have a data frame called df that contains two columns say C1 and C2 where C2 contains some missing values then we can use the below mentioned command to combine C1 and C2 by ignoring missing values in C2 −cbind(df,Combined=paste(df[,1],replace(df[,2],is.na(df[,2]),"")))Example 1Following snippet creates a sample data frame −x1

Find the common elements between two columns of an R dataframe.

Nizamuddin Siddiqui
Updated on 08-Nov-2021 10:10:25

5K+ Views

To find the common elements between two columns of an R data frame, we can use intersect function.For Example, if we have a data frame called df that contains two columns say X and Y then we can find the common elements between X and Y by using the below command −intersect(df$X,df$Y)Example 1Following snippet creates a sample data frame −x1

How to subset an R data frame by specifying columns that contains NA?

Nizamuddin Siddiqui
Updated on 08-Nov-2021 10:14:09

297 Views

To subset an R data frame by specifying columns that contains NA, we can follow the below steps −First of all, create a data frame with some columns containing NAs.Then, use is.na along with subset function to subset the data frame by specifying columns that contains NA.ExampleCreate the data frameLet’s create a data frame as shown below −x

Find the number of non-missing values in each group of an R data frame.

Nizamuddin Siddiqui
Updated on 08-Nov-2021 10:04:49

178 Views

To find the number of non-missing values in each group of an R data frame, we can convert the data frame to data.table object and then use the sum function with negation of is.na.For Example, if we have a data frame called df that contains a grouping column say Group and a numerical column with few NAs say Num then we can find the number of non-missing values in each Group by using the below given command −setDT(df)[,sum(!is.na(df)),by=.(Group)]Example 1Following snippet creates a sample data frame −Grp

Create stacked bar chart with percentages on Y-axis using ggplot2 in R.

Nizamuddin Siddiqui
Updated on 08-Nov-2021 09:59:33

941 Views

To create stacked bar chart with percentages on Y-axis using ggplot2 in R, we can use fill argument inside geom_bar and put the second categorical variable with position set to fill.For Example, if we have a data frame called with two categorical columns say C1 and C2 then we can create stacked bar chart with percentages on Y-axis using the below mentioned command −ggplot(df,aes(C1))+geom_bar(aes(fill=C2),position="fill")ExampleFollowing snippet creates a sample data frame −f1

How to find the column variance if some columns are categorical in R data frame?

Nizamuddin Siddiqui
Updated on 08-Nov-2021 10:02:08

364 Views

To find the column variance if some columns are categorical in R data frame, we can follow the below steps −First of all, create a data frame.Then, use numcolwise function from plyr package to find the column variance if some columns are categorical.ExampleCreate the data frameLet’s create a data frame as shown below −Group

How to find the sum of rows of a column based on multiple columns in R data frame?

Nizamuddin Siddiqui
Updated on 08-Nov-2021 09:55:05

849 Views

To find the sum of rows of a column based on multiple columns in R data frame, we can follow the below steps −First of all, create a data frame.Then, use aggregate function to find the sum of rows of a column based on multiple columns.ExampleCreate the data frameLet’s create a data frame as shown below −Grp1

Subset groups that occur greater than equal to n times in R dataframe.

Nizamuddin Siddiqui
Updated on 08-Nov-2021 09:55:25

413 Views

To subset groups that occur less than n times in R data frame, we can use filter function of dplyr package.For Example, if we have a data frame called df that contains a grouping column say Group then we can subset groups that occur less than 4 times by using the below mentioned command −df%%group_by(Group)%%filter(n()=4)Example 1Following snippet creates a sample data frame −Grp

Create bar plot of one column in an R data frame using ggplot2.

Nizamuddin Siddiqui
Updated on 08-Nov-2021 08:21:41

3K+ Views

To create bar plot of one column in an R data frame using ggplot2, we can use rownames of the data frame as x variable in aes.For Example, if we have a data frame called df that contains two columns say X and Y and we want to create the bar plot of values in Y then we can use the below mentioned command −ggplot(df,aes(rownames(df),Y))+geom_bar(stat="identity")ExampleFollowing snippet creates a sample data frame −x1

Display infinity and minus infinity both symbols in one base R plot.

Nizamuddin Siddiqui
Updated on 08-Nov-2021 08:18:49

196 Views

To display infinity and minus infinity symbol in one base R plot, we can use text function and expression function. Inside expression function we can put infinity word for the display of infinity and infinity word with minus sign for the display of minus infinity.Check out the below Examples to understand how it can be done.Example 1To display infinity and minus infinity symbol in one base R plot use the following code −plot(1:10, type="n") text(c(1, 5), expression(-infinity, infinity))OutputIf you execute the above given snippet, it generates the following Output −Example 2To display infinity and minus infinity symbol in one base ... Read More

Advertisements