Found 2038 Articles for R Programming

How to find the index of an element in a matrix column based on some condition in R?

Nizamuddin Siddiqui
Updated on 17-Oct-2020 06:19:59

250 Views

We might want to find the position of a value in a matrix column which is less than a certain value. This will help us to identify the position of critical or threshold values in each column. For example, if we have a matrix M that contains 5 rows and 5 columns with vales in the range of 1 to 100 then we might want to find the index of values in each column that are less than 50 so that we can understand how many columns have such type of values. In R, we can easily do this by ... Read More

How to subset factor columns in an R data frame?

Nizamuddin Siddiqui
Updated on 17-Oct-2020 06:14:42

643 Views

Subsetting of factor columns can be done by creating an object of all columns using sapply with is.factor to extract only factor column in the future then passing that object into subsetting operator single square brackets. For example, if we have a data frame df that contains three columns x, y, z and two of them say x and y are factor columns then we can use Factors

How to create a scatterplot using ggplot2 with different shape and color of points based on a variable in R?

Nizamuddin Siddiqui
Updated on 17-Oct-2020 06:08:32

447 Views

In general, the default shape of points in a scatterplot is circular but it can be changed to other shapes using integers or sequence or the variable. We just need to use the argument shape inside geom_point function and pass the variable name. For example, if we want to create the scatterplot with varying shapes of a variable x then we can use geom_point(shape=x). And if we want to change the size then integer values can be used.ExampleConsider the below data frame − Live Demoset.seed(151) x

How to remove the row names or column names from a matrix in R?

Nizamuddin Siddiqui
Updated on 16-Oct-2020 15:50:58

20K+ Views

To remove the row names or column names from a matrix, we just need to set them to NULL, in this way all the names will be nullified. For example, if we have a matrix M that contain row names and column names then we can remove those names by using the command colnames(M)

How to replicate a matrix by rows in R?

Nizamuddin Siddiqui
Updated on 16-Oct-2020 15:35:03

2K+ Views

The replication of matrix by rows means that repeating a matrix one or more times but row-wise. For example, if we have a matrix that contains only one row and three columns then the replication of that matrix three times will repeat that one row three times. This can be done by using rep function along with matrix function as shown in the below example.Example Live DemoM

How to extract website name from their links in R?

Nizamuddin Siddiqui
Updated on 16-Oct-2020 15:25:38

185 Views

If we have a list of website links and we want to extract the website name from those links then it is a time-consuming task because we would need to copy each name one-by-one. Therefore, it is better to extract them using a function in R and save time. To extract the website name from the website link, we can use suffix_extract function of urltools package. This will extract the host, subdomain, domain and suffix. And it is known that the domain values are the website names.Loading urltools package −library(urltools)Website links stored in a vector −Web_LinksRead More

How to create horizontal line for a range of values in a plot created by using ggplot2 in R?

Nizamuddin Siddiqui
Updated on 16-Oct-2020 15:22:49

515 Views

To display a particular part of independent variable in a plot, we might want to use a horizontal line. This will make the plot look different and get the attention of the viewer. To create a horizontal line in a plot, we can use geom_line function but we need to pass the values in a data frame format for which we want to create the horizontal line.Consider the below data frame −Example Live Demox

How to remove underscore from column names of an R data frame?

Nizamuddin Siddiqui
Updated on 16-Oct-2020 15:20:06

2K+ Views

When we import data from outside sources then the header or column names might be imported with underscore separated values and this is also possible if the original data has the same format. Therefore, to make the headers shorter and look better we would prefer to remove the underscore sign and this can be easily done with the help of gsub function.Consider the below data frame −Example Live Demox_1

How to find the row-wise frequency of zeros in an R data frame?

Nizamuddin Siddiqui
Updated on 16-Oct-2020 15:13:03

866 Views

In data analysis, we need to be very cautious about repeated values because they might be inputted purposely to create bias in the data and this value could be a zero as well. It happens in situations when we have missing data and the data collector replaces missing values with zeros which is a wrong practice. To find the row-wise frequency of zeros in an R data frame, we can use rowSums function for zero values by using the syntax −rowSums(“data_frame_name”==0)Consider the below data frame −Example Live Demoset.seed(189) x1

How to represent all values of X-axis or Y-axis on the graph in R using ggplot2 package?

Nizamuddin Siddiqui
Updated on 16-Oct-2020 15:03:52

6K+ Views

If we have many unique elements or repeated in a column of an R data frame and create a graph using that column, either on X-axis or Y-axis then R automatically choses the axes labels, this might not display all the unique values of the column in the plot. Therefore, we can use scale_x_continuous function or scale_y_continuous function with labels depending on our requirement to display the column values.Consider the below data frame −Example Live Demox

Advertisements