Factors are used to handle categorical data in R. They are stored as vectors of integers with labels. Factors are useful for representing variables like gender, status, category, grade, etc., which have a fixed number of possible values (called levels).
# Create a factor for gender gender <- factor(c("Male", "Female", "Female", "Male", "Male")) # Display factor print(gender)
Output:
[1] Male Female Female Male Male Levels: Female Male
You can view the levels of a factor using:
levels(gender)
Some categories are ordered (e.g., grades or sizes). You can define that using ordered = TRUE
:
grades <- factor(c("B", "A", "C", "A", "B"), levels = c("C", "B", "A"), ordered = TRUE) print(grades)
You can compare ordered factors:
grades[2] > grades[1] # TRUE because A > B in defined order
x <- c("yes", "no", "yes", "yes", "no") f <- as.factor(x)
as.character(f) # Convert to text as.numeric(f) # Convert to level index (e.g., 2 1 2 2 1)
nlevels()
.Help others discover Technorank Learning by sharing your honest experience.
Your support inspires us to keep building!