Base R supports only one type of NA (‘not available’) to represent missing values. However, your data may include several types of missing data or incomplete category responses.
To address this issue of a singular “NA”, we use the
haven
package tagged_na()
tagged_na()
adds an additional character to any NA
values, enabling users to define additional missing data types.
tagged_na()
applies only for numeric values; character
based values can use any string to represent NA or missing data.
recodeflow
recodes missing data categories values into 3
NA values that are commonly used for most studies:
Summary of tagged_na
values and their
corresponding category values.
recodeflow tagged_na |
category value |
---|---|
NA(a) | 6 |
NA(b) | 7 |
NA(b) | 8 |
NA(b) | 9 |
NA(c) | question not asked in the survey cycle |
haven::tagged_na()
##
## Attaching package: 'haven'
## The following objects are masked from 'package:sjlabelled':
##
## as_factor, read_sas, read_spss, read_stata, write_sas, zap_labels
x <- c(1:5, tagged_na("a"), tagged_na("b"))
# Is used to read the tagged NA in most other functions they are still viewed as NA
na_tag(x)
## [1] NA NA NA NA NA "a" "b"
## [1] 1 2 3 4 5 NA(a) NA(b)
## [1] 1 2 3 4 5 NA NA
## [1] FALSE FALSE FALSE FALSE FALSE TRUE TRUE