Data Activity 3/Unit 3
Task
Using the Crime Survey for England and Wales, 2013-2014 (see Unit 1), perform the following activities:
- Create a subset of individuals who belong to the ‘75+’ age group and who were a ‘victim of crime’ that occurred in the previous 12 months. Save this dataset under a new name ‘crime_75victim’.
Process and Findings
library(haven)
crime <- crime <- read_sav("filelocation.sav")
crime$bcsvictim <- as_factor(crime$bcsvictim) # converts numeric factor to the factor label so can determine relevant group
crime$agegrp7 <- as_factor(crime$agegrp7)
table(crime$agegrp7) # to check output and age clasiifications
crime_75victim <- subset(crime, agegrp7 == "75+" & bcsvictim == "Victim of crime")
Learnings
On first attempt the resulting crime_75victim table had no observations, it was (eventually) noticed that this was simply down to missing the capitalised ‘V’ at the begining of ‘victim’. This was corrected and a the table with 67 observations (out of the original 8843) was produced. If the labels for the numeric part of the factors are understood then I would consider it easier to produce the table using this numerical part of the factor to help minimise such errors. This can be checked using e.g.:
attributes(crime$agegrp7)
attributes(crime$bcsvictim)
#or alternatively
install.packages("labelled")
library(labelled)
val_labels(crime$agegrp7)
16-24 25-34 35-44 45-54 55-64 65-74 75+
1 2 3 4 5 6 7
val_labels(crime$bcsvictim)
Not a victim of crime Victim of crime
0 1
#can now create subset table using numeric part of factor
crime_75victim <- subset(crime, agegrp7 == 7 & bcsvictim == 1)