Wednesday, November 25, 2015
vehicle DTC text mining in the R tm package
text mining in R
how to process ngrams:
http://tm.r-forge.r-project.org/faq.html#Bigrams
comparing frequency of DTM across factor levels (like DTC against platform):
create DTM
create dataframe from DTM: df <- as.data.frame(inspect(dtm))
df$platform <- substr(data$vin_last_8,2,2)
nice compact summary table:
dtc_by_platform <- with(melt(df),tapply(value,list(variable,platform),sum))
and this table can be used as input to a barplot or chisquare test
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment