-
Notifications
You must be signed in to change notification settings - Fork 0
/
Diebold Descriptive Analytics.Rmd
105 lines (87 loc) · 2.25 KB
/
Diebold Descriptive Analytics.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
---
title: "**Diebold Descriptive Analytics**"
date: "February 23, 2019"
output: html_document
---
##**Importing Data**
```{r}
diebold <- read.csv("December Non-bill calls.csv", stringsAsFactors = FALSE)
View(diebold)
```
##**Necessary Packages**
```{r message=FALSE, warning=FALSE}
library(qdap)
library(dplyr)
library(ggplot2)
```
##**Attribute Analysis**
**Call Text**
```{r}
call_text <- diebold$Call.Text
clean_call_text_reduced <- call_text %>%
tolower() %>%
removePunctuation() %>%
removeNumbers() %>%
stripWhitespace() %>%
removeWords(stopwords("en"))
freq_terms_reduced <- freq_terms(clean_call_text_reduced)
plot(freq_terms_reduced)
freq_terms_reduced
```
**Billing Notes**
```{r}
billing_notes <- diebold$Billing.Notes
clean_billing_notes_reduced <- billing_notes %>%
tolower() %>%
removePunctuation() %>%
removeNumbers() %>%
stripWhitespace() %>%
removeWords(stopwords("en"))
freq_terms_reduced <- freq_terms(clean_billing_notes_reduced)
plot(freq_terms_reduced)
freq_terms_reduced
```
**Branch Description**
```{r}
br_branch_desc <- diebold$Br.Branch.Desc
clean_br_branch_desc_reduced <- br_branch_desc %>%
tolower() %>%
removePunctuation() %>%
removeNumbers() %>%
stripWhitespace() %>%
removeWords(stopwords("en"))
freq_terms_reduced <- freq_terms(clean_br_branch_desc_reduced)
plot(freq_terms_reduced)
freq_terms_reduced
```
**Item Description**
```{r}
item_desc <- diebold$Item.Desc
clean_item_desc_reduced <- item_desc %>%
tolower() %>%
removePunctuation() %>%
removeNumbers() %>%
stripWhitespace() %>%
removeWords(stopwords("en"))
freq_terms_reduced <- freq_terms(clean_item_desc_reduced)
plot(freq_terms_reduced)
freq_terms_reduced
```
**SR Duration**
This was an attribute we created from the difference of SR Contact Date and Activity Completed Date to show total time that it took to complete each instance.
```{r}
median(diebold$Difference)
mean(diebold$Difference)
range(diebold$Difference)
Diff.Count <- diebold %>%
group_by(Difference) %>%
summarize(Count = n())
head(Diff.Count)
```
**Activity Type**
```{r}
diebold %>%
group_by(Activity.Type) %>%
summarize(Count = n()) %>%
arrange(desc(Count))
```