-
Notifications
You must be signed in to change notification settings - Fork 0
/
Homework01.Rmd
128 lines (98 loc) · 2.39 KB
/
Homework01.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
---
title: "Hw01"
author: "Dre"
date: "September 24, 2016"
output: html_document
---
```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```
```{r}
library(gapminder)
library(tidyverse)
```
# as_tibble() to view data as tibble if it wasn't made a tibble
```{r}
gapminder
str(gapminder)
glimpse(gapminder)
head(gapminder)
tail(gapminder)
```
# ctrl + alt + i to insert chunk
```{r}
(canada <- filter(gapminder, country == "canada"))
filter(gapminder, year>2000)
filter(gapminder, continent == "Europe", year == 2007)
filter(gapminder, country == "Bulgaria" | year == 2007)
filter(gapminder, country == "Bulgaria" | country == "Albania")
filter(gapminder, country %in% c("Bulgaria", "Albania"))
select(gapminder, year, lifeExp)
select(
filter(gapminder, year, lifeExp, country),
country =="Canada"
)
```
```{r}
# cmd shift m gives %>%
# pipe always goes on the right of the command (end of the line)
gapminder %>%
filter(country == "Canada") %>%
select(year, lifeExp)
gapminder %>%
select(year, lifeExp, country) %>%
filter(country == "Canada") %>%
select(-country)
select(
filter(gapminder, year, lifeExp, country),
country =="Canada")
y <- gapminder %>%
select(starts_with("co"))
```
Let's look at some functions to get to know a data frame.
```{r}
names(gapminder)
colnames(gapminder)
ncol(gapminder)
length(gapminder)
dim(gapminder)
nrow(gapminder)
```
Shift from describing the whole object to looking at the variables inside.
```{r}
summary(gapminder)
# main way to get one variable
gapminder$lifeExp
# Another way to get one variable
gapminder[["gdpPercap"]]
str(gapminder$lifeExp)
str(gapminder["lifeExp"])
```
Now we know how to get 1 variable. Let's explore single variables.
```{r}
summary(gapminder$continent)
table(gapminder$continent)
class(gapminder$continent)
levels(gapminder$continent)
nlevels(gapminder$continent)
barplot(table(gapminder$continent))
summary(gapminder$lifeExp)
quantile(gapminder$lifeExp)
mean(gapminder$lifeExp)
median(gapminder$lifeExp)
hist(gapminder$lifeExp)
```
Let's make a few figures.
```{r}
## library(ggplot2)
ggplot(gapminder, aes(x = gdpPercap, y = lifeExp)) +
geom_point()
## can also pipe data in:
gapminder %>%
#filter(country == "Canada") %>%
ggplot(aes(x = gdpPercap, y = lifeExp)) +
geom_point()
p <- ggplot(gapminder, aes(x = gdpPercap, y = lifeExp))
## p + geom_point()
p + geom_point(aes(color = continent))
```