dplyr is an R package for data manipulation. Using a series of examples on a dataset you can download, this tutorial covers the new functionality in dplyr versions 0.3 and 0.4.
Watch the previous dplyr tutorial: • Hands-on dplyr tutoria...
View the R Markdown document: rpubs.com/justmarkham/dplyr-tu...
Download the source document: github.com/justmarkham/dplyr-...
Tutorial contents:
1. Introduction (starts at 0:00)
2. Loading dplyr and the nycflights13 dataset (starts at 1:12)
3. Choosing columns: `select`, `rename` (starts at 2:28)
4. Choosing rows: `filter`, `between`, `slice`, `sample_n`, `top_n`, `distinct` (starts at 5:40)
5. Adding new variables: `mutate`, `transmute`, `add_rownames` (starts at 12:38)
6. Grouping and counting: `summarise`, `tally`, `count`, `group_size`, `n_groups`, `ungroup` (starts at 15:20)
7. Creating data frames: `data_frame` (starts at 23:01)
8. Joining (merging) tables: `left_join`, `right_join`, `inner_join`, `full_join`, `semi_join`, `anti_join` (starts at 25:28)
9. Viewing more output: `print`, `View` (starts at 31:29)
10. Resources (starts at 34:41)
== RESOURCES ==
Release announcement (0.3): blog.rstudio.org/2014/10/13/dp...
Release announcement (0.4): blog.rstudio.org/2015/01/09/dp...
Reference manual and vignettes: cran.r-project.org/web/package...
Two-table vignette: cran.r-project.org/web/package...
Data Wrangling Cheat Sheet: www.rstudio.com/wp-content/upl...
GitHub repo: github.com/hadley/dplyr
List of releases: github.com/hadley/dplyr/releases
== LET'S CONNECT! ==
Newsletter: www.dataschool.io/subscribe/
Twitter: / justmarkham
Facebook: / datascienceschool
LinkedIn: / justmarkham
Негізгі бет Going deeper with dplyr: New features in 0.3 and 0.4 (tutorial)
Пікірлер: 93