April 19th, 2026
A brief introduction to modern development tools. Integrated development environments. Language server protocol. Generative code assistant tools.
Overview of data transformation activities. Data frames. Distinct values. Selecting, renaming, and mutating variables. Filtering and arranging data. Data pivoting.
A top-down introduction to the basic concepts of data visualization. Layers, aesthetics and geometric objects. Coloring. Scatter plots. Bars, histograms, and density plots. Box plots.
Comma-separated vales format. Importing data from the disk or the web. Relative and absolute paths. An overview of other data file formats.
A follow-up look on data transformation topics. Boolean columns. Discretizing continuous variables. Counting observations. Groupwise operations. Summarizing. Slicing operations. Accessing columns. Combining data with joins.
More details on data visualization. Know-how and -why of aesthetics. Basic themes and extensions. Guides: axes, labels, scales, breaks, and legends. Statistical transformations. Text and annotations.
A concise step-by-step guide on how to set up the tools and folders needed for the course.
An introduction to managing and analyzing large datasets. Foundations of SQL and NoSQL databases. File-based databases. Mapping dplyr verbs to SQL queries.