Training Series
Introduction to Open Data Science
These two-day workshops provided a hands-on introduction to open data science so you can work with data in an open, reproducible, and collaborative way. Participants learned data wrangling and visualization in R and RMarkdown while practicing reproducible workflows with RStudio and GitHub. These trainings were led through The Carpentries using training materials from the Ocean Health Index.
Access materials by workshop date:
- 2018 | October 22 at Woods Hole Oceanographic Institution
- 2017 | November 30 at Monterey Bay Aquarium Research Institute
- 2017 | August 17 at UC Merced Yosemite Field Station
- 2016 | July 12 at Oxford University
- 2016 | April 15 at UC Santa Barbara
R and GitHub for ocean science
Online tutorials and trainings in open data science, specifically for using R, RStudio, git, and GitHub and developed by our Ocean Health Index team.
2018
Reproducible Analyses with R
This training covered techniques for building reproducible analysis workflows using the R programming language through a series of hands-on coding sessions. We will cover topics in data science, including reproducible analysis, version control, data modeling, cleaning, and integration, and data visualization both for publications and the web. The training took place at the University of Alaska, Fairbanks' International Arctic Research Center.
Workshop Dates: October 16-17
Open Science for Arctic Research and Data Management
These trainings provided Arctic researchers with critical skills for the stewardship of data, software, and many other research products that are preserved at the Arctic Data Center. The trainings included an overview of best data management practices, data science tools, and concrete steps and methods for more easily documenting and uploading their data to the Arctic Data Center.
Workshop Dates: August 13-17
Reproducible Analysis in R for Salmon Data Management
These trainings covered techniques for building reproducible analysis workflows using the R programming language, using Alaskan salmon brood data to demonstrate how heterogeneous data can be cleaned, integrated, and documented through RMarkdown. They also touched on other data science topics, including version control, data modeling, cleaning, integration, and data visualization.
Workshop Dates: May 17-18
Data Rescue and Integration for Coral Reef Science
This training provided data science training in RStudio and GitHub for coral reef researchers, focusing on data rescue, data integration, and team science, and integrating datasets in an exploratory way. Organized on behalf of the Coral Reef Science and Cyberinfrastructure Network (CRESCYNT), the workshops addressed issues consistently identified as challenges in the field of coral reef research and to broadly share lessons learned.
Workshop Dates: March 7-15
2017
Reproducible Analysis in R for Salmon Data Management
These trainings covered techniques for building reproducible analysis workflows using the R programming language, using Alaskan salmon brood data to demonstrate how heterogeneous data can be cleaned, integrated, and documented through RMarkdown. They also touched on other data science topics, including version control, data modeling, cleaning, integration, and data visualization.
Workshop Dates: November 29-30
Open Science for Arctic Research and Data Management
These trainings provided Arctic researchers with critical skills for the stewardship of data, software, and many other research products that are preserved at the Arctic Data Center. The trainings included an overview of best data management practices, data science tools, and concrete steps and methods for more easily documenting and uploading their data to the Arctic Data Center.
Workshop Dates: July 31 - August 1, March 28
Open Science for Synthesis: Reproducible Science in the Gulf of Mexico
This training helped participants build a foundation of data science skills for synthesizing data, weaving together several core concepts, including virtual collaboration; data management; data integration and exploration; scientific workflows and reproducible research; programming using agile and sustainable software practices; data analysis and modeling; and communicating results to broad communities. Programming languages learned include R and Python. The training was organized on behalf of The National Academy of Science’s Gulf Research Program.
Training materials
Workshop Dates: July 10-28
2016
Computational & Collaboration Skills for Integrative Conservation Science
This training brought together Science for Nature and People Partnership (SNAPP) postdoctoral associates and technical liaisons to foster community and collaboration, as well as promote scientific computing and open science best practices. Topics included data modeling, manipulation and visualization; collaborative and open science principles and techniques; geospatial analysis; and coding best practices.
Workshop Dates: June 27-30
2014
Open Science Codefest
Open Science Codefest was a participant-driven event that brought together computer programmers and environmental scientists, who typically work separately, to collaborate, problem solve, code, and share skills. The unconventional, unstructured “unconference” was inspired by “hack-a-thons,” and the format of the event allowed flexibility for organic work flows and synergies between attendees, many of whom had not worked together prior to the event. A multitude of topics were covered across a wide range of interests chosen by the participants.
Workshop Dates: September 2-4
Open Science for Synthesis 2014
In partnership with University of North Carolina’s Renaissance Computing Institute (RENCI), this unique training for early career scientists taught software and technology skills needed for open, collaborative, and reproducible synthesis research. Participants received hands-on, guided experience using best practices in the technical aspects that underlie successful open science and synthesis – from data discovery and integration to analysis and visualization, and special techniques for collaborative research.
Workshop Dates: July 21- August 8
Community Dynamics Toolbox
In collaboration with the National Science Foundation’s Advances in Biological Informatics (ABI) program, this workshop series aimed to improve infrastructure and capacity for scientists to create and share ecological community dynamics analyses and models through workflow systems. Participants created a toolbox that provides ecologists with tools for quantifying how communities change over time, helping to minimize data preparation and foster collaboration. Their approach built upon many informatics developments including Kepler, DataONE, and Ecological Metadata Language to advance ecological research.
Workshop Dates: February 5-7
2013
Summer Institute (a.k.a. Open Science for Synthesis)
This three-week intensive workshop in ecological analysis and synthesis gave participants hands-on experience using best practices in the technical aspects that underlie successful synthesis, including data discovery and integration, analysis and visualization, and special techniques for collaborative scientific research. The institute was supported generously by the Packard Foundation.
Workshop Dates: June 19 - July 10