Workshopping data curation for digital humanities projects

Linking cultures

Increasingly DH researchers have greater access to funding to support large-scale multi-partner projects with diverse digital assets. A lack of formal training opportunities for data curation in multi-site DH teams means that the data produced in these teams is in danger of being lost! This workshop is a good preparation for researchers who must create a data management plan to comply with funding agency requirements.

Making research data public

May 21 and May 28, 2021
Part of the
Digital Humanities Summer Institute : Technologies East 2021
(known as DHSITE)

Linking cultures Digital humanities banner

This workshop will cover all areas of data management including: IP permissions and informed consent, data collection, metadata standards, file sharing, preservation (data deposit), and data sharing through the open data spectrum of access.

Participants will work on their own data curation challenges in break-out sessions and with reference to case study examples presented by a panel of DH scholars and digital asset management specialists:

  • Constance Crompton (University of Ottawa),
  • Karis Shearer (University of British Columbia Okanagan Campus),
  • Matthew Lincoln (Carnegie Mellon University),
  • Mikhel Proulx (Concordia University and Indigenous Digital Art Archive)

This workshop was originally developed and delivered at the 2019 Spoken Web Symposium: Resonant Practices In Communities of Sound. It draws significantly upon cases and RDM processes developed and in continued development, across the SpokenWeb research network.

Questions? Email Chantal Ripp, Research Data Management Librarian (interim) & Felicity Tayler, Interim Head, Research Support (Arts and Special Collections): [email protected]

Speakers: Speaker biographies (PDF, 238KB)

Facilitators: René Duplain, Roxanne Lafleur, Pascale Dangoisse, Fatoumata Bah, Sarah Simpkin, Chantal Ripp, Felicity Tayler, and Marjorie Mitchell.


Day 1: In addition to the plenary sessions for the whole group, 2 breakout sessions will be held where participants will work on their own data curation challenges. The same cohort of attendees will attend the breakouts together to ensure a continuity of the conversation.

The plenary session will be offered in English with simultaneous interpretation in French. The workshops and breakout sessions will be offered in English only.

Day 2: The participants along with the panel of DH scholars and digital asset management specialists will reconvene to co-develop a primer for DH researchers.

The workshops and breakout sessions will be offered in English. No simultaneous translation will be offered

May 21 2021 12:00 pm – 16:00 pm (Eastern Standard Time)

Hour Program
12:00 – 12:15 

Welcoming remarks, with a view toward a collaboratively authored DH data primer

Speaker: C. Ripp

12:15 – 13:15

Roundtable panel: “Learning from examples” DH data curation successes (and failures)

  • Constance Crompton: LINCS: Linked Infrastructure for Networked Cultural Scholarship
  • Karis Shearer: Press Play: Making Spoken Web Research Data Public
  • Matthew Lincoln: The Labor Behind DH Data Complexity
  • Mikhel Proulx: Indigenous Data Sovereignty and Ethical Allyship in the Archive

Moderator: F. Tayler




What is your data? Introduction to data curation key concepts [Presentation materials]

Participants will learn about how to recognize their digital humanities data, and why it is important to map their data work flow so they can make their data public!

Introduction to data management plans in international contexts (DMPOnline (UK), DMPTool (USA); DMP Assistant (Canada)


What is Your Data Flow and Discovery Model? [Data Flow and Discovery Model handout]

In a breakout session, participants work individually or in a group.
Participants map their research data at different phases of the project.




How to (ethically) make your data public [Presentation materials]

Participants will learn how to map their data into 5 categories of access from secure & protected to open license

Intro to Data Papers & Data Journals


What is Your Spectrum of Data Access? [Data Flow and Discovery Model handout]

In a breakout session, participants map their data flow models onto 5 categories of access from secure & protected to open license so that outputs such as linked open data, podcast, tutorials, exhibitions and other forms of knowledge mobilization can happen!


Sharing & Feedback session

Break out group facilitators report on themes, sticking points or revelations in their breakout groups.
Discussion on what would be useful in a data primer (define data primer).


Closing Remarks

May 28 2021 12:00 pm – 15:00 pm (Eastern Standard Time)

Time Program


Welcoming Remarks


Highlights from workshop day 1




Data primer overview: How collaborative editing will work

Data Primer Full Manuscript [View only]

  1. What is Data Curation for Digital Humanists?
  2. Consent
  3. Data Collection
  4. Data Processing
  5. Critical Analysis
  6. Sharing & Preservation


Breakout groups:
review/comment/edit assigned sections




Reconvene for Q&A and feed-back


Breakout groups: review/comment/edit assigned sections


Wrap-up: Next steps

Making research data public: workshopping data curation for digital humanities projects is supported in part by funding from the Social Sciences and Humanities Research Council.

chat loading...