Building the Research Data Community of
Practice
Panelists
Andrew Johnson - DataQ Sherry Lake - Virginia Collaborative Data Management BootcampBrianna Marshall - Midwest Data Librarian SymposiumRegina Raboin - NECDMC/eSci portalBrian Westra - DataCure
Moderator: Cynthia Hudson-Vitale, Washington University in St. Louis
This project was made possible in part by the Institute of Museum and Library Services Sparks! Ignition Grant for Libraries SP-02-14-0020-14
Andrew JohnsonResearch Data Librarian
University of Colorado Boulder@PrezSeventeen
Academic Libraries and Research Data Services(Tenopir, Birch, and Allard, 2012):
● “Reassigning existing library staff is the most common tactic for offering RDS. This approach also needs to be supported with professional development for staff so they can gain the required expertise to provide the full range of RDS”
● “When libraries provide research data services related to reference, consultation, or instruction, those services are most likely to be offered by individual librarians or library staff members who are subject discipline specialists.”
How do we provide support for individuals in these situations?
The Need
● “Research data is intimidating!”
● “How can I take on research data support with so much else already on my plate?!”
● “I need practical tools to use to help researchers with their data”
But Also:
● “Helping faculty and students with their data is an increasingly important part of my liaison duties”
Librarian Feedback:
DataQ is a web-based collaborative tool to support librarians who are engaging in research data services by providing:
● An online space for assistance and collaboration
● A community of research data experts and subject librarians
● A knowledge base of practical data information for librarians
● Andrew Johnson, University of Colorado Boulder, PI
● Megan Bresnahan, Tufts University, Co-PI
● Joni Blake, GWLA
● Greg Monaco, GPN
● Ann Riley, ACRL
DataQ Advisory Board
● Kathleen Fear, University of Rochester● Elena Feinstein, Duke University● Cynthia Hudson-Vitale, Washington University in St. Louis● Erica Johns, Cornell University● Sherry Lake, University of Virginia● Stanislav Pejša, Purdue University● Sarah Pickle, The Claremont Colleges● Amanda Rinehart, The Ohio State University● Yasmeen Shorish, James Madison University● Cecilia Smith, Texas A&M University● Shea Swauger, University of Colorado Denver● Steve Van Tuyl, Oregon State University● Mary Whelan, Arizona State University● Christie Wiley, University of Illinois at Urbana-Champaign● Jackie Wirz, Oregon Health and Science University
DataQ Editorial Team
● Thea Atwood, University of Massachusetts Amherst● Carolyn Bishoff, University of Minnesota● Rebekah Cummings, University of Utah● Khue Dong, California State University, Long Beach● Christopher Eaker, University of Tennessee● Abigail Goben, University of Illinois at Chicago● Renaine Julian, Florida State University● Dessi Kirilova, Qualitative Data Repository● Chris Kollen, University of Arizona● Robert Olendorf, Los Alamos National Laboratory● Laura Palumbo, Rutgers University● Robert Sippel, Florida Institute of Technology● Dan Valen, Figshare
DataQ Project Volunteers
● Ask all of your questions related to research data and libraries athttp://ResearchDataQ.org!
● Many ways to participate:○ Ask questions completely anonymously○ Ask questions and provide your email address to just our team for
follow-up and clarification purposes○ Register for a free DataQ account so that questions you ask will be
attributed to your username○ Comment on all questions and answers (requires free account)○ Browse or search answers to existing questions○ Follow us on Twitter: @ResearchDataQ
DataQ Website
● ~18,000 site visits from ~4,000 users● 82 questions answered from these categories:
(Meta)DataQ
● About research data and libraries?: http://ResearchDataQ.org/ask
● About the DataQ website?: http://ResearchDataQ.org/contact
● About the DataQ project?: [email protected]
Questions?
Virginia Data Management Bootcamp
Sherry LakeScholarly Repository LibrarianUniversity of VirginiaMay 4, 2016
Building the Research Data Community of Practice
RDAP 2016
Virginia Data Management Bootcamp
2013 2014 2015 2016
Technology
Technology
Delivery:● 4VA Telepresence rooms ● ODU, VCU, W&M direct dial to “trunk” or back end● Webex (2015 & 2016)● Recorded Sessions
Content:● UVA Collab (Course management system)● Google Docs
○ Planning (notes, agenda)○ Content Delivery ○ Question/Answer notes
● Libguides○ Links to google docs, software, agenda, schedule
Content
2013
http://guides.lib.odu.edu/VADMBC/
Margaret E. Henderson, "Virginia Data Management Bootcamp: A Collaborative Initiative in Data Education" (April 9, 2015). University of Massachusetts and New England Area Librarian e-Science Symposium. Paper 5. http://escholarship.umassmed.edu/escience_symposium/2015/posters/5
PlanningVA DM Bootcamp Team
• Meetings starting the summer before • Debriefings after bootcamps• Assessments – data after 2014 & 2015
Local (UVA Example)• Marketing• Registration• Room scheduling/ Food (sponsors)• Volunteers• Video Release Forms• Preparing copies of Handouts
Assessment
MOST USEFUL SESSION
Intellectual property, copyright: The speaker was very thorough
and used plain English rather than "lawyer-speak".
MOST USEFUL SESSIONMethods to scrub messy data with OpenRefine. I had no idea
such things were possible and they will be so useful!
LEAST USEFULScholarly communication - nebulous subject
LEAST USEFULmetadata - i didn't really follow what they were talking about
ADDITIONAL COMMENTS SUGGESTIONS
I would say, keep doing this as it provides a great service to
researchers and librarians alike.
ADDITIONAL COMMENTS SUGGESTIONS
I felt that many of the topics did not go into depth
enough. I would have liked fewer topics, more in depth.
2014
MOST USEFUL SESSION any sessions where there were hands-on activities were good
I personally learned something new from each session and found them all to be very helpful.
Data Documentation and Metadata. My first time to learn the Meatdata concept.
LEAST USEFULmetadata - i didn't really follow what they were talking about
ADDITIONAL SUGGESTIONSBetter to have shorter more intense days. Maybe more days.Real data sets to work with.
Overall, how likely are you to recommend this training to your colleagues?78% likely/very likely
Length of bootcamp (2 full days, optional .5)- 59% too long
2015
Met expectations:92% agree/strongly agree
MOST USEFUL SESSIONIntellectual property, copyright: The speaker was very thorough and used plain English rather than "lawyer-speak".
OpenRefine/Data Wrangling
Hearing multiple perspectives from other universities
Methods to scrub messy data with OpenRefine. I had no idea such things were possible and they will be so useful!
LEAST USEFULScholarly communication - nebulous subject
ADDITIONAL COMMENTS SUGGESTIONSI felt that many of the topics did not go into depth enough. I would have liked fewer topics, more in depth.
I would say, keep doing this as it provides a great service to researchers and librarians alike.
The technology needs to be more stable so that all sites see and hear everything at the same time.
73% said length was just right
Overall, how likely are you to recommend this training to your colleagues?77% likely
Assessment2014
MOST USEFUL SESSION any sessions where there were hands-on activities were good
I personally learned something new from each session and found them all to be very helpful.
Data Documentation and Metadata. My first time to learn the Meatdata concept.
LEAST USEFULmetadata - i didn't really follow what they were talking about
ADDITIONAL SUGGESTIONSBetter to have shorter more intense days. Maybe more days.Real data sets to work with.
Overall, how likely are you to recommend this training to your colleagues?78% likely/very likely
Length of bootcamp (2 full days, optional .5)- 59% too long
2015
Met expectations:92% agree/strongly agree
MOST USEFUL SESSIONIntellectual property, copyright: The speaker was very thorough and used plain English rather than "lawyer-speak".
OpenRefine/Data Wrangling
Hearing multiple perspectives from other universities
Methods to scrub messy data with OpenRefine. I had no idea such things were possible and they will be so useful!
LEAST USEFULScholarly communication - nebulous subject
ADDITIONAL COMMENTS SUGGESTIONSI felt that many of the topics did not go into depth enough. I would have liked fewer topics, more in depth.
I would say, keep doing this as it provides a great service to researchers and librarians alike.
The technology needs to be more stable so that all sites see and hear everything at the same time.
73% said length was just right
Overall, how likely are you to recommend this training to your colleagues?77% likely
What’s Next?
• Involving more Virginia institutions
• Offer smaller workshops/ sessions throughout the year
• Larger collaboration (like DM Bootcamp) every-other year
• Do more non-workshop collaborations
http://guides.lib.odu.edu/VADMBC/
State-wide Community of Practice
Building the Research Data Community of Practice
University of Massachusetts Medical School
Regina Fisher Raboin
Associate Director, Library Education & Research
Lamar Soutter Library, University of Massachusetts Medical School
RDAP Summit 2016
E-Science Portal for New England LibrariansA Librarian’s Link to e-Science Resources
Background
Goals
• Library roles in e-Science/Data Science
• Fundamentals of domain sciences
• Emerging trends in supporting networked scientific research
Project
• Collaborative initiative• Education, Application, Professional Development
• Outreach, Connections, Dissemination
• In-depth resource for data management and literacy; research environment
Results
• e-Science Symposium 2009 – 2016
• Science Boot Camp 2009 – 2016• Portal 2010 - 2016• Journal of eScience Librarianship
2012 - 2016
E-Science Portal
New England e-Science Program
Community of Practice
180+ Librarians
Contributors
NN/LM NER
IMLS
BLC
NAHSL
Portal and Advisory Boards
DisseminationJournal of eScience
Librarianship
e-Science Community Blog
Twitter feeds
Tools/Resources
e-Science Portal
Content for RDM class
e-Science Thesaurus
Professional Development/EducationProfessional Development Day
e-Science Symposium
Science Boot Camp
Simmons GSLIS class
Course: How You Teach RDM
New England Collaborative Data Management Curriculum
(NECDMC)
New England Collaborative Data Management Curriculum (NECDMC)
Background
Goals
• Built on Frameworks for Data Management Curriculum (IMLS/NNLM NER)
• Present and future researchers, students, and librarians data management learning needs
• Address federal compliance/ open data requirements
Project
• Phase 1: Collaboration with other libraries
• Modules, lesson plans, instructional materials, research cases; module evaluations
• Workshop resources and community feedback created
Results
• Phase 2: Local piloting• Phase 3: Piloting beyond
UMMS• Evaluation, assessment,
feedback
NECDMC(CC-BY)
Assessment
NECDMC Pilot Sites
(22 sites)300+
Phase 1 - 20 questions, evaluations on
NECDMC site
Phase 2 – 12 questions, paper-based
Train-the-Trainers
(2 workshops, 2013 & 2014) 8011-15 questions per module, paper-based
handout
Flexible Clinical Experience (FCE) 1
Two electronic evaluation instruments
provided by School of Medicine (SOM):
one for instructors (8 quest.), one for
student
(22 questions, including open-ended)
Type of Education Attendees Assessment
Lessons Learned
NECDMCModular format; cases; website
with ready-to-go content
Too much content; updating
Creating MOOC
Train-the-TrainersDemonstrations of teaching cases
and teaching methods for different
audiences;
customizability/flexibility
Time concernsPre-class surveys and encouraging
customization
Flexible Clinical ExperienceIncorporation of a research
project; use of research data
management tools (e.g.
LabArchives, DMPTool);
collaboration with UMMS IT
and IRB
Time investment and prep for
a 1 week course; branding;
needed more sophisticated
content; one student
Incorporating local examples of data
management content and activities;
increase marketing for course
Future Vision
NECDMCMOOC (BD2K)
E-Science Program
Expand Nationally
Create Identity or Discipline
Potential Researcher
Portal
Roundtable Discussions
and Tools
Workshops
Selected ResourcesE-Science Portal for New England Librarians; See especially ‘Data Management’, ‘Data Literacy’, and ‘Professional Education’, http://esciencelibrary.umassmed.edu/
Integrating Data Management Tools into Research Data Management Instruction, Rebecca Reznik-Zellen, Lisa A. Palmer and Julie Goldman, Poster Session, University of Massachusetts and New England Area e-Science Symposium, April 6, 2016. http://escholarship.umassmed.edu/escience_symposium/2016/posters/8/
Ishida, Mayu. 2014. "The New England Collaborative Data Management Curriculum Pilot at the University of Manitoba: A Canadian Experience." Journal of eScience Librarianship 3(1): e1061. http://dx.doi.org/10.7191/jeslib.2014.1061
Kafel, Donna, Myrna E. Morales, Robert J. Vander Hart, Sally A. Gore, Andrew Creamer, Javier Crespo, and Elaine R. Martin. 2012. "Building an e-Science Portal for Librarians: A Model of Collaboration." Journal of eScience Librarianship 1(1): e1005. http://dx.doi.org/10.7191/jeslib.2012.1005
New England Collaborative Data Management Curriculum (NECDMC); See especially ‘Resources from Workshops’, ‘Community Bulletin Board’, and ‘Join the Collaboration’, http://library.umassmed.edu/necdmc/index
New England Region eScience Program (Data Management Curriculum and Data Tools Forum)https://www.youtube.com/channel/UCmwetQc2pcwAwJYPDPucJIg
Piorun, Mary E., Donna Kafel, Tracey Leger-Hornby, Siamak Najafi, Elaine R. Martin, Paul Colombo, and Nancy R. LaPelle. 2012. "Teaching Research Data Management: An Undergraduate/Graduate Curriculum." Journal of eScience Librarianship 1(1): e1003. http://dx.doi.org/10.7191/jeslib.2012.1003
Peters, Christie, and Porcia Vaughn. 2014. "Initiating Data Management Instruction to Graduate Students at the University of Houston Using the New England Collaborative Data Management Curriculum." Journal of eScience Librarianship 3(1): e1064. http://dx.doi.org/10.7191/jeslib.2014.1064
Read, Kevin, Andrew T. Creamer, Donna Kafel, Robert J. Vander Hart, and Elaine R. Martin. 2013. "Building an eScience Thesaurus for Librarians: A Collaboration Between the National Network of Libraries of Medicine, New England Region and an Associate Fellow at the National Library of Medicine." Journal of eScience Librarianship 2(2): e1049. http://dx.doi.org/10.7191/jeslib.2013.1049
Datacure
OriginsA ‘chance’ meeting: DigCCurr Professional Institute (2009/2010)
● Dianne Dietrich - Cornell● Susan Parham - Georgia Tech● Jake Carlson - Michigan● Brian Westra - Oregon● Barrie Hayes - UNC Chapel Hill
Foundations
● Trust● Confidentiality● Small group● Unaffiliated
Group dynamics
Outcomes Confluence wiki → Wordpress web site
Email list → Google Group
Informal practice → Code of Conduct
Events list Wiki page → Drupal events calendar
Outcomes Monthly conference calls
Conference proposals, articles
Data repository working group
Instruction working group
Letter to PLoS
Opportunities and Challenges80/20
Growing membership, and growing the membership
Volunteer leadership
Unaffiliated
Expansion in lists, interest and working groups, meetings
What’s next?
Discussion● How others can do the same or be involved, new initiatives, future
training
● Thoughts on how we can create, or whether or not we need, a more cohesive research data community of practice
● Where is there interest in a data services related topic - but little opportunity for support/training? (such as: reaching out to faculty, starting the conversations w/the research office, curating sensitive data, de-identifying/anonymizing data)
Thank you!
Andrew [email protected]
Sherry [email protected]
Regina Fisher [email protected]
Brianna [email protected]
Brian [email protected]
Cynthia [email protected]