+ All Categories
Home > Education > RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

Date post: 16-Jul-2015
Category:
Upload: asist
View: 218 times
Download: 1 times
Share this document with a friend
34
treating data like data unifying data processing workflows for datasets in the IR Steve Van Tuyl - @badgerbouse Data and Digital Repository Librarian, Oregon State University #WorstTalkEver
Transcript
Page 1: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

treating data like dataunifying data processing workflows for datasets in the IR

Steve Van Tuyl - @badgerbouse

Data and Digital Repository Librarian,

Oregon State University

#WorstTalkEver

Page 2: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

• introduction

• the setup

• phase 1: new definitions

• phase 2: what to expect

• lessons learned

Page 3: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR
Page 4: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR
Page 5: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR
Page 6: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR
Page 7: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR
Page 8: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

1.9 gb

Page 9: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR
Page 10: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

data = data

phase 1: new definitions

Page 11: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR
Page 12: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

“I was a little daunted by the

documenting mentioned in the last

e-mail, as I am starting a new PhD

program, and have lots of

responsibilities there.”

- The Perpetrator

Page 13: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

iterate

Page 14: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

At least for the next couple of

years, until the RDM community

has made such an impact that

incoming graduate students

know how to manage data from

the start

Page 15: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

LOL, JK

Page 16: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

iterate

encourage

tattle

Page 17: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

phase 2: what to expect

Page 18: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR
Page 19: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

93theses & dissertations

Page 20: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

45% excel

Page 21: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

22% images

Page 22: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

25% documents

Page 23: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

25% other “data”

text

database

statistical

Page 24: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

23% code

15% executables

Page 25: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

12% “metadata”

Page 26: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

33% of excel have:

linked info

charts

macros

Page 27: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

30% unknown

unopenable

obsolete

Page 28: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

3% missing data

Page 29: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

?

Page 30: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

definitions

Page 31: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

“ScholarsArchive@OSU promises to ensure that the

following common file formats (among many others)

are useable in the future, using whatever

combination of techniques (such as migration,

emulation, etc.) is appropriate given the context of

need”

- someone 10 years ago

promises, promises

Page 32: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

nothing ever changes

Page 33: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

new baseline

Page 34: RDAP 15: Treating data like data: Unifying data processing workflows for datasets in the IR

Recommended