Trends in Research, Research Data and Responses...Research has to be reproducible, so data is needed...

Post on 18-Jul-2020

0 views 0 download

transcript

PRESENTED BY

TrendsinResearch,ResearchDataandResponses

RossWilkinsonAustralianResearchDataCommonsCanberra,2019

2

Outline:

§ Someresearchtrends§ TheAustralianresearchdataexperience§ Somereflections

3

SomeResearchTrends:§ ScaleofProblem§ Complexity§ Translationofresearchforsocietybenefit§ Reproducibleresearch§ Highreliabilityresearch§ Openresearch§ LeaveittoAI!

4

5

From G. Boulton

§ Clinicaltrialwith100K+participation§ Dataisnot“controlled”

6

EquitableAccesstoSustainableDevelopment§ secureandresilientfoodsystems

supportedbysustainablemarineresourcesandagriculture

§ sustainablehealthandwellbeing§ inclusiveandequitablequality

education§ cleanair,waterandsanitation§ affordable,reliable,sustainable

energySustainableEconomiesandSocieties§ Sustainablelivelihoodssupported

bystrongfoundationsforinclusiveeconomicgrowthandinnovation

§ Resilienceandactiononshort-termenvironmentalshocksandlong-termenvironmentalchange

§ Sustainablecitiesandcommunities

§ Sustainableproductionandconsumptionofmaterialsandotherresources

HumanRights,GoodGovernanceandSocialJustice§ Understandandrespond

effectivelytoforceddisplacementandmultiplerefugeecrises

§ Reduceconflictandpromotepeace,justiceandhumanitarianaction

§ Reducepovertyandinequality,includinggenderinequalities.

7

8

Productivity & Irreproducibility

Paul et al. (Nature Rev. Drug Discov. 9, 203–214; 2010Calcoen D, Elias L, Yu X. (Nature Rev. Drug Discov. 14. 161-2; 2015

AIwillsolveit§ RecentstudiesofUSsentencingshowssystemicincreasedsentencingofAfricanAmericans

§ PoorpeoplepaymoreforphotocopypaperinNewYork

§ BestfacialrecognitioncompanyisChinese§ Googletranslateishighlyusable

§ Thedatamatters9

DataTrends:§ FAIRData§ OpenData§ DataCitation§ Publisherdatarequirements

§ DataJournals§ FundedFairData

§ DataforTranslation§ Trusteddata§ DataRepositories§ DataQuality§ TrustedDataRepositories

§ TheDataMarketplace

10

FAIRdataforOpenScience§ Findable– datashouldbepublishedwithapersistentidentifier

§ Accessible– itshouldhaveanopenlicense– asopenaspossible

§ Interoperable– usecommunityagreedformats,languageand vocabularies

§ Reusable– richprovenance,enablingusebeyondtheoriginalpurpose

11

DataConnectivity(International)§ Researchdataidentification- DataCite§ ResearcherIdentification– ORCID§ Researchpublicationidentification§ Researchprojectidentiifcation- RAID§ ResearchInstitutionidentification§ ResearchFunderIdentification§ Vendorsupport

12

SomeInternationalResponses:§ DataCite,Orcid,etc§ DataJournals§ Harmonisationofdatapoliciesbyfunders§ http://www.nature.com/sdata/policies/repositories

§ Internationaldomaininitiatives§ OpenDataInitiative§ ResearchDataAlliance§ EuropeanOpenScienceCloud 13

DataComplexity– integratingmanydatasources,frommanypartners

14

DataReliabilityforResearch§ Goodnews!§ IncreasingQAprocessesfromlabsgeneratingdataforresearch

§ IncreasingQAexpectationsforhealthresearch

§ Plentyofprotocols§ Industrialprocessesfordatageneration

15

The ACRF International Centre for the Proteome of Human Cancer

Agood(reliable)homeforsocialsciencesdata:

16

EuropeanOpenScienceCloud

https://rd-alliance.org/-https://twitter.com/resdatall

• The European Open Science Cloud (EOSC) aims to accelerate and support the current transition to more effective Open Science and Open Innovation in the Digital Single Market.

• It should enable trusted access to services, systems and the re-use of shared scientific data across disciplinary, social and geographical borders.

VisionResearchers and innovatorsopenly share data across technologies, disciplines, and countries to address the grand challenges of society.

MissionRDA builds the social and technical bridges that enable open sharing of data.

www.rd-alliance.org@resdatall

CC BY-SA 4.0

Theimportanceofdatafortranslation§ Researchhastobereproducible,sodataisneededasevidence

§ Researchdataneedssolidfoundationssodataforresearchmustbequalityassured

§ Researchdataisacommodityfortrust buildingincollaborations

§ Researchdataisanoutputofresearchsoisbestoptimizedformanyforms oftranslation

19

DataisTransformative§ Governmentsarenotinvestinginresearchdatatomakelifeeasierforresearchers

§ Investmentsinresearchdatatoenablesocietalproblemstobeaddressed

§ Thisrequiresdatatobeinaformthatallowsawidevarietyofuse

§ Australiahasbeeninvestinginthistransformationfor10yearsnow

20

FAIRdataopportunity/threat§ Publishersincreasinglyrequiringpublisheddata§ Fundersincreasinglyrecognisingresearchdataoutputs

§ FAIRdataprotectsresearchers§ FAIRdatabuildsreputation,partnership,legacy§ TheopportunityofFAIRdataistocompeteonideas,notonamonopolyofknowledge

21

DataValue§ Strongerresearch– newanswersinarichdataenvironment

§ Moreefficientresearch– seenext§ Moretrustworthyresearch- reproducibility§ Strongerpartnerships§ Moreindustryengagement– dataasatrustbuilder§ Strongerinternationalengagementonnationallysignificantproblems

22

TheValueofOpenData§ DataismorevaluableifFAIR§ Publicationsareareliablemeansofmakinginformationavailable

§ Datahastobereliable§ Ithastobeprovidedreliably– throughreliabledatarepositories

23

TheValueofOpenDataReportTheanalysisinthereportsuggeststhatthevalueofdatainAustralia’spublicresearchisatleast$1.9billionperannumandpossiblyupto$6billionperannum– at2012-13levelsofexpenditureandactivity.Itismorevaluableifitisavailablethroughappropriateresearchdatainfrastructuree.g.usersoftheBritishAtmosphericDataCentrereportanaverageof56%oftheirtimeworkingwithdata– thatdataisopenandwithappropriatetools.

24

Achievingvalue:thereisagap§ Researchdataasatrustbuildervs.commercialadvantage

§ Researchdatainfrastructureisnationallyandinstitutionallybased

§ Systemhastorecognise valuetomaintainvalue–culturechange

…tobridgethegap….

25

DataInfrastructureRequirements§ Technology– tosupportscale,complexity,veracity,uncertainty

§ Processes– tosupporttheinteractionofresearcherswithresearchsystems,inc.publication,licencing,attribution

§ Policy– todetermineappropriateoutcomesfortheexpenditureofpublicmoney

§ People– toensurethatresearchershaverelevantskills,anddataprofessionalsareabletoengage 26

ResearchDataParticipants§ Researchers:needbestdata,andsimplemeanstopublishandberecognised

§ Researchinstitutions:needstrongdataholdings,andmeansofpreservingdatavalue

§ Researchdatageneratingfacilities:needquality§ Nationalinvestors:needmeansofmaximisingnationalinvestment,innovationandjobs

§ International:needmeansofcooperationwithallrelevantpartners

Soneeds/valueareverydifferentinresearchsystem27

TheAustralianResponse§ Improveddatapolicyframework§ NCRIS§ Domaininitiatives§ InstitutionalInitiatives§ ARDC§ Emergingskillsintegratedapproach

28

NCRIS:AustralianInfrastructureApproach§ Stable for10years§ $AU150M/annum§ Investsincollaborative infrastructure§ Bothphysicalanddata§ Dataisinfrastructure§ Separatefromresearchfunding§ Substantialnationaldataassetscreated§ $20M/annumondataandcollaborationservices

29

AustralianResearchDataInfrastructureActivity

Nationally:§ CapturingdatavaluableoverlongperiodsinMarine,Astronomy,EarthSciences,Ecosystems…forawiderangeofresearchpurposes

§ Supportingthestorageofdata§ Supportingthemanagementofdata§ SupportingtheenhancementofdataPluslotsatresearchinstitutions

30

AustralianResearchDataCommons:§ Atransformational,sector-wideinitiative,workingwithsector,government,andindustrypartnerstobuildacoherentnationalandcollaborativeresearchdatacommons

§ Todeliveraworld-leadingdataadvantage,facilitateinnovation,fostercollaborationandenhanceresearchtranslation.

31

ARDCactivitiesinclude:§ Datastorageandcloudcompute

§ Virtuallaboratories§ Researchdatamanagementcapacity

§ Policysupport§ Communitydevelopmentsupport

32

§ Skillssupport§ ResearchInstitutionalEngagement

§ InternationalEngagement

§ Researchdatapublicationservices

…asystemsapproach

33

Reliability– Coherence– Integration– Policy– Processes– Skills

AnalysisFind/createdata&tools+

Assembledataandtools+Collaborate+Adopt/Adapt/Build+Computeoverdata+

=Produceresearchresults

Trusteddataandcollaboration DisseminationPublicationTranslation

Referencecollections

Findable,accessible,interoperable,reusable

UseandReuse

Store/Cloud /Compute

MetadataProvenanceCuration

Preservation

AuthenticationAccesscontrolVLprovisionCollaboration

tools

Idealresearchsystem

Virtuallaboratoriesandcollaborationplatforms

Discard

The research system through a data lens:

34

Reliability– Coherence– Integration– Policy– Processes– Skills

AnalysisFind/createdata&tools+

Assembledataandtools+Collaborate+Adopt/Adapt/Build+Computeoverdata+

=Produceresearchresults

Trusteddataandcollaboration DisseminationPublicationTranslation

Referencecollections

Findable,accessible,interoperable,reusable

UseandReuse

Store/Cloud /Compute

MetadataProvenanceCuration

Preservation

AuthenticationAccesscontrolVLprovisionCollaboration

tools

Idealresearchsystem

Virtuallaboratoriesandcollaborationplatforms

Discard

DataFor

Research

DataIn

Research

DataFrom

Research

2013-18changesMovedfromindividualdatamanagement/storage/computeinitiativesto§ Focusedonincreasingthevalueofdatatofunders,institutions,andtoresearchers

§ Focusedonthewholeoftheresearchsystem– anintegratedapproachtodata,datainfrastructure,skills,policy,andprocess

§ Strengthenedinternationalengagement– mainlythroughtheResearchDataAlliance 35

Reflections§ Ratherthanfocusingonthedata“problem”–focusonthedataassetscreatedthroughresearch

§ Thevalueofdataisnotstatic,itiseithermoreorlessvaluable

36

Reflections§ DataismorevaluableifitisFAIR,OpenandQualityAssured§ Ifitisopen– itisusedmore§ Ifitreproducibleitcanbeusedasevidence§ Ifitiswelldescribed– provenance– itcanbeusedformorepurposes

§ Ifithasexplicitlicences – UseOpenlicences asmuchaspossible

§ Ifitiseasytouse,becausecommonmetadataschemaareused

§ Makedataasopenaspossible!37

ReflectionsDataisnotatechnologyissueonly,ormainly:§ Ensurethateffortisputintoaprofessional dataworkforce

§ Appropriatepolicysettingsrequiringandrewardingbestdatapractice

§ Goodprocessesareputinplacetosupportpolicy§ Ensuretechnologysupportseasyresearchdatapartnerships– nationallyandinternationally

38

ReflectionsResearchpublicationsaretrusted.TrustedDataiscrucialforaworldofopenscience§ Ensurethattrusteddatarepositoryservicesareavailable

§ Payparticularattentiontoprovenance§ Buildagreementsonwhatconstitutestrusteddata§ Recognise thevalueoftrusteddataintheresearchsystem

39

Inconclusion:§ ThevalueofFAIRresearchdatathatisopenandqualityassuredisveryhigh

§ Thevalueisdifferentforresearchers,disciplines,institutions,nations,andthepublic

§ Usepolicy,skills,technologytogether– takea“wholeofsystem”approach

§ Itisveryimportanttoestablishinternationalpartnerships

40

41ThisworkislicensedunderaCreativeCommonsAttribution3.0AustraliaLicense

ARDCissupportedbytheAustralianGovernmentthroughtheNationalCollaborativeResearchInfrastructureStrategy(NCRIS).

Thank you