+ All Categories
Home > Documents > Justin Hayes UK Data Service

Justin Hayes UK Data Service

Date post: 04-Jan-2016
Category:
Upload: tanisha-dickerson
View: 16 times
Download: 2 times
Share this document with a friend
Description:
Structural analysis of the aggregate outputs from the 2011 Census to develop alternative integrated multidimensional conceptual models of data and geographies for easier management and dissemination. Justin Hayes UK Data Service. What the census tells us workshop Manchester 23 July 2014. - PowerPoint PPT Presentation
Popular Tags:
31
Structural analysis of the aggregate outputs from the 2011 Census to develop alternative integrated multidimensional conceptual models of data and geographies for easier management and dissemination Justin Hayes UK Data Service What the census tells us workshop Manchester 23 July 2014
Transcript
Page 1: Justin Hayes UK Data Service

Structural analysis of the aggregate outputs from the 2011 Census to develop alternative integrated multidimensional conceptual models of data and geographies for easier management and dissemination

Justin Hayes

UK Data Service

What the census tells us workshop

Manchester

23 July 2014

Page 2: Justin Hayes UK Data Service

Making it easier for everyone to find, understand and use the bits of the census they’re interested in

Justin Hayes

UK Data Service

What the census tells us workshop

Manchester

23 July 2014

Page 3: Justin Hayes UK Data Service

Overview

• Traditional and integrated approaches• Work with 2011 outputs

• Integrated descriptive model• Integrated model of geographies

• Ongoing work with data producers

Page 4: Justin Hayes UK Data Service

Our job

• Find• Understand• Use• Automated systems with online interfaces• Online and interactive support• Main services now freely available to everyone

Page 5: Justin Hayes UK Data Service

Traditional tabular aggregate outputs

• Outputs conceived and specified as tables• Details of individual tables defined through consultation with

different user groups• Per-table categorisations and descriptions• Complex table universes and footnotes• Visual layout an important consideration• Extended metadata unattached

• Complex process!• Number of tables limited by resource available

• Numerous inconsistencies between tables• Effectively separate datasets

Page 6: Justin Hayes UK Data Service

Traditional tabular dissemination

Page 7: Justin Hayes UK Data Service

Traditional tabular dissemination

Page 8: Justin Hayes UK Data Service

Integrated aggregate outputs

• Deconstruct tables• Assemble and rationalise all variables and categories in tables• Variable-ise table universes and footnotes• Create a standardised library of variables to describe all data

• Define integrated models of characteristics (What?)and geographies (Where?)• Enables global operations/queries• Framework for Attachment of extended metadata• Facilitates description and transfer using standards

• Provide access via Web service API• Data becomes self-describing

Page 9: Justin Hayes UK Data Service

Integrated dissemination

Page 10: Justin Hayes UK Data Service

Variable combination selection

Page 11: Justin Hayes UK Data Service

Variable combination selection

Page 12: Justin Hayes UK Data Service

Category combination selection

Page 13: Justin Hayes UK Data Service

Area selection

Page 14: Justin Hayes UK Data Service

Data download

Page 15: Justin Hayes UK Data Service

InFuse

Page 16: Justin Hayes UK Data Service

Under the bonnet

• Integrated multidimensional descriptive model• Integrated model of geographies• The really important bits!

Page 17: Justin Hayes UK Data Service

InFuse 2011 release 2: Raw data

• England and Wales Local and Detailed Characteristics to output area level

• UK harmonised data to local authority level• 422 tables, mainly multivariate• 31 geography types• 241,334 areas• 11,311 files• 15Gb volume

Page 18: Justin Hayes UK Data Service

Integrated descriptive model

• Processing of raw metadata• Deconstruction, rationalisation and re-integration• Library of variables and categories• Re-insertion of data values• Attachment of associated metadata

• Global description using standards• Global operations via Web service API

• Data is self-describing• Enables lightweight, generic applications

Page 19: Justin Hayes UK Data Service

Benefits of this work

• Data producers• Efficient data management• Flexible output production• Best value

• Application developers• Easy access to self describing web services• Light weight generic applications

• End users• Quick and easy global search• Context along with data

Page 20: Justin Hayes UK Data Service

InFuse 2011 release 2: Processed data

• 97 variables• 2,501 categories• 281 variable combinations• 140 thousand category combinations• 4.6 billion values

• A 460Km high stack of sticky notes!• Anticipating approximately 10 billion values in all

Page 21: Justin Hayes UK Data Service

Integrated model of UK census geographies

• Assembly of raw information on geographies• 31 geography types• 241,334 areas (anticipating ~ 2 million including postcodes)• Direct and indirect hierarchies

• Simplified presentational model• 11 composite geography layers• Simplification of merged geographies in England and Wales

• Calculation of ‘missing’ data• Linkage between descriptive and geography models

• Partial availability of data for geographies and extents

Page 22: Justin Hayes UK Data Service

Raw admin and statistical geographies

Page 23: Justin Hayes UK Data Service

Admin and statistical geography layers

infuse.mimas.ac.uk/help/definitions/2011geographies

Page 24: Justin Hayes UK Data Service

What’s next for InFuse

• Interface improvements• Geography first option• Fine tune interface features• Select categories from more than one category combination• ‘Select all’ categories• Back button• Geography tree improvements (multiple hierarchies)

• User testing

Page 25: Justin Hayes UK Data Service

What’s next?

• More data• More comparable data

• Different data• Boundary and flow data

• More functionality• Personalisation, analysis and visualisation

• Public InFuse API• Work with statistical agencies?

• Machine-friendly data from source• Flexible generation with automated disclosure control?• Information on usage and contact with users

Page 26: Justin Hayes UK Data Service

What is the UK Data Service?

• a comprehensive resource funded

by the ESRC

• a single point of access to a wide range of secondary social science data

• support, training and guidance

Page 27: Justin Hayes UK Data Service

UK Data Service Census Support

• Specialist function of UK Data Service

• Access and support services for outputs from recent UK censuses

• Add value by making census outputs easy to find, understand and use

• Engagement with UK census agencies

• Long history of technological innovation in service development

• census.ukdataservice.ac.uk

Page 28: Justin Hayes UK Data Service

census.ukdataservice.ac.uk

Page 29: Justin Hayes UK Data Service

• Aggregate component of census outputs

Census Support at Manchester

Justin Hayes

Rob Dymond-Green

Richard Wiseman

Jamey Hart

Page 30: Justin Hayes UK Data Service

• Aggregate component of census outputs

Census Support at Manchester

Justin Hayes

Rob Dymond-Green

Richard Wiseman

Jamey Hart

Page 31: Justin Hayes UK Data Service

Give InFuse a go!

infuse.mimas.ac.uk

•Comments, questions and ideas welcome•[email protected]


Recommended