+ All Categories
Home > Documents > Paycheck Income Data

Paycheck Income Data

Date post: 23-Feb-2016
Category:
Upload: sue
View: 35 times
Download: 0 times
Share this document with a friend
Description:
Paycheck Income Data. John Rae, Partner - Data and Product Development Simon Power, Principal Consultant HNDA Training for Practitioners, 6 th May 2014. Agenda. What is Paycheck. Introduction. S ources Using the geographic hierarchy Multi-method models. Method. Latest data ideas - PowerPoint PPT Presentation
Popular Tags:
36
John Rae, Partner - Data and Product Development Simon Power, Principal Consultant HNDA Training for Practitioners, 6 th May 2014 Paycheck Income Data
Transcript
Page 1: Paycheck  Income Data

John Rae, Partner - Data and Product Development

Simon Power, Principal Consultant

HNDA Training for Practitioners, 6th May 2014

Paycheck Income Data

Page 2: Paycheck  Income Data

2

Agenda

• What is Paycheck

• Sources• Using the geographic hierarchy• Multi-method models

• Latest data ideas• Keeping up to date

• Licence use• Opportunities

Introduction

Method

Innovations

Limitations

Page 3: Paycheck  Income Data

3

What is Paycheck?

Page 4: Paycheck  Income Data

4

Paycheck provides estimates of household income

UK wide estimates Mean Median Mode Distribution by income band

Full geographic detail Modelled down to full postcodes Potential to aggregate to any geographic area

Option to sub-divide incomes by lifestage

Page 5: Paycheck  Income Data

5

Data and modelling concepts

Page 6: Paycheck  Income Data

6

Modelling objective

To fit an appropriate statistical distribution to data on household incomes

To predict this distribution for all relevant geographic areas by means of the mean and standard deviation

Page 7: Paycheck  Income Data

7

Geographical hierarchy of the modelling

Page 8: Paycheck  Income Data

8

Outline of data inputs

1. Survey data Structured to be representative Sample will rarely include data points in a given local area

2. Large lifestyle database Unrepresentative collection method Large sample size will include data for most local areas

Surveys are good best nationally, lifestyle is better locally.

Page 9: Paycheck  Income Data

9

Survey Data

Survey Data

Step 1 – Bring the data up to date

Lifestyle data Living Costs and Food survey Sample 6,500 across the UK Available survey period typically

two years ago

Data Locator Group Covers 1.2 million individuals in

Scotland (UK 15 million) After cleaning we get data for

856,000 households in Scotland (UK 4.3 million)

We apply weighting to match the survey data at national level

We inflate to the present using Average Earnings time series

We bring the incomes up to current year using Average Earning change figures published by ONS

Page 10: Paycheck  Income Data

10

Step 2 - Establish the current UK earning profile

Take the household incomes measured by the survey Inflate these to current year figures Represent the distribution as bands of £5,000 Model incomes above £100,000 as an exponentially decaying distribution Transform the (percentile points of) the distribution to fit a standard

normal distribution

All subsequent modelling is conducted on normally distributed variables and a reverse transformation converts the model results back to real income values

Page 11: Paycheck  Income Data

11

Demographic modelling

Step 3 - Bayesian modelling approach

’Direct’ calculation Take a sample of lifestyle

data (representative of national socio-demographics)

Build linear regression models to estimate (transformed) income from the demographics

Apply to local areas based on local socio-demographics

Calculate incomes directly from the Lifestyle data

Create (local) correction factors for the initial model estimates in light of the actual scores

Repeat for the next (smaller) geographic level

Undo the transformation £

Page 12: Paycheck  Income Data

12

Points of Discussion

Why LCF as opposed to other surveys?

Why a UK model?

How is it kept up to date?

Page 13: Paycheck  Income Data

13

Data innovations

Page 14: Paycheck  Income Data

14

Places change….. and we can impute data about them

Page 15: Paycheck  Income Data

15

What is Acorn and why is it relevant to the income model?

Page 16: Paycheck  Income Data

16

Which is Ethel ? Which is Kayleigh?

Page 17: Paycheck  Income Data

17

The purpose of geodemographics

To analyse data and facilitate educated guesses.

Which channels fit which people? Where might it be more likely to find people with unhealthy lifestyles? Which people are using which of my services and in what manner?

Page 18: Paycheck  Income Data

18

49 Young families in low cost private flats50 Struggling younger people in mixed tenure51 Young people in small, low cost terraces52 Poorer families, many children, terraced housing53 Low income terraces54 Multi-ethnic, purpose-built estates55 Deprived and ethnically diverse in flats56 Low income large families in social rented semis57 Social rented flats, families and single parents58 Singles and young families, some receiving benefits59 Deprived areas and high-rise flats

34 Student flats and halls of residence35 Term-time terraces36 Educated young people in flats and tenements37 Low cost flats in suburban areas38 Semi-skilled workers in traditional neighbourhoods39 Fading owner occupied terraces40 High occupancy terraces, many Asian families41 Labouring semi-rural estates42 Struggling young families in post-war terraces43 Families in right-to-buy estates44 Post-war estates, limited means45 Pensioners in social housing, semis and terraces46 Elderly people in social rented flats47 Low income older people in smaller semis48 Pensioners and singles in social rented flats

21 Farms and cottages22 Larger families in rural areas23 Owner occupiers in small towns and villages24 Comfortably-off families in modern housing25 Larger family homes, multi-ethnic areas26 Semi-professional families, owner occupied neighbourhoods27 Suburban semis, conventional attitudes28 Owner occupied terraces, average income29 Established suburbs, older families30 Older people, neat and tidy neighbourhoods31 Elderly singles in purpose-built accommodation32 Educated families in terraces, young children33 Smaller houses and starter homes

1 Exclusive enclaves2 Metropolitan money3 Large house luxury4 Asset rich families5 Wealthy countryside commuters6 Financially comfortable families7 Affluent professionals8 Prosperous suburban families9 Well-off edge of towners10 Better-off villagers11 Settled suburbia, older people12 Retired and empty nesters13 Upmarket downsizers

It looks like this

Affluent Achievers1

Comfortable Communities3

Financially Stretched4

Urban Adversity5

A. Lavish Lifestyles

B. Executive Wealth

C. Mature Money

D. City Sophisticates

E. Career Climbers

F. Countryside CommunitiesG. Successful SuburbsH. Steady NeighbourhoodsI. Comfortable SeniorsJ. Starting Out

K. Student Life

L. Modest Means

M. Striving Families

N. Poorer Pensioners

O. Young Hardship

P. Struggling Estates

Q. Difficult Circumstances

Category

14 Townhouse cosmopolitans15 Younger professionals in smaller flats16 Metropolitan professionals17 Socialising young renters18 Career driven young families19 First time buyers in small, modern homes20 Mixed metropolitan areas Rising Prosperity2

Group Type

Page 19: Paycheck  Income Data

19

The new Acorn has revolutionised geodemographics

Peter SleightChair, The Association of Census Distributors'Tracking a decade of changing Britain‘, Market Research Society seminar, November 2013

All thit is relevant because…..

Page 20: Paycheck  Income Data

20

Registers of ScotlandLand RegistryNational Register of Social HousingFoI requests to LAD’sPublic register of HMO’s

Zoopla property portalsCACI lifestyle databasesHousing for the elderlyCACI High rise dwellings database

Data is derived by combining multiple sources

e.g. Local level housing type and tenure

Page 21: Paycheck  Income Data

21

Adding to the census

Variables no longer on the census

Identify likely locations of high rise buildings

WALK THE STREETS Create a database of addresses;

Social high rise (10+ storey) Social mid-rise (5-9 storey)

Page 22: Paycheck  Income Data

22

Data is derived by combining multiple sources

e.g. Local level family structure, occupation and affluence

CACI names and addressesCredit application age dataElderly-only accommodationEmma’s diary children databaseDWP claimant data

CACI lifestyle database UCL ethnicity imputationCompany directorsShareholdersStudents

Page 23: Paycheck  Income Data

23

Adding to the census

Replace the census - housing for specific categories of people

Improve the census

Page 24: Paycheck  Income Data

24

This approach realises a lot of address level data…

22mhouseholds where we

have detailed age data

21.5mhouseholds

where we have housing /

tenure data 10mhouseholds

where we have more detailed

socio-demographics

3mpeople in

HMO’s

600,000age-limited addresses

Page 25: Paycheck  Income Data

25

And produces a remarkable outcome…

Prior to the release of the census

Following the release of the census

• We built the segmentation without census inputs

• We linked to research surveys to form an insight test-bed

• We optimised across over 2,500 topics

• We added in the census and checked for change

• We found including the census made no difference to the structure of the segmentation

The approach appears to achieve the equivalent (for these geodemographic models) of having a census every year

Page 26: Paycheck  Income Data

26

As an illustration…

South Ayrshire’s first affordable housing in 30 years, the Somerset Road Development includes:

West of Scotland Housing Association’s development of 32 flats as part of a bigger development of 76 units.

Dawn Homes development of 44 homes for outright sale. Segmentation types..

49 Young families in low cost private flats 50 Struggling younger people in mixed tenure

Page 27: Paycheck  Income Data

27

Licencing – limitations and opportunities

Page 28: Paycheck  Income Data

28

Limitations and opportunities

End User Licence

Contractual restrictions on the use of the data

Council use

Third parties

Page 29: Paycheck  Income Data

29

Limitations and opportunities

Opportunities

Other CACI Datasets

Page 30: Paycheck  Income Data

30

Limitations and opportunities

Opportunities

Other CACI Datasets

Page 31: Paycheck  Income Data

Consumers.Locations.

Communities.

Individual

Postcode

AffluenceAffluence

DigitalDigital

Current Demographics

WorkforceACORN

Retail, Leisure & Financial Catchments

Public Transport Access Levels (PTAL)

Retail Spend Estimates

Online Spend Estimates

2011 Census

Out of Work Benefits

Retail, Leisure & Financial Outlets

Job Seekers Allowance

British Crime Survey

FRS: GFKNoP’s Financial Research Survey

Understanding Society

IrishACORN

TGI

Worker Spend Estimates

Rail Passengers

Tourist Spend Estimates

Hospitals/GPs/Schools/Libraries

Page 32: Paycheck  Income Data

KEY NAME DATA-SET KEY THEMES TYPE LEVEL DEMOGRAPHICSAFFLUENCE &

WEALTH LIFESTAGE FINANCE ATTITUDES BRANDS CHANNELDIGITAL

BEHAVIOUR PUBLIC SERVICES RETAIL

Acorn Lifestyle CHousehold Acorn Lifestage Cmy.Acorn Bespoke Segmentation CWellbeing Acorn Health & Wellbeing CFinancial ACORN Finance, geodemographics CEuroACORN Europe geodemographics CSocialScene ACORN Eating-out & Drinks Market CPayCheck Household Income MStreetValue House Value MeTypes Online Behaviour CPeople UK Affl uence, Lifestage CFresco Affl uence, Lifestage, Finance COcean Lifestyle, Affluence, Digital MRetail Dimensions Retail Market Analysis MTGI Media, Brands, Attitude SUnderstanding Society Longitudinal Research GSGfK FRS Financial Research SBritish Crime Survey Attitudes on Crime GSPublic Transport Access and Passenger Counts GService Locations Public Service Outlets GBenefits Claimants Details on Claimants G2011 Census Detailed, smal l area data GCurrent Demographics Detailed, smal l area data MSpend Estimates Consumer Expenditure MOutlets 503k Outlets MCatchments Gravity Models MWorkforce ACORN Lifestyle of Working Population CIrish ACORN Lifestyle (Eire) C

C = ClassificationKEY M = Modelled S = Survey G = Government = Postcode or > = Individual = Location = Area/Catchment = Anonymous= Household

Page 33: Paycheck  Income Data

33

Summary

• Seeking the best between national surveys and very local data

• Data techniques experts consider to be revolutionary

• Use within the HNDA• Options for other uses

Paycheck

Up to date

Usability

Page 34: Paycheck  Income Data

34

Questions

Page 35: Paycheck  Income Data

35

CACI Contact Details

Simon PowerT. 07977 522792

E. [email protected]

Page 36: Paycheck  Income Data

Recommended