+ All Categories
Home > Documents > Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle...

Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle...

Date post: 14-Dec-2015
Category:
Upload: ella-huddle
View: 213 times
Download: 0 times
Share this document with a friend
Popular Tags:
24
Towards a Better Towards a Better Integration of Survey and Integration of Survey and Tax Data in the Unified Tax Data in the Unified Enterprise Survey Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada June 18-21, 2007
Transcript
Page 1: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

Towards a Better Integration of Towards a Better Integration of Survey and Tax Data in the Unified Survey and Tax Data in the Unified

Enterprise SurveyEnterprise Survey

Claude Turmelle

Statistics Canada

ICES-III Montréal, Québec, Canada

June 18-21, 2007

Page 2: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

2

Outline

Overview of the UESCharacteristics of the target populationCurrent use of tax data

At samplingAt imputationAt estimation

Issues and ChallengesTowards a better use of tax dataConclusion

Page 3: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

3

Overview of the UES

Unified Enterprise Survey (UES) started in 1997Objectives

Integrate all annual business surveys into one unified survey frameworkTo produce quality financial and commodity estimates

National and sub-national levelsIndustrial levels

Page 4: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

4

Overview of the UES

Target populationAll Canadian businesses within the covered industriesThe UES is an Establishment based survey

Coverage over time1997: Seven Industries1998: Sixteen more (including Wholesale)1999: Four more (including Retail)2000: Four more (including Manufacture)….2007: Now covers over 60 major industries

Page 5: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

5

Characteristics of the Target Population

Divided into two main types of businesses: unincorporated (T1) and incorporated (T2)

General Index of Financial Information (GIFI) data are available electronically for the entire T2 population

T1 data are only available electronically for about half the T1s (e-filers)

Page 6: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

6

Characteristics of the Target Population

An enterprise is Complex: Multi-provincial and/or Multi-industry and/or Multi-legal Simple: The opposite

An enterprise is alsoSingle: Only one establishment Multi: More than one establishment

Simple-Single enterprises represent about 95% of the population, although only about 40% of the economy

Page 7: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

7

Current Use of Tax Data

Why would someone use tax data?Improve efficiency of the sample design

Reduce the response burden

Reduce the collection cost

Improve quality of the estimates

Page 8: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

8

Current Use of Tax Data

At samplingSome key variables taken from different tax files are put on the sampling frame

Total Revenue, Total Expenses from GIFI

Total Sales from Goods & Services Tax (GST)

Salaries & Wages, # Employees from Payroll Deductions (PD7)

Used to define a size measure (Total Revenue) for each establishment on the frame

Used to stratify the population by size and to define the Take-None (T-N) portion

Page 9: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

9

Current Use of Tax Data

At imputationUsed to replace survey data (financial variables) for a predetermined sub-sample of selected Simple-Single units

Also used to replace survey data for some non-respondents

Used as auxiliary data during imputation

Page 10: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

10

Current Use of Tax Data

At estimationGIFI data are used to produce estimates for all T2 units falling in the T-N portion

T1 e-filer data are used to produce estimates for all T1 units falling in the T-N portion

Page 11: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

11

UES Survey Design at a GlanceT2

T2 Take-None:

Census of GIFI

EXCLUSION THRESHOLD

Main sample to be surveyed

For variables available from tax:

Total estimate = Survey estimate (T1,T2) + T2 Take-None + T1 Take-none e-filer estimate

For variables not available from tax (Characteristics):

Total estimate= Survey estimate (T1, T2)

Not eligible for tax : full questionnaire

Tax replaced

Characteristic quest. (services surveys) or full questionnaire (other surveys)

T1

Main sample to be surveyed

T1 Take-None:

Sample of e-filers

Page 12: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

12

Issues and Challenges

At samplingSometimes we get inconsistent tax data

Ex: GIFI Total Revenue=$2M

GST Total Sales=$25M

What do we do?We use a conservative approach, i.e. we take the maximum

We manually verify and adjust the extreme cases (we’ll make use of survey data if available)

Page 13: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

13

Issues and Challenges

At sampling (cont’d)Sometimes all we get is # Employees or Salaries & Wages (Revenues = . or $0)

What do we do?We model Total Revenue using what’s available

Page 14: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

14

Issues and Challenges

At imputationSometimes we can’t find the link to tax data (ex.: not-for-profit organizations)

Sometimes we link to 2 or more tax files

We currently use direct tax replacement (i.e. Ysurvey = Xtax). Should we instead use a modelling approach (i.e. Ysurvey = f(Xtax)?

Studies have shown that in some cases it might be more appropriate to use f(X)

Page 15: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

15

Page 16: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

16

Issues and Challenges

At estimationCurrently, we use the one-phase Horvitz-Thompson estimator

It’s a very simple, and fairly efficient estimatorUnfortunately, it could be severely biased if the model y = x doesn’t hold

unitsreplacedTaxforx

irequestionnathroughcollectedunitsforyywhere

ywY

i

ii

si

ii

*

*1

ˆ

Page 17: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

17

Issues and Challenges

At estimation (cont’d)Estimates for variables not available from tax file (characteristics/commodity) do not cover the T-N portion

For some characteristics the T-N portion can count for a lot more than 10%

Page 18: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

18

Issues and Challenges

Data qualityResponse rates (What is a respondent?)

Respond to tax but not to the characteristic questionnaire

Reported tax data vs imputed tax data

Planned tax replacement vs tax replacement for non-response

Variance & CVA lot of imputation occurs in the current strategy (incl. tax replacement)

Shouldn’t we include the variance due to imputation?

Page 19: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

19

Towards a Better Use of Tax Data

Understand the particularities of the different tax data sources (ex.: GST vs T2 is currently under investigation)

Explore different administrative files to help with particular sub-populations (ex.: not-for-profit organizations)

Page 20: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

20

Towards a Better Use of Tax Data

Keep investigating why Ysurvey ≠ Xtax even when they should conceptually be equal

Explore the idea of using Ysurvey = f(Xtax)

Fine-tune our definition of who is eligible for tax replacement and who is not

Currently studying the possibility of using a more robust estimator to protect against the potential bias

Developing a strategy to cover the entire population for all variables of interest

Page 21: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

21

Start taking into account the variability introduced by imputation when computing variances and CVsA framework is under development to define response rates when both tax data and survey data are used for the same unitsExplore the possibility of making use of all the GIFI data, not only for the T-N and the sample

Towards a Better Use of Tax Data

Page 22: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

22

Towards a Better Use of Tax DataT2

T2 Take-None:

Census of GIFI

EXCLUSION THRESHOLD

Main sample to be surveyed

For variables available from tax:

Total estimate = Survey estimate (T1,T2) + T2 Take-None + T1 Take-none e-filer estimate

For variables not available from tax (Characteristics):

Total estimate= Survey estimate (T1, T2)

Not eligible for tax : full questionnaire

Tax replaced

Characteristic quest. (services surveys) or full questionnaire (other surveys)

T1

T1 Take-None:

Sample of e-filers

Eligible Ineligible

Page 23: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

23

Conclusion

Since the introduction of the UES, the use of tax data has increased consistently

It has significantly reduced response burden and the cost of the survey

Unfortunately, sometimes at the expense of a reduced data interpretability

Fortunately, it was recently decided that we would take a few steps back to evaluate how we currently do things, and to determine how we could improve our strategy

Page 24: Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.

Pour plus d’information, veuillez contacter

For more information please contact

Visit our web site atwww.statcan.ca

Claude Turmelle(613) 951-3327

[email protected]


Recommended