» 1
Geocoding of Statistics Portugal Business Register and it’s integration with the
INSPIRE’s Annex III Buildings theme
Barcelona, 26th September2016INE/DMSI-GEO
» 2
• Business Register (BR) – Enterprise and Sampling frame
• Spatial Data Infrastructure (SDI)o Buildings Geographic Database (BGE)
o National Dwellings Register (FNA)
• Building the BR Geographic Database (BRGD)o Geo processing methodology
• Joining BRGD and BGE/FNAo INSPIRE data specifications
The summary
» 3
• Enterprises are identified by a unique national number• All section NACE.Rev2 activities are included • All institutional sectors according to European System of
Accounts• Micro, small and medium-sized enterprises
«
BR – Coverage
I1
» 4
«
Legal UnitsEnterprise
Companies Individuals Public Administration
Non Profit Institutions
Local units
Enterprise group
BR – Coverage
» 5
Updatingprocess
Quality Control
BR Access online
Data Warehouse
BR
Data System Collection
the management system of population and samples
BR – Architecture
» 6
• Each year, a new Structural population frame is built• All active enterprises are included• All enterprises with activity cessation in reference period
are included• All Institutional sectors are included• The sampling frame will be obtained from this Structural
population frame
BR – Population Frame of surveys
» 7
• Ministry of Justice - National Registry of Legal Persons• Ministry of Finance - Corporate Tax- Income Tax• SICAE system (a partnership information system about legal units
NACE code, where Statistics Portugal is one of the three partners, the others are Tax Authority and Institute of Registration and Notary Affairs)
BR – Updating Process
Administrative Sources
» 8
• Each variable of the BR is linked to different information sources, ranked by degree of importance
• For each variable a cross-check is made by comparing the information provided by the source with the one contained in the BR
• Concerning the economic variables a growth rate is calculated and an acceptable range is established. Those that fall out of this range are subject to further analysis
BR – Updating Process
» 9
Data and Metadata
HumanResources
StandardsInstitutional
Partnership and Data sharing
Technology(Hardware and
Software)
SDI
Spatial Data Infrastructure
» 11
Spatial Data
Census2011
3 547 318N
1
N
1
1
BGRIResidential
OtherSchoolsHospitalsBusiness(…)
Road Segment Code
Roads
CAOP – Portugal’s Official Administrative Map
Building Code
Road Segment
1 600 150
GridCells
94 265
GRID (Grid_ETRS89_LAEA_1K
BGE Buildings
• BGRI Census Blocks (polygons)• BSA Road network (lines)• BGE Buildings (points)
Geographical FeaturesFNA
National Dwellings Register
Spatial Data Infrastructure
» 12
• In 2011, Statistics Portugal constructed a national geographical database of all the georeferenced buildings from the 2011 Census
• This geographical dataset has been used to reference census data at point level and to support the creation of a National Dwellings Register (FNA)
• FNA is updated by data available in different sources: (1) surveys conducted by Statistics Portugal: (2) administrative sources
Censos 2011 Construction Update
• Administrative Sources
Buildings/Dwellings
x
x
t
• INE Surveys• (Building license survey)
1st fase 2nd fase
National Dwellings Register
» 13
SIGINQ-IAP(BR)
SIGINQ-IAP(BR)
Company
Local Unit
SIGINQ-IE(FNA)
SIGINQ-IE(FNA)
Building (ED)
Dwelling (UA)
SIGINQ-AGR(BAA)
SIGINQ-AGR(BAA)
“Parcel”
Statistical Units
Information System – Type of statistical unit
» 14
Building
Fraction
Local UnitDwellingFarmer
Legal Unit
Householddwelling
Collectivedwelling
GRID
N
N
N
N
FNA BRBAA
Farm
SIGINQ-IE SIGINQ-IAPSIGINQ-AGR
Statistical Units
» 15
• Statistics Portugal strategy to improve the efficiency statistical process
• Grant: “Merging statistics and geospatial information in member states”
GOAL• Implement a spatially enabled and quality-controlled point
based infrastructure for the production and delivery of BR statistics at all relevant geographic breakdown levels by means of data integration
Action
The current action is integrated into:
» 16
The address is the key element to directly or indirectly match the records with the existing BGE
following a step-by-step approach based on locators capable of sequentially pinpoint the BR records
A different mix of those locators has been used for the cases processed
BR Geo processing
MORADA_CP7 Complete address composed by type of road, name, number, 7 digit postal code
BSA_CP7_DTA BSA_CP7_ESQ Used over the BSA in order to overcome discrepancies in the address
MORADA_LOC_CP4 Uses the locality name and the 4 digit postal code
CP7 Based on the 7 digit postal code, which is a linear structure used to code each block façade composed by the CP4 and 3 additional digits
CP4 Based on the 4 digit postal code, which is a polygonal structure used to code each postal distribution area
Methodology
» 18
«
Statistics Portugal Responsability: 5 themes
I.3 Geographical NamesI.5 AdressesIII.1 Statistical UnitsIII.2 BuildingsIII.10 Population Distribution– demography
Participation in 5 Thematic WG
Implementation of the INSPIRE Framework
» 19
byHale“The alignment is the mapping between source and target schemas. It defines relations between source and target entities (types or properties). Based on the defined relations a transformation is derived.”
Download versão 2.9.4 (2015-11-01) versão 32 e 64 bit para windows, Mac OS, Linux
Harmonization
Implementation of the INSPIRE Framework
» 20
HALE Workflow
1.Import Source/TargetSchemas
Transformationaccording to target Schema
2.Import data
3.Defining mapping rules
4.Export transformed data
5.Data validation
» 21
«
Schema Explorerallows you to view the structure of the source (left) and the target (right) schema in various ways and to define mappings between the elements of the schemas.
Schema Explorerallows you to view the structure of the source (left) and the target (right) schema in various ways and to define mappings between the elements of the schemas.
Properties Viewdisplays information on the current selection
Properties Viewdisplays information on the current selection
Functions Viewshows the available transformation functions, which can be used to define relations. Further information on a selected function will be displayed in the Properties view.
Functions Viewshows the available transformation functions, which can be used to define relations. Further information on a selected function will be displayed in the Properties view.
Alignment viewdisplays the current alignment per type relation and allows editing or removing mapping cells.
Alignment viewdisplays the current alignment per type relation and allows editing or removing mapping cells.
Error Loggives you insight into the application's log messages
Error Loggives you insight into the application's log messages
Report Listprovides an overview of the last completed processes
Report Listprovides an overview of the last completed processes
HALE Interface
» 22
The BR Geographic Database is joined to point based component - BGE Buildings and FNA
Aiming to create a unique geocoded national framework of Statistical Units to be used in the national Statistical System
Joining BR and BGE / FNA
» 24
Data to be analysed / updatedTHANK YOU
Department of Methodology and System Information
GeoInformation Unit