Post on 14-Mar-2020
transcript
Building Enterprise Reporting that Stands the Test of Time
Data Architect
Email: wcrayger@hitachisolutions.com
William Crayger
#HSCCATLANTA19
Breakout Track 3: Analytics and AI
Introduction
Data Discovery
Data Structuring
Data Cleansing
Data Enrichment
01
02
03
04
05
C O N T E N T S
Data Validation
Data Model Reusability
Enterprise Branding
Data Visualization
Wrapping up
06
07
08
09
10
/ 5Think Digital Customer Conference 2019
Introduction
• 5 steps to wrangling data together• Discovery• Structuring• Cleaning• Enriching• Validating
• The power of data visualization
/ 6Think Digital Customer Conference 2019
Data Discovery
• Have a deep understanding of the data• What are the attributes to be used for filtering and grouping?• Are there common attributes between datasets?• What are the fields that will be aggregated?
• Wrangling needs to be done in a specific manner based on some criteria which could demarcate and divide the data accordingly
/ 7Think Digital Customer Conference 2019
Data Structuring
• Data will often come to you in an unstructured, haphazard manner
• Based on criteria identified in data discovery, restructure and prepare the data to a format conducive to your reporting needs
• One column may become multiple, multiple columns may be concatenated together to form key fields
/ 8Think Digital Customer Conference 2019
Data Cleansing
• All datasets are sure to have some outliers, which can skew the results of the analysis
• Have business rules in place to dictate how outliers and null values are dealt with
• Formatting should be standardized to ensure higher quality analysis
/ 9Think Digital Customer Conference 2019
Data Enrichment
• Develop a strategy regarding augmentation of your data• Are there additional data sources that need to be brought in to your
model, or perhaps creation of tables via one of the native languages such as date dimensions
• Determine if new data can be derived using existing datasets• Creation of derived tables using existing data to enable filtering, or
reporting across multiple subject areas / grains
/ 10Think Digital Customer Conference 2019
Data Validation
• The last step is to verify the consistency, quality, and security of the data
• Usually be done programmatically
• Validate fields are accurate across new derived content, and aggregations are correct across grains
/ 11Think Digital Customer Conference 2019
Model Reusability
• Power BI models can be used as a source for new reports, acting as an interim step towards Analysis Services
• Enables both enterprise and self-service BI
• Ensures standardization across downstream reports
/ 12Think Digital Customer Conference 2019
Data Visualization
• Primary objective of report building is to create accurate, actionable reports that are legible and visually appealing
• Understanding your data is vital to deciding how to best tell the story
• Comparison – Which regions bring the most revenue?• Composition – How is our revenue divided by line of business?• Relationship – Is there correlation between sales and sales rep tenure?• Distribution – How are exam scores distributed?
/ 13Think Digital Customer Conference 2019
Which story would you like
to tell?
Composition
Comparison
DistributionRelationship
Two variables
Three variables
Between items Over time
Scatter Plot
Line Histogram
Bar Histogram
Box-and-WhiskerPareto ChartPolar/Radar PlotLine ChartGeo MapBullet ChartBar Chart
Scatter Plot
Bubble Chart
Waterfall Tree/Heat Map Funnel Chart 100% Stacked Bar Stacked Bar 100% Stacked Area Stacked Area Chart
Word Cloud
Two variables
Single variable
Many data points
Few data points
Known outliers
StaticChanging over time
Few periods Many periods
Relative differences
Relative & absolute differences
Relative differences
Relative & absolute differences
Onevariable
Qualitative ranges
Geographical locations
Share of total
Accumulation to total
Share & differences
Progression through phases
One or several categories
Cyclical data
With distribution
Network Chart Between matrix items
With running total
Pie/Donut Chart
Adapted from: ©A. Abela, 2010. www.ExtremePresentation.com
/ 14Think Digital Customer Conference 2019
UI / UX Design and Enterprise Report Branding
• Design with your C-suite in mind• Flow should move from high level KPI’s with ability to navigate to lowest
level of detail• Adding layers via shapes or other objects increases readability and appeal
• Branding can be accomplished in several ways• Custom theme using marketing color palette• Company logos
/ 15Think Digital Customer Conference 2019
Wrapping Up
• Successful engagements are two prong• Strong data model as the foundation• Accurate, actionable, reports to drive business
• Know and understand your data, then decide how best to tell the story
/ 16Think Digital Customer Conference 2019
Questions?
• Linkedin: https://www.linkedin.com/in/wcrayger/
• Power BI Blogs• Power BI UX Design Tips• Power BI Best Practices and Design Considerations• Power BI Tips and Tricks