Crossing the Longest Yard: Eight Strategies for Creating Knowledge from a Glut of Data

Post on 02-Jan-2016

32 views 5 download

Tags:

description

Crossing the Longest Yard: Eight Strategies for Creating Knowledge from a Glut of Data. Dr. David L. Hall School of Information Sciences and Technology The Pennsylvania State University Sonya A. H. McMullen Teach Reach Inc. The Evolving Problem. - PowerPoint PPT Presentation

transcript

Crossing the Longest Yard: Crossing the Longest Yard: Eight Strategies for Creating Knowledge Eight Strategies for Creating Knowledge

from a Glut of Datafrom a Glut of Data

Dr. David L. HallDr. David L. HallSchool of Information Sciences and TechnologySchool of Information Sciences and Technology

The Pennsylvania State UniversityThe Pennsylvania State University

Sonya A. H. McMullenSonya A. H. McMullenTeach Reach Inc.Teach Reach Inc.

The Evolving ProblemThe Evolving Problem

Assessing and digesting enormous Assessing and digesting enormous quantities of dataquantities of data

Developing an accurate threat or Developing an accurate threat or situation assessmentsituation assessment

Decision makingDecision making

Assessing the resultAssessing the result

Multitude of Data SourcesMultitude of Data SourcesNational SIGINT AssetsNational SIGINT Assets

Tactical SIGINT Assets:Tactical SIGINT Assets:– AircraftAircraft– Unmanned Aerial Vehicles (UAVs)Unmanned Aerial Vehicles (UAVs)– Ground-based SensorsGround-based Sensors

Nano and Micro-scale sensorsNano and Micro-scale sensors– Smart DustSmart Dust

HUMINTHUMINT

Open Source Open Source

Traditional View of Processing Traditional View of Processing Intelligence DataIntelligence Data

Energy Signals Data State vectors Labels Knowledge

The utility of a data fusion system must be measured by the extent to which it supports effective decision making

Energy Signals Data State vectors Labels Knowledge

The utility of a data fusion system must be measured by the extent to which it supports effective decision making

Glut of Data “Problem”Glut of Data “Problem”Lots of data in multiple forms:Lots of data in multiple forms:– COMINTCOMINT– SIGINTSIGINT– IMINTIMINT– HUMINTHUMINT

Data is not housed in a single Data is not housed in a single repository or data baserepository or data baseNot enough analystsNot enough analystsCollaboration limitationsCollaboration limitations

Proposed SolutionsProposed Solutions

More:More:– Pattern RecognitionPattern Recognition– Machine Learning MethodsMachine Learning Methods– Advanced FilteringAdvanced Filtering– Data MiningData Mining

Proposed Solution FallaciesProposed Solution Fallacies

It is just a matter of “finding the It is just a matter of “finding the needle in the haystack”needle in the haystack”

Knowledge creation involves human Knowledge creation involves human collaborationcollaboration

Knowledge is inherently a human Knowledge is inherently a human productproduct

The approach starts at the “wrong The approach starts at the “wrong end”end”

Strategies for Resolving the Strategies for Resolving the Knowledge Creation ProblemKnowledge Creation Problem

Increase the bandwidth:Increase the bandwidth: Make Make computer displays largercomputer displays larger

Fly-by-wire:Fly-by-wire: Use gaming techniques Use gaming techniques to “physically” access datato “physically” access data

Use multiple senses:Use multiple senses: Vision + Vision + hearing + haptic hearing + haptic

Gaming Concept for DataGaming Concept for Data

Strategies for Resolving the Strategies for Resolving the Knowledge Creation ProblemKnowledge Creation Problem

Conserve analyst attention:Conserve analyst attention: – Deliberate synesthesia – Transforming Deliberate synesthesia – Transforming

data from one sensory domain to data from one sensory domain to another another

– Use other techniques to focus the Use other techniques to focus the attention of the analystattention of the analyst

Leverage the power of languageLeverage the power of language

Focus on negative spaceFocus on negative space

Negative Space ExampleNegative Space Example

Strategies for Resolving the Strategies for Resolving the Knowledge Creation ProblemKnowledge Creation Problem

Create cyber cognitive multipliers:Create cyber cognitive multipliers:– ““The Matrix Approach” – Agents in The Matrix Approach” – Agents in

cyber-space can replicate themselvescyber-space can replicate themselves– Mixed teams of analysts, intelligent Mixed teams of analysts, intelligent

team-based agents, and decision-team-based agents, and decision-makers makers

Adapt to individual users:Adapt to individual users: Interfaces Interfaces that learn the preferences of and that learn the preferences of and “fit” individual users“fit” individual users

““The Matrix Approach”The Matrix Approach”

Summary of Improvement StrategiesSummary of Improvement StrategiesStrategyStrategy Anticipated Anticipated

Improvement Improvement FactorFactor

Increase the bandwidthIncrease the bandwidth 50x50x

Fly-by-wireFly-by-wire 10x10x

Use multiple sensesUse multiple senses 10x10x

Conserve analyst attentionConserve analyst attention 50x50x

Leverage the power of languageLeverage the power of language 100x - 1000x100x - 1000x

Negative spaceNegative space 10x - 100x10x - 100x

Cognitive multipliersCognitive multipliers 20x - 100x20x - 100x

Adapt to individualsAdapt to individuals 10x10x

Total Potential ImprovementTotal Potential Improvement 260x-1430x260x-1430x

RecommendationsRecommendations

Develop two realistic scenarios and Develop two realistic scenarios and associated data sets to test associated data sets to test strategiesstrategies

Conduct some knowledge elicitation Conduct some knowledge elicitation from “real” analystsfrom “real” analysts

RecommendationsRecommendations

Evaluate the effectiveness of Evaluate the effectiveness of implemented tools using the Living implemented tools using the Living Laboratory approachLaboratory approach

Transition selected tools to real Transition selected tools to real environmentsenvironments

Living Laboratory ConceptLiving Laboratory Concept

Ethnography Studies

Design & Development of Support Tools

TransactiveMemory

Visualization Tools Collaboration Aids Cognitive AidsNGA

9/11 Crisis Centers

. . .

KnowledgeEngineering

Concept & Procedural Maps

Living Laboratory Environment

Scenarios to Drive Team Experiments

DTRA

Applications and Knowledge Domains

NGA KRSOC 9/11A unique living laboratoryenvironment provides the

ability to support the design of effective analyst

support tools and to quantify their utility

Ethnography Studies

Design & Development of Support Tools

TransactiveMemory

Visualization Tools Collaboration Aids Cognitive AidsNGA

9/11 Crisis Centers

. . .

KnowledgeEngineering

Concept & Procedural Maps

Living Laboratory Environment

Scenarios to Drive Team Experiments

DTRA

Applications and Knowledge Domains

NGA KRSOC 9/11A unique living laboratoryenvironment provides the

ability to support the design of effective analyst

support tools and to quantify their utility

Questions?Questions?

Crossing the Longest Crossing the Longest Yard.docYard.doc