+ All Categories
Home > Documents > Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data...

Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data...

Date post: 25-Sep-2018
Category:
Upload: trancong
View: 215 times
Download: 0 times
Share this document with a friend
73
Cultural Domain Analysis (CDA) Steve Borgatti Boston College ABT Associates 8 January, 2002
Transcript
Page 1: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Cultural Domain Analysis(CDA)

Steve BorgattiBoston College

ABT Associates8 January, 2002

Page 2: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Topics• Overview of CDA

– Theory– Data collection– Analysis– Applications

• Software Demonstration– Anthropac– UCINET/NetDraw

Page 3: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

History• Became popular in the 60s

– In part because of availability of Bell Labs Fortran programs

• Linguistic anthropology à cognitive anthropology à marketing research

• Scientific, yet emic– From distinction between phonemic and phonetic– Describing & modeling the native’s point of view

• Models themselves remain in researcher’s world• It is the objective that makes it emic, not the result

– Informant ethnographies is yet another class of work

Page 4: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Underlying Notions• Cognition organized around categories

(domains)– Typically named, shared– Examples: illnesses, vegetables, countries

• Categories contain items– Some may be categories themselves

• tree structure

• Items in semantic relations w/ each other– Part/whole, similar to, causes

• Items distinguished by attributes or features– What are the differences that make difference?

Page 5: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Componential analysisof horse terms

• Features– Stallion ß horse+male+adult– Mare ß horse+female+adult– Gelding ß horse+neuter+adult|adolescent– Filly ß horse+female+adolescent– Colt ß horse+male|female+child– Foal ß horse+male|female+baby

• Paradigm

foalbabycoltchild

fillyadolescent geldingmarestallionadultneuterfemalemaleHORSE

pigletbaby

shoatchild

giltadolescentbarrow

sowboaradult

neuterfemalemalePIG

Age

Sex

Page 6: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Typical CDA Study• Eliciting domain• Eliciting items within a domain• Analyzing structure of the domain

– Semantic relations– Uncovering the meaningful attributes

• Analyzing structure of agreement among respondents

• Prediction– [People react similarly to similar things]

Page 7: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Elicitation & Measurement• Domain membership

– Free listing• Measuring Similarities

– Pile sorts, Triads, Direct rating, Map drawing• Attributes

– Eliciting:• Pile sort labeling• Interpreting MDS maps of similarities

– Measurement:• Paired comparisons• Direct rating

Page 8: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Analysis Techniques• Multidimensional scaling (MDS)

– Of aggregate similarity data• Cluster analysis

– Of aggregate similarity data• Property Fitting

– Relating attributes to similarity data• Consensus Analysis

– Understanding variations in beliefs

Page 9: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Free Listing• Basic idea:

– Tell me all the <category name> you can think of– Typically loosely timed, no questions allowed– An example of Spradley’s “grand tour” question

• Contrasts with survey open-ended question– Open-end is typically about the respondent:

• what do you like about this product? what ice-cream flavors do you like? what illnesses have you had?

– Free list is about the domain: • what ice-cream flavors are there? what illnesses exist?

Page 10: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Domain of Fruits

Weller & Romney. 1988. Systematic Data Collection. Sage.

Page 11: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Domain of Vegetables

Weller & Romney. 1988. Systematic Data Collection. Sage.

Page 12: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

The “Bad Words” DomainWARNING:

4-Letter words follow!The squeamish and the moral should go back to work now!

Page 13: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Frequencies• Sort in descending order• Tally average position in lists• Combine frequency and position to create

salience measure• May need editing to standardize spelling• In some cases, want to collapse synonyms

– Not in linguistics projects, though

Page 14: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Domain borders are fuzzy

0

10

20

30

40

50

60

70

80

90

1 10 19 28 37 46 55 64 73 82 91 100 109 118 127 136 145 154 163 172 181 190 199 208 217 226 235 244 253 262 271 280 289 298 307

Frequencies of each bad word

Page 15: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Domains have core/periphery structure

• MDS of item-item co-occurrences

• Each dot is a bad word

• Core items are in the center – in everybody’s list – and co-occur with each other

-2

-1.5

-1

-0.5

0

0.5

1

1.5

2

-2 -1.5 -1 -0.5 0 0.5 1 1.5 2

Page 16: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Core items typically mentioned first

Frequency vs Rank

y = -0.0767x + 12.142

R2 = 0.2393

0

5

10

15

20

25

0 10 20 30 40 50 60 70 80 90 100

Frequency of Mention

Ave

rage

Pos

itio

n in

Lis

t

Characteristic negative correlation between avg rank and frequency

Page 17: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Use scree plot to select coreFREQUENCY

0

10

20

30

40

50

60

70

80

90

SHITFUCK

ASSHOLE BITCHDAMN

DICK

PUSSY HELL

BASTARD

MOTHERFUCKERASS

CUNT

WHORE

SON OF A BITC

HSLU

T

SHITHEAD

FAGGOT

BULLSHIT COCK

COCKSUCKER PISS

NIGGERPRICK

GODDAMN

DICKHEAD

GOD DAMN CUM CLIT

Page 18: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Can analyze respondents as well• Length of lists• Conventionality of their lists (do they tend to

list more popular items)• Correlation between rank (position on list) and

sample frequency• Similarities (overlaps) in people’s lists

Page 19: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Things to notice …• Boundaries of a domain are fuzzy

– Not just artifact of aggregation– For additional data collection, need inclusion rules

• Simple, established cultural domains have – Core/periphery structure– Core items recalled first– Consensus among respondents:

• Each list has core items + idiosyncratic• We don’t see clusters

• Quantitative analysis of qualitative data

Page 20: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Animals Domain• Please grab a piece of paper and something to

write with• When I say ‘go’, please write down all the

animals you can think of. You will have two minutes

Page 21: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Things to notice …• Ordering of items encodes …

– sub-category membership– Semantic relations such as similarity (lions &

tigers) complementarity (forks & knives)• Can reproduce map of domain from free lists

Page 22: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Causes of Breast Cancer

Page 23: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

-1.67

-1.32

-0.97

-0.61

-0.26

0.10

0.45

0.81

1.16

1.51

1.87

-1.67 -0.97 -0.26 0.45 1.16 1.87

BLOWS

PROBPRODMILK

IMPLANTSWILDLIFE

FONDLING

SMOKING

NEVERBREASTFEED

LACKHYGIENE

FAMILYHISTORY

ABORTIONS

ILLEGALDRUGS

DIRTYWORK

CHEMICALSINFOOD

BIRTHCONTROL

BREAST-FEEDING

LACKMEDICALATTN

ALCOHOL

NOCHILDREN

POLLUTION

FATDIET

LARGEBREASTSCAFFEINE

RADIATIONDIETJUSTHAPPENS

FIBROCYSTIC

OBESITYHORMONESUPPS

LATECHILDREN

CANCERHISTORYAGE

ETHNICITYEARLYMENSES

SALVADOR

MEXICAN

CHICANAS ANGLO

PHYSICIANS

Correspondence analysis of factor-by-group crosstab

Page 24: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Things to notice …• Comparative analysis is particularly powerful• Correspondence analysis

– is clearly quantitative• Singular value decomposition of frequency matrix

adjusted for row and column marginals– So we have quantitative analysis of qualitative data– On the other hand, the result is a picture – what

can be more qualitative than that?

Page 25: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Uses of Free List• First step in mapping the domain

– i.e., getting a list of items to work with• Analysis of the list itself

– What makes something a fruit? A bad word?– Comparing salience of items for different groups– Examining similarities among respondents

• Who lists the same items– Examining similarities among items

• Which items tend to mentioned by the same respondents?

• Obtaining native terminology

Page 26: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Pile Sort Technique• Basic idea:

– On each of these cards is written the name of a thing. Please sort the cards into piles according to how similar they are. You can use as many or as few piles as you like.

• Outcome is quantitative measure of similarity among all pairs of items– For each pair of items, count the proportion of

respondents who put them in the same pile• Respondents only asked for non-quantitative

judgments

Page 27: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Aggregate Proximity Matrix• Item by item matrix gives the percent of

respondents placing the two items in the same pile

• Typically visualize with MDS and cluster analysis

Page 28: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Triads• Basic idea:

– Present items to respondent 3 at a time, and ask which is most different

• To elicit attributes– ask why they chose as they did, then try other triples

• To measure similarity– Systematically present all possible triples*– Each time an item is chosen most different it is a vote for

the similarity of the other two– Arrange as an aggregate similarity matrix

dogsealshark

* Or use clever balanced incomplete block design

Page 29: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

BIBDs• Number of triples rises fast as items increase

– n(n-1)(n-2)/6– For 30 items, have 4,060 triads to fill out …

• Each pair of items occurs n-2 times. – Let lambda stand for number of occurrences

• Balanced incomplete block design has each pair occurring same number of times, but lambda < n-2– Lambda-1 design: each pair occurs just once

Page 30: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Representing Proximities• Multidimensional scaling (MDS)

– Maps items to points in Euclidean space such that points corresponding to more similar items are placed nearer to each other in the space

• Cluster analysis• Network analysis techniques

Page 31: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

MDS of animals domain

FROGSALAMANDER

FLAMINGO

WOODTHRUSH

TURKEYROBIN

BEAVER

RACCOONRABBIT

MOUSE

DOLPHIN

COYOTEDEER

MOOSEELK

BEAR

WHALE

LION

SNAKESTARFISH

HYENALEOPARDGORILLA

FOX

BABOONELEPHANT

KANGAROOANTELOPE

SQUIRRELGROUNDHOG

-Strong clustering indicates subdomains

Stress = 0.12

Page 32: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

MDS of land animals only

Page 33: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Fruits & Vegetables

Page 34: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Things people are scared of

BEING_ALONE

BUGS

CHANGE

COMMITMENT

DEATH

DENTISTS

DISEASE

DOCTORS

DOGS

DROWNING

ENCLOSED_SPACE

FAILURE

FINANCIAL_TROUBLE

FIRE

FLYING

GHOSTS

GROWING_OLDGUNS

HEIGHTS

LIGHTNING

LOSING_A_LOVED_ONE

PUBLIC_SPEAKING

RAPERATS

SCARY_MOVIES

SHARKS SICKNESSSNAKESSPIDERS

TESTS

THE_DARK

THUNDER

WATER

Page 35: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Things to notice …• Can use MDS with any proximity matrix

– Aggregate similarities, Direct ratings, Confusion matrices, Correlation matrices, etc.

• Typically use 1-3 dimensions (mostly 2)• Measure of fit (stress)• Simplifies complex data• Interpretation centers on

– Looking for dimensions (quantitative item attributes)

– Looking for clusters (qualitative item attributes)

Page 36: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Holidays• Demo of Visual Anthropac pre-release version

Page 37: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Network analysis• Crimes dataset• Animals• Holidays

Page 38: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Things people are scared of

BEING_ALONE

BUGS

CHANGECOMMITMENT

DEATH

DENTISTS

DISEASE

DOCTORSDOGS

DROWNING

ENCLOSED_SPACE

FAILUREFINANCIAL_TROUBLE

FIRE

FLYING

GHOSTS

GROWING_OLD

GUNS

HEIGHTS

LIGHTNING

LOSING_A_LOVED_ONE

PUBLIC_SPEAKING

RAPE

RATS

SCARY_MOVIES

SHARKS

SICKNESS

SNAKES

SPIDERS

TESTSTHE_DARK

THUNDER

WATER

Female respondents

Page 39: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Things people are scared of

BEING_ALONE

BUGS

CHANGE

COMMITMENT

DEATH

DENTISTS

DISEASE

DOCTORS

DOGS

DROWNING

ENCLOSED_SPACE

FAILURE

FINANCIAL_TROUBLE

FIRE

FLYING

GHOSTS

GROWING_OLDGUNS

HEIGHTS

LIGHTNING

LOSING_A_LOVED_ONE

PUBLIC_SPEAKING

RAPERATS

SCARY_MOVIES

SHARKS SICKNESSSNAKESSPIDERS

TESTS

THE_DARK

THUNDER

WATER

Male respondents

Page 40: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Discrepancy Analysis

Romney, Moore, Batchelder and Hsia. 2002. Statistical methods … PNAS 97(1): 518-523

* English• Japanese

Page 41: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

MDS of similarities in respondents’ sorts

* English• Japanese

Page 42: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Emotion Terms

Notintense

IntenseBad

Good

Page 43: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Crimes

-1.16

-0.87

-0.58

-0.29

0.00

0.29

0.58

0.87

1.17

1.46

1.75

-1.16 -0.58 0.00 0.58 1.17 1.75

VANDALISM

ATTMURDER

PUBENDANGER

AUTOTHEFT

CRIMES-WEAPON

ROBBERY

BATTERY

DUI

PROSTITUTIONTORTURE

WHITECOLLAR

THEFT

NECROPHILIA

DOMESTICAB

SPEEDINGRAPEMURDER

CHILDABUSE

DRUGOFFENSES

ASSAULT

BURGLARY

ARSON

ENVIRONMENTAL

AGAINSTPERSONVIOLENT

KIDNAPPING

LARCENY

HOMEINVASION

MANSLAUGHTER

TERRORISM

people

thingsVictims,serious

Victimlessnon-serious

Page 44: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Occupations

Page 45: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Property Fitting (PROFIT)• Testing hypotheses about dimensions in mds

maps– Were respondents influenced by this dimension

when they did the pile sorts or triads?• Ask sample of respondents to rate each item

on this dimension• Aggregate across all respondents• Regress average score on map coordinates

– Prestige = b1*X_coordinate + b2*Y_coordinate• Calculate vector angles from regression coefs

Page 46: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

PersonalityTraits

Page 47: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

PROFIT• The cases in the regression are items• The dependent variable is the average rating of each

item on the hypothesized attribute• Look for significant r-square > 0.80• If r-square is low, then we can discredit an attribute

as being a factor in people’s judgments• If r-square is high, then they may have been using

this attribute (or a highly correlated one) in their thinking

• Can also use un-averaged ratings: a different rating vector for each respondent

Page 48: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Contagiousness (US)

Page 49: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Severity (US)

Page 50: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Contagion (Guatemala)

Page 51: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Severity (Guatemala)

Page 52: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Age of the Infirm (Guatemala)

Page 53: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Hot-Cold (Guatemala)

Page 54: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Consensus Analysis• Is it ok to aggregate across respondents?

– Only if they belong to same culture – averaging systematically different sets of answers just gets mush

– Similar to interpreting average of a bi-modal univariate distribution

• Can we tell which respondents know what they are talking about (or have conventional views) and which don’t (are out in left field)?

• Consensus theory of Romney, Weller & Batchelder can help

Page 55: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Response model

Qk KnowAnswer?

Yes:writeit down

No:guess

Rightanswer

Rightanswer

Wronganswer

di

1-di

1/L

1-1/L

Ld

dmcorrectProb iii

)1()(

−+==

Knowledge:Proportion of Domain that Person I knows

L = # of choicesIn multiple choicequestion.

Page 56: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Prob of agreement, mij

Case Probability

1. Both know answer didj

2. I knows and J guesses right di(1-dj)/L

3. J knows and I guesses right dj(1-di)/L

4. Neither knows, both guess the same

(1-di)(1-dj)/L

(between respondents I and J)

Page 57: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Neither Knows, Guess Same

1 2 … L

1 ( 1 / L ) 2 1 / L

2 ( 1/L) 2 1 / L

… ( 1/L) 2 1 / L

L ( 1 / L ) 2 1 / L

1 / L 1 / L 1 / L 1 / L 1

(1/L)2 + (1/L)2 + ... = L(1/L)2 = 1/L

Person J

Pers

on I

Page 58: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Pairwise agreement mij

• Agreement mij is sum of four cases:

mij = didj + di(1-dj)/L + dj(1-di)/L + (1-di)(1-dj)/Lmij = didj + (1-didj)/L

• Or rearrange terms:

(Lmij-1)/(L-1) = didj

• Agreement between respondents is a multiplicative function of knowledge level of each

Page 59: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Factor Analysis

• Left side of (Lmij-1)/(L-1) = didj is just obsagreement adjusted by constants. If we let m*ij= (Lmij-1)/(L-1) then we can write more simply:m*ij = didj

• We solve for d’s by factor analyzing M*– Spearman’s fundamental equation of factor analysis

rij = fifj• Corr between two variables is a function of the extent each

is correlated with the latent factor

observed unknown

Page 60: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

We can figure out how much people know without

having an answer key !!!!!!!!!!!!

Page 61: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Inferring knowledge

• Factoring the observed agreement matrix M* solves for the unknown values di– The d values given by the factor loadings

• The d values are the amount of knowledge each person has– Literally, the correlation of the person’s responses with the

unknown answer key• So factoring the agreement matrix gets us exact

estimates of the amount of knowledge each person has– And no answer key is needed!!! – Exactly what we were looking for

Page 62: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

What’s the catch??• The response model must be right

• Can characterize this model as follows

QjKnowAnswer?

Yes:writeit down

No:guess

Rightanswer

Rightanswer

Wronganswer

di

1-di

1/L

1-1/L

Page 63: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Three conditions• Common Truth

– each question has exactly one right answer, applicable to entire sample of respondents

• Sample drawn from one pop w/ same answer key

• Local Independence– resp-item response variables xij are independent,

conditional on the truth• One Domain

– All questions drawn from same domain, i.e.:• can model knowledge w/ one parameter, di

Page 64: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Bullseye Model• Two people agree to the extent that each is

correlated with the truth– Truth is culturally correct answer key

• Each member of culture is aiming at same answer key– but missing to varying degrees in idiosyncratic ways

• Different org cultures havedifferent targets

Answer keyfor culture 1

Answer keyfor culture 2

Page 65: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Expected Agreement Pattern

-1.94

-1.55

-1.16

-0.78

-0.39

0.00

0.38

0.77

1.16

1.54

1.93

-1.94 -1.16 -0.39 0.38 1.16 1.93

79%58%42%

32%26%

26%

21%

21%

16%

16%

16%

16%11%

11%

11%

11%

5%

5%

5%

5%

Page 66: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Partitioning variability• Model identifies two sources of variability in

responses (beliefs)– Cultural: multiple answer keys– Individual: variation in knowledge

• Within each culture, we still expect (and can measure), variability due to differential access to information, ability, etc.

Page 67: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Test of consensus model• Undergraduate class with 92 students• Multiple choice final exam with 50 questions• Instructor’s answer key provides gold

standard to compare against • Each student asked to guess test score of all

acquaintances, including self

Page 68: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Measures• Self-report model

– Each person’s estimate of their own score• Network model

– for each person, use average estimate of their scores (persons with fewer than 5 acquaintances were excluded)

• All acquaintances• Only friends

• Consensus model– Factor loadings of minimum residual factor analysis of student-by-

student agreement matrix• Gold standard

– % correct based on instructor’s answer key

Page 69: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Factor Analysis of Agreements

• Results consistent w/ single answer key– therefore we can use loadings to estimate

knowledge

1003.11.7023

1.06596.93.31.8132

28.30893.693.651.3231

RatioCum %PercentEigenvalFactor

Page 70: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

48%

65%

68%71%

56%

71%

45%

77%

65%

68%

68%74%

71%

71%

71%

77%

53%

77%

71%

83%

68%

88%

88%

51%

80%

77%

71%

80%

85%

71%

68%

62%

68%

85%

62%77%65%

59%

65%

51%

48%

80%77%

59%

68%

56%

83%

62%

62%

77%

74%

71%

91%83%83%

80%

62%

94%

56%

71%

59%

83%

77%

80%

74%

74%68%

68%

48%

56%

62%

85%

74%

59%

71%

48%

68%

88%

74%

77%

62%

74%

77%

62%

59%

62%

74%

74%71%

77%

68%

MDS of Respondent Agreement

Page 71: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Correlations

1.0000.4000.3420.4710.947Consen

1.0000.8910.5560.398Friends

1.0000.5640.334Acquaint

1.0000.479Self

1.000Gold

ConsenFriendsAcquaintSelfGold

• Consensus estimates virtually identical to gold standard (r = 0.947)

• Self-report better than network model

Page 72: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Running Consensus

Page 73: Cultural Domain Analysis - Analytic Tech · Topics • Overview of CDA – Theory – Data collection – Analysis – Applications • Software Demonstration – Anthropac – UCINET/NetDraw

Summary• CDA is about mapping structure of emic

domains• Data collection relies on text statements or

simple categorical judgments– Listing terms– Piling, choosing most different, choosing greater

of two items• Analysis uses sophisticated computational

techniques but mostly delivers pictures


Recommended