+ All Categories
Home > Documents > Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. ·...

Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. ·...

Date post: 05-Oct-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
39
Characterizing Global Web Censorship: Why is it so hard? Phillipa Gill The Citizen Lab/Stony Brook University Work done in collaboration with: Masashi Crete Nishihata, Jakub Dalek, Sharon Goldberg, Adam Senft and Greg Wiseman Workshop on Active Internet Measurements CAIDA Feb. 8, 2012
Transcript
Page 1: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Characterizing Global Web Censorship: Why is it so hard?

Phillipa Gill The Citizen Lab/Stony Brook University

Work done in collaboration with:

Masashi Crete Nishihata, Jakub Dalek, Sharon Goldberg, Adam Senft and Greg Wiseman

Workshop on Active Internet Measurements CAIDA

Feb. 8, 2012

Page 2: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Overview Large-scale politically driven Internet outages are well known… • …but what happens within countries is less well understood

We leverage data gathered by an interdisciplinary group (Open Net Initiative) to bootstrap analysis • 77 countries, 286 distinct ISPs, measured from 2007-2012 • Advantages: context about what, when, and where to measure • Disadvantages: dearth of technical data/raw measurements

Our results highlight important challenges for censorship research!

2

Page 3: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Background • Where censorship can happen:

3

Page 4: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Background • Where censorship can happen:

4

Start

DNS reply?

Page 5: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Background • Where censorship can happen:

5

Start

DNS blocking

DNS reply? No

Page 6: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Background • Where censorship can happen:

6

Start

DNS blocking

DNS reply? Yes No

DNS redirect?

Page 7: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Background • Where censorship can happen:

7

Start

DNS blocking

DNS reply? Yes

Yes

No

No DNS redirect?

Response to SYN?

Page 8: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Background • Where censorship can happen:

8

Start

DNS blocking

DNS reply? Yes

Yes

No

No DNS redirect?

IP blocking

No Response to SYN?

Page 9: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Background • Where censorship can happen:

9

Start

DNS blocking

DNS reply? Yes

Yes

No

No

Response to HTTP request?

DNS redirect?

IP blocking

No Response to SYN?

Yes

Page 10: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Background • Where censorship can happen:

10

Start

DNS blocking

DNS reply?

No HTTP Reply

Yes

Yes

No

No

No Response to

HTTP request?

DNS redirect?

IP blocking

No Response to SYN?

Yes

Page 11: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Background • Where censorship can happen:

11

Start

DNS blocking

DNS reply?

No HTTP Reply What was it?

Yes

Yes

Yes

No

No

No Response to

HTTP request?

DNS redirect?

IP blocking

No Response to SYN?

Yes

Page 12: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Background • Where censorship can happen:

12

Start

DNS blocking

DNS reply?

No HTTP Reply

RST

Block page

What was it?

Yes

Yes

Yes

No

No

No Response to

HTTP request?

DNS redirect?

IP blocking

No Response to SYN?

Yes

Infinite HTTP Redirect

Page 13: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Background • Where censorship can happen:

13

Start

DNS blocking

DNS reply?

No HTTP Reply

RST

Block page

What was it?

Yes

Yes

Yes

No

No

No Response to

HTTP request?

DNS redirect?

IP blocking

No Response to SYN?

Yes

Infinite HTTP Redirect

Page 14: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Methodology • Basic idea: Issue requests for a consistent set of sites in the

field and a control location (lab) • Software synchronizes the requests between lab and field • Once both lab and field have completed, results sent back to

the lab for more analysis • What is tested:

– Sites that are likely to trigger censorship – Determined in collaboration with regional groups

• Where are tests run: – Combination of targeted/opportunistic testing – Performed by regional collaborators after informed consent

meeting

14

Page 15: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Challenges for censorship research

15

Page 16: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

0

0.2

0.4

0.6

0.8

1

China Iran UAE Yemen Burma Vietnam

Frac

tion

of b

lock

ing

resu

lts

Country

No DNS Reply DNS Redirection No HTTP ReplyRST Blockpage

1. Variation between countries

16

Page 17: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

0

0.2

0.4

0.6

0.8

1

China Iran UAE Yemen Burma Vietnam

Frac

tion

of b

lock

ing

resu

lts

Country

No DNS Reply DNS Redirection No HTTP ReplyRST Blockpage

1. Variation between countries

17

Page 18: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

0

0.2

0.4

0.6

0.8

1

China Iran UAE Yemen Burma Vietnam

Frac

tion

of b

lock

ing

resu

lts

Country

No DNS Reply DNS Redirection No HTTP ReplyRST Blockpage

1. Variation between countries

18

Page 19: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

0

0.2

0.4

0.6

0.8

1

China Iran UAE Yemen Burma Vietnam

Frac

tion

of b

lock

ing

resu

lts

Country

No DNS Reply DNS Redirection No HTTP ReplyRST Blockpage

1. Variation between countries

19

There is no such thing as a “representative” country

Page 20: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

2. Variation between ISPs

20

Decentralized blocking in UAE

0

0.05

0.1

0.15

0.2

0.25

2007 2008 2009 2010 2011 2012

Frac

tion

of c

onte

nt b

lock

ed

Year AS 5384 AS 15802

Page 21: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

2. Variation between ISPs

21

Decentralized blocking in UAE

0

0.05

0.1

0.15

0.2

0.25

2007 2008 2009 2010 2011 2012

Frac

tion

of c

onte

nt b

lock

ed

Year AS 5384 AS 15802

“Du” ISP does not censor prior to April 2008

Page 22: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

2. Variation between ISPs

22

Decentralized blocking in UAE

0

0.05

0.1

0.15

0.2

0.25

2007 2008 2009 2010 2011 2012

Frac

tion

of c

onte

nt b

lock

ed

Year AS 5384 AS 15802Censorship is a per-ISP property

(when censorship is decentralized)

Page 23: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

2. Variation between types of networks

23

Page 24: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

2. Variation between types of networks

24

00.10.20.30.40.50.60.70.80.9

1Ja

ccar

d si

mila

rity

coe

ff.

Country

Academic networks block an average of 40% less!

Page 25: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

2. Variation between types of networks

25

00.10.20.30.40.50.60.70.80.9

1Ja

ccar

d si

mila

rity

coe

ff.

Country

Academic networks block an average of 40% less!

Academic networks are not representative!

Page 26: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

3. Sudden temporal shifts in blocking

26

00.05

0.10.15

0.20.25

0.30.35

2009 2010 2011 2012

Frac

tion

of te

sts b

lock

ed

Year

Political Social Internet Conflict

Censorship in Burma over time

Page 27: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

3. Sudden temporal shifts in blocking

27

00.05

0.10.15

0.20.25

0.30.35

2009 2010 2011 2012

Frac

tion

of te

sts b

lock

ed

Year

Political Social Internet Conflict

Censorship in Burma over time

End of military rule in 2011 brought political reforms.

Page 28: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

3. Sudden temporal shifts in blocking

28

00.05

0.10.15

0.20.25

0.30.35

2009 2010 2011 2012

Frac

tion

of te

sts b

lock

ed

Year

Political Social Internet Conflict

Censorship in Burma over time

End of military rule in 2011 brought political reforms.

Need to measure over time and correlate with political changes

Page 29: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

4. Stealthy blocking of certain content

29

00.20.40.60.8

1

Political Social Internet Conflict

Frac

tion

of b

lock

res

ults

Theme No DNS Reply No HTTP Reply RST Blockpage

Censorship of content in Yemen

Page 30: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

4. Stealthy blocking of certain content

30

00.20.40.60.8

1

Political Social Internet Conflict

Frac

tion

of b

lock

res

ults

Theme No DNS Reply No HTTP Reply RST Blockpage

Censorship of content in Yemen

Transparent blocking of social and Internet content

Page 31: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

4. Stealthy blocking of certain content

31

00.20.40.60.8

1

Political Social Internet Conflict

Frac

tion

of b

lock

res

ults

Theme No DNS Reply No HTTP Reply RST Blockpage

Censorship of content in Yemen

Transparent blocking of social and Internet content

“Stealthy” blocking of political and conflict related content

Page 32: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

4. Stealthy blocking of certain content

32

00.20.40.60.8

1

Political Social Internet Conflict

Frac

tion

of b

lock

res

ults

Theme No DNS Reply No HTTP Reply RST Blockpage

Censorship of content in Yemen

Transparent blocking of social and Internet content

“Stealthy” blocking of political and conflict related content

Measurement needs to be robust to distinguish failure from censorship

Page 33: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

5. The type of content tested matters

33

00.10.20.30.40.5

Frac

tion

bloc

ked

Country

LocalGlobal

Page 34: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

5. The type of content tested matters

34

00.10.20.30.40.5

Frac

tion

bloc

ked

Country

LocalGlobal

3-5X more blocking of local content in China/Yemen * most blocked content is political

Page 35: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

5. The type of content tested matters

35

00.10.20.30.40.5

Frac

tion

bloc

ked

Country

LocalGlobal

Less discrepancy in UAE * most blocked content is social

Page 36: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

5. The type of content tested matters

36

00.10.20.30.40.5

Frac

tion

bloc

ked

Country

LocalGlobal

Need to take an interdisciplinary approach to determine what content to test

Page 37: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

Challenges for censorship research:

1. Variations between technology used by countries 2. Variations between ISPs and between ISPs and

institutions 3. Sudden temporal shifts in blocking 4. Stealthy blocking of certain content 5. Locally relevant content is more likely to be blocked And more! … maintaining infrastructure across funding cycles/staff turn over … informed consent/preserving user privacy when testing can pose a physical risk!

37

Page 38: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

What’s next?

More measurements, taking an interdisciplinary approach to tackle the problem: • Rigorous measurements + political context

Data sharing? • Short answer: we’re working on it. • Longer answer: this project has laid the

foundation in terms of unifying the data and removing PII. – Anticipate releasing data in the next ~4 months

38

Page 39: Characterizing Global Web Censorship: Why is it so hard?phillipa/AIMS_Slides.pdf · 2013. 2. 7. · Challenges for censorship research: 1. Variations between technology used by countries

What I hope to get out of this workshop

• Discuss how existing platforms may be used for censorship research

Particularly interested in: – Platforms with visibility into the network edge – DNS/BGP measurements

• Discuss how a large scale, long-term censorship measurement

platform may be built

• Discuss how we might distinguish transient failures/TCP bugs from actual censorship

39


Recommended