+ All Categories
Home > Documents > CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

Date post: 21-Dec-2015
Category:
View: 217 times
Download: 0 times
Share this document with a friend
85
CS 245 Notes 08 1 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina
Transcript
Page 1: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 1

CS 245: Database System Principles

Notes 08: Failure Recovery

Hector Garcia-Molina

Page 2: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 2

PART II

• Crash recovery (2 lectures) Ch.17[17]• Concurrency control (3 lectures)

Ch.18[18]• Transaction processing (2 lects)

Ch.19[19]• Information integration (1 lect)

Ch.20[21,22]

Page 3: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 3

Integrity or correctness of data

• Would like data to be “accurate” or“correct” at all times

EMP Name

WhiteGreenGray

Age

523421

1

Page 4: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 4

Integrity or consistency constraints• Predicates data must satisfy• Examples:

- x is key of relation R- x y holds in R- Domain(x) = {Red, Blue, Green}is valid index for attribute x of R- no employee should make more than

twice the average salary

Page 5: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 5

Definition:

• Consistent state: satisfies all constraints

• Consistent DB: DB in consistent state

Page 6: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 6

Constraints (as we use here) may not capture “full

correctness”Example 1 Transaction constraints• When salary is updated,

new salary > old salary• When account record is deleted,

balance = 0

Page 7: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 7

Note: could be “emulated” by simpleconstraints, e.g.,

accountAcct # …. balancedeleted?

Page 8: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 8

Example 2 Database should reflect real world

DBReality

Constraints (as we use here) may not capture “full

correctness”

Page 9: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 9

in any case, continue with constraints...Observation: DB cannot be consistent

always!Example: a1 + a2 +…. an = TOT (constraint)

Deposit $100 in a2: a2 a2 + 100TOT TOT + 100

Page 10: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 10

a2

TOT

.

.50

.

.1000

.

.150

.

.1000

.

.150

.

.1100

Example: a1 + a2 +…. an = TOT (constraint)Deposit $100 in a2: a2 a2 + 100

TOT TOT + 100

Page 11: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 11

Transaction: collection of actions

that preserve consistency

Consistent DB Consistent DB’T

Page 12: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 12

Big assumption:

If T starts with consistent state + T executes in isolation

T leaves consistent state

Page 13: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 13

Correctness (informally)

• If we stop running transactions,DB left consistent

• Each transaction sees a consistent DB

Page 14: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 14

How can constraints be violated?

• Transaction bug• DBMS bug• Hardware failure

e.g., disk crash alters balance of account

• Data sharinge.g.: T1: give 10% raise to programmers

T2: change programmers systems analysts

Page 15: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 15

How can we prevent/fix violations?

• Chapter 8[17]: due to failures only• Chapter 9[18]: due to data sharing

only• Chapter 10[19]: due to failures and

sharing

Page 16: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 16

Will not consider:

• How to write correct transactions• How to write correct DBMS• Constraint checking & repair

That is, solutions studied here do not need

to know constraints

Page 17: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 17

Chapter 8[17]: Recovery

• First order of business:Failure Model

Page 18: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 18

Events Desired Undesired Expected

Unexpected

Page 19: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 19

Our failure model

processor

memory disk

CPU

M D

Page 20: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 20

Desired events: see product manuals….

Undesired expected events:System crash

- memory lost- cpu halts, resets

Page 21: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 21

Desired events: see product manuals….

Undesired expected events:System crash

- memory lost- cpu halts, resets

Undesired Unexpected: Everything else!

that’s it!!

Page 22: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 22

Examples:• Disk data is lost• Memory lost without CPU halt• CPU implodes wiping out

universe….

Undesired Unexpected: Everything else!

Page 23: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 23

Is this model reasonable?

Approach: Add low level checks + redundancy to increase

probability model holds

E.g., Replicate disk storage (stable store)

Memory parity CPU checks

Page 24: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 24

Second order of business:

Storage hierarchy

Memory Disk

x x

Page 25: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 25

Operations:

• Input (x): block containing x memory• Output (x): block containing x disk

Page 26: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 26

Operations:

• Input (x): block containing x memory• Output (x): block containing x disk

• Read (x,t): do input(x) if necessary t value of x in

block• Write (x,t): do input(x) if necessary

value of x in block t

Page 27: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 27

Key problem Unfinished transaction

Example Constraint: A=B T1: A A 2 B B 2

Page 28: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 28

T1: Read (A,t); t t2Write (A,t);Read (B,t); t t2Write (B,t);Output (A);Output (B);

A: 8B: 8

A: 8B: 8

memory disk

Page 29: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 29

T1: Read (A,t); t t2Write (A,t);Read (B,t); t t2Write (B,t);Output (A);Output (B);

A: 8B: 8

A: 8B: 8

memory disk

1616

Page 30: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 30

T1: Read (A,t); t t2Write (A,t);Read (B,t); t t2Write (B,t);Output (A);Output (B);

A: 8B: 8

A: 8B: 8

memory disk

1616

16

failure!

Page 31: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 31

• Need atomicity: execute all actions of a transaction or none at all

Page 32: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 32

One solution: undo logging (immediate

modification)

due to: Hansel and Gretel, 1812 AD

Page 33: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 33

One solution: undo logging (immediate

modification)

due to: Hansel and Gretel, 1812 AD

• Improved in 1813 AD to durable undo logging

Page 34: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 34

T1: Read (A,t); t t2 A=B

Write (A,t);Read (B,t); t t2Write (B,t);Output (A);Output (B);

A:8B:8

A:8B:8

memory disk log

Undo logging (Immediate modification)

Page 35: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 35

T1: Read (A,t); t t2 A=B

Write (A,t);Read (B,t); t t2Write (B,t);Output (A);Output (B);

A:8B:8

A:8B:8

memory disk log

Undo logging (Immediate modification)

1616

<T1, start><T1, A, 8>

Page 36: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 36

T1: Read (A,t); t t2 A=B

Write (A,t);Read (B,t); t t2Write (B,t);Output (A);Output (B);

A:8B:8

A:8B:8

memory disk log

Undo logging (Immediate modification)

1616

<T1, start><T1, A, 8>

16 <T1, B, 8>

Page 37: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 37

T1: Read (A,t); t t2 A=B

Write (A,t);Read (B,t); t t2Write (B,t);Output (A);Output (B);

A:8B:8

A:8B:8

memory disk log

Undo logging (Immediate modification)

1616

<T1, start><T1, A, 8>

16 <T1, B, 8>

16

Page 38: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 38

T1: Read (A,t); t t2 A=B

Write (A,t);Read (B,t); t t2Write (B,t);Output (A);Output (B);

A:8B:8

A:8B:8

memory disk log

Undo logging (Immediate modification)

1616

<T1, start><T1, A, 8>

<T1, commit>16 <T1, B, 8>

16

Page 39: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 39

One “complication”

• Log is first written in memory• Not written to disk on every action

memory

DB

Log

A: 8 16B: 8 16Log:<T1,start><T1, A, 8><T1, B, 8>

A: 8B: 8

Page 40: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 40

One “complication”

• Log is first written in memory• Not written to disk on every action

memory

DB

Log

A: 8 16B: 8 16Log:<T1,start><T1, A, 8><T1, B, 8>

A: 8B: 8

16BAD STATE

# 1

Page 41: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 41

One “complication”

• Log is first written in memory• Not written to disk on every action

memory

DB

Log

A: 8 16B: 8 16Log:<T1,start><T1, A, 8><T1, B, 8><T1, commit>

A: 8B: 8

16BAD STATE

# 2

<T1, B, 8><T1, commit>

...

Page 42: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 42

Undo logging rules

(1) For every action generate undo logrecord (containing old value)

(2) Before x is modified on disk, logrecords pertaining to x must beon disk (write ahead logging: WAL)

(3) Before commit is flushed to log, allwrites of transaction must bereflected on disk

Page 43: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 43

Recovery rules: Undo logging• For every Ti with <Ti, start> in log:

- If <Ti,commit> or <Ti,abort> in log, do nothing

- Else For all <Ti, X, v> in log:write (X, v)output (X )

Write <Ti, abort> to log

Page 44: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 44

Recovery rules: Undo logging• For every Ti with <Ti, start> in log:

- If <Ti,commit> or <Ti,abort> in log, do nothing

- Else For all <Ti, X, v> in log:write (X, v)output (X )

Write <Ti, abort> to log

IS THIS CORRECT??

Page 45: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 45

Recovery rules: Undo logging

(1) Let S = set of transactions with<Ti, start> in log, but no<Ti, commit> (or <Ti, abort>) record

in log(2) For each <Ti, X, v> in log,

in reverse order (latest earliest) do:

- if Ti S then - write (X, v)

- output (X)

(3) For each Ti S do

- write <Ti, abort> to log

Page 46: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 46

Question• Can writes of <Ti, abort> records

be done in any order (in Step 3)?– Example: T1 and T2 both write A– T1 executed before T2– T1 and T2 both rolled-back– <T1, abort> written but NOT <T2,

abort>?– <T2, abort> written but NOT <T1,

abort>?

T1 write A T2 write Atime/log

Page 47: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 47

What if failure during recovery?No problem! Undo idempotent

Page 48: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 48

To discuss:

• Redo logging• Undo/redo logging, why both?• Real world actions• Checkpoints• Media failures

Page 49: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

Redo Logging

CS 245 Notes 08 49

First send Gretel up with no rope,then Hansel goes up safely with rope!

Page 50: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 50

Redo logging (deferred modification)

T1: Read(A,t); t t2; write (A,t); Read(B,t); t t2; write (B,t);

Output(A); Output(B)

A: 8B: 8

A: 8B: 8

memory DB

LOG

Page 51: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 51

Redo logging (deferred modification)

T1: Read(A,t); t t2; write (A,t); Read(B,t); t t2; write (B,t);

Output(A); Output(B)

A: 8B: 8

A: 8B: 8

memory DB

LOG

1616

<T1, start><T1, A, 16><T1, B, 16>

<T1, commit>

Page 52: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 52

Redo logging (deferred modification)

T1: Read(A,t); t t2; write (A,t); Read(B,t); t t2; write (B,t);

Output(A); Output(B)

A: 8B: 8

A: 8B: 8

memory DB

LOG

1616

<T1, start><T1, A, 16><T1, B, 16>

<T1, commit>

output1616

Page 53: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 53

Redo logging (deferred modification)

T1: Read(A,t); t t2; write (A,t); Read(B,t); t t2; write (B,t);

Output(A); Output(B)

A: 8B: 8

A: 8B: 8

memory DB

LOG

1616

<T1, start><T1, A, 16><T1, B, 16>

<T1, commit><T1, end>

output1616

Page 54: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 54

Redo logging rules(1) For every action, generate redo log

record (containing new value)(2) Before X is modified on disk (DB),

all log records for transaction thatmodified X (including

commit) must be on disk

(3) Flush log at commit(4) Write END record after DB updates

flushed to disk

Page 55: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 55

• For every Ti with <Ti, commit> in log:– For all <Ti, X, v> in log:

Write(X, v)Output(X)

Recovery rules: Redo logging

Page 56: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 56

• For every Ti with <Ti, commit> in log:– For all <Ti, X, v> in log:

Write(X, v)Output(X)

Recovery rules: Redo logging

IS THIS CORRECT??

Page 57: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 57

(1) Let S = set of transactions with<Ti, commit> (and no <Ti, end>) in log

(2) For each <Ti, X, v> in log, in forward order (earliest latest) do:

- if Ti S then Write(X, v) Output(X)

(3) For each Ti S, write <Ti, end>

Recovery rules: Redo logging

Page 58: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 58

Combining <Ti, end> Records

• Want to delay DB flushes for hot objects

Say X is branch balance:T1: ... update X...T2: ... update X...T3: ... update X...T4: ... update X...

Actions:write Xoutput Xwrite Xoutput Xwrite Xoutput Xwrite Xoutput X

Page 59: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 59

Combining <Ti, end> Records

• Want to delay DB flushes for hot objects

Say X is branch balance:T1: ... update X...T2: ... update X...T3: ... update X...T4: ... update X...

Actions:write Xoutput Xwrite Xoutput Xwrite Xoutput Xwrite Xoutput X

combined <end> (checkpoint)

Page 60: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 60

Solution: Checkpoint

Periodically:(1) Do not accept new transactions(2) Wait until all transactions finish(3) Flush all log records to disk (log)(4) Flush all buffers to disk (DB) (do not discard buffers)

(5) Write “checkpoint” record on disk (log)

(6) Resume transaction processing

• no <ti, end> actions>•simple checkpoint

Page 61: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 61

Example: what to do at recovery?

Redo log (disk):

<T1

,A,1

6>

<T1

,com

mit

>

Ch

eck

poin

t

<T2

,B,1

7>

<T2

,com

mit

>

<T3

,C,2

1>

Crash... ... ... ...

...

...

Page 62: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 62

Key drawbacks:

• Undo logging: cannot bring backup DBcopies up to date

• Redo logging: need to keep all modified blocks inmemory

until commit

Page 63: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 63

Solution: undo/redo logging!

Update <Ti, Xid, New X val, Old X val>page X

Page 64: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 64

Rules

• Page X can be flushed before orafter Ti commit

• Log record flushed before corresponding updated page (WAL)

• Flush at commit (log only)

Page 65: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 65

Example: Undo/Redo logging what to do at recovery?

log (disk):

<ch

eck

poin

t>

<T1

, A

, 1

0,

15

>

<T1

, B

, 2

0,

23

>

<T1

, co

mm

it>

<T2

, C

, 3

0,

38

>

<T2

, D

, 4

0,

41

>

Crash... ... ... ...

...

...

Page 66: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 66

Non-quiesce checkpoint

LOG

for undo dirty buffer

pool pagesflushed

Start-ckptactive TR:

Ti,T2,...

endckpt

.........

...

Page 67: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

Non-quiesce checkpoint

CS 245 Notes 08 67

memory

checkpoint process:for i := 1 to M do output(buffer i)

[transactions run concurrently]

Page 68: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 68

Examples what to do at recovery time?

no T1 commit

LOG

T1,-a

...CkptT1

...Ckptend

...T1-b

...

Page 69: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 69

Examples what to do at recovery time?

no T1 commit

LOG

T1,-a

...CkptT1

...Ckptend

...T1-b

...

Undo T1 (undo a,b)

Page 70: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 70

Example

LOG

...T1

a... ...

T1

b... ...

T1

c...

T1

cmt...

ckpt-end

ckpt-sT1

Page 71: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 71

Example

LOG

...T1

a... ...

T1

b... ...

T1

c...

T1

cmt...

ckpt-end

ckpt-sT1

Redo T1: (redo b,c)

Page 72: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 72

Recover From Valid Checkpoint:

... ckptstart ... ...

T1

b... ...

T1

c...ckpt-

startckptend

LOG

startof latestvalidcheckpoint

Page 73: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 73

Recovery process:• Backwards pass (end of log latest valid checkpoint

start)

– construct set S of committed transactions– undo actions of transactions not in S

• Undo pending transactions– follow undo chains for transactions in

(checkpoint active list) - S

• Forward pass (latest checkpoint start end of log)

– redo actions of S transactions

backward pass

forward passstart

check-point

Page 74: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 74

Real world actions

E.g., dispense cash at ATMTi = a1 a2 …... aj …... an

$

Page 75: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 75

Solution

(1) execute real-world actions after commit

(2) try to make idempotent

Page 76: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 76

ATMGive$$(amt, Tid, time)

$

give(amt)

lastTid:

time:

Page 77: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 77

Media failure (loss of non-volatile storage)

A: 16

Page 78: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 78

Media failure (loss of non-volatile storage)

A: 16

Solution: Make copies of data!

Page 79: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 79

Example 1 Triple modular redundancy

• Keep 3 copies on separate disks• Output(X) --> three outputs• Input(X) --> three inputs + vote

X1 X2 X3

Page 80: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 80

Example #2 Redundant writes, Single reads

• Keep N copies on separate disks• Output(X) --> N outputs• Input(X) --> Input one copy

- if ok, done- else try

another one Assumes bad data can be detected

Page 81: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 81

Example #3: DB Dump + Log

backupdatabase

activedatabase

log

• If active database is lost,– restore active database from backup– bring up-to-date using redo entries in log

Page 82: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

Backup Database• Just like checkpoint,

except that we write full database

CS 245 Notes 08 82

database

create backup database:for i := 1 to DB_Size do [read DB block i; write to backup]

[transactions run concurrently]

Page 83: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

Backup Database• Just like checkpoint,

except that we write full database

CS 245 Notes 08 83

database

create backup database:for i := 1 to DB_Size do [read DB block i; write to backup]

[transactions run concurrently]

• Restore from backup DB and log:Similar to recovery from checkpoint and log

Page 84: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 84

When can log be discarded?

check-point

dbdump

lastneededundo

not needed formedia recovery redo

not needed for undoafter system failure

not needed forredo after system failure

log

time

lastneededundo

not needed formedia recovery

Page 85: CS 245Notes 081 CS 245: Database System Principles Notes 08: Failure Recovery Hector Garcia-Molina.

CS 245 Notes 08 85

Summary

• Consistency of data• One source of problems: failures

- Logging- Redundancy

• Another source of problems: Data Sharing..... next


Recommended