Introduction
Pseudorandomness
LFSR
Design
Refer to “Handbook of Applied Cryptogra-phy” [Ch 5 & 6]
1
Stream Cipher
Introduction • Originate from one-time pad• bit-by-bit Exor with pt and key stream ◦ (ci = mi zi)• Encryption = Decryption --> Symmetric• Use LFSR (Linear Feedback Shift Register) • (external) Synchronous or self-synchronous
Properties• Faster and Low Complexity in H/W -> Lightweight !• Security measure : Period of key stream, LC(Linear Complexity), Statistical properties• Vast amounts of theoretical knowledge• Proprietary and Confidential for Military
Stream Cipher
2
Def) ◦s=s0,s1,… : infinite seq., ◦sn=s0,s1,…,sn-1: n term of s ◦if si = si+n for all i >=0, s is periodic
seq. having period n.◦run : subsequence of consecutive
‘0’(gap) or consecutive ‘1’(block)
3
Sequence
4
Pseudorandomness
Golomb’s postulates(I)
5
sN : periodic seq. of period N (1) For a cycle of sN, 0~1 balanceness, i.e,
| #{si=1} - #{sj=0} | =<1(2) For a cycle of sN, half the runs have length 1,
1/4 have the length 2, …, etc. (3) Autocorrelation* function is two-valued
11if,
0if,1)- 2()12()(
1
0 NtKtN
sstCN ti
N
ii
* Measuring similarity between original and t-shifted sequences** A sequence satisfying them is called Pseudo-Noise(PN) sequence.
(Ex) s15 = 0,1,1,0,0,1,0,0,0,1,1,1,1,0,1(1) #{0} = 7, #{1}=8 (why ?)(2) 8 runs, 4 runs with length 1 (2 gaps, 2
blocks), 2 runs with length 2 (1 gap, 1 block), 1 run with length 3 (1 gap), 1 run with length 4 (1 block)
(3) Autocorrelation function, C(0)=1, C(t)= -1/15
Thus, PN-seq.
Golomb’s postulates(II)
6
Five Basic Tests◦ Frequency Test (monobit)◦ Serial Test (twobit; Overlapping is allowed)◦ Poker Test (Frequency of m-bit subsequences)◦ Runs Test ◦ Autocorrelation Test
Others◦ Spectral Test◦ Linear Complexity Profile◦ Quadratic Complexity◦ Universal Test
7
Statistical Randomness
Statistical Test by FIPS 140-1
8
For a given 20,000bit sample seq.(I) monobit test : The number of ‘1’=n1, 9,654 < n1 <
10,346(2) poker test : m=4, 1.03 < X3 < 57.4 (3) runs test : for length 1 i 6(4) long run test : no run greater than 34
mnkkn
kX
m
ii
m
,,2 2
1
23 단
9
LFSR
Notation of LFSR
10
Notation: < L, C[D]> where connection polynomial C[D] = 1 + c1D + c2D2 + …+cLDL Z2[D] If cL=1, {i.e., deg{C[D]}=L}, C[D] is called a nonsingular poly-
nomial
If initial vector 0 is [sL-1, … , s1,s0], si ={0,1}, output sequence s= s0,s1, … is uniquely determined by the recursion
sj = (c1s j-1 + c 2 s j-2 + … + c Ls j-L) mod 2 , j L (Ex) <4, 1 + D + D4> , 0 = [0,1,1,0] c1 =1, c4 =1, s4=s3+s0 t D3 D2 D1 D0 t D3 D2 D1 D0 0 0 1 1 0 (6) 8 1 1 1 0 (14) 1 0 0 1 1 (3) 9 1 1 1 1 (15) 2 1 0 0 1 (9) 10 0 1 1 1 (7) 3 0 1 0 0 (4) 11 1 0 1 1 (11) 4 0 0 1 0 (2) 12 0 1 0 1 (5) 5 0 0 0 1 (1) 13 1 0 1 0 (10) 6 1 0 0 0 (8) 14 1 1 0 1 (13) 7 1 1 0 0 (12) 15 0 1 1 0 (6) Output seq. = 0,1,1,0,0,1,0,0,0,1,1,1,1,0,1
10
Stage 1
Stage 0
Output
Stage 3
Stage 2
D3D2 D1 D0
15Clock
The period of the sequence from LFSR divides 2L-1
A irreducible polynomial f(x) in Zp[x] of degree m is called a primitive polynomial if and only if f(x) divides xk-1 for k=2m-1 and for no smaller positive integer k• # of monic primitive poly. of degree m over Zp =(pm-1)/m
where is Euler-phi ft.
If the connection polynomial is primitive, the period is 2L-1
Such sequence is called Maximum-length Shift Regis-ter Seq., M –seq. and LFSR is called m-LFSR.
11
Properties of m-LFSR(I)
Primitive Polynomials
12
Primitive polynomial over Z2: - xm+xk+1(trinomial) for smallest k - xm + xk1+xk2+xk3+1(pentanomial)
m k(k1,k2,k3) m k(k1,k2,k3) m k(k1,k2,k3) m k(k1,k2,k3)
234567891011
111211
6,5,1432
12131415161718192021
7,4,34,3,1
12,11,11
5,3,237
6,5,132
22232425262728293031
15
4,3,13
8,7,18,7,1
32
16,15,13
32333435363738394041
28,27,113
15,14,12
1112,10,2
6,5,14
21,19,23
Well suited for H/W implementation Produce seq. of large period Good statistical properties Readily analyzed by algebraic structure
Breakable by consecutive 2 * L subse-quence is known to attacker ◦ Using Berlekamp-Massey algorithm, from any
(short) subsequences having length at least 2*L, we can find the LFSR with length L
Properties of LFSR
13
(Def) Given an infinite sequence s, the shortest length of LFSR’s that generate s is called Linear Complexity
Using Berlekamp-Massey algorithm, LC is com-puted
(Properties of LC) s,t : binary seq.◦ For any n 1, 0 L(sn) n ◦ LC(sn) =0 iff sn is ‘0’ seq. of length n.◦ LC(sn) =n iff sn=0,0,…,0,1.◦ If s is periodic with period N, LC(sn) N.◦ LC(st) LC(s) + LC(t)
Linear Complexity(I)
14
sn : random seq. from all seq. of length n Expectation value of LC
where B(n)=0 if even n, otherwise 0
For large n, E(L(sn)) n/2 + 2/9 and Var(L(sn)) 86/81
(Def) LCP (Linear Complexity Profile) Denote LN is LC of sN=s0,s1,…sN-1, L1, L2, … LN is LCP
15
Linear Complexity(II)
92
321
18)(4))(( 2
nnBsLE nn n
16
Nonlinear FSR
StageL-1
Stage 1
Stage 0
Sj-1 sj-L+1Sj-L+2 S j-L Sj
Output
f ( s j-1, s j-2, …, s j-L)
f() : nonlinear ft
17
Design
Synchronous Stream Ci-pher(I)
18
f : next state ft, i+1 = f(i , k), 0 : initial value g : keystream generating ft, zi = g (i , k), k : key h : output ft, ci = h (zi, mi) , mi : pt, zi : key stream, ci:ct
f
ii+1
g
h-1
k
mici
zi
Encryption
f
ii+1
g
h
k
mi ci
zi
Decryption
Keystream is independent of pt and ct Properties
◦ Synchronization requirement ◦ No error propagation◦ Active attack
Insertion, deletion or replay will lose synchroniza-tion
Change selected ciphertext digits Need to have integrity check mechanisms
Synchronous Stream Ci-pher(II)
19
i = (ci-t , ci-t+1, …, ci-1), 0 = (c-t, c-t+1, …, c-1) : initial value g : keystream generating ft, zi = g (i , k), k : key h : output ft, ci = h (zi, mi) , mi : pt, zi : keystream, ci : ct
20
Self-Sync. Stream Cipher(I)
g
h
k
mi ci
zi
g
h-1
k
mici
zi
Encryption Decryption
Keystream is independent of pt and ct Properties
◦ Self-Synchronization◦ Limited error propagation◦ Active attack Difficult to detect insertion, deletion, or replay Easy to find passive modification
◦ More diffusion more resistant against attacks based on plaintext redundancy
21
Self-Sync. Stream Cipher(II)
Nonlinear Combiner(I)
22
LFSR 1
LFSR 2
LFSR n
f Keystream, z
Algebraic Normal Form (ANF) : mod. 2 sum of distinct m-th orderproduct of its variable, 0 <= m <= n Ex) f(x1,x2,x3,x4,x5)=1 + x2+ x3 + x4 + x4x5 + x1x2x3x4, deg(f) =4
Nonlinear Combiner(II)
23
Geffe generatorLFSR 1
LFSR 2
LFSR 3
Keystream, z
x1
x2
x3
• f(x1,x2,x3) = x1x2 (1+x2)x3 = x1x2 x2x3 x3
• p(z) : (2L1-1) (2L2-1)(2L3-1) where L1,L2 and L3 are relatively prime• L(z) = L1L2 + L1L3 + L3
• Prob(z(t)=x1(t)) =3/4 Correlation attack is possible !
Summation generator
24
Nonlinear Combiner(III)
z, keystream
LFSR 1
LFSR 2
LFSR n
Carry
x1
x2
xn
If Li and Lj are pairwise relatively prime, thenp(z) = i=1 n (2Li -1) LC p(z) But vulnerable to the correlation attack of carry and 2-adic span
Alternating step generator
25
Clock-controlled generator(I)
LFSR R1
LFSR R2
LFSR R3
Clock z, keystream
R1 : de Brujin seq. of period 2L1
R2,R3 : m-seq s.t., gcd(L2, L3)=1p(z) = 2L1 (2L2-1)(2L3-1)L(z) : (L2 + L3) 2L1-1 < L(z) <= (L2+L3) 2L1
Best known attack is a divide-and-conquer attack on the control register R1 in 2L
L should be about 128 (de Brujin = maximal period)
Shrinking generator
26
Clock-controlled genera-tor(II)
LFSR R1
LFSR R2
Clock
ai
bi ai=1
ai=0
output bi
discard bi
• If gcd(L1, L2) =1, p(z) = (2L2-1) 2L1-1
• L2 2 L1-2 < L(z) < L2 2 L1-1 • Best known attack takes O(2L1L2
3). Li is about 64
Cascade Generator CSPRBG(Cryptographically Secure Pseudo
Random Bit Generator)◦ RSA LSB Generator◦ BBS Generator (p.336)
Pseudo-noise Generator◦ Noise Diode or Noise Transistor
Feedback with Carry Shift Register (FCSR)◦ 2-adic span
A5/1, A5/2, HC-256, RC4, PKZIP, Py, Rab-bit, FISH, SEAL, Salsa20, SOBER, etc.
27
Other generators
28
Correlation Attack
Siegenthaler, 1984◦ The complexity of a Combining Generator depends on the
correlation of the combining function F.◦ Divide-and-Conquer Attack - If the output of F has a correlation with the output of
KSG1, we can find the initial vector of the KSG1
29
Correlation Attack (I)
KSG 1
KSG 2
KSG n
F z
x1
xn
x2
Assume Prob(z=0|xi=0)=1/2-e, e>0 Identify the initial vector of the KSGi by Di-
vide and Conquer
Known ciphertext attack◦ Assume an initial vector of KSGi◦ Generate xi’ from KSGi◦ Compute e’=1/2- Prob(z=0|xi’=0)◦ If the initial vector is correct, we must have e’=e.
If not, we have e0 since x’ has no correlation with z
◦ This attack is very effective. So e must be zero.30
Correlation Attack (II)
KSG 1
KSG 2
KSG n
F z
x1
xn
x2
A balanced function {0,1}n {0,1}m
- every possible output m-tuple is equally likely to occur A k-resilient function f : {0,1}n {0,1}m
- every possible output m-tuple is equally likely to occur when the values of k arbitrary inputs are fixed and the remaining n-k input bits are chosen independently at random.
A 0-resilient function is just a balanced function. A k-resilient function is (k-1)-resilient. E.g.) f(x1,x2)=x1+x2 is 1-resilient.
31
Resilient Functions
To design a multi-output stream cipher based on a combining generator, we need a resilient function which◦ is nonlinear◦ has algebraic degree as large as possible (for large LC)◦ has nonlinearity as large as possible◦ has resiliency as large as possible
32
Multi-output Stream Ci-phers
KSG 1
KSG 2
KSG n
F
Period : Depends on req’d level of security
Linear Complexity◦ shortest LFSR that generates a given seq.
Measure against Correlation Attack◦ Correlation Immune function ◦ Nonlinear function
33
Summary of a Stream Ci-pher