Date post: | 21-Dec-2015 |
Category: |
Documents |
View: | 214 times |
Download: | 0 times |
ECE 15B Computer OrganizationSpring 2010
Dmitri Strukov
Lecture 6: Logic/Shift Instructions
Partially adapted from Computer Organization and Design, 4th edition, Patterson and Hennessy, and classes taught by Ryan Kastner at UCSB and Marry Jane Irwin from PSU
ECE 15B Spring 2010
Bitwise operations
• Up until now, instructions have been:– Arithmetic (e.g. add, sub, addi)– Memory access (e.g. lw, sw)– Branches and jumps (e.g. j, beq, bne)
• All these instructions view contents of registers as a single quantity (such as signed or unsigned integer)
• New Perspective – View contents of register as 32 individual bits rather than as a
single 32 bit number – Introduce two new classes of instructions:
• Logical operations• Shift Instructions
Logical Operations• Instructions for bitwise manipulation
Operation C Java MIPS
Shift left << << sll
Shift right >> >>> srl
Bitwise AND & & and, andi
Bitwise OR | | or, ori
Bitwise NOT ~ ~ nor
Useful for extracting and inserting groups of bits in a word
ECE 15B Spring 2010
AND Operations• Useful to mask bits in a word
– Select some bits, clear others to 0
and $t0, $t1, $t2
0000 0000 0000 0000 0000 1101 1100 0000
0000 0000 0000 0000 0011 1100 0000 0000
$t2
$t1
0000 0000 0000 0000 0000 1100 0000 0000$t0
ECE 15B Spring 2010
OR Operations• Useful to include bits in a word
– Set some bits to 1, leave others unchanged
or $t0, $t1, $t2
0000 0000 0000 0000 0000 1101 1100 0000
0000 0000 0000 0000 0011 1100 0000 0000
$t2
$t1
0000 0000 0000 0000 0011 1101 1100 0000$t0
ECE 15B Spring 2010
NOT Operations• Useful to invert bits in a word
– Change 0 to 1, and 1 to 0• MIPS has NOR 3-operand instruction
– a NOR b == NOT ( a OR b )
nor $t0, $t1, $zero
0000 0000 0000 0000 0011 1100 0000 0000$t1
1111 1111 1111 1111 1100 0011 1111 1111$t0
Register 0: always read as zero
ECE 15B Spring 2010
Shift Operations
• shamt: how many positions to shift • Shift left logical (SLL)
– Shift left and fill with 0 bits– sll by i bits multiplies by 2i
• Shift right logical (SRL)– Shift right and fill with 0 bits– srl by i bits divides by 2i (unsigned only)
• Shift right arithmetic (SRA)– Shift right and fill emptied bits by sign extending
• Note that shamt (immediate value) is only 5 bits
op rs rt rd shamt funct6 bits 6 bits5 bits 5 bits 5 bits 5 bits
ECE 15B Spring 2010
ECE 15B Spring 2010
Uses for Shift Instructions
• Very convenient operation to extract group of bits (e.g. one byte) within a word (e.g. think of operations on 8-bit pixels or 8-bit characters)
• For example, suppose we want to get bits 8 to 15 from $t0. The code below will do the job
sll $t0, $t0, 16srl $t0, $t0, 24
ECE 15B Spring 2010
Uses for Shift Instructions
• Since shifting is faster than multiplication, a good compiler (or a good programmer for that matter) usually notices when C code multiplies by a power of 2 and compiles it to a shift instruction
For example:a = a*8; (in C)
would compile to: sll $s0, $s0, 3 (in MIPS)
Integer Addition• Example: 7 + 6
Overflow if result out of range Adding +ve and –ve operands, no overflow Adding two +ve operands
Overflow if result sign is 1 Adding two –ve operands
Overflow if result sign is 0
ECE 15B Spring 2010
Integer Subtraction
• Add negation of second operand• Example: 7 – 6 = 7 + (–6)
+7: 0000 0000 … 0000 0111–6: 1111 1111 … 1111 1010+1: 0000 0000 … 0000 0001
• Overflow if result out of range– Subtracting two +ve or two –ve operands, no overflow– Subtracting +ve from –ve operand
• Overflow if result sign is 0
– Subtracting –ve from +ve operand• Overflow if result sign is 1
ECE 15B Spring 2010
Dealing with Overflow
• Some languages (e.g., C) ignore overflow• MIPS solution is 2 kinds of arithmetic instructions to
recognize 2 choices– add (add), add immediate (addi) and subtract (sub) cause
overflow to be detected assuming operation on two’s compliment
– add unsigned (addu), add immediate unsigned (addiu) and subtract unsigned (subu) do not cause overflow detection
• MIPS signals overflow with an exception (aka interrupt) – an unscheduled procedure call where the EPC contains the address of the instruction that caused the exception
ECE 15B Spring 2010
Multiply• Binary multiplication is just a bunch of right shifts and adds
multiplicandmultiplier
partialproductarray
double precision product
n
2n
ncan be formed in parallel and added in parallel for faster multiplication
ECE 15B Spring 2010
Multiplication• Start with long-multiplication approach
1000× 1001 1000 0000 0000 1000 1001000
Length of product is the sum of operand lengths
multiplicand
multiplier
product
ECE 15B Spring 2010
Optimized Multiplier Hardware: Example
multiplicand
32-bit ALU
multiplier Control
addshiftright
product
0 1 1 0 = 6
0 0 0 0 0 1 0 1 = 5add 0 1 1 0 0 1 0 1
0 0 1 1 0 0 1 0add 0 0 1 1 0 0 1 0
0 0 0 1 1 0 0 1 add 0 1 1 1 1 0 0 1
0 0 0 1 1 1 1 0add 0 0 1 1 1 1 0 0
0 0 1 1 1 1 0 0
= 30ECE 15B Spring 2010
Faster Multiplier• Uses multiple adders
– Cost/performance tradeoff
Can be pipelined Several multiplication performed in parallel
ECE 15B Spring 2010
MIPS Multiplication
• Two 32-bit registers for product– HI: most-significant 32 bits– LO: least-significant 32-bits
• Instructions– mult rs, rt / multu rs, rt
• 64-bit product in HI/LO
– mfhi rd / mflo rd• Move from HI/LO to rd• Can test HI value to see if product overflows 32 bits
– mul rd, rs, rt• Least-significant 32 bits of product –> rd
ECE 15B Spring 2010
Division• Division is just a bunch of quotient digit guesses and left
shifts and subtractsdividend = quotient x divisor + remainder
dividenddivisor
partialremainderarray
quotientnn
remaindern
0 0 0
0
0
0
ECE 15B Spring 2010
Division• Check for 0 divisor• Long division approach
– If divisor ≤ dividend bits• 1 bit in quotient, subtract
– Otherwise• 0 bit in quotient, bring down next
dividend bit
• Restoring division– Do the subtract, and if remainder goes <
0, add divisor back• Signed division
– Divide using absolute values– Adjust sign of quotient and remainder as
required
10011000 1001010 -1000 10 101 1010 -1000 10
n-bit operands yield n-bitquotient and remainder
quotient
dividend
remainder
divisor
ECE 15B Spring 2010
Optimized Division Hardware divisor
32-bit ALU
quotient Control
subtractshiftleft
dividend
remainder
0 0 1 0 = 2
0 0 0 0 0 1 1 0 = 60 0 0 0 1 1 0 0
sub 1 1 1 0 1 1 0 0 rem neg, so ‘ient bit = 00 0 0 0 1 1 0 0 restore remainder0 0 0 1 1 0 0 0
sub 1 1 1 1 1 1 0 0 rem neg, so ‘ient bit = 00 0 0 1 1 0 0 0 restore remainder0 0 1 1 0 0 0 0
sub 0 0 0 1 0 0 0 1 rem pos, so ‘ient bit = 10 0 1 0 0 0 1 0
sub 0 0 0 0 0 0 1 1 rem pos, so ‘ient bit = 1= 3 with 0 remainder
ECE 15B Spring 2010
Faster Division
• Can’t use parallel hardware as in multiplier– Subtraction is conditional on sign of remainder
• Faster dividers (e.g. SRT devision) generate multiple quotient bits per step– Still require multiple steps
ECE 15B Spring 2010
MIPS Division
• Use HI/LO registers for result– HI: 32-bit remainder– LO: 32-bit quotient
• Instructions– div rs, rt / divu rs, rt– No overflow or divide-by-0 checking
• Software must perform checks if required
– Use mfhi, mflo to access result
ECE 15B Spring 2010
ECE 15B Spring 2010
Review
• Instructions so far:add, addi, sub, addu, addiu, subu mult, div, mfhi, mflo, multu, divulw, sw, lb, lbu, lh, lhu, sb, shwbeq, bne, j, slt, slti, sltu, sltuiand, andi, or, ori, xor, xori, sll, srl, sra
• Registers so farC variables: $s0 - $s7Temporary variables: $t0 - $t9Zero: $zero