Assembly Language for Intel-Based Assembly Language for Intel-Based ComputersComputers
Chapter 4: Data Transfers, Addressing, and Arithmetic
Kip Irvine
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 2
Direct Memory OperandsDirect Memory Operands
• A direct memory operand is a named reference to storage in memory
.datavar1 BYTE 10h.codemov al,var1 ; AL = 10hmov al,[var1] ; AL = 10h
alternate format
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 3
MOV InstructionMOV Instruction
.datacount db 100wVal dw 2.code
mov bl,countmov ax,wValmov count,al
mov al,wVal ; errormov ax,count ; errormov eax,count ; error
• Move from source to destination. Syntax:MOV destination,source
• No more than one memory operand permitted• CS, EIP, and IP cannot be the destination• No immediate to segment moves
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 4
Your turn . . .Your turn . . .
.databVal db 100bVal2 db ?wVal dw 2dVal dd 5.code
mov ds,45mov esi,wValmov eip,dValmov 25,bValmov bVal2,bVal
Explain why each of the following MOV statements are invalid:
immediate move to DS not permittedsize mismatchEIP cannot be the destinationimmediate value cannot be destinationmemory-to-memory move not permitted
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 5
Zero ExtensionZero Extension
mov bl,10001111bmovzx ax,bl ; zero-extension
When you copy a smaller value into a larger destination, the MOVZX instruction fills (extends) the upper half of the destination with zeros.
The destination must be a register.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 6
Sign ExtensionSign Extension
mov bl,10001111bmovsx ax,bl ; sign extension
The MOVSX instruction fills the upper half of the destination with a copy of the source operand's sign bit.
The destination must be a register.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 7
XCHG InstructionXCHG Instruction
.datavar1 dw 1000hvar2 dw 2000h.codexchg ax,bx ; exchange 16-bit regsxchg ah,al ; exchange 8-bit regsxchg var1,bx ; exchange mem, regxchg eax,ebx ; exchange 32-bit regs
xchg var1,var2 ; error: two memory operands
XCHG exchanges the values of two operands. At least one operand must be a register. No immediate operands are permitted.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 8
Direct-Offset OperandsDirect-Offset Operands
.dataarrayB db 10h,20h,30h,40h.codemov al,arrayB+1 ; AL = 20hmov al,[arrayB+1] ; alternative notation
A constant offset is added to a data label to produce an effective address (EA). The address is dereferenced to get the value inside its memory location.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 9
Direct-Offset Operands Direct-Offset Operands (cont)(cont)
.dataarrayW dw 1000h,2000h,3000harrayD dd 1,2,3,4.codemov ax,[arrayW+2] ; AX = 2000hmov ax,[arrayW+4] ; AX = 3000hmov eax,[arrayD+4] ; EAX = 00000002h
A constant offset is added to a data label to produce an effective address (EA). The address is dereferenced to get the value inside its memory location.
; Will the following statements assemble?mov ax,[arrayW-2] ; ??mov eax,[arrayD+16] ; ??
What will happen when they run?
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 10
Your turn. . .Your turn. . .
Write a program that rearranges the values of three doubleword values in the following array as: 3, 1, 2.
.dataarrayD dd 1,2,3
• Step 2: Exchange EAX with the third array value and copy the value in EAX to the first array position.
• Step1: copy the first value into EAX and exchange it with the value in the second position.
mov eax,arrayDxchg eax,[arrayD+4]
xchg eax,[arrayD+8]mov arrayD,eax
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 11
Addition and SubtractionAddition and Subtraction
• INC and DEC Instructions• ADD and SUB Instructions• NEG Instruction• Implementing Arithmetic Expressions• Flags Affected by Arithmetic
• Zero• Sign• Carry• Overflow
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 12
INC and DEC InstructionsINC and DEC Instructions
• Add 1, subtract 1 from destination operand• operand may be register or memory
• INC destination• destination destination + 1
• DEC destination• destination destination – 1
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 13
INC and DEC ExamplesINC and DEC Examples
.datamyWord dw 1000hmyDword dd 10000000h.code
inc myWord ; 1001hdec myWord ; 1000hinc myDword ; 10000001h
mov ax,00FFhinc ax ; AX = 0100hmov ax,00FFhinc al ; AX = 0000h
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 14
Your turn...Your turn...
Show the value of the destination operand after each of the following instructions executes:
.datamyByte db 0FFh, 0.code
mov al,myByte ; AL =mov ah,[myByte+1] ; AH =dec ah ; AH =inc al ; AL =dec ax ; AX =
FFh00hFFh00hFEFF
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 15
ADD and SUB InstructionsADD and SUB Instructions
• ADD destination, source• destination destination + source
• SUB destination, source• destination destination – source
• Same operand rules as for the MOV instruction
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 16
ADD and SUB ExamplesADD and SUB Examples
.datavar1 DD 10000hvar2 DD 20000h.code ; ---EAX---
mov eax,var1 ; 00010000hadd eax,var2 ; 00030000hadd ax,0FFFFh ; 0003FFFFhadd eax,1 ; 00040000hsub ax,1 ; 0004FFFFh
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 17
NEG (negate) InstructionNEG (negate) Instruction
.datavalB db -1valW dw +32767.code
mov al,valB ; AL = -1neg al ; AL = +1neg valW ; valW = -32767
Reverses the sign of an operand. Operand can be a register or memory operand.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 18
NEG Instruction and the FlagsNEG Instruction and the Flags
.datavalB db 1,0valC db -128.code
neg valB ; CF = 1, OF = 0neg [valB + 1] ; CF = 0, OF = 0neg valC ; CF = 1, OF = 1
The processor implements NEG using the following internal operation:
SUB 0,operand
Any nonzero operand causes the Carry flag to be set.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 19
Implementing Arithmetic ExpressionsImplementing Arithmetic Expressions
Rval dd ?Xval dd 26Yval dd 30Zval dd 40.code
mov eax,Xvalneg eax ; EAX = -26mov ebx,Yvalsub ebx,Zval ; EBX = -10add eax,ebxmov Rval,eax ; -36
HLL compilers translate mathematical expressions into assembly language. You can do it also. For example:
Rval = -Xval + (Yval – Zval)
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 20
Flags Affected by ArithmeticFlags Affected by Arithmetic
• The ALU has a number of status flags that reflect the outcome of arithmetic (and bitwise) operations• based on the contents of the destination operand
• Essential flags:• Zero flag – set when destination equals zero• Sign flag – set when destination is negative• Carry flag – set when unsigned value is out of range• Overflow flag – set when signed value is out of range
• The MOV instruction never affects the flags.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 21
Concept MapConcept Map
status flags
ALUconditional jumps
branching logic
arithmetic & bitwise operations
part of
used by provideattached to
affect
CPU
You can use diagrams such as these to express the relationships between assembly language concepts.
executes
executes
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 22
Zero Flag (ZF)Zero Flag (ZF)
mov cx,1sub cx,1 ; CX = 0, ZF = 1mov ax,0FFFFhinc ax ; AX = 0, ZF = 1inc ax ; AX = 1, ZF = 0
The Zero flag is set when the result of an operation produces zero in the destination operand.
Remember...• A flag is set when it equals 1. • A flag is clear when it equals 0.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 23
Sign Flag (SF)Sign Flag (SF)
mov cx,0sub cx,1 ; CX = -1, SF = 1add cx,2 ; CX = 1, SF = 0
The Sign flag is set when the destination operand is negative. The flag is clear when the destination is positive.
The sign flag is a copy of the destination's highest bit:
mov al,0sub al,1 ; AL = 11111111b, SF = 1add al,2 ; AL = 00000001b, SF = 0
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 24
Signed and Unsigned IntegersSigned and Unsigned IntegersA Hardware ViewpointA Hardware Viewpoint
• All CPU instructions operate exactly the same on signed and unsigned integers
• The CPU cannot distinguish between signed and unsigned integers
• YOU, the programmer, are responsible for using the correct data type with each instruction
Added Slide. Gerald Cahill, Antelope Valley College
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 25
Carry Flag (CF)Carry Flag (CF)
The Carry flag is set when the result of an operation generates an unsigned value that is out of range (too big or too small for the destination operand).
mov al,0FFhadd al,1 ; CF = 1, AL = 00
; Try to go below zero:
mov al,0sub al,1 ; CF = 1, AL = FF
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 26
Your turn . . .Your turn . . .
mov ax,00FFhadd ax,1 ; AX= SF= ZF= CF=sub ax,1 ; AX= SF= ZF= CF=add al,1 ; AL= SF= ZF= CF=mov bh,6Chadd bh,95h ; BH= SF= ZF= CF=
mov al,2sub al,3 ; AL= SF= ZF= CF=
For each of the following marked entries, show the values of the destination operand and the Sign, Zero, and Carry flags:
0100h 0 0 000FFh 0 0 000h 0 1 1
01h 0 0 1
FFh 1 0 1
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 27
Overflow Flag (OF)Overflow Flag (OF)
The Overflow flag is set when the signed result of an operation is invalid or out of range.
; Example 1mov al,+127add al,1 ; OF = 1, AL = ??
; Example 2mov al,7Fh ; OF = 1, AL = 80hadd al,1
The two examples are identical at the binary level because 7Fh equals +127. To determine the value of the destination operand, it is often easier to calculate in hexadecimal.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 28
A Rule of ThumbA Rule of Thumb
• When adding two integers, remember that the Overflow flag is only set when . . .• Two positive operands are added and their sum is
negative• Two negative operands are added and their sum is
positive
What will be the values of the Overflow flag?mov al,80hadd al,92h ; OF =
mov al,-2add al,+127 ; OF =
1
0
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 29
Data-Related Operators and DirectivesData-Related Operators and Directives
• OFFSET Operator• PTR Operator
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 30
OFFSET OperatorOFFSET Operator
• OFFSET returns the distance in bytes, of a label from the beginning of its enclosing segment
• Protected mode: 32 bits• Real mode: 16 bits
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 31
OFFSET ExamplesOFFSET Examples
.databVal db ?wVal dw ?dVal dd ?dVal2 dd ?
.codemov si,OFFSET bVal ; SI = 4000mov si,OFFSET wVal ; SI = 4001mov si,OFFSET dVal ; SI = 4003mov si,OFFSET dVal2 ; SI = 4007
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 32
Relating to C/C++Relating to C/C++
; C++ version:char array[1000];char * p = array;
The value returned by OFFSET is a pointer. Compare the following code written for both C++ and assembly language:
.dataarray db 1000 DUP(?).codemov si,OFFSET array ; SI is p
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 33
PTR OperatorPTR Operator
.datamyDouble dd 12345678h.codemov ax,myDouble ; error – why?
mov ax,WORD PTR myDouble ; loads 5678h
mov WORD PTR myDouble,4321h ; saves 4321h
Overrides the default type of a label (variable). Provides the flexibility to access part of a variable.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 34
Little Endian OrderLittle Endian Order
• Little endian order refers to the way Intel stores integers in memory.
• Multi-byte integers are stored in reverse order, with the least significant byte stored at the lowest address
• For example, the doubleword 12345678h would be stored as:
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 35
PTR Operator ExamplesPTR Operator Examples
.datamyDouble dd 12345678h
mov al,BYTE PTR myDouble ; AL = 78hmov al,BYTE PTR [myDouble+1] ; AL = 56hmov al,BYTE PTR [myDouble+2] ; AL = 34hmov ax,WORD PTR myDouble ; AX = 5678hmov ax,WORD PTR [myDouble+2] ; AX = 1234h
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 36
PTR Operator PTR Operator (cont)(cont)
.datamyBytes db 12h,34h,56h,78h
.codemov ax,WORD PTR [myBytes] ; AX = 3412hmov ax,WORD PTR [myBytes+2] ; AX = 7856hmov eax,DWORD PTR myBytes ; EAX = 78563412h
PTR can also be used to combine elements of a smaller data type and move them into a larger operand. The CPU will automatically reverse the bytes.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 37
Your turn . . .Your turn . . .
.datavarB db 65h,31h,02h,05hvarW dw 6543h,1202hvarD dd 12345678h
.codemov ax,WORD PTR [varB+2] ; a.mov bl,BYTE PTR varD ; b.mov bl,BYTE PTR [varW+2] ; c.mov ax,WORD PTR [varD+2] ; d.mov eax,DWORD PTR varW ; e.
Write down the value of each destination operand:
0502h78h02h1234h12026543h
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 38
Indirect AddressingIndirect Addressing
• Indirect Operands• Array Sum Example• Indexed Operands• Pointers
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 39
Indirect Operands Indirect Operands (1 of 2)(1 of 2)
.dataval1 db 10h,20h,30h.codemov si,OFFSET val1mov al,[si] ; dereference SI (AL = 10h)
inc simov al,[si] ; AL = 20h
inc simov al,[si] ; AL = 30h
An indirect operand holds the address of a variable, usually an array or string. It can be dereferenced (just like a pointer).
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 40
Indirect Operands Indirect Operands (2 of 2)(2 of 2)
.datamyCount dw 0
.codemov si,OFFSET myCountinc [si] ; error: ambiguousinc WORD PTR [si] ; ok
Use PTR to clarify the size attribute of a memory operand.
Should PTR be used here?
add [esi],20
yes, because [esi] could point to a byte, word, or doubleword
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 41
Array Sum ExampleArray Sum Example
.dataarrayW dw 1000h,2000h,3000h.code
mov si,OFFSET arrayWmov ax,[si]add si,2add ax,[si]add si,2add ax,[si] ; AX = sum of the array
Indirect operands are ideal for traversing an array. Note that the register in brackets must be incremented by a value that matches the array type.
ToDo: Modify this example for an array of doublewords.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 42
Indexed OperandsIndexed Operands
.dataarrayW dw 1000h,2000h,3000h.code
mov esi,0mov ax,[arrayW + si] ; AX = 1000hmov ax,arrayW[si] ; alternate formatadd si,2add ax,[arrayW + si]etc.
An indexed operand adds a constant to a register to generate an effective address. There are two notational forms:
[label + reg] label[reg]
ToDo: Modify this example for an array of doublewords.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 43
JMP and LOOP InstructionsJMP and LOOP Instructions
• JMP Instruction• LOOP Instruction• LOOP Example• Summing an Integer Array• Copying a String
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 44
JMP InstructionJMP Instruction
top:..jmp top
• JMP is an unconditional jump to a label that is usually within the same procedure.
• Syntax: JMP target
• Logic: EIP target
• Example:
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 45
LOOP InstructionLOOP Instruction
• The LOOP instruction creates a counting loop• Syntax: LOOP target• Logic:
• ECX ECX – 1• if ECX != 0, jump to target
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 46
LOOP ExampleLOOP Example
mov ax,0 mov ecx,5
L1: add ax,cxloop L1
The following loop calculates the sum of the integers 5 + 4 + 3 +2 + 1:
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 47
Your turn . . .Your turn . . .
What will be the final value of AX?
mov ax,6mov ecx,4
L1:inc axloop L1
How many times will the loop execute?
mov ecx,0X2:
inc axloop X2
10
4,294,967,296
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 48
Nested LoopNested LoopIf you need to code a loop within a loop, you must save the outer loop counter's ECX value. In the following example, the outer loop executes 100 times, and the inner loop 20 times.
.datacount dd ?.code
mov ecx,100 ; set outer loop countL1:
mov count,ecx ; save outer loop countmov ecx,20 ; set inner loop count
L2: ..loop L2 ; repeat the inner loopmov ecx,count ; restore outer loop countloop L1 ; repeat the outer loop
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 49
Summing an Integer ArraySumming an Integer Array
.dataintarray dw 100h,200h,300h,400h.code
mov di,OFFSET intarray ; address of intarraymov ecx,4 ; loop countermov ax,0 ; zero the accumulator
L1:add ax,[di] ; add an integeradd di,2 ; point to next integerloop L1 ; repeat until ECX = 0
The following code calculates the sum of an array of 16-bit integers.
Irvine, Kip R. Assembly Language for Intel-Based Computers 5/e, 2007. Web site Examples 50
Copying a StringCopying a String
.datasource db "This is the source string",0target db 26 DUP(0)
.codemov si,0 ; index registermov ecx, 26 ; loop counter
L1:mov al,source[si] ; get char from sourcemov target[si],al ; store it in the targetinc si ; move to next characterloop L1 ; repeat for entire string
The following code copies a string from source to target: