# **Complex Instruction Sets** CSE 410 - Computer Systems November 2, 2001 # Readings and References - · Reading - Sections 3.12 3.15, Computer Organization & Design, Patterson and Hennessy - Other References - VAX MACRO and Instruction Set Reference Manual (Compaq) - PowerPC Assembly Language Reference (IBM) 2-Nov-2001 CSE 410 - CISC and RISC #### CISC and RISC - Complex Instruction Set Computer - VAX: about 325 instructions - uses very powerful instructions to do the work of the program - Reduced Instruction Set Computer - MIPS: about 100 instructions - use many simple instructions to do the same amount of work faster 2-Nov-2001 CSE 410 - CISC and RISC 3 ## Digital Equipment VAX - 1977 - Advances in microcode technology made complex instructions possible - Memory was expensive - so small program size = good - Compilers had a long way to go - so ease of translation from high-level language to assembly = good 2-Nov-2001 CSE 410 - CISC and RISC #### VAX Instructions - Stack manipulation - pop, push registers - Queue manipulation - insert, remove entries at head or tail - Cyclic Redundancy Check - · Character Strings - Compare, Locate, Match, Move, Scan, Skip, Span - Move Translated Until Character 2-Nov-2001 CSE 410 - CISC and RISC # MOVTUC Move Translated Until Character The **source string** specified by the source length and source address operands **is translated**. It replaces the destination string specified by the destination length and destination address operands. Translation is accomplished by using **each byte of the source string as an index into a 256-byte table** whose first entry address (entry number 0) is specified by the table address operand. The **byte selected replaces the byte** of the destination string. Translation continues until a translated byte is equal to the escape byte, or until the source string or destination string is exhausted. If translation is terminated because of escape, the condition code V-bit is set; otherwise, it is cleared. Nov-2001 CSE 410 - CISC and RISC # Impact on a pipeline? "During the execution of the character string instructions, pending interrupt conditions are tested. If any conditions are found, the control block is updated, a first-part-done bit is set in the processor status longword (PSL), and the instruction is interrupted. After the interruption, the instruction resumes transparently." 2-Nov-2001 CSE 410 - CISC and RISC # Intel 80x86 - 1978 and on - 8086 - extension of the 8-bit 8080 microprocessor - dedicated register usage requirements - 80286, 80386 - segmented 32-bit address space - general purpose registers - 80486, Pentiums - performance, parallelism 2-Nov-2001 CSE 410 - CISC and RISC ## **Backward Compatible** - A key feature and a major problem is that the Intel architecture has maintained backward compatibility - Good: old programs run on new CPUs - Bad: new CPUs must implement the complex designs of the past - Intel has repeatedly pushed the architecture beyond what was thought possible 2-Nov-200 CSE 410 - CISC and RISC #### The RISC Reaction - Complex instructions - Take longer to execute - Take more hardware to implement - Idea: compose simple, fast instructions - Less hardware is required - Faster execution speed if done right - PUSHR vs. sw + sw + sw 2-Nov-20 CSE 410 - CISC and RISC ## Born to Pipeline - MIPS 1985 - Instructions all one length - simplifies Instruction Fetch stage - Regular format - simplifies Instruction Decode - Few memory operands, only registers - only lw and sw instructions access memory - Aligned memory operands - only one memory access per operand 11 #### PowerPC - 1993 and on - Branch unit - prefetch instructions and analyze - substitute destination instruction for branch - Superscalar - multiple functional units operating in parallel - · Memory access - unaligned load / store - big-endian, little-endian 2-Nov-2001 CSE 410 - CISC and RISC 2 # Clever isn't always Good "While the PowerPC architecture provides both load and store multiple instructions for GPRs, it discourages their use because their implementation on some machines may not be optimal. In fact, use of the load and store multiple instructions on some future implementations may be significantly slower than the equivalent series of single word loads or stores." 13 2-Nov-2001 CSE 410 - CISC and RISC ``` int strlen(char *s) int strlen(char *s) { char *p = s; while (*p != '\0') p++; return p-s; } ``` ## strlen on a Pentium ``` Address Machine Code Assembly Code 00000 8b 4c 24 04 mov ecx, DWORD PTR _s$[esp-4] 00006 80 39 00 cmp BYTE PTR [ecx], 0 je SHORT $L36 00009 74 06 $L35: 0000b inc eax cmp BYTE PTR [eax], 0 80 38 00 0000c 0000f jne SHORT $L35 75 fa $L36: 00011 2b cl sub eax, ecx 00013 c3 ret 0 CSE 410 - CISC and RISC 15 ``` #### strlen on a PowerPC CSE 410 - CISC and RISC | <u>Address</u> | Machine Code | Assembly | Code | |----------------|--------------|----------|------------| | | | | | | 00000000 | 93E1FFFC | stw | r31,-4(SP) | | 00000004 | 7C7F1B78 | mr | r31,r3 | | 80000008 | 48000008 | b | *+8 | | 000000C | 3BFF0001 | addi | r31,r31,1 | | 00000010 | 889F0000 | lbz | r4,0(r31) | | 00000014 | 7C840774 | extsb | r4,r4 | | 00000018 | 2C040000 | cmpwi | r4,0 | | 000001C | 4082FFF0 | bne | *-16 | | 00000020 | 7C63F850 | sub | r3,r31,r3 | | 00000024 | 83E1FFFC | lwz | r31,-4(SP) | | 00000028 | 4E800020 | blr | | | | | | | | | | | | #### strlen on a MIPS | Address | Machine Code | Assembly C | <u>'ode</u> | | |------------|--------------|---------------------|----------------|----| | | | | | | | 00000000 | 80820000 | 1b | \$v0,0(\$a0) | | | 00000004 | 00041821 | move | \$v1,\$a0 | | | 80000008 | 10400006 | beqz | \$v0,done | | | 0000000c | 24630001 | addu | \$v1,\$v1,1 | | | loop: | | | | | | 00000010 | 80620000 | 1b | \$v0,0(\$v1) | | | 00000014 | 24630001 | addu | \$v1,\$v1,1 | | | 00000018 | 1440fffe | bnez | \$v0,loop | | | 0000001c | 2463ffff | subu | \$v1,\$v1,1 | | | done: | | | | | | 00000020 | 00641023 | subu | \$v0,\$v1,\$a0 | | | 00000024 | 03e00008 | j | \$ra | | | 2-Nov-2001 | C | SE 410 - CISC and I | RISC | 17 |