Table 1:
Load |
26% |
Compare |
14% |
Shift left and shift right |
4% |
Store |
9% |
Load immediate |
4% |
AND |
3% |
Add |
14% |
Conditional branch |
17% |
OR |
5% |
Sub |
0% |
Jump |
1% |
Other register-register instructions (XOR, NOT, etc.) |
1% |
Multiply |
0% |
Call |
1% |
||
Divide |
0% |
Return |
1% |
Using the data in Table 1, which of the following two
enhancements will result in faster execution of the five benchmark
programs that are described by the instruction frequency data?
Assume that the computer used to gather the instruction frequency
data in Table 1 had no cache memory and used main memory for all
fetch and store memory accesses.
Enhancement 1: Buy a faster,
more expensive processor. The time to complete the Fetch-Execute
cycle for every instruction except those accessing a memory address
is reduced by 10% by using faster, higher-priced MOSFET transistors
for the processor circuit.
Enhancement 2: Buy faster, more
expensive DRAM memory. The average access time for data accesses
only is reduced by one third and is the same for either fetch or
store access.
Hint: When reduction in time is given for an enhancement, that
datum must be converted to a speedup for the enhancement before the
conventional formulation of Amdahl’s Law can be employed to compute
overall speedup.
Given that
we have to Using the data in Table 1, which of the following two enhancements will result in faster execution of the five benchmark programs that are described by the instruction frequency data? Assume that the computer used to gather the instruction frequency data in Table 1 had no cache memory and used main memory for all fetch and store memory accesses.
ALU register-register |
add, sub, multiply, divide, shift left and shift right, AND, OR, Other register-register (xor, not, etc.) |
Memory access |
load, store, load immediate |
Conditional branch |
conditional branch, |
Jump | call, return, jump |
Table 1: Load 26% Compare 14% Shift left and shift right 4% Store 9% Load i...
Add 9 MUX 4 4 Addresult ALU Shift left 2 RegDst Branch MemRead Instruction (31-26) Control Memto Reg ALUOD MemWrite ALUSC RegWrite Instruction [25-21) Read PC Read address register 1 Read Instruction (20-16] MUX1 MUX Zero ALU ALU MUX3 M Instruction (31-0) Instruction memory Road Address data Read data 1 register 2 Write Read register data 2 Write data Registers result Instruction (15-11] Fox SX) Data Write data memory 16 32 Instruction (150) Sign- extend ALU control Instruction (5-0)
1. (10 points) Suppose you have a load-store computer with the following instruction mix Operation Frequency Number of clock cycles ALU ops Loads Stores Branches 40 % 20 % 18% 22 % 4 4 The ALU ops (arithmetic logic unit ops) typically use operands in CPU registers and hence they take fewer clock cycles to execute. However, if you want to add a memory operand to a CPU register, then you would have to explicitly load it into a CPU...
Questions1. The function L is defined as L(1) = 2,L(2) = 1,L(3) = 3,L(4) = 4 and for n ≥ 4,L(n + 1) = L(n) + L(n − 1) + L(n − 2)L(n − 3)i.e., the (n + 1)-th value is given by the sum of the n-th, n − 1-th and n − 2-th values divided by the n − 3-th value.(a) Write an assembly program for computing the k-th value L(k), where k is an integer bigger than...
Question 5 0.25 pts What is the value of the MemWrite control signal? Question 6 0.25 pts What is the value of the ALUSrc control signal? Add Add Sum--(1 4 Shift left 1 Branch MemRead Instruction [6-0] ControMemtoReg MemWrite ALUSrc RegWrite Instruction [19-15]Read Read register 1 Read Read data! PCaddress Instruction [24-20] Zero ALU ALU result register 2 Instruction 31-0 Instruction [11-7 Read1 Address data | Write Read register daiaALU | M Instruction memory Write data Registers Write Data data...
T F Xilinx's SDK assembler supports both FOR statements, but not wHILE statements T F In the ARM processor, immediate operands are stored in data memory, and not in the opcode T F In ARM processor instructions, one but not both operands can come from main T F In the ARM processor, a single load/store instruction T F It is possible for a microprocessor to use a virtual TCache memory is typically much faster and much larger than main memory...
1.) a.) Using the simplified instruction set shown for part b, write code for the following. Suppose memory locations 1400 to 1449 contain 16-bit words. Each word represents 2 ASCII characters. Write code to read in and write out these 100 characters. Left-side character from location 1400 should be first, right-side character from location 1400 should be second, and remaining characters follow in numeric order. Assume you have access to 4 registers: R1, R2, R3, R4. Each register holds one...
1. Introduced by IBM with its System/360, the _________ is a set of computers offered with different price and performance characteristics that presents the same architecture to the user. 2. A large number of general-purpose registers, and/or the use of compiler technology to optimize register usage, a limited and simple instruction set, and an emphasis on optimizing the instruction pipeline are all key elements of _________ architectures. 3. The difference between the operations provided in high-level languages (HLLs) and those...
Group Project 1 The Micro-1 Processor Simulation <Micro-1 Computer> Here's the organization of a computer equipped with a Micro-1 processor Memory contains an array of integer cells: int cell[] = new int[CAP]; where CAP is the capacity of memory. Initially this is set to 256. Internally, the Micro-1 processor is equipped with eight 32-bit data/address registers and two 32 bit control registers: PC, the program counter, contains the address of the next instruction to execute. IR, the instruction register, contains...