Answer (a) :
Average execution time of each instruction (except floating point) = 10ns
In question we need to calculate the average execution time of floating point instruction and we have the average execution time of overall program B so we use simple average formula.
average execution time of overall program B =18.1ns
So average formula :
where = Average execution time of each instruction
n=Number of instructions
x=Average execution time of floating point instruction
Answer) On calculation , x=2.172ns
Answer (b) :
Given system's clock frequency(clock rate)=100MHz
We need to calculate CPI in this question for program B. Now what is CPI ?
CPI : It is the average number of clock cycles per instruction. If for each instruction we are aware of its frequency and its number of clock cycles it will take for execution , then we can calculate the CPI of overall code.
CPI formula :
where, F = frequency of each instruction (i.e the probability of occurence of each instruction as given in question
N = Number of clock cycles associated with each instruction iof the program
From the average execution time of each instructions we can take out the number of clock cycles per instruction by formula :
Clock rate is the clock frequency so will give us the clock cycle time preferrably in (seconds).
Given, System's clock rate = 100MHz
Execution time of floating point instruction = 2.172 ns
Execution time of all other instructions(except floating point) = 10 ns
For floating point instruction ,
N (no of clock cycles) = 4.6
For all other instructions ,
N (no of clock cycles) = 1
Now CPI ,
Answer) CPI=1.684 clock cycles/instruction
Problem 2. Probability of occurrence Program A Program B (commercial) (scientific) Instruction type Consider the instruction...
Assume that for a program, compiler A results in a dynamic instruction count of 8.0E8 and has an execution time of 2.4 s, while compiler B results in a dynamic instruction count of 1.25E9 and an execution time of 1.8 s. a. [10] Find the average CPI for each program given that the processor has a clock cycle time of 2 ns. b. [10] Assume the compiled programs run on two different processors. If the execution times on the two...
urgent help needed in computer architecture ! Thanx Question 4 (9 pts total) program consists of 8 instructions. Each instruction takes an average of 140 ns to access memory and decode the instruction. It takes an additional 20 ns for the CPU to process each instruction. 4.1 How long does the program take to run? (3 pts) 4.2 An interrupt occurs during the program that delays the program by 240 ns. What percentage of time is spent processing data by...
Problem 4 (15pts): (a) (5pts) Consider the following MIPS memory with data shown in hex, which are located in memory from address 0 through 15. Show the result of the MIPS instruction "lw Ss0,4(Sa0)" for machines in little-endian byte orders, where Sa0 4. Address Contents Address Contents 9b lb 2 4 6 10 b4 c5 12 13 14 15 3d 5f 70 7 8f (b) (10pts)Assume we have the following time, performance and architecture parameters in the specified units Ec-...
Problem 3. (25 pts.) Compilers can have a profound impact on the performance of an application. Assume that for a program, compiler A results in a dynamic instruction count of 1 billion instructions and has an execution time of 1.1 seconds, while compiler B results in a dynamic instruction count of 1.2 billion instructions and an execution time of 1.5 seconds. A) Find the average CPI for each program given that the processor has a clock cycle time of 1...
Problem 4 (15pts): (a) (5pts) Consider the following MIPS memory with data shown in hex, which are located in memory from address 0 through 15. Show the result of the MIPS instruction "lw Ss0,4(Sa0)" for machines in little-endian byte orders, where Sa0 4. Address Contents Address Contents 9b lb 2 4 6 10 b4 c5 12 13 14 15 3d 5f 70 7 8f (b) (10pts)Assume we have the following time, performance and architecture parameters in the specified units Ec-...
Problem 4 (15pts): (a) (5pts) Consider the following MIPS memory with data shown in hex, which are located in memory from address 0 through 15, Show the result of the MIPS instruction "Iw Ss0, 4(Sa0)" for machines in little-endian byte orders, where Sa0 8 Address Contents Address Contents 4¢ 8 c5 6d 1 9 2 7e 8f 66 10 70 11 8a Oa 12 13 14 15 1b a3 b4 2c 6 3d 7 (b) (10pts)Assume we have the following...
Problem 4 (15pts): hines iom address oing MIPS memory with data shown in hex, which are located in little-endian byte on rough 15. Show the result of the MIPS instruction "w Ss0, 4(Sa0)" for an byte orders, where $a0 4 Address Contents Address Contents 0a 1 b 2c 3d 8a 9b 10 b4 c5 6d 7e 8f 5f 13 14 15 70 (b) (10pts)Ass specified units. ume we have the following time, performance and architecture parameters in the Ec execution...
Problem 0 Consider three different processors P, P2, and P3 executing the same instruction set. PI has a 3 GHz clock rate and a CPI of 1.5. P2 has a 2.5 GHz clock rate and a CPI of 1.0. P3 has a 4.0 GHz clock rate and has a CPI of 2.2. a. Which processor has the highest performance expressed in instructions per second? b. Ifthe processors ecach execute a program in 10 seconds, find the number of eycles and...
Topics 1. MIPS instruction set architecture (ISA). 2. Performance. 3. MIPS datapath and control. Exercise 1 Consider the memory and register contents shown below. Registers Ox0100 FFF8 13 ($t 5) 14 ($t6) 0x0100 FFFC 0x0101 0000 Memory 0x0000 0000 0x0001 1100 0x0A00 со00 0x1234 4321 OxBAOO OOBB 15 OXAAAA 0000 0x1111 1010 0x7FFF FFFD 0x0100 FFFO 0x0101 0008 (St7) Ox0101 0004 16 ($80) 0x0101 0008 17 ($sl) Show what changes and give the new values in hexadecimal after the following...
(6 pts) A program P2 is executed on computer system B, which has a 2.8571429 GHz clock frequency. A programmer writes code to read processor performance monitoring registers after the program completes (Intel refers to the part of the CPU that can be used to collect run-time information as the Performance Monitoring Unit or PMU). The performance monitoring unit indicates that 23,118,471,971 user instructions were executed (just the instructions of P2 excluding OS and overhead instructions) in 53,866,039,692 clock cycles....