[컴퓨터 구조] 챕터 4리뷰

컴퓨터 구조

klee9 2024. 11. 15. 16:30

외우지 말고 이해하라고 하신다. 중간고사가 끝나고 보니까 너무 잘 이해된다.

Jump (unconditional)
- ex) j F
  1. read instruction stored in the PC address from the instruction memory.
  2. 26-bit address is multiplied by 4 (shift left 2) and the top 4 bits of (PC + 4) is concatenated to the front of the shifted value (= jump address)
  3. the value of PC changes to the jump address.

Improves performance by increasing throughput (latency may decrease)
Ideal speedup == # of stages
Consists of 5 stages
1. IF: fetch instruction from memory
2. ID: decode instruction & read registers
3. EX: execute operation or calculate address
4. MEM: access memory operand
5. WB: write the result back to the register.
Examples of needed steps
- lw: IF - ID - EX - MEM - WB
- sw: IF - ID - EX - MEM
- R-format: IF - ID - EX - WB
Pipeline performance
- single cycle -> total time for N instructions = 800N ps (800 ps for each stage)
- pipelined -> total time for N instructions = 800 + 200N ps
- if N -> inf, then speedup = 4 (not 5 as we're wasting some time)

Structure hazard
- A required resource is busy -> solved by using multiple memories
Data hazard
- Need to wait for previous instruction to complete read/write
- ex) add s0, t0, t1
  sub t2, s0, t3

Forwarding
- This can solve the issue above.
- Uses the ALU result immediately after it's computed.
- One extra connection is needed in the datapath as pipelining is done in one circuit.

Load-use data hazard
- Cannot be solved by forwarding. (We can't go back in time and fetch the old value)
- MUST stall. However, this can be avoided to some extent by reordering the code (code rescheduling; done by the compiler)
- ex) lw s0, 20(t1)
  sub t2, s0, t3

Control hazard
- Fetching next instruction depends on branch outcome
- Must stall until outcome is determined
- To avoid stall, predict branch outcome (MIPS always branches)
- Predicting an average of 0.5 cycles.
Pipeline Registers
- Used for holding information produced in the previous cycle

Simplified pipelined "control"
- Control signals are the same, but the timing of when to use control.
- Control signals are passed down the pipeline.

Stalls and performance
- Stalls reduce performance but are required for correct results.
- Compilers can rearrage code to avoid hazards / stalls.
Instruction-level parallelism (ILP)
- Pipelining != parallel, but true to some extent.
- To increase ILP:
  1. Deeper pipeline: shorter clock cycle
  2. Multiple issues: multiple datapaths
  3. Loop unrolling: copy and paste the same code instead of looping. It's actually faster

[컴퓨터 구조] 챕터5 리뷰 (3)	2024.12.20
[컴퓨터 구조] 챕터1 리뷰 (3)	2024.10.25

klee9 님의 블로그

klee9 님의 블로그 입니다.

klee9 님의 블로그