Implement direct mapped cache for instruction fetch #103

yy214123 · 2025-10-29T18:53:26Z

Extend the existing architecture to cache the last fetched PC instruction, improving instruction fetch hit rate by approximately 2%.

based on performance metrics introduced in #99 (MMU Cache Statistics).

=== MMU Cache Statistics ===

Hart 0:
   Fetch: 443724828 hits,    7671580 misses (98.30% hit rate)
   Load:   65003296 hits,   34592372 misses (2-way) (65.27% hit rate)
   Store:  59967516 hits,   11483892 misses (83.93% hit rate)

Also includes clang-format fixes for several expressions.

Summary by cubic

Implemented a direct-mapped instruction fetch cache with a small victim cache to reduce conflict misses and speed up fetches. Improves instruction fetch hit rate by ~2% (per MMU cache stats from #99) and reduces fetch translations by ~7%.

New Features
- Direct-mapped I-cache (256 blocks × 256 B).
- 16-block victim cache with swap-on-hit to mitigate conflict misses.
- I-cache invalidation on MMU invalidate.
- 2-entry direct-mapped fetch page cache with parity hash to avoid thrashing.
Refactors
- Documented I-cache macros/masks and renamed IC/VC prefixes to ICACHE/VCACHE.
- Fixed tag calculation, victim-cache fill, and hit/miss accounting in mmu_fetch.

^{Written for commit db3a37f. Summary will update automatically on new commits.}

riscv.h

riscv.c

jserv · 2025-10-30T08:33:05Z

Consider 2-Way Page Cache: Given that the current page-level cache already achieves 98.30% hit rate, consider simply upgrading it instead:

  // Current: single-entry page cache
  mmu_fetch_cache_t cache_fetch;

  // Proposed: 2-way set-associative page cache
  mmu_fetch_cache_t cache_fetch[2];  // Only +16 bytes overhead

  Use parity hash (like load/store caches):
  uint32_t idx = __builtin_parity(vpn) & 0x1;
  if (unlikely(vpn != vm->cache_fetch[idx].n_pages)) {
      // ... fill
  }

riscv.h

riscv.c

jserv

Unify the naming scheme.

Extend the existing architecture to cache the last fetched PC instruction, improving instruction fetch hit rate by approximately 2%. Also includes clang-format fixes for several expressions.

- Rename I-cache structures from ic to icache to avoid ambiguity. - Add explanations for instruction-cache definitions and masks, and align the macro names with the terminology used in the comments. - Fix tag calculation to use the precomputed tag rather than shifting the physical address.

Replace the previous 1-entry direct-mapped design with a 2-entry direct-mapped cache using hash-based indexing (same parity hash as cache_load). This allows two hot virtual pages to coexist without thrashing. Measurement shows that the number of virtual-to-physical translations during instruction fetch (mmu_translate() calls) decreased by ~10%.

Introduce a small victim cache to reduce conflict misses in the direct-mapped instruction cache. On an I-cache miss, probe the victim cache; on hit, swap the victim block with the current I-cache block and return the data. Also rename ic.block → ic.i_block to distinguish between primary I-cache blocks and victim cache blocks.

Adjust instruction cache related defines and identifiers: - Rename IC/ic prefix to ICACHE/icache - Rename VC/vc prefix to VCACHE/vcache

Previous implementation did not correctly place the evicted I-cache block into the victim cache, leaving all victim entries empty and thus never hit. This patch properly stores the replaced I-cache block into the victim cache before refill, allowing victim hits to function as intended. Measurement shows that the number of virtual-to-physical translations during instruction fetch (mmu_translate() calls) decreased by ~7%.

jserv

Rebase the latest 'master' branch and resolve build errors.

Adjust expressions to align with the new 2-entry cache_fetch design introduced in "Adopt 2-entry direct-mapped page cache".

jserv

Squash commits and refine commit message.

visitorckw

This series appears to contain several "fix-up," "refactor," or "build-fix" commits that correct or adjust a preceding patch.

To maintain a clean history and ensure the project is bisectable, each patch in a series should be complete and correct on its own.

visitorckw · 2025-11-04T18:31:37Z

As a friendly reminder regarding project communication:

Please ensure that when you quote-reply to others' comments, you do not translate the quoted text into any language other than English.

This is an open-source project, and it's important that we keep all discussions in English. This ensures that the conversation remains accessible to everyone in the community, including current and future participants who may not be familiar with other languages.

visitorckw · 2025-11-04T19:02:22Z

riscv.c

+            icache_block_t tmp = *blk;
+            *blk = *vblk;
+            *vblk = tmp;
+            blk->tag = tag;


This code looks suspicious to me.

When you move the evicted I-cache block (tmp) back into the victim cache, you are setting the vblk->tag to tmp.tag, which is the 16-bit I-cache tag.

Won't this corrupts the victim cache entry? The VC search logic requires a 24-bit tag ([ICache Tag | ICache Index]) to function. Because you're only storing the 16-bit tag, this VCache entry will never be hit again.

yy214123 marked this pull request as draft October 29, 2025 18:56

This comment was marked as outdated.

Sign in to view

yy214123 force-pushed the direct-mapped-cache branch 2 times, most recently from 98114a7 to 0e4f67b Compare October 30, 2025 05:32

jserv reviewed Oct 30, 2025

View reviewed changes

riscv.h Outdated Show resolved Hide resolved

jserv reviewed Oct 30, 2025

View reviewed changes

riscv.h Outdated Show resolved Hide resolved

jserv reviewed Oct 30, 2025

View reviewed changes

riscv.c Outdated Show resolved Hide resolved

yy214123 force-pushed the direct-mapped-cache branch 2 times, most recently from bb9e6cb to 74e3b99 Compare October 30, 2025 19:24

yy214123 closed this Oct 31, 2025

This comment was marked as outdated.

Sign in to view

yy214123 reopened this Oct 31, 2025

This comment was marked as resolved.

Sign in to view

yy214123 force-pushed the direct-mapped-cache branch from 686cede to 5478710 Compare November 1, 2025 06:40

yy214123 requested a review from jserv November 2, 2025 09:08

jserv reviewed Nov 2, 2025

View reviewed changes

riscv.h Outdated Show resolved Hide resolved

jserv reviewed Nov 2, 2025

View reviewed changes

riscv.h Outdated Show resolved Hide resolved

jserv reviewed Nov 2, 2025

View reviewed changes

riscv.c Outdated Show resolved Hide resolved

jserv requested changes Nov 2, 2025

View reviewed changes

yy214123 added 6 commits November 5, 2025 00:51

Implement direct mapped cache for instruction fetch

8825b96

Extend the existing architecture to cache the last fetched PC instruction, improving instruction fetch hit rate by approximately 2%. Also includes clang-format fixes for several expressions.

Rename I-cache related definitions

502d099

Adjust instruction cache related defines and identifiers: - Rename IC/ic prefix to ICACHE/icache - Rename VC/vc prefix to VCACHE/vcache

yy214123 force-pushed the direct-mapped-cache branch from fe2d95e to f657fb2 Compare November 4, 2025 16:55

jserv requested changes Nov 4, 2025

View reviewed changes

Fix build errors

db3a37f

Adjust expressions to align with the new 2-entry cache_fetch design introduced in "Adopt 2-entry direct-mapped page cache".

yy214123 force-pushed the direct-mapped-cache branch from aab465c to db3a37f Compare November 4, 2025 18:20

sysprog21 deleted a comment from yy214123 Nov 4, 2025

jserv requested changes Nov 4, 2025

View reviewed changes

visitorckw suggested changes Nov 4, 2025

View reviewed changes

visitorckw reviewed Nov 4, 2025

View reviewed changes

Implement direct mapped cache for instruction fetch #103

Are you sure you want to change the base?

Implement direct mapped cache for instruction fetch #103

Conversation

yy214123 commented Oct 29, 2025 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by cubic

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

jserv commented Oct 30, 2025

Uh oh!

This comment was marked as outdated.

This comment was marked as resolved.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jserv left a comment

Choose a reason for hiding this comment

Uh oh!

jserv left a comment

Choose a reason for hiding this comment

Uh oh!

jserv left a comment

Choose a reason for hiding this comment

Uh oh!

visitorckw left a comment

Choose a reason for hiding this comment

Uh oh!

visitorckw commented Nov 4, 2025

Uh oh!

visitorckw Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yy214123 commented Oct 29, 2025 •

edited by cubic-dev-ai bot

Loading