Adopt CSS-style drop shadow mask #146

jserv · 2025-10-26T16:04:20Z

This commit generates shadow_gaussian lookup table during the build, so the renderer stops depending on runtime math. It replaces the stack-blur border writes with a lookup-table renderer that matches CSS behavior and stays efficient.

For performance considerations, it caches the vertical and bottom falloff tables and rely on memset() or pointer writes to trim per-frame work. It exposes a small configurable fade tail and update defaults plus the API, so it aligns with CSS box-shadow semantics.

Summary by cubic

Switches drop shadow rendering to a CSS-style box-shadow mask using a precomputed Gaussian LUT. Improves visual match and speeds up rendering while simplifying the shadow API.

New Features
- Generate a fixed-point Gaussian weight LUT at build time (scripts/gen-shadow-lut.py → src/shadow-gaussian-lut.h), removing runtime math.
- Replace stack-blur edge writes with a LUT-based renderer for right, bottom, and corner falloffs to match CSS behavior.
- Cache vertical and bottom falloff tables and use fast clears to cut per-frame work.
- Add SHADOW_FADE_TAIL (default 2) and update defaults: HORIZONTAL_OFFSET 6, VERTICAL_OFFSET 6, SHADOW_BLUR 12; set default shadow color to 50% black (0x80000000).
Migration
- Update calls to twin_shadow_border(shadow, color); remove shift_x and shift_y as offsets now come from Kconfig.

cubic-dev-ai

No issues found across 7 files

src/draw-common.c

weihsinyeh · 2025-10-28T10:30:10Z

src/draw-common.c

+#define SHADOW_LUT_Y_LEN (CONFIG_VERTICAL_OFFSET + CONFIG_SHADOW_FADE_TAIL)
+
+/* Fast lookup: 17-entry approximation of (1 - t^2)^2 * 0.92 + 0.08. */
+static inline twin_fixed_t shadow_gaussian_weight(twin_fixed_t t)


It seems that the shadow_gaussian_weight() function has the same effect as twin_stack_blur(). Removing the call to twin_stack_blur() still produces a gradient color effect.

It seems that the shadow_gaussian_weight() function has the same effect as twin_stack_blur().

No, these two functions have completely different purposes and effects.

The purpose of twin_stack_blur:

Blurs an existing image in-place

Uses the Stack Blur algorithm (2-pass separable convolution)

Horizontal scan + vertical scan

Creates a temporary pixmap for two-directional processing

Complexity: O(2 × width × height × k)

The purpose of shadow_gaussian_weight:

Looks up a single weight value from a precomputed LUT

Input: t ∈ [0, 1] (fixed-point format)

Output: Gaussian weight (16.16 fixed-point)

Uses formula: (1 - t²)² × 0.92 + 0.08

Complexity: O(1) lookup

Comparisons:

twin_stack_blur is like taking a photograph and applying a Photoshop blur filter to it. You're processing what's already there.

shadow_gaussian_weight is like looking up paint opacity values from a chart to paint a gradient from scratch. You're creating something new.

src/draw-common.c

weihsinyeh · 2025-10-29T06:28:04Z

src/draw-common.c

+        twin_coord_t bottom_start = win_height;
+        if (bottom_start < 0)
+            bottom_start = 0;
+        twin_coord_t bottom_end = win_height + bottom_extent;


twin_coord_t win_height = shadow->height - shadow->window->shadow_y;
twin_coord_t bottom_extent = shadow->height - win_height;
Therefore, bottom_end == shadow->height

weihsinyeh · 2025-10-29T06:28:42Z

src/draw-common.c

+        if (bottom_start < 0)
+            bottom_start = 0;
+        twin_coord_t bottom_end = win_height + bottom_extent;
+        if (bottom_end > shadow->height)


There is no need to compare.

There is no need to compare.

Scenario where bounds check matters:
CONFIG_VERTICAL_OFFSET = 6, FADE_TAIL = 2
∴ lut_y_len = 8

Case 1: shadow_y = 10
→ bottom_extent limited to 8
→ bottom_end = win_height + 8
→ Need bounds check if win_height + 8 > shadow->height

Case 2: shadow_y = 20 (large window)
→ bottom_extent limited to 8
→ bottom_end = win_height + 8 (definitely < shadow->height)

Bounds check is REQUIRED because bottom_extent gets clamped to LUT size.

weihsinyeh · 2025-10-29T06:36:52Z

src/draw-common.c

+                if (bottom_width <= 0)
+                    continue;
+
+                twin_coord_t dist_y = y - win_height;


twin_coord_t dist_y = y - bottom_start; this is clearer.

twin_coord_t dist_y = y - bottom_start; this is clearer.

Current implementation is correct for LUT indexing semantics.

alpha_lut_y is built with indices [0, y_offset)

LUT entry 0 = maximum alpha (at window edge)

LUT entry y_offset-1 = minimum alpha (at shadow fade end)

dist_y must represent distance from window edge (win_height)

NOT distance from loop start (bottom_start)

weihsinyeh · 2025-10-29T06:40:33Z

Only need to keep one variable for the same function.

This commit generates shadow_gaussian lookup table during the build, so the renderer stops depending on runtime math. It replaces the stack-blur border writes with a lookup-table renderer that matches CSS behavior and stays efficient. For performance considerations, it caches the vertical and bottom falloff tables and rely on memset() or pointer writes to trim per-frame work. It exposes a small configurable fade tail and update defaults plus the API, so it aligns with CSS box-shadow semantics.

jserv requested review from jouae, ndsl7109256 and weihsinyeh October 26, 2025 16:04

cubic-dev-ai bot reviewed Oct 26, 2025

View reviewed changes