perf(stark): skip fixed 0/1 muls in LogUp fingerprint accumulation by diegokingston · Pull Request #696 · yetanotherco/lambda_vm

diegokingston · 2026-06-22T15:39:19Z

In the fingerprint hot loop (prover aux-build + constraint-eval + verifier):

Bus-id term: alpha_powers[0] = alpha^0 = 1, so embed the bus id into the extension field directly instead of multiplying by 1 (drops one F*E mul per interaction per row, hoisted out of the row loop on the aux path).
Fixed-zero bus elements (the ~235 constant(0) used for bus-width padding) contribute nothing: skip the F*E multiply + accumulate entirely. Variable elements that happen to be zero on a row also benefit.

Value-identical (field addition is exactly associative): stark lib 128/128 (default + parallel), prover bus/logup tests pass, clippy clean. Net effect on prove time is what we want to measure on the 32-core bench.

In the fingerprint hot loop (prover aux-build + constraint-eval + verifier): - Bus-id term: alpha_powers[0] = alpha^0 = 1, so embed the bus id into the extension field directly instead of multiplying by 1 (drops one F*E mul per interaction per row, hoisted out of the row loop on the aux path). - Fixed-zero bus elements (the ~235 constant(0) used for bus-width padding) contribute nothing: skip the F*E multiply + accumulate entirely. Variable elements that happen to be zero on a row also benefit. Value-identical (field addition is exactly associative): stark lib 128/128 (default + parallel), prover bus/logup tests pass, clippy clean. Net effect on prove time is what we want to measure on the 32-core bench.

diegokingston · 2026-06-22T15:39:26Z

/bench 5

github-actions · 2026-06-22T15:41:39Z

Benchmark — ethrex 20 transfers (median of 3)

_{Table parallelism: auto (cores / 3)}

Metric	main	PR	Δ
Peak heap	80284 MB	80139 MB	-145 MB (-0.2%) ⚪
Prove time	50.346s	49.795s	-0.551s (-1.1%) ⚪

✅ No significant change.

✅ Low variance (time: 0.1%, heap: 1.7%)

_{Commit: e6524ef · Baseline: cached · Runner: self-hosted bench}

MauroToscano · 2026-06-25T14:35:19Z

/bench

MauroToscano · 2026-06-25T17:44:03Z

I have been doing a better statistic analysis for this kind of smaller opts, sharing the results, it's a small speedup but it's there. I will make a follow up to update our measurement methods

- compute_fingerprint_from_step: drop the vestigial *α^0 from the doc formula so it mirrors the code (and matches docs/cryptography/lookup.md and spec/logup.typ). - accumulate_fingerprint{,_from_step}: the zero-skip also covers variable elements that are zero on a row, not just the constant(0) padding — reword the inline comments to say so.

…agnostics Confirmed findings from the multi-model review: - critical/high: SHA-aware binary cache — rebuild when cli_{A,B}.sha don't match the requested SHAs (was existence-only, so a persistent /tmp on the self-hosted runner could silently benchmark a previous PR's binaries). - high: fork-PR head resolution — workflow now resolves headRefOid + fetches pull/N/head and passes the SHA (origin/<branch> doesn't exist for forks). - high: clamp /bench-abba N to [2,40] in the workflow (was unbounded -> DoS). - high: build output -> per-binary log, surfaced on failure (was >/dev/null). - high: prove runs capture stderr (2>&1) so prover failures are diagnosable. - medium: add timeout-minutes: 120 so a hang can't strand the bench runner. - medium: louder warning on git fetch failure. - low: REF_A is now required (dropped the hardcoded PR #696 default). - low: fail fast if python3 is missing (before the ~30-min build). Deliberately kept: shared cargo target across the two worktree builds (incremental 2nd build; cargo recompiles on source change, REBUILD=1 covers dep changes).

…t-constants # Conflicts: # crypto/stark/src/lookup.rs

diegokingston marked this pull request as ready for review June 22, 2026 19:54

diegokingston and others added 2 commits June 24, 2026 12:18

Merge branch 'main' into perf/logup-fingerprint-constants

469425f

Merge branch 'main' into perf/logup-fingerprint-constants

e6524ef

This was referenced Jun 25, 2026

ci(bench): two-tier benchmarking — cheap-tier knobs + on-demand /bench-abba tiebreaker #710

Closed

ci(bench): two-tier benchmarking — cheap-tier knobs + on-demand /bench-abba tiebreaker #712

Merged

Merge remote-tracking branch 'origin/main' into perf/logup-fingerprin…

c2a3c82

…t-constants # Conflicts: # crypto/stark/src/lookup.rs

MauroToscano enabled auto-merge June 26, 2026 19:36

MauroToscano approved these changes Jun 26, 2026

View reviewed changes

MauroToscano added this pull request to the merge queue Jun 26, 2026

Merged via the queue into main with commit be5c4c2 Jun 26, 2026
12 checks passed

MauroToscano deleted the perf/logup-fingerprint-constants branch June 26, 2026 19:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf(stark): skip fixed 0/1 muls in LogUp fingerprint accumulation#696

perf(stark): skip fixed 0/1 muls in LogUp fingerprint accumulation#696
MauroToscano merged 5 commits into
mainfrom
perf/logup-fingerprint-constants

diegokingston commented Jun 22, 2026

Uh oh!

diegokingston commented Jun 22, 2026

Uh oh!

github-actions Bot commented Jun 22, 2026 •

edited

Loading

Uh oh!

MauroToscano commented Jun 25, 2026

Uh oh!

MauroToscano commented Jun 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

diegokingston commented Jun 22, 2026

Uh oh!

diegokingston commented Jun 22, 2026

Uh oh!

github-actions Bot commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark — ethrex 20 transfers (median of 3)

Uh oh!

MauroToscano commented Jun 25, 2026

Uh oh!

MauroToscano commented Jun 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions Bot commented Jun 22, 2026 •

edited

Loading