Final comparison: all 6 proposals combined by VoX · Pull Request #8 · VoX/Bob-Rust-Java

VoX · 2026-04-04T23:23:30Z

Summary

Adds a comprehensive comparison test (FinalComparisonTest.java) that benchmarks all features OFF vs all features ON across 6 test images (3 synthetic + 3 real photos)
Runs each configuration 5 times for statistical rigor with paired t-test analysis
Generates before/after images, target references, and diff heatmaps in test-results/final-comparison/

Results

Proposals 1-5 (SA + error-guided + adaptive size + batch-parallel + TSP)

Image	Before	After	Improvement	p-value	Sig?
photo_detail	0.250774	0.250016	+0.30%	0.1051	No
nature	0.058198	0.058940	-1.27%	1.0000	No
edges	0.244876	0.245867	-0.40%	1.0000	No
river	0.090141	0.089522	+0.69%	0.0410	Yes
portrait	0.062045	0.060768	+2.06%	0.0429	Yes
landscape	0.063394	0.063036	+0.56%	0.0695	No

Aggregate: +0.17% (real photos benefit most)

All 6 Proposals (with Progressive Resolution)

Progressive resolution trades ~5% energy score for significantly faster generation. This is expected — it generates 40% of shapes at reduced resolution for speed.

Key Findings

Real photographs show the clearest improvement from proposals 1-5 (portrait: +2.06%, river: +0.69%)
Synthetic test images show mixed results — the baseline was already near-optimal for simple patterns
Progressive resolution adds speed at a small quality cost; appropriate for interactive use
Batch-parallel energy (Proposal 4) and TSP optimization (Proposal 5) primarily improve performance/drawing efficiency rather than energy scores

See test-results/final-comparison/COMPARISON.md for full statistical analysis with confidence intervals.

Test plan

./gradlew clean build -x test passes
FinalComparisonTest generates all images and statistics
All comparison images committed to repo

🤖 Generated with Claude Code

Generates before/after images for 6 test images (3 synthetic + 3 real photos) with 5 runs each for statistical rigor. Includes paired t-test results, diff heatmaps, and a summary COMPARISON.md documenting energy scores across all configurations (all-off, proposals 1-5, all 6 proposals). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Adds a new end-to-end “final comparison” benchmark/test that runs the generator with all features OFF vs all features ON (including progressive resolution), repeats runs for basic statistical analysis, and writes a Markdown report plus before/after/target/diff images under test-results/final-comparison/.

Changes:

Add FinalComparisonTest to generate images, compute summary stats, and emit COMPARISON.md.
Commit the resulting comparison artifacts (targets, before/after renders, diff heatmaps, and photo copies).
Add a gradlew wrapper script to the repo.

Reviewed changes

Copilot reviewed 2 out of 30 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
test-results/final-comparison/COMPARISON.md	Generated report summarizing per-image scores, significance, and linking rendered artifacts
test-results/final-comparison/edges_after.png	Generated “all ON” output image for edges case
test-results/final-comparison/edges_before.png	Generated “all OFF” output image for edges case
test-results/final-comparison/edges_diff.png	Generated diff heatmap for edges case
test-results/final-comparison/edges_target.png	Generated target/reference image for edges case
test-results/final-comparison/landscape_after.png	Generated “all ON” output image for landscape case
test-results/final-comparison/landscape_before.png	Generated “all OFF” output image for landscape case
test-results/final-comparison/landscape_diff.png	Generated diff heatmap for landscape case
test-results/final-comparison/landscape_target.png	Generated target/reference image for landscape case
test-results/final-comparison/nature_after.png	Generated “all ON” output image for nature case
test-results/final-comparison/nature_before.png	Generated “all OFF” output image for nature case
test-results/final-comparison/nature_diff.png	Generated diff heatmap for nature case
test-results/final-comparison/nature_target.png	Generated target/reference image for nature case
test-results/final-comparison/photo_detail_after.png	Generated “all ON” output image for photo_detail case
test-results/final-comparison/photo_detail_before.png	Generated “all OFF” output image for photo_detail case
test-results/final-comparison/photo_detail_diff.png	Generated diff heatmap for photo_detail case
test-results/final-comparison/photo_detail_target.png	Generated target/reference image for photo_detail case
test-results/final-comparison/portrait_after.png	Generated “all ON” output image for portrait case
test-results/final-comparison/portrait_before.png	Generated “all OFF” output image for portrait case
test-results/final-comparison/portrait_diff.png	Generated diff heatmap for portrait case
test-results/final-comparison/portrait_target.png	Generated target/reference image for portrait case
test-results/final-comparison/river_after.png	Generated “all ON” output image for river case
test-results/final-comparison/river_before.png	Generated “all OFF” output image for river case
test-results/final-comparison/river_diff.png	Generated diff heatmap for river case
test-results/final-comparison/river_target.png	Generated target/reference image for river case
test-results/final-comparison/photos/landscape.png	Committed photo input copy used by final comparison
test-results/final-comparison/photos/portrait.png	Committed photo input copy used by final comparison
test-results/final-comparison/photos/river.png	Committed photo input copy used by final comparison
src/test/java/com/bobrust/generator/FinalComparisonTest.java	New benchmark-style test that runs comparisons, computes stats, and writes artifacts
gradlew	Adds Gradle wrapper start script

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-04T23:28:43Z