chore(execute): add gas validation for benchmark tests in execute mode #2219

CPerezz · 2025-09-29T09:01:26Z

🗒️ Description

Previously, execute mode was not validating that transactions consumed the expected amount of gas when expected_benchmark_gas_used was set. This could cause benchmark tests to incorrectly pass even when consuming significantly less gas than expected (e.g., due to missing factory contracts).

This feature is needed by benchmark tests like the ones in #2186 in order to make sure that the benchmarks are indeed consuming all gas available or causing a failure otherwise when the flag is set.

Changes:

Add expected_benchmark_gas_used and skip_gas_used_validation fields to TransactionPost
Implement gas validation logic in TransactionPost.execute() using transaction receipts
Pass gas validation parameters from StateTest and BlockchainTest to TransactionPost
Add eth_getTransactionReceipt RPC method to fetch gas used from receipts

This ensures benchmark tests fail appropriately when gas consumption doesn't match expectations, preventing false positives in performance testing.

🔗 Related Issues or PRs

Se this discussion: #2186 (comment). for more context.

✅ Checklist

All: Ran fast tox checks to avoid unnecessary CI fails, see also Code Standards and Enabling Pre-commit Checks:
```
uvx --with=tox-uv tox -e lint,typecheck,spellcheck,markdownlint
```
All: PR title adheres to the repo standard - it will be used as the squash commit message and should start type(scope):.
All: Considered adding an entry to CHANGELOG.md.
All: Considered updating the online docs in the ./docs/ directory.
All: Set appropriate labels for the changes (only maintainers can apply labels).
Tests: Ran mkdocs serve locally and verified the auto-generated docs for new tests in the Test Case Reference are correctly formatted.
Tests: For PRs implementing a missed test case, update the post-mortem document to add an entry the list.
Ported Tests: All converted JSON/YML tests from ethereum/tests or tests/static have been assigned @ported_from marker.

CPerezz · 2025-09-29T09:02:10Z

Just unsure if tests should be added to prove that this works or not. I haven't seen many tests for the core of the lib. So LMK if you want me to add them and where (as I locally tested with a python script).

LouisTsai-Csie

Thanks for the fix, it would be nice if we align the behavior between fill and execute mode, i left some suggested changes.

src/ethereum_test_execution/transaction_post.py

Previously, execute mode was not validating that transactions consumed the expected amount of gas when expected_benchmark_gas_used was set. This could cause benchmark tests to incorrectly pass even when consuming significantly less gas than expected (e.g., due to missing factory contracts). This feature is needed by benchmark tests like the ones in ethereum#2186 in order to make sure that the benchmarks are indeed consuming all gas available or causing a failure otherwise when the flag is set. Changes: - Add expected_benchmark_gas_used and skip_gas_used_validation fields to TransactionPost - Implement gas validation logic in TransactionPost.execute() using transaction receipts - Pass gas validation parameters from StateTest and BlockchainTest to TransactionPost - Add eth_getTransactionReceipt RPC method to fetch gas used from receipts This ensures benchmark tests fail appropriately when gas consumption doesn't match expectations, preventing false positives in performance testing.

CPerezz · 2025-09-29T22:49:06Z

Unsure why I needed to force-push. Anyways, I refactored the code to mimic fill behaviour. LMK if that's ok according to your comments @LouisTsai-Csie.

BTW, nice review. Your suggestions indeed were nice!

LouisTsai-Csie

I left a comment for some refactoring, inside the base_test_parametrizer_func function in execute.py file, we should configure default value for expected_benchmark_gas_used, not gas_benchmark_value.

Based on this, we do not need to pass the gas_benchmark_value from BlockchainTest and StateTest to the TransactionPost object, as it is already configured during test env configuration phase.

Please let me know if it is not clear.

src/pytest_plugins/execute/execute.py

src/ethereum_test_specs/state.py

src/ethereum_test_specs/blockchain.py

src/ethereum_test_specs/base.py

src/ethereum_test_execution/transaction_post.py

Addresses review comment to make execute mode gas validation cleaner: - Set expected_benchmark_gas_used to gas_benchmark_value as default in execute parametrizer - Remove gas_benchmark_value parameter from TransactionPost, StateTest, BlockchainTest, and BaseTest - Simplify gas validation logic in TransactionPost This ensures consistent gas validation behavior between fill and execute modes with a cleaner implementation that sets defaults at the parametrizer level.

CPerezz · 2025-09-30T08:25:02Z

@LouisTsai-Csie thanks so much for the tips. I missunderstood the approach at first. I hope this iteration is closer to what you wanted!

LouisTsai-Csie

LGTM! Thanks for the fix. I've testes it locally using Anvil, and it works as expected.

CPerezz · 2025-09-30T12:04:04Z

@LouisTsai-Csie should we merge? All checks passing and your test locally succeeds.

spencer-tb

LGTM from my side! Thanks :)

CPerezz mentioned this pull request Sep 29, 2025

feat(tests): multi opcode bloatnet ext cases #2186

Merged

8 tasks

LouisTsai-Csie requested changes Sep 29, 2025

View reviewed changes

src/ethereum_test_execution/transaction_post.py Outdated Show resolved Hide resolved

CPerezz force-pushed the fix/execute-mode-gas-validation branch from 9d894e2 to 5e4fe91 Compare September 29, 2025 22:47

CPerezz requested a review from LouisTsai-Csie September 29, 2025 22:49

LouisTsai-Csie requested changes Sep 30, 2025

View reviewed changes

LouisTsai-Csie approved these changes Sep 30, 2025

View reviewed changes

spencer-tb approved these changes Oct 1, 2025

View reviewed changes

spencer-tb merged commit 0ef435a into ethereum:main Oct 1, 2025
15 checks passed

spencer-tb added type:chore Type: Chore scope:execute Scope: Changes to the execute command labels Oct 1, 2025

spencer-tb changed the title ~~fix(execute): add gas validation for benchmark tests in execute mode~~ chore(execute): add gas validation for benchmark tests in execute mode Oct 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore(execute): add gas validation for benchmark tests in execute mode #2219

chore(execute): add gas validation for benchmark tests in execute mode #2219

CPerezz commented Sep 29, 2025

Uh oh!

CPerezz commented Sep 29, 2025

Uh oh!

LouisTsai-Csie left a comment

Uh oh!

Uh oh!

CPerezz commented Sep 29, 2025

Uh oh!

LouisTsai-Csie left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CPerezz commented Sep 30, 2025

Uh oh!

LouisTsai-Csie left a comment

Uh oh!

CPerezz commented Sep 30, 2025

Uh oh!

spencer-tb left a comment

Uh oh!

Uh oh!

Uh oh!

chore(execute): add gas validation for benchmark tests in execute mode #2219

chore(execute): add gas validation for benchmark tests in execute mode #2219

Conversation

CPerezz commented Sep 29, 2025

🗒️ Description

🔗 Related Issues or PRs

✅ Checklist

Uh oh!

CPerezz commented Sep 29, 2025

Uh oh!

LouisTsai-Csie left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

CPerezz commented Sep 29, 2025

Uh oh!

LouisTsai-Csie left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CPerezz commented Sep 30, 2025

Uh oh!

LouisTsai-Csie left a comment

Choose a reason for hiding this comment

Uh oh!

CPerezz commented Sep 30, 2025

Uh oh!

spencer-tb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!