fix: bound QuestionHistory per-entry known-answer payload by bdraco · Pull Request #1755 · python-zeroconf/python-zeroconf

bdraco · 2026-05-20T14:49:38Z

Summary

QuestionHistory._history[question] = (now, known_answers) stores the incoming known-answers set by reference. The dict's entry count is capped at _MAX_QUESTION_HISTORY_ENTRIES = 10000 (PR #1733), but each entry's set was unbounded. QueryHandler.async_response builds the set from the union of every TC-deferred fragment's answers; up to _MAX_DEFERRED_PER_ADDR = 16 packets x ~750 records each, so a LAN peer sustaining TC-fragmented queries with large known-answer lists can pin hundreds of MB across the _MAX_QUESTION_HISTORY_ENTRIES dimension. The records never enter the DNS cache, so _MAX_CACHE_RECORDS does not bound this path either.

Fixes #1754.

Details

_MAX_KNOWN_ANSWERS_PER_HISTORY_ENTRY = 256 (src/zeroconf/const.py); well above any RFC-realistic single-question known-answer list; cdef'd in _history.pxd so the bound check is a direct C integer compare under Cython.
QuestionHistory.add_question_at_time (src/zeroconf/_history.py) drops the insert when len(known_answers) > _MAX_KNOWN_ANSWERS_PER_HISTORY_ENTRY. Any pre-existing smaller entry for the same question key is left untouched, so legitimate suppression continues to work even while the attacker hammers the same question with oversized payloads.
Why drop instead of truncate: suppresses() returns not previous_known_answers - known_answers, i.e., it suppresses when the stored set is a subset of the incoming known-answers. Truncating the stored set to a smaller subset would make the subset relation hold more easily, which means over-suppression — we would drop responses that the legitimate querier was entitled to. The conservative direction under uncertainty is to forgo suppression for the one oversized query (cost: one extra response), not to retain a partial snapshot (cost: silent suppression of legitimate follow-ups).

Test plan

New tests in tests/test_history.py:
- test_question_history_oversized_known_answers_dropped: a set of _MAX_KNOWN_ANSWERS_PER_HISTORY_ENTRY + 1 known-answers does not appear in _history.
- test_question_history_oversized_preserves_existing_entry: a pre-existing 2-record entry survives an oversized follow-up call for the same question; the stored set object is unchanged.
- test_question_history_at_cap_known_answers_is_stored: a set of exactly _MAX_KNOWN_ANSWERS_PER_HISTORY_ENTRY records is retained (boundary check).
REQUIRE_CYTHON=1 poetry install --only=main,dev && poetry run pytest tests/ --no-cov --timeout=60: 384 passed, 3 skipped.
SKIP_CYTHON=1 poetry run pytest tests/ --no-cov --timeout=60: 384 passed, 3 skipped.
poetry run pre-commit run on touched files: ruff, ruff format, mypy, codespell, flake8, cython-lint all green.
cython -a confirms the bound compare lands as __pyx_t_8 = (__pyx_t_7 > __pyx_v_8zeroconf_8_history__MAX_KNOWN_ANSWERS_PER_HISTORY_ENTRY) (pure C int compare against the cdef'd unsigned int).

`QuestionHistory._history[question] = (now, known_answers)` stores the incoming known-answers set by reference. The dict's entry count is capped at `_MAX_QUESTION_HISTORY_ENTRIES = 10000`, but each entry's set was unbounded. `QueryHandler.async_response` builds the set from the union of every TC-deferred fragment's answers — up to `_MAX_DEFERRED_PER_ADDR = 16` packets x ~750 records each — so a LAN peer sustaining TC-fragmented queries with large known-answer lists can pin hundreds of MB across the `_MAX_QUESTION_HISTORY_ENTRIES` dimension. The records never enter the DNS cache, so `_MAX_CACHE_RECORDS` does not bound this path. `add_question_at_time` now drops the insert when the known-answers set exceeds `_MAX_KNOWN_ANSWERS_PER_HISTORY_ENTRY = 256` (well above any RFC-realistic single-question known-answer list). Truncating to a subset was considered and rejected: `suppresses()` returns True when the stored set is a subset of the incoming known-answers, so a smaller stored set matches more easily and would over-suppress legitimate follow-up queries — the conservative direction is to forgo suppression for the oversized query, not to retain a partial snapshot. Any pre-existing smaller entry for the same question key is left untouched. The new constant is `cdef unsigned int` in `_history.pxd` so the bound check compiles to a direct C integer compare.

bdraco · 2026-05-20T14:50:38Z

@bluetoothbot review

codspeed-hq · 2026-05-20T14:51:49Z

Merging this PR will not alter performance

✅ 14 untouched benchmarks

_{Comparing fix/bound-question-history-payloads (da508d8) with master (b22c8ff)}

codecov · 2026-05-20T14:52:25Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.77%. Comparing base (b22c8ff) to head (da508d8).
⚠️ Report is 1 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #1755   +/-   ##
=======================================
  Coverage   99.77%   99.77%           
=======================================
  Files          33       33           
  Lines        3497     3500    +3     
  Branches      489      490    +1     
=======================================
+ Hits         3489     3492    +3     
  Misses          5        5           
  Partials        3        3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Copilot

Pull request overview

This pull request closes a memory-retention DoS vector in duplicate-question suppression by adding a per-entry size bound on the stored known-answer set in QuestionHistory. It prevents attacker-supplied, TC-fragment-amplified known-answer payloads from being retained in _history while preserving existing suppression behavior for normal-sized payloads.

Changes:

Introduces _MAX_KNOWN_ANSWERS_PER_HISTORY_ENTRY = 256 as a hard per-entry bound for retained known-answers.
Updates QuestionHistory.add_question_at_time() to drop (not truncate) oversized known-answer payloads to avoid over-suppression.
Adds targeted unit tests covering oversized-drop behavior, preservation of existing entries, and the boundary-at-cap case.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated no comments.

File	Description
`src/zeroconf/const.py`	Adds the per-entry known-answer cap constant with rationale.
`src/zeroconf/_history.py`	Enforces the per-entry cap by ignoring oversized known-answer inserts.
`src/zeroconf/_history.pxd`	Declares the new constant for the Cython-optimized hot path.
`tests/test_history.py`	Adds coverage for oversized known-answer handling and boundary behavior.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

bdraco requested a review from Copilot May 20, 2026 14:50

Copilot started reviewing on behalf of bdraco May 20, 2026 14:52 View session

bdraco temporarily deployed to release May 20, 2026 14:54 — with GitHub Actions Inactive

Copilot AI reviewed May 20, 2026

View reviewed changes

bdraco marked this pull request as ready for review May 20, 2026 15:00

bdraco merged commit 4ff6540 into master May 20, 2026
40 checks passed

bdraco deleted the fix/bound-question-history-payloads branch May 20, 2026 15:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: bound QuestionHistory per-entry known-answer payload#1755

fix: bound QuestionHistory per-entry known-answer payload#1755
bdraco merged 1 commit into
masterfrom
fix/bound-question-history-payloads

bdraco commented May 20, 2026

Uh oh!

bdraco commented May 20, 2026

Uh oh!

codspeed-hq Bot commented May 20, 2026

Uh oh!

codecov Bot commented May 20, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bdraco commented May 20, 2026

Summary

Details

Test plan

Uh oh!

bdraco commented May 20, 2026

Uh oh!

codspeed-hq Bot commented May 20, 2026

Merging this PR will not alter performance

Uh oh!

codecov Bot commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented May 20, 2026 •

edited

Loading