mirror of
https://github.com/hickory-dns/hickory-dns.git
synced 2026-04-25 11:15:54 +03:00
[GH-ISSUE #3151] CI test flakiness: resolver::tests::test_sec_lookup and resolver::tests::test_sec_lookup_fails #1144
Labels
No labels
blocked
breaking-change
bug
bug:critical
bug:tests
cleanup
compliance
compliance
compliance
crate:all
crate:client
crate:native-tls
crate:proto
crate:recursor
crate:resolver
crate:resolver
crate:rustls
crate:server
crate:util
dependencies
docs
duplicate
easy
easy
enhance
enhance
enhance
feature:dns-over-https
feature:dns-over-quic
feature:dns-over-tls
feature:dnsssec
feature:global_lb
feature:mdns
feature:tsig
features:edns
has workaround
ops
perf
platform:WASM
platform:android
platform:fuchsia
platform:linux
platform:macos
platform:windows
pull-request
question
test
tools
tools
trust
unclear
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/hickory-dns#1144
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @divergentdave on GitHub (Jul 23, 2025).
Original GitHub issue: https://github.com/hickory-dns/hickory-dns/issues/3151
The
resolver::tests::test_sec_lookupandresolver::tests::test_sec_lookup_failstests have been failing frequently in CI of late. It looks like the cause is a timeout error while fetching DNSSEC records causing validation failures.@divergentdave commented on GitHub (Jul 23, 2025):
I wanted to test whether recent changes have affected retry behavior, worsening the impact of dropped packets on this test. I ran the following commands to set up simulated packet loss in a network namespace. (loosely assembled from https://ilmanzo.github.io/post/faulty_network_simulation/ and https://josephmuia.ca/2018-05-16-net-namespaces-veth-nat/)
I ran tests with the following commands:
sudo ip netns exec testing sudo -u $USER bash -c "for i in {1..25}; do $(which cargo) test -p hickory-resolver --features dnssec-aws-lc-rs --lib -- resolver::tests::test_sec_lookup; done"sudo ip netns exec testing sudo -u $USER bash -c "RUST_LOG=debug $(which cargo) test -p hickory-resolver --features dnssec-aws-lc-rs --lib -- resolver::tests::test_sec_lookup --exact --nocapture"I did not get any test failures, but sometimes the tests took five, ten, or twenty seconds to run, instead of the usual half second. This suggests that our retry logic is effective in this test, but some of our Mac/Windows CI runs are exhausting retries due to even worse network conditions than the 10% packet loss I chose. (or rate limiting by Google Public DNS) Logs from the second test command show the internal workings of the resolver as it hits timeouts and resends requests. Note that some DNSKEY responses were truncated, so fallback to TCP had to be used as well.
@divergentdave commented on GitHub (Jul 31, 2025):
#3157 ignores these tests, and I don't think there are any further underlying issues to address. so I'm going to close this issue.