Skip to content

Conversation

skshetry
Copy link
Collaborator

We have per-ref rwlocks set. But they do not prevent conflicts when refs are stored in packed-refs, as multiple processes could be modifying the same file simultaneously.

On #10673, this is triggering FileLocked errors.

Wrap ref removal in a retry loop (10 attempts, 0.1s delay) to mitigate these race conditions.

@github-project-automation github-project-automation bot moved this to Backlog in DVC Aug 14, 2025
@skshetry skshetry moved this from Backlog to Review In Progress in DVC Aug 14, 2025
@skshetry skshetry force-pushed the 10673-retry-remove-ref branch from d1b2573 to 697472c Compare August 14, 2025 13:13
@skshetry skshetry self-assigned this Aug 15, 2025
@skshetry skshetry added A: experiments Related to dvc exp bugfix fixes bug labels Aug 15, 2025
We have per-ref rwlocks set. But they do not prevent conflicts
when refs are stored in packed-refs, as multiple processes
could be modifying the same file simultaneously.

On #10673, this is triggering `FileLocked` errors.

Wrap ref removal in a retry loop (10 attempts, 0.1s delay) to
mitigate these race conditions.
@skshetry skshetry force-pushed the 10673-retry-remove-ref branch from 697472c to e8424fb Compare October 12, 2025 03:45
@skshetry skshetry merged commit 8912ca7 into main Oct 12, 2025
40 checks passed
@skshetry skshetry deleted the 10673-retry-remove-ref branch October 12, 2025 04:20
@github-project-automation github-project-automation bot moved this from Review In Progress to Done in DVC Oct 12, 2025
@skshetry skshetry linked an issue Oct 12, 2025 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

A: experiments Related to dvc exp bugfix fixes bug

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

exp-workers failing without logs

1 participant