database: treat database as read-only after creation and use shared locks by fischeti · Pull Request #315 · pulp-platform/bender

fischeti · 2026-06-10T11:47:24Z

In #307 we introduced database locks and separaring database from checkout in order to share the database as a cache cross projects. This is also relevant in the context of CI, since the git databases are only fetched once and can then be reused. However, the exclusive locks on the databases essentially prevent any kind of parallelization and every invocation of bender becomes serialized across CI jobs.

This PR aims to relax the locks to use exclusive locks only when writing to the database, and use shared locks when reading from it.

The flow of fetching and checking out a git repository was slightly adapted to reduce the amount of write operations on a shared database:

git init and git fetch [<rev>] are write operations and still acquire an exclusive lock
The creation of a temporary tag tmp-<hash> in the database and the later checkout with git clone --branch tmp-<hash> has been replaced with git clone --shared --no-checkout && git checkout <rev>
The database is configured to disable git gc since revisions might not be tracked anymore by refs i.e. the previous tmp-<hash> tags. This is maybe overly catious.

Furthermore, file locking is done more fine grained. Instead of locking the whole git_database call exclusively, only the part that requires writes (git init and git fetch) acquire an exclusive lock (if database is not ready yet).

fischeti · 2026-06-16T18:58:10Z

Actually, I am not sure anymore if the locks are needed. I have the impression that git should be able to handle concurrency itself of a shared git database i.e. objects are immutable and do not need to be locked, while other things like refs seem to have fine-grained lock files:

https://stackoverflow.com/questions/19962024/locking-strategy-of-git-to-achieve-concurrency
https://stackoverflow.com/questions/750765/concurrency-in-a-git-repo-on-a-network-shared-folder#answer-751026

Edit: I guess the git init cannot be locked properly by git and could cause problems and apparently git also errors out when failing to acquire locks, so it could make sense after all🤓

micprog · 2026-06-16T19:21:16Z

Actually, I am not sure anymore if the locks are needed. I have the impression that git should be able to handle concurrency itself of a shared git database i.e. objects are immutable and do not need to be locked, while other things like refs seem to have fine-grained lock files:

https://stackoverflow.com/questions/19962024/locking-strategy-of-git-to-achieve-concurrency https://stackoverflow.com/questions/750765/concurrency-in-a-git-repo-on-a-network-shared-folder#answer-751026

I think we should preserve locks, and I like your solution to speed things up (still need to review the details). The main issue I'm attempting to solve is with multiple repos fetching the same bare ref. Concurrent fetches racing on the same ref would produce fatal: cannot lock ref ...: File exists from one of them. Git handles it cleanly but bender has no retry layer, so we'd just propagate that as a CI failure. Rare, but the failure mode is "this CI job randomly fails once a week". Keeping locks fixes this.

fischeti · 2026-06-16T19:24:41Z

Yes, you are right🤓 I just realized that git locks are not blocking but just error out

…ocks

fischeti force-pushed the shared-locks branch 2 times, most recently from a6ac0ee to 9a16348 Compare June 16, 2026 08:31

fischeti marked this pull request as ready for review June 16, 2026 08:46

fischeti force-pushed the shared-locks branch from 9a16348 to 95ce186 Compare June 16, 2026 09:03

fischeti added 3 commits June 16, 2026 21:29

database: treat database as read-only after creation and use shared l…

71db9e9

…ocks

database: only acquire exclusive lock for write operations

8073a9a

sess: only report checkout stage completion once

69a0252

fischeti force-pushed the shared-locks branch from 95ce186 to 69a0252 Compare June 16, 2026 19:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

database: treat database as read-only after creation and use shared locks#315

database: treat database as read-only after creation and use shared locks#315
fischeti wants to merge 3 commits into
masterfrom
shared-locks

fischeti commented Jun 10, 2026 •

edited

Loading

Uh oh!

fischeti commented Jun 16, 2026 •

edited

Loading

Uh oh!

micprog commented Jun 16, 2026

Uh oh!

fischeti commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

fischeti commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fischeti commented Jun 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

micprog commented Jun 16, 2026

Uh oh!

fischeti commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fischeti commented Jun 10, 2026 •

edited

Loading

fischeti commented Jun 16, 2026 •

edited

Loading