Don't re-download rpms if they exist on disk
NOTE: this issue was addressed by Pulp upstream with documentation updates. Should similar documentation updates be made for Katello? The initial description is below; however, refer to bugzilla for full details.
Description of problem:
We (Katello) have gotten a lot of requests from users trying out katello and redeploying it multiple times that pulp seems to re-download the rpms when syncing the same repository after resetting the DB. Ideally pulp would simply check to see if its on the filesystem before downloading.
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. Sync a large repo
2. Watch it take a while
3. Clear your mongo db
4. Sync the same repo
5. Watch it take the same amount of time (all the files are on the file system, so it should take the same amount of time)