Project

General

Profile

Bug #6005

Content reindexing Incredibly Inefficient

Added by Justin Sherrill almost 5 years ago. Updated 9 months ago.

Status:
Closed
Priority:
Normal
Category:
Repositories
Target version:
Difficulty:
Triaged:
Yes
Bugzilla link:
Pull request:
Team Backlog:
Fixed in Releases:
Found in Releases:

Description

Currently when running 'rake katello:reindex' we loop over each repo and index packages, errata, etc.. on each repo. While this is simpler it is incredibly inefficient especially if you many content views with many versions. The same set of packages will end up being reindex over and over and over again.

A more efficient way would be to just go through the entire $ITEM list in pulp and index that list once.

Associated revisions

Revision 2c7d6aae (diff)
Added by Justin Sherrill almost 5 years ago

Fixes #6005 - speeding up content reindexing

Instead of indexing per repo, we can index all of each content
and it will be much faster. Sample times with a content view with about
5 versions:

Old Reindex: 7m05s
New Reindex: 3m13s

the more content views and versions you have the greater the difference

Revision ede3a9b2
Added by Justin Sherrill almost 5 years ago

Merge pull request #4190 from jlsherrill/index

Fixes #6005 - speeding up content reindexing

History

#1 Updated by Eric Helms almost 5 years ago

  • Target version set to 45
  • Triaged changed from No to Yes

#2 Updated by Justin Sherrill almost 5 years ago

  • Status changed from Assigned to Closed
  • % Done changed from 0 to 100

Applied in changeset katello|commit:2c7d6aaeff5fca84347bbd96fe9f8096eae29ef7.

#3 Updated by Eric Helms almost 5 years ago

  • Bugzilla link set to https://bugzilla.redhat.com/show_bug.cgi?id=1112749

#4 Updated by Eric Helms over 4 years ago

  • Legacy Backlogs Release (now unused) set to 13

Also available in: Atom PDF