Bug #7162
closedcapsule: synchronize command never times out/silently fails.
Description
Cloned from https://bugzilla.redhat.com/show_bug.cgi?id=1102763
Description of problem:
if a capsule runs into some issue that keeps syncs from completing, there is nothing to indicate this
Version-Release number of selected component (if applicable):
Satellite-6.0.3-RHEL-6-20140528.4
How reproducible:
unsure.
Steps to Reproduce:
1. Attempt to sync sat content to a capsule. It may (or may not) help to reproduce this if you have two servers over wide geographical locations
2. Wait
3. View results.
Actual results:
In the synchronize process... user sees really nothing, other than the progress bar never moving -- in my case at 50%.
In the pulp logs on sat server we see:
May 29 15:48:44 ibm-x3550m3-07 pulp: pulp.server.async.scheduler:ERROR: Workers 'reserved_resource_worker-23@ibm-x3550m3-07.lab.eng.brq.redhat.com' has gone missing, removing from list of workers
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: 7461b684-4048-4e72-94dd-3b82956c6fab
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: Traceback (most recent call last):
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "/usr/lib/python2.6/site-packages/gofer/transport/qpid/consumer.py", line 113, in get
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: return self.__receiver.fetch(timeout=timeout)
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "<string>", line 6, in fetch
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "/usr/lib/python2.6/site-packages/qpid/messaging/endpoints.py", line 1030, in fetch
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: self._ecwait(lambda: self.linked)
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "/usr/lib/python2.6/site-packages/qpid/messaging/endpoints.py", line 50, in _ecwait
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: result = self._ewait(lambda: self.closed or predicate(), timeout)
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "/usr/lib/python2.6/site-packages/qpid/messaging/endpoints.py", line 994, in _ewait
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: self.check_error()
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "/usr/lib/python2.6/site-packages/qpid/messaging/endpoints.py", line 983, in check_error
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: raise self.error
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: NotFound: no such queue: pulp.task
Expected results:
The log above may have something to do with what's causing this, but is not the main issue; rather, the issue is the synchronizer never "gives up" or indicates an issue
Additional info:
Updated by Brad Buckingham over 10 years ago
- Assignee set to Brad Buckingham
- Triaged changed from No to Yes
Updated by The Foreman Bot over 10 years ago
- Status changed from New to Ready For Testing
- Target version set to 63
- Pull request https://github.com/Katello/katello/pull/4595 added
- Pull request deleted (
)
Updated by Brad Buckingham over 10 years ago
- Status changed from Ready For Testing to Closed
- % Done changed from 0 to 100
Applied in changeset katello|f778714b1e934857301ec977c31ec3e9075a3c4a.
Updated by Eric Helms over 10 years ago
- Translation missing: en.field_release set to 13
Updated by Steve Loranz about 10 years ago
- Related to Bug #10009: Trying to add a repo with a missing/unresponsive capsule hangs indefinitely/for a very long time. added
Updated by Steve Loranz about 10 years ago
- Status changed from Closed to Assigned
- Assignee changed from Brad Buckingham to Steve Loranz
- Target version changed from 55 to 69
- % Done changed from 100 to 30
- Translation missing: en.field_release changed from 13 to 23
The fix failed acceptance testing and was reopened.
Updated by Eric Helms almost 10 years ago
- Translation missing: en.field_release changed from 23 to 51
Updated by Eric Helms almost 10 years ago
- Related to Bug #10295: Capsule syncing should timeout if it is not picked up within a certain amount of time added
Updated by Eric Helms almost 10 years ago
- Translation missing: en.field_release changed from 51 to 55
Updated by The Foreman Bot almost 10 years ago
- Status changed from Assigned to Ready For Testing
Updated by dustin tsang almost 10 years ago
- Status changed from Ready For Testing to Closed
- % Done changed from 30 to 100
Applied in changeset katello|f6405f285efe6c40622ee1aa6ebc2c3d7307f4ba.
Updated by Eric Helms almost 10 years ago
- Assignee changed from Steve Loranz to dustin tsang
Updated by dustin tsang almost 10 years ago
new pull request to ensure that the task times out based on configurations (default 12hrs)
https://github.com/Katello/katello/pull/5278