Bug #7162
closedcapsule: synchronize command never times out/silently fails.
Description
Cloned from https://bugzilla.redhat.com/show_bug.cgi?id=1102763
Description of problem:
if a capsule runs into some issue that keeps syncs from completing, there is nothing to indicate this
Version-Release number of selected component (if applicable):
Satellite-6.0.3-RHEL-6-20140528.4
How reproducible:
unsure.
Steps to Reproduce:
1. Attempt to sync sat content to a capsule. It may (or may not) help to reproduce this if you have two servers over wide geographical locations
2. Wait
3. View results.
Actual results:
In the synchronize process... user sees really nothing, other than the progress bar never moving -- in my case at 50%.
In the pulp logs on sat server we see:
May 29 15:48:44 ibm-x3550m3-07 pulp: pulp.server.async.scheduler:ERROR: Workers 'reserved_resource_worker-23@ibm-x3550m3-07.lab.eng.brq.redhat.com' has gone missing, removing from list of workers
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: 7461b684-4048-4e72-94dd-3b82956c6fab
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: Traceback (most recent call last):
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "/usr/lib/python2.6/site-packages/gofer/transport/qpid/consumer.py", line 113, in get
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: return self.__receiver.fetch(timeout=timeout)
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "<string>", line 6, in fetch
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "/usr/lib/python2.6/site-packages/qpid/messaging/endpoints.py", line 1030, in fetch
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: self._ecwait(lambda: self.linked)
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "/usr/lib/python2.6/site-packages/qpid/messaging/endpoints.py", line 50, in _ecwait
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: result = self._ewait(lambda: self.closed or predicate(), timeout)
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "/usr/lib/python2.6/site-packages/qpid/messaging/endpoints.py", line 994, in _ewait
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: self.check_error()
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: File "/usr/lib/python2.6/site-packages/qpid/messaging/endpoints.py", line 983, in check_error
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: raise self.error
May 29 15:48:52 ibm-x3550m3-07 pulp: gofer.transport.qpid.consumer:ERROR: NotFound: no such queue: pulp.task
Expected results:
The log above may have something to do with what's causing this, but is not the main issue; rather, the issue is the synchronizer never "gives up" or indicates an issue
Additional info:
Updated by Brad Buckingham over 10 years ago
- Assignee set to Brad Buckingham
- Triaged changed from No to Yes
Updated by The Foreman Bot over 10 years ago
- Status changed from New to Ready For Testing
- Target version set to 63
- Pull request https://github.com/Katello/katello/pull/4595 added
- Pull request deleted (
)
Updated by Brad Buckingham over 10 years ago
- Status changed from Ready For Testing to Closed
- % Done changed from 0 to 100
Applied in changeset katello|f778714b1e934857301ec977c31ec3e9075a3c4a.
Updated by Eric Helms about 10 years ago
- Translation missing: en.field_release set to 13
Updated by Steve Loranz over 9 years ago
- Related to Bug #10009: Trying to add a repo with a missing/unresponsive capsule hangs indefinitely/for a very long time. added
Updated by Steve Loranz over 9 years ago
- Status changed from Closed to Assigned
- Assignee changed from Brad Buckingham to Steve Loranz
- Target version changed from 55 to 69
- % Done changed from 100 to 30
- Translation missing: en.field_release changed from 13 to 23
The fix failed acceptance testing and was reopened.
Updated by Eric Helms over 9 years ago
- Translation missing: en.field_release changed from 23 to 51
Updated by Eric Helms over 9 years ago
- Related to Bug #10295: Capsule syncing should timeout if it is not picked up within a certain amount of time added
Updated by Eric Helms over 9 years ago
- Translation missing: en.field_release changed from 51 to 55
Updated by The Foreman Bot over 9 years ago
- Status changed from Assigned to Ready For Testing
Updated by dustin tsang over 9 years ago
- Status changed from Ready For Testing to Closed
- % Done changed from 30 to 100
Applied in changeset katello|f6405f285efe6c40622ee1aa6ebc2c3d7307f4ba.
Updated by Eric Helms over 9 years ago
- Assignee changed from Steve Loranz to dustin tsang
Updated by dustin tsang over 9 years ago
new pull request to ensure that the task times out based on configurations (default 12hrs)
https://github.com/Katello/katello/pull/5278