Loading...

XML

Word

Printable

Type: Bug
Resolution: Not a Bug
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: restic
Labels:
- need-info
- restic

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
QEStatus:
ToDo
Regression:
No

Cost of Delay:
0
WSJF:
0
Risk Probability:
Very Likely
Risk Score:
0

Workstream:

None

Root Cause:
Unset
Failure Category:
Unknown

SFDC Cases Links:
SFDC Cases Counter:

restore using restic as plugin failed on the following setup :
cloud33 BM ( 12 workers )

restic pods have 16GB/32GB memory and cpu 2/1

from looking on the grafane on the restic pod during the restore process
we didn't hit any memory limitation
the restore failed after around 60 min

from the restore log :

 [root@f07-h28-000-r640 tzahi]# velero restore logs  perf-2.1.0-nfs-restic-32gb  --insecure-skip-tls-verify |grep -i error
time="2022-07-26T11:31:12Z" level=error msg="unable to successfully complete restic restores of pod's volumes" error="timed out waiting for all PodVolumeRestores to complete" logSource="pkg/restore/restore.go:1560" restore=openshift-adp/perf-2.1.0-nfs-restic-32gb
time="2022-07-26T11:31:12Z" level=error msg="Velero restore error: timed out waiting for all PodVolumeRestores to complete" logSource="pkg/controller/restore_controller.go:497" restore=openshift-adp/perf-2.1.0-nfs-restic-32gb

 Name:         perf-2.1.0-nfs-restic-32gb
Namespace:    openshift-adp
Labels:       <none>
Annotations:  <none>Phase:                       PartiallyFailed (run 'velero restore logs perf-2.1.0-nfs-restic-32gb' for more information)
Total items to be restored:  29
Items restored:              29Started:    2022-07-26 10:31:11 +0000 UTC
Completed:  2022-07-26 11:31:12 +0000 UTCWarnings:
  Velero:     <none>
  Cluster:  could not restore, CustomResourceDefinition "clusterserviceversions.operators.coreos.com" already exists. Warning: the in-cluster version is different than the backed-up version.
  Namespaces:
    perf-datagen-case1-nfs:  could not restore, RoleBinding "system:deployers" already exists. Warning: the in-cluster version is different than the backed-up version.
                             could not restore, RoleBinding "system:image-builders" already exists. Warning: the in-cluster version is different than the backed-up version.
                             could not restore, RoleBinding "system:image-pullers" already exists. Warning: the in-cluster version is different than the backed-up version.Errors:
  Velero:   timed out waiting for all PodVolumeRestores to complete
  Cluster:    <none>
  Namespaces: <none>Backup:  perf-2.1.0-nfs-restic-32gbNamespaces:
  Included:  all namespaces found in the backup
  Excluded:  <none>Resources:
  Included:        *
  Excluded:        nodes, events, events.events.k8s.io, backups.velero.io, restores.velero.io, resticrepositories.velero.io
  Cluster-scoped:  autoNamespace mappings:  <none>Label selector:  <none>Restore PVs:  autoRestic Restores:
  New:
    perf-datagen-case1-nfs/perf-datagen-case1-8495854c4d-gmkq4: vol-0Existing Resource Policy:   <none>Preserve Service NodePorts:  auto

restic pods cpu/memory limitation :

  configuration:
    restic:
      enable: true
      podConfig:
        resourceAllocations:
          limits:
            cpu: 2
            memory: 32768Mi
          requests:
            cpu: 1
            memory: 16384Mi

versions :

OCP : 4.10.21
OADP : 1.1.0 iib:279750

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

perf-2.1.0-nfs-restic_restore_32gb.log
66 kB
2022/07/26 12:16 PM
screenshot-1.png
58 kB
2022/07/26 12:17 PM

Assignee:: Scott Seago

Reporter:: Tzahi Ashkenazi

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2022/07/26 12:11 PM

Updated:: 2022/07/27 12:40 PM

Resolved:: 2022/07/27 12:40 PM

Details

Description

Attachments

Attachments

Activity

People

Dates