Skip to content

libpcp: recover from corrupted archives in multi-archive contexts#2586

Open
kurik wants to merge 1 commit intoperformancecopilot:mainfrom
kurik:corrupted-archive-recovery
Open

libpcp: recover from corrupted archives in multi-archive contexts#2586
kurik wants to merge 1 commit intoperformancecopilot:mainfrom
kurik:corrupted-archive-recovery

Conversation

@kurik
Copy link
Copy Markdown
Contributor

@kurik kurik commented May 7, 2026

When reading multi-archive PCP logs (e.g. Cockpit metrics history), a corrupted volume no longer terminates the entire replay. Instead, corrupted archives are flagged and skipped so that data from healthy volumes can still be read.

  • Add 'corrupted' field to __pmMultiLogCtl to track damaged archives
  • Skip corrupt/unreadable archives during multi-archive context init with a warning rather than failing outright
  • Rebuild shared __pmLogCtl when lost due to failed archive opens
  • On PM_ERR_LOGREC in __pmLogRead, advance to the next (or previous) archive instead of returning an error in multi-archive mode
  • Skip known-corrupted archives in LogChangeToNextArchive() and LogChangeToPreviousArchive()
  • Update QA tests 722 and 1671 for new recovery behaviour

Resolves: RHEL-169855

When reading multi-archive PCP logs (e.g. Cockpit metrics history),
a corrupted volume no longer terminates the entire replay.  Instead,
corrupted archives are flagged and skipped so that data from healthy
volumes can still be read.

- Add 'corrupted' field to __pmMultiLogCtl to track damaged archives
- Skip corrupt/unreadable archives during multi-archive context init
  with a warning rather than failing outright
- Rebuild shared __pmLogCtl when lost due to failed archive opens
- On PM_ERR_LOGREC in __pmLogRead, advance to the next (or previous)
  archive instead of returning an error in multi-archive mode
- Skip known-corrupted archives in LogChangeToNextArchive() and
  LogChangeToPreviousArchive()
- Update QA tests 722 and 1671 for new recovery behaviour

Resolves: RHEL-169855
@kurik kurik requested review from kmcdonell and natoscott May 7, 2026 08:54
@kurik
Copy link
Copy Markdown
Contributor Author

kurik commented May 7, 2026

@natoscott , @kmcdonell May I ask you for a review, please?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant