HDDS-8982. Infinite loop in WritableRatisContainerProvider if pipeline's nodes are not found#5742
HDDS-8982. Infinite loop in WritableRatisContainerProvider if pipeline's nodes are not found#5742DaveTeng0 wants to merge 7 commits intoapache:masterfrom
Conversation
adoroszlai
left a comment
There was a problem hiding this comment.
Thanks @DaveTeng0 for working on this.
...hdds/server-scm/src/main/java/org/apache/hadoop/hdds/scm/container/ContainerManagerImpl.java
Outdated
Show resolved
Hide resolved
| if (containerInfo.getContainerID() != -1) { | ||
| return containerInfo; | ||
| } else { | ||
| excludeList.addPipeline(containerInfo.getPipelineID()); |
There was a problem hiding this comment.
Trying to exclude the faulty pipeline may not work, because:
There was a problem hiding this comment.
yeah, the reason I add it in the first part of findPipelinesByState(Line 100) of this method is to prevent that broken pipeline from being selected in the second part of findPipelinesByState (Line 164 , after pipelineManager.createPipeline) I just feel it's nice to have that, but it could be removed too. I'm open to any suggestion whether to keep it or not here. Thanks!
….container.max.retry to private attribute in WritableRatisContainerProvider
|
Please check CI run in fork before starting PR workflow. If there is a failure caused by the change (here |
…bleRatisContainerProvider
|
Thanks again @DaveTeng0 for working on this. I prefer #5911 to this one. Marking as draft for now. Will close if #5911 gets merged. |
What changes were proposed in this pull request?
HDDS-8982. Prevent infinite loop in getContainer which causes log flooded by WritableRatisContainerProvider if pipeline's nodes are not found
Please describe your PR in detail:
What is the link to the Apache JIRA
https://issues.apache.org/jira/browse/HDDS-8982
How was this patch tested?
unit test