recovering from broker failures in MSK

1

If my cluster is setup as follows: Brokers: 3 AZ: 3 RF: 3 MinISR: 1 Ack: all

Q1: If a broker is being upgraded, Kafka will reassign the leadership of some partitions. After the upgrade will the leaderships get reassigned again so that all brokers are being used as before?

Q2: If 1 AZ (AZ1) goes down, I understand that Kafka will automatically reassign the partitions to the other brokers in the two AZs without impacting the producers and consumers. When AZ1 comes back will MSK automatically create/restart the failed broker and redistribute the partitions?

1개 답변
0
수락된 답변

Please find answers inline:

Q1: If a broker is being upgraded, Kafka will reassign the leadership of some partitions. After the upgrade will the leaderships get reassigned again so that all brokers are being used as before?

  • Upgrades will be in done in a rolling fashion on each broker at a time. So for example in a 3 broker cluster when broker 1 is undergoing upgrade, all the leadership that broker 1 contains will be reassigned to broker 2 and broker 3. When upgrade is complete and all 3 brokers are active, current partition leadership ratio between brokers is validated against a broker config parameter 'leader.imbalance.per.broker.percentage' which by default 10% and accordingly leadership is distributed so all brokers gets leader reassigned again after upgrade.

Q2: If 1 AZ (AZ1) goes down, I understand that Kafka will automatically reassign the partitions to the other brokers in the two AZs without impacting the producers and consumers. When AZ1 comes back will MSK automatically create/restart the failed broker and redistribute the partitions?

  • That's correct, once the AZ comes back failed brokers will be relaunched and added to the existing cluster topology and then leader partitions will be distributed automatically
AWS
지원 엔지니어
답변함 2년 전
AWS
전문가
검토됨 2년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠