EMR Serverless consume different amounts of resources in the different regions for the same data size

0

We are using EMR serverless to process the data, but we find for the same data size, EMR serverless consume different amounts of resource, such as vCPU-hours, memoryGB-hours and storageGB-hours.

I testing in us-east-1, us-east-2 and ap-northeast-1, I found the cheapest one is us-east-2, and then us-east-1 and ap-northeast-1. The difference between the two regions is more than 20%.

What's the reason of the difference?

Thanks

질문됨 6달 전191회 조회
1개 답변
3

Hello,

EMR Serverless automatically scales workers up or down based on the workload and parallelism required at every stage of the job. The worker container's resource requirement is availed based on the availability resource pool which might varies depends on the instance type tenancy. I request you to compare the application logs for an instance (tez, spark container logs to see if any stages skipped or reducer optimized the steps etc).

AWS
지원 엔지니어
답변함 6달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠