1 réponse
- Le plus récent
- Le plus de votes
- La plupart des commentaires
4
I would recommend using Amazon EMR to run your Spark applications. Amazon EMR is a managed cluster platform that simplifies running big data frameworks, such as Apache Hadoop and Apache Spark, on AWS. It's designed for data processing tasks and is a good fit for your use case.\
ERM Advantages
- EMR can scale your cluster up or down depending on your data processing needs. It also integrates well with Amazon S3, which can be used as a data lake to store your input and output data.
- EMR supports running Spark applications written in various programming languages such as Scala, Python, and Java. It also provides integration with Apache Kafka and other AWS services.
- You can use EC2 Spot Instances to save on costs when running your EMR clusters. Additionally, EMR has an auto-termination feature that automatically terminates idle clusters to save costs.
- EMR integrates with AWS Identity and Access Management (IAM), allowing you to control access to your Spark applications and data.
Contenus pertinents
- demandé il y a 7 mois
- demandé il y a un an
- demandé il y a 8 mois
- AWS OFFICIELA mis à jour il y a un an
- AWS OFFICIELA mis à jour il y a un an
- AWS OFFICIELA mis à jour il y a un an
- AWS OFFICIELA mis à jour il y a 2 ans