Questions tagged with Amazon EMR
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
This [documentation page][1] shows how to set the `JAVA_HOME` environment variable for EMR. I'm experimenting with running another version of Java, and I want to try passing some more command line...
0
answers
0
votes
123
views
asked a year agolg...
I have a 3TB Data kept in S3 location , I have created a Presto cluster in EMR and it queries the S3 data with every kind of filter condition ,,, but it takes 5-10 min (approx) to give results.
Now I...
1
answers
0
votes
275
views
asked a year agolg...
Hi team,
I see that deleting the EMR cluster sometimes does not necessarily delete the virtual clusters that they create and they remain running stale.
Everytime I see this issue, I can only use the...
0
answers
0
votes
57
views
asked a year agolg...
I've followed the methods for adding Python libraries. Documentation here:
https://docs.aws.amazon.com/emr/latest/EMR-Serverless-UserGuide/using-python-libraries.html
Boto installs and loads...
1
answers
0
votes
397
views
asked a year agolg...
Can we use Python 3.9 with EMR 6.9 / 6.10 ? Can we load multiple Python versions in EMR?
1
answers
0
votes
297
views
asked a year agolg...
Hi,
I have a workspace that successfully attaches to a EMR (Spark cluster with `applications = ["Spark", "JupyterEnterpriseGateway"]` ) cluster. But when i run any commands from the notebook, i get...
1
answers
0
votes
813
views
asked a year agolg...
**Problem**
I'd like to process my data using Spark.
How can I consume Lake Formation data assets that I'm subscribed to, using Glue ETL Jobs or EMR?
**Context**
I created following domains:...
2
answers
0
votes
1146
views
asked a year agolg...
Hi,
Looking to confirm how things work!
- I have created a EMR cluster with ec2 which closes down after no use
- I have created a EMR Studio using the terraform module:...
1
answers
0
votes
1306
views
asked a year agolg...
My job is still running even though I have received the results. why is that ??
I have successfully go the results.
Can I manually set the "Run status" as "Success" after I got result ? I mean are...
1
answers
0
votes
576
views
asked a year agolg...
I want to configure jar (deequ-2.0.1-spark-3.2.jar) on EMR serverless arm64.
This works for x86_64 but doesn't work for arm64 architecture.
Could you please consider this matter it gave this error...
2
answers
0
votes
541
views
asked a year agolg...
Hi,
trying to attach a emr studio and workspace to a emr cluster via terraform. But get an error saying:
```
Error: creating EMR Studio: InvalidRequestException: The service role does not have...
5
answers
0
votes
1905
views
asked a year agolg...
I'm trying to deploy Hadoop on EMR EC2 Cluster and having an issue with **local disk encryption error** on AWS Console.
AWS doesn't provide me any deeper logs nor info about the issue (S3 bucket log...
3
answers
0
votes
409
views
asked a year agolg...