Articles tagged with Amazon EMR
Amazon EMR is a cloud big data platform for running large-scale distributed data processing jobs, interactive SQL queries, and machine learning (ML) applications using open-source analytics frameworks such as Apache Spark, Apache Hive, and Presto.
Content language: English
Filter articles
Select tags to filter
Sort by
Sort by most recent
Browse through articles or filter your results using the tools displayed.
13 results
Ram AchantaEXPERT
published 14 days ago1 votes103 views
Enterprises struggle with EMR version upgrades, facing challenges like production downtime, performance degradation, and compliance risks. Without a structured approach, organizations often experience...
GaganBrahmiAWSEXPERT
published a year ago2 votes532 views
Performance testing for big data analytics tools and engines at petabyte scale is an increasingly challenging avenue. Using traditional sample test datasets may not reflect the actual production-grade...
Yokesh NKSUPPORT ENGINEER
published a year ago3 votes1.6K views
This article offers instructions on how to set up and access Delta tables from SQL Explorer in EMR JupyterHub. SQL Explorer utilizes the Presto engine configured within the EMR cluster to process data...
ARTICLE
Amazon EMRYokesh NKSUPPORT ENGINEER
published a year ago2 votes3.1K views
This article offers instructions on how to configure additional Elastic Block Store (EBS) volumes for HDFS or YARN to increase the storage capacity of a running Amazon EMR cluster.
ARTICLE
Amazon EMRYokesh NKSUPPORT ENGINEER
published 2 years ago2 votes2.1K views
This article might provide guidance on configuring and accessing the Spark application UI for Interactive Endpoints that are either self-hosted notebooks or EMR Studio managed notebooks.
Yokesh NKSUPPORT ENGINEER
published 2 years ago3 votes1.5K views
The guidance provided in the article could prove instrumental in conducting a comprehensive and systematic evaluation of the log data, potentially leading to the identification and resolution of the u...
ARTICLE
Amazon EMRYokesh NKSUPPORT ENGINEER
published 2 years ago3 votes2.1K views
This article might help to investigate the EMR cluster that terminated with error mentioned as "On the master instance, application provisioning failed".
ARTICLE
Amazon EMRYokesh NKSUPPORT ENGINEER
published 2 years ago3 votes1.6K views
This article might help to investigate the EMR cluster that terminated with error mentioned as "Master instance startup failed due to an internal error" especially when using custom AMI image.
ARTICLE
Amazon EMRYokesh NKSUPPORT ENGINEER
published 2 years ago3 votes2K views
This article might help to investigate the EMR cluster that terminated with error mentioned as "Failed to start the job flow due to an internal error" especially when using custom AMI image.
ARTICLE
Amazon EMRYokesh NKSUPPORT ENGINEER
published 2 years ago4 votes1.7K views
The Instance-state log available in Amazon EMR on EC2 that provides valuable information for troubleshooting application failures or investigating system details. This article describes the detailed i...
ARTICLE
Amazon EMRBehrens, IsaacEXPERT
published 2 years ago0 votes2.1K views
Assist with build and install of prerequisite software for TensorFlow on Amazon Linux 2023 for Graviton
Yokesh NKSUPPORT ENGINEER
published 2 years ago3 votes1.6K views
This article describes the high level procedure on how to integrate the tableau application with kerberized EMR cluster.
ARTICLE
Amazon EMR