Skip to content

Questions tagged with Data Lakes

Content language: English

Filter questions
Select tags to filter
Sort by
Sort by most recent
Filter Questions by:

Browse through the questions and answers listed below or filter and sort to narrow down your results.

96 results
Hi Team, I couldn't find list/details of the tools to which Amazon S3 integrates with by using a S3 connector. Which tools integrate with S3 to provide in-place querying of S3 data (i.e. data shouldn'...
1
answers
0
votes
438
views
asked 4 years ago
I need to load data from multiple tables in a SQL server to S3 for some batch processing. Can AWS Glue read data from different SQL Server table, generate csv files and zipping it to S3? And can AWS G...
1
answers
0
votes
796
views
AWS
asked 4 years ago
Hey I am trying to learn what others and what the best practices are with glue for development automation and testing/validation.
1
answers
0
votes
443
views
asked 4 years ago
I have a large dataset (table) with >1e9 records (rows) in Glue. The tables are partitioned by column A, which is a n-letters subtring of column B. For example: | A (partition key) | B | ... | | --- ...
1
answers
0
votes
422
views
asked 4 years ago
Hi, I'm relatively new to AWS glue and was having trouble in the following transformation codes: ``` DataSource4 = glueContext.create_dynamic_frame.from_catalog(database = "beta", table_name = "[tabl...
1
answers
0
votes
4.8K
views
asked 4 years ago
We are exploring usecases where we want to achieve in-place transformation and querying of S3 data lake data. We don't want to provision database and create tables (so we are not keen to consider Reds...
5
answers
0
votes
1.6K
views
asked 4 years ago
I have a table with columns A, B, C, D, ..., where A is a partition key. In a Glue job I want to group records of this table by column A. Is there a way to make the glue workers aware of the partition...
1
answers
0
votes
884
views
asked 4 years ago
What is the best way to scale cross-account AWS KMS–encrypted Amazon S3 bucket access using ABAC? Tag Name – scaling-cross-account-kms-encrypted-s3-access-using-ABAC
1
answers
0
votes
577
views
AWS
asked 4 years ago
My blueprint needs require an S3 PutObject event to start a Glue ETL job. I only see On-Demand and schedule based triggers as options when creating a blueprint. Does anyone know of a method to creat...
1
answers
0
votes
503
views
asked 4 years ago
I have an AWS Data Lake that is ready to be used at the moment. My use case for the Data Lake is to be able to, ingest data from different API connectors (coming from other data vendors and service ...
1
answers
0
votes
2.2K
views
asked 4 years ago
I ran an Lake Formation BluePrint and result was as follows: | method to run | result | | --- | --- | | by hand | COMPLETED | | scheduled | IMPORT FAILED(NoCredentialsError: Unable to locate credent...
0
answers
0
votes
273
views
asked 4 years ago
Hi all, I'm using a glue job that reads CSV files from S3 and injects data to MySQL RDS. at the end of my pyspark glue script, I call a stored procedure. the issue is the glue job status end by succ...
1
answers
0
votes
1.9K
views
asked 4 years ago
  • 1
  • •••
  • 4
  • 5
  • 6
  • 7
  • 8
  • Page size
    12 / page