Skip to content

Questions tagged with Data Lakes

Content language: English

Filter questions
Select tags to filter
Sort by
Sort by most recent
Filter Questions by:

Browse through the questions and answers listed below or filter and sort to narrow down your results.

96 results
I've got a fairly simple ETL job that reads several catalog tables or views and does some joins. the job errors out with the following error: ``` Error Category: UNCLASSIFIED_ERROR; An error occurr...
1
answers
0
votes
397
views
asked 2 years ago
Is it possible to integrate opentelemetry(ADOT) and promethus managed by AWS with a data lake. The data lake could be both on edge or on AWS? If yes then how does it work? How can oepntemetry and Pro...
1
answers
0
votes
375
views
asked 2 years ago
At every 30 minutes it saying "ICEBERG_VACUUM_MORE_RUNS_NEEDED: Removed 20000 files in this round of vacuum" but when I calculate the my table metadata size it didn't changed before and after the vacu...
2
answers
0
votes
1.1K
views
asked 2 years ago
Hi! I have been searching and playing around with services and cannot seem to find what I need. I am using the following architecture to guide me in building out my end-to-end solution: https://aws....
2
answers
0
votes
492
views
asked 2 years ago
When executing a task the last step is validating the data migrated with the source against target apparently using Athena, I have the following error: 2023-11-07T22:09:04 [VALIDATOR_TARGE ]E: Not r...
1
answers
1
votes
1.6K
views
asked 2 years ago
Hello I have created a resource link to a shared database from a different account. I am able to query the tables within the database but "Show tables from <database>" and "View Tables" on the AWS Lak...
1
answers
0
votes
975
views
asked 2 years ago
My Glue 4.0 jobs have suddenly stopped working with error message below. As it is related to boto3, I am unable to make any changes to library config. Pls advise. NB: I noticed that urllib3 released ...
0
answers
0
votes
312
views
asked 2 years ago
Hello everyone, ### 1. Context We have a Delta Lake where we write our tables in **S3** in **Delta format**, and we use the **Glue Catalog** for queries in **Athena**. Tables are created both with an...
1
answers
1
votes
1.3K
views
asked 2 years ago
I am trying to create a Delta Table from spark sql using the Glue meta catalog. I can correctly query a Delta table using the Glue metastore: ``` %%sql select * from `my_table` VERSION AS OF 1 limit ...
2
answers
0
votes
3.3K
views
asked 3 years ago
Looks like attempting to write to a Delta Lake table from a DynamicFrame is not working. The Visual Glue interface generates a script like: ``` s3 = glueContext.write_dynamic_frame.from_optio...
2
answers
0
votes
937
views
asked 3 years ago
Hello team, We are planning to build a data lake in AWS that will contain regularly extracted data from an on-prem data warehouse. The purpose of this data lake is to serve the following purposes in ...
2
answers
0
votes
1.1K
views
asked 3 years ago
This CDK code produces a "Resource did not stabilize" error: ``` data_location = lakeformation.CfnPrincipalPermissions.DataLocationResourceProperty( catalog_id=Aws.ACCOUNT_ID, res...
1
answers
0
votes
695
views
asked 3 years ago
  • 1
  • 2
  • 3
  • 4
  • 5
  • •••
  • 8
  • Page size
    12 / page