Skip to content

Questions tagged with Data Lakes

Content language: English

Filter questions
Select tags to filter
Sort by
Sort by most recent
Filter Questions by:

Browse through the questions and answers listed below or filter and sort to narrow down your results.

96 results
I am attempting to use Hudi 1.1.1 in an AWS 5.0 Glue job. To avoid conflicts with the pre-installed Glue Hudi libraries, I have set the job parameter `--datalake-formats` to an empty string (""). Howe...
2
answers
0
votes
63
views
asked 12 days ago
We keep getting following error sometimes when we are wrting data to S3 table. Can anyone let me know what is the issue and work around? Its not happening for all tables but only specific tables only....
1
answers
0
votes
241
views
asked 5 months ago
I have existing Iceberg tables in gov cloud. I have created them using AWS Glue ETL spark py jobs. My dbt debug command says "All checks have passed" But my dbt run command gives me the below error: ...
1
answers
0
votes
201
views
asked 7 months ago
Hi, We are planning to use S3 table for storing our clients' data but we want to have RBAC-based feature so that we can make sure that data is access based on the permission. We are planning to crea...
1
answers
0
votes
81
views
asked 7 months ago
Using Airbyte "AWS Datalake" destination to push data into AWS Glue/S3. I have to use the S3 accesspoint ARN, since the location where Airbyte is running does not have open TCP:443 egress. ....so I...
1
answers
0
votes
118
views
asked 8 months ago
Hi all, I'm trying to create a Iceberg table through Athena. The detailed S3 bucket path looks like this 's3://my_bucket/company_id={company_id}/date={YYYY-MM-DD}/data.parquet However, the 'date' ...
1
answers
0
votes
790
views
asked a year ago
I am creating a data mesh architecture on datazone domain with associating multiple accounts from the Central Account but when I am adding IAM role as a user into root domain of Datazone it does not a...
1
answers
0
votes
457
views
asked a year ago
In AWS DMS, I have a Serverless replication, but I want to modify it now justs to add an extra table. No matter what I change, I get this error: Task Settings CloudWatchLogGroup or CloudWatchLogStream...
1
answers
0
votes
144
views
asked a year ago
This is the third time I've run into this error. I don't know is it a real issue or I just forget to set something up. So basically when I try to add data permission to all table in Lakeformation ba...
0
answers
0
votes
104
views
asked a year ago
Config of Redshift Cluster: - Enhanced VPC routing has enabled - Redshift subnet in the same subnet as S3 vpc endpoint Config of S3 - VPC endpoints created for S3 - Routing has configured to rout...
1
answers
0
votes
193
views
asked a year ago
hello, in planning phase of a Datalake project and came across LakeFormation which seems to be the preferred way. I understand that essentially it is a group of S3 buckets so resiliency & durability i...
1
answers
0
votes
496
views
asked 2 years ago
HIVE_CURSOR_ERROR: incorrect data check This query ran against the "dbreport" database, unless qualified by the query. Please post the error message on our forum or contact customer support with Que...
1
answers
0
votes
366
views
asked 2 years ago
  • 1
  • 2
  • 3
  • 4
  • 5
  • •••
  • 8
  • Page size
    12 / page