Questions in Analytics
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Why does Amazon EMR creates inbound rule entries for master and core security groups?
![Core SG](/media/postImages/original/IM6Mggxg_vTQSTJFNCM0FRPA)
![Master...
1
answers
0
votes
145
views
asked 7 days agolg...
Hi,
I have seen a lot of examples where data records are sent from KDS to KDF even there is NO real-time processing is required. Why can't we ingest data directly to KDF and store the records in data...
1
answers
0
votes
127
views
asked 7 days agolg...
Hello,
We set up AWS DMS, where the source is MS SQL Server 2019, and the target is S3 (with parquet). Setting up CDC copying. And it is important for us to check that DDLs on source work as well:
1)...
0
answers
0
votes
205
views
asked 7 days agolg...
I am getting json files to my s3. For example:
```
{
"name" : "John",
"lastname": "Doe",
"meta" : {
"x": "a",
"y": "b",
"unwanted_field": {
"some":...
1
answers
0
votes
71
views
asked 7 days agolg...
Environment variables for PySpark executor in AWS EMR Serverless and Env key limitations with EMRlg...
Hello, I have gone documentation and practically observed the limitation for ENV Keys `spark.emr-serverless.driverEnv` and `spark.emr-serverless.executorEnv` with EMR Serverless which is limited to 50...
0
answers
0
votes
56
views
asked 7 days agolg...
Adding tag to EDPlg...
Is there any way to tag EDP? When I create some quick sight dashboards and filter by tag/costs, EDP just shows up as empty. I would have to add a filter for charge type to show the amount related to...
0
answers
0
votes
74
views
asked 8 days agolg...
I'm trying to remove a database but i'm getting this error:
SQL Error [1010] [HY000]: Error dropping database (can't rmdir './databasename', errno: 39)
I've been reading solutions, but it looks like...
1
answers
0
votes
124
views
asked 8 days agolg...
say i have couple of json files in s3, I would to set up a crawler or a glue job, such that i can create table in aws rds (mysql or postgre) , such that in table 1, it creates a autogenerated id and...
1
answers
0
votes
460
views
asked 8 days agolg...
I need to find the difference between two timestamp range. I tried using DATEDIFF but not getting the exact result. Below is the query I am using:
```
select
rtrim(datediff(hour,'2024-05-15...
1
answers
0
votes
253
views
asked 8 days agolg...
I'm having the same issue. Data is stored in below format in s3 as JSON array with partitions
S3 path - s3://fleet-fuelcard-data-import-dev/lambda/fuelsoft-morgan/660306/2024/Apr/03-Apr-2024.json....
1
answers
0
votes
43
views
asked 8 days agolg...
How to build AWS Glue ETL Jobs or Data Quality Jobs, if access to console is not allowed as per company policy. Does not having AWS Console access defeats the purpose AWS Glue? What features cannot be...
2
answers
0
votes
119
views
asked 9 days agolg...
I've been trying to test out Iceberg tables with Amazon Redshift Spectrum and have come across a major issue.
Here is my setup:
1. I create an iceberg table via spark (emr 7.0) and insert data across...
0
answers
1
votes
182
views
asked 9 days agolg...