Questions in Analytics
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Hello,
We set up AWS DMS, where the source is MS SQL Server 2019, and the target is S3 (with parquet). Setting up CDC copying. And it is important for us to check that DDLs on source work as well:
1)...
0
answers
0
votes
209
views
asked 8 days agolg...
I am getting json files to my s3. For example:
```
{
"name" : "John",
"lastname": "Doe",
"meta" : {
"x": "a",
"y": "b",
"unwanted_field": {
"some":...
1
answers
0
votes
72
views
asked 8 days agolg...
Environment variables for PySpark executor in AWS EMR Serverless and Env key limitations with EMRlg...
Hello, I have gone documentation and practically observed the limitation for ENV Keys `spark.emr-serverless.driverEnv` and `spark.emr-serverless.executorEnv` with EMR Serverless which is limited to 50...
0
answers
0
votes
56
views
asked 8 days agolg...
Adding tag to EDPlg...
Is there any way to tag EDP? When I create some quick sight dashboards and filter by tag/costs, EDP just shows up as empty. I would have to add a filter for charge type to show the amount related to...
0
answers
0
votes
74
views
asked 8 days agolg...
I'm trying to remove a database but i'm getting this error:
SQL Error [1010] [HY000]: Error dropping database (can't rmdir './databasename', errno: 39)
I've been reading solutions, but it looks like...
1
answers
0
votes
125
views
asked 9 days agolg...
say i have couple of json files in s3, I would to set up a crawler or a glue job, such that i can create table in aws rds (mysql or postgre) , such that in table 1, it creates a autogenerated id and...
1
answers
0
votes
498
views
asked 9 days agolg...
I need to find the difference between two timestamp range. I tried using DATEDIFF but not getting the exact result. Below is the query I am using:
```
select
rtrim(datediff(hour,'2024-05-15...
1
answers
0
votes
255
views
asked 9 days agolg...
I'm having the same issue. Data is stored in below format in s3 as JSON array with partitions
S3 path - s3://fleet-fuelcard-data-import-dev/lambda/fuelsoft-morgan/660306/2024/Apr/03-Apr-2024.json....
1
answers
0
votes
44
views
asked 9 days agolg...
How to build AWS Glue ETL Jobs or Data Quality Jobs, if access to console is not allowed as per company policy. Does not having AWS Console access defeats the purpose AWS Glue? What features cannot be...
2
answers
0
votes
120
views
asked 9 days agolg...
I've been trying to test out Iceberg tables with Amazon Redshift Spectrum and have come across a major issue.
Here is my setup:
1. I create an iceberg table via spark (emr 7.0) and insert data across...
0
answers
1
votes
194
views
asked 10 days agolg...
when I followed this document https://docs.amazonaws.cn/en_us/redshift/latest/mgmt/jdbc20-configuration-options.html#jdbc20-plugin_name-option to connect redshift with IdpTokenAuthPlugin, I got an...
0
answers
0
votes
161
views
asked 10 days agolg...
I have deployed opensearch serverless collection. The collection type is VectorSearch. I have also defined all the security policies, like data access policies, encryption policy (Using KMS key),...
0
answers
0
votes
69
views
asked 10 days agolg...