Handling Json Data [AWS Glue]

0

Hi folks!

right now i'm working on project to do ETL Process using AWS Glue. I already create the query script to process the data, but my source data is in Json format. I need help to transform the json data into table form.

I need solution in Visual ETL Job

Note: right now i use s3 but need solution if using DynamoDB as the source DB

Ivan
已提問 4 個月前檢視次數 237 次
1 個回答
0

In a visual job, you can use an S3 source and specify JSON format, with an optional JsonPath, or due the same reading from a JSON table that you can build using a crawler.
Once the source reads the data as you need, then you can store it on DynamoDB to use as the source of other jobs.

profile pictureAWS
專家
已回答 4 個月前
profile picture
專家
已審閱 1 個月前
  • Thanks for your respon! but sorry i think i will give you my example flow if using table form (Structured data)

    S3 (csv) [Sources] -> Select fields [Transforms] -> SQL Query [Transforms] -> S3 (csv) [targets]

    but now the source data is using Json format, which is i need too transform it into table/structured data so i can do data transformation using my SQL Query.

    I'm new with AWS, usually i just use ETL tools like pentaho/talend. so i prefer use visual job than the script one.

  • the "source" transfoms the data into a structured in memory table, which you can see in the "Data preview" panel and from then transform, run SQL or whatever you need

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南