1 Answer
- Newest
- Most votes
- Most comments
1
Hi there,
Another option for inferring schemas of files that reside in S3 is to use an AWS Glue Crawler.
Once the S3-based files have been crawled, table entries will appear in the AWS Glue Data Catalog, which can be made visible in Redshift through creation of an EXTERNAL SCHEMA using the 'DATA CATALOG' keyword.
Once the external schema is created, you can begin querying the crawled tables inside Redshift. To create a physical copy of the external tables in Redshift, you can run a CTAS statement.
Any subsequent tables crawled will appear within Redshift for querying (as long as they are mapped to the same Glue Database).
I hope this helps!
Relevant content
- asked 2 years ago
- AWS OFFICIALUpdated 5 months ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 8 months ago
- AWS OFFICIALUpdated a year ago