Querying external data using Amazon Redshift Spectrum
UsingRedshift Spectrum, you can efficiently query and retrieve structured and semi-structured data from files in S3 without having to load the data into Redshift tables.
Redshift Spectrum queries employ massive parallelism to execute very fast against large datasets.
Much of the processing occurs in the Redshift Spectrum layer, and most of the data remains in S3.
Multiple clusters can concurrently query the same dataset in S3 without the need to make copies of the data for each cluster.