ABOUT ME

-

Today
-
Yesterday
-
Total
-
  • Athena
    Cloud/AWS 2022. 6. 17. 10:56

    Overview

    • Interactive query service for S3 (SQL)
    • No need to load data, it stays in S3
    • Presto under the hood
    • Serverless
    • Unstructured, semi-structured, or structured
    • Supports many data formats
      • CSV
      • JSON
      • ORC
      • Parquet
      • Avro

    Examples

    • ad-hoc queries of weblogs
    • Querying staging data before loading to Redshift
    • Analyze Cloudtail/CloudFront/VPC/ELB etc logs in S3
    • Integration with Jupiter, Zeppelin, R Studio notebooks
    • Integration with QuickSight
    • Integration via ODBC/JDBC with other visualization tools

    Integration with Glue

    • Glue Crawler populate Glue Data Catalog for S3
    • Athena will see Glue Data Catalog and build a table from it automatically
    • Athena provides a SQL interface underlying Glue Structure

    Cross-region concerns

    • Athena cannot query across regions on its own, BUT a Glue crawler can
    • So, you can query S3 data across regions if you query a Glue Data Catalog in the same region as Athena
    • And, the Glue Crawler that created the data catalog spanned multiple regions

    'Cloud > AWS' 카테고리의 다른 글

    Glue  (0) 2022.06.15
    AWS Redshift  (0) 2022.06.05
    Lake Formation  (0) 2022.04.26
    Lambda  (0) 2021.03.09
    Choosing the right database on AWS  (0) 2021.03.08

    댓글

Designed by Tistory.