Connect your Athena

In order to connect your Athena cluster, Whaly needs some credentials. This guide will details the necessary steps:

  1. Create an IAM User and generate an Access Key (+secret)

  2. Select your region & work group

Prerequisites

To connect Athena to Whaly, you need the following:

circle-info

To save cost on the Output Bucket, you can configure its Bucket Lifecycle rulearrow-up-right to delete any file after 1 day as the results won't be used after a query have resolved.

Create an IAM User and generate an Access Key (+secret)

To connect to your AWS Athena cluster, Whaly need to have a User and its credentials (Access Key). In order to create such a User, please follow this guide.arrow-up-right

When being asked which permissions and policies the user should have, please create a custom Policyarrow-up-right that have the following rights:

circle-info

In the Policy definition, you need to fill the ARNs of the S3 buckets that Whaly will have access to.

Whaly user needs to access to 2 kinds of S3 buckets:

  • Input buckets: Those are the buckets in which you have the data that is being queried by Athena

  • A single Output bucket: This is the bucket that Whaly will use to store the query results

Select your region & Workgroup

In order to properly query your Athena data, Whaly needs to know in which region you want to run the compute. It should be one of the AWS Regionarrow-up-right (ex. us-east1). A good practise would be to use the same one as the one you are using when doing SQL Queries in the console:

Also, you'll need to select an existing work group or create one.arrow-up-right Inside Whaly, you'll need to pass the name of the Workgroup you wish to use when querying your data with Whaly.

Last updated

Was this helpful?