First, write down the names or path locations for the S3 bucket with your data and the S3 bucket where the query results are stored. You’ll need both ARNs to set up the policies.
- Open your AWS Management Console then click Services > IAM from the left of the top navigation bar.
- From the left sidebar, click Policies, then click Create Policy from the top of the page.
- Stay on the Visual Editor tab. For the Service, search for S3 in the search bar and select it from the results.
- For Actions, select the following:
- List > ListBucket
- Read > GetObject
- For Resources, provide the following:
- Bucket: Click Add ARN then provide the path location of your S3 bucket. Click Add.
- Object: Click Add ARN then provide the path location of your S3 bucket for the Bucket name and select Any for the Object name. Click Add.
- Click Review policy
- Provide a name (e.g., bigeye_read_access) and description for your new policy. Click Create policy when you’re done.
- From the left sidebar of the IAM page, click Users, then click Add User from the top of the page. You’ll use this user to connect to Bigeye.
- Provide a username for the new user (e.g., bigeye_user) and select Programmatic access for the access type. Click Next:Permissions to continue.
- Click Attach existing policies directly then add select the following policies:
- The read access policy you created
- Click Next:Tags, add your tags (optional), then Next:Review. Review your user details then click Create user.
- Upon creating the user, an access key will be generated. Copy the Access key ID and the generated Secret access key to a secure location (or download and securely store the info as a CSV file); you’ll need those to connect Athena to Bigeye. The secret key can only be accessed from this user creation screen. If you leave the screen without copying the secret key, you’ll need to create a new one.
Host: name of the server that hosts the database you want to connect to. The format should be as follows: athena.[region].amazonaws.com. For example: athena.us-east-1.amazonaws.com
Name of the S3 staging directory: for example, s3://aws-athena-query-results-123456785678-us-eastexample-2/
From the Catalog tab in the sidebar, select Add Source. This will take you to a page where you can configure the connection to your database.
To connect a database, you will need to input the following information:
- Vendor - AWS Athena
- Name - any string
- Host - host from step 3
- Port - 443 (this is also the default port if left blank)
- Database - S3 staging directory from step 3
- Username - AWS access key ID from step 4
- Password - AWS secret access key from step 4
Once filled out, the form should look something like this
Bigeye will now start loading and profiling your tables. It can take up to 24 hours for the profiling to complete and your autometrics/autothresholds to populate. You can now proceed to deploying autometrics in the Getting Started guide.
Updated 5 months ago