Objectives:
• Get familiar with S3 and Athena
• Use QuickSight to visualize data in S3 via Athena
Data:
• Elastic Load Balancing Log, more details: docs.aws.amazon.com/elasticlo...
• NOAA Global Historical Climatology Network Daily (GHCN-D) data, more details: docs.opendata.aws/noaa-ghcn-p...
Steps:
1. Create a bucket in S3.
1.1. Log in to AWS Educate. In the classroom of this course, go to the AWS Console.
1.2. In AWS Console, choose S3, create a bucket, and name it lab5-your_last_name.
1.3. Optional: try to upload some files to the bucket.
2. Use Athena to query the ELB log data in S3
2.1. Go back to the AWS console, and choose Athena.
2.2. Click Get Started, and set the query result location to the bucket created in step1.
2.3. In the Query Editor, type the codes from github.com/xbwei/data-analysi...
• Load data from the Athena-examples bucket to your Athena services. You will see a table named elb_logs_orc being created,
• Run some simple queries on the loaded data.
3. Visualzie S3 data in QuickSight.
3.1. In QuickSight, create a new analysis and a new data set.
3.2. Choose Athena as the data source, and select blogdb/orginal_csv.
3.3. Choose Edit/Preview data, and uncheck obstime, mlfg, aflg and sflag filed.
3.4. Convert the date filed to date data type
3.5. Don’t use SPICE because the dataset is too huge.
3.6. Create a filter to show snow and prcp records in the element field.
3.7. Create two visualizations showing the number of records for snow and precipitation and the recorded snows and precipitations over the past years.
3.8. Publish your dashboard as your_last_name_first_name_lab5.
3.9. Share your dashboard with the instructor’s QuickSight account (weixx@jmu.edu).
Негізгі бет Use
Пікірлер: 2