【Day 15】做題庫小試身手 - 2 - iT 邦幫忙::一起幫忙解決難題，拯救 IT 人的一天

2024 iThome 鐵人賽

DAY 15

佛心分享-我的證照是這樣攻略的

【Day 15】做題庫小試身手 - 2

16th鐵人賽 aws dea-c01 data engineer

611 瀏覽

[ ] A. Use Amazon Aurora for data storage. Use an Amazon Redshift provisioned cluster for data analysis.
[x] B. Use Amazon S3 for data storage. Use Amazon Athena for data analysis.
[ ] C. Use AWS Glue DataBrew for centralized data governance and access control.
[ ] D. Use Amazon RDS for data storage. Use Amazon EMR for data analysis.
[x] E. Use AWS Lake Formation for centralized data governance and access control.

選項A，Aurora 是 RDS 的一種，可以作為 PostgreSQL 或是 MySQL 的取代。不論哪種，都是關聯式資料庫。
選項B，用 S3 來集中存放資料，較符合 data mesh 這種不知道會給你什麼資料，通通都可以存。而 Athena 可以針對 S3 存放的資料建立表格，提供 SQL 語法讓使用者去撈。
選項C，提到了 AWS Glue DataBrew 是用來做 ETL 的工具，而不是用來做權限管控的工具。
選項D，與選項A都是關聯式資料儲存，不合用。
選項E，AWS Lake Formation 可以去讀取 RDS / DynamoDB / S3 集中菜渣，並提供權限管控。
https://d1.awsstatic.com/diagrams/Lake-formation-HIW.9ea3fab3b2ac697a42ae7a805b986278ffd4f41e.png

[ ] A. Store a pointer to the custom Python scripts in the execution context object in a shared Amazon S3 bucket.
[x] B. Package the custom Python scripts into Lambda layers. Apply the Lambda layers to the Lambda functions.
[ ] C. Store a pointer to the custom Python scripts in environment variables in a shared Amazon S3 bucket.
[ ] D. Assign the same alias to each Lambda function. Call reach Lambda function by specifying the function's alias.

選項A，的 Step Functions 是「無伺服器工作微服務工作編排」，其主要的用途是用來觸發 AWS 上的服務、Lambda function，並且接收觸發 function 的回傳結果，根據狀態不同去分別觸發不同任務。
- 原則上可以串 AWS 的服務，但是要去和 MS SQL Srever 對接，有額外的工（需要寫程式）
- 費用，可以免費用四千次，之後每一千次收費 0.025： https://aws.amazon.com/tw/step-functions/pricing/
選項B，相較於 A，Glue 可以透過 connector 和 MS SQL Server 對接。
選項C，Glue Studio 指的是 Glue 底下的一個可視覺化編輯的開發介面。
- 可參考文件 https://docs.aws.amazon.com/zh_tw/glue/latest/dg/what-is-glue.html
Amazon Managed Workflows for Apache Airflow (Amazon MWAA) 算是第三方的服務託管在 AWS 上，然後蠻貴。
- 費用，一小時至少要花掉 0.5 USD
- https://aws.amazon.com/tw/managed-workflows-for-apache-airflow/pricing/