Inspirational journeys

Follow the stories of academics and their research expeditions

AWS Certified Machine Learning Specialty Questions 2022 - Part 26

Mary Smith

Sat, 18 Apr 2026

AWS Certified Machine Learning Specialty Questions 2022 - Part 26

1. Which of the following statements about Amazon FSx for Lustre is true?

A) Amazon FSx for Lustre provides a managed, high-performance file system that is optimized for compute-intensive workloads.
B) Amazon FSx for Lustre provides a managed, high-performance file system that is optimized for general-purpose workloads.
C) Amazon FSx for Lustre is a self-managed file system that requires manual configuration and optimization for compute-intensive workloads.
D) Amazon FSx for Lustre is not recommended for use with compute-intensive workloads due to potential performance issues.



2. Which of the following options is NOT a valid method for encrypting objects in Amazon S3?

A) SSE-KMS (Server-Side Encryption with AWS Key Management Service)
B) SSE-S3 (Server-Side Encryption with Amazon S3-managed keys)
C) Client-Side Encryption with AWS KMS-managed customer master key (CMK)
D) Client-Side Encryption with client-managed keys (CSE-C)



3. Which of the following statements accurately describe the capabilities of Amazon S3 Select for machine learning workloads?

A) Amazon S3 Select allows you to query data stored in S3 using SQL without having to load the entire object, making it a useful tool for machine learning workloads.
B) Amazon S3 Select allows you to train machine learning models directly on data stored in S3, eliminating the need for data transfer to other storage systems.
C) Amazon S3 Select provides real-time streaming of data stored in S3 for machine learning workloads.
D) Amazon S3 Select supports querying data stored in any file format for machine learning workloads.
E) Amazon S3 Select allows you to directly run Python code on data stored in S3 for machine learning workloads.


4. As a data engineer, you need to design a pipeline that extracts data from an Amazon RDS instance, processes the data using an AWS Lambda function, and stores the results in an Amazon S3 bucket. Which of the following statements is true regarding AWS Data Pipeline?

A) AWS Data Pipeline supports real-time data processing and is ideal for use cases where you need to process streaming data.
B) AWS Data Pipeline allows you to execute AWS Lambda functions as part of your data processing workflow.
C) AWS Data Pipeline only supports data sources that are stored in Amazon S3.
D) AWS Data Pipeline provides a graphical interface for creating data processing workflows, but you cannot modify the workflows using code.
E) AWS Data Pipeline requires you to define the input and output data formats explicitly, which can limit the flexibility of your data processing workflow.


5. In Amazon Machine Learning, what is the recommended approach for dealing with imbalanced datasets in binary classification tasks?

A) Oversampling the minority class with SMOTE
B) Undersampling the majority class with RandomUnderSampler
C) Using the default threshold for the binary classifier
D) Adjusting the threshold for the binary classifier to prioritize precision over recall
E) Using the default imbalanced dataset handling options in Amazon Machine Learning


1. Right Answer: A
Explanation:

2. Right Answer: D
Explanation:

3. Right Answer: A
Explanation:

4. Right Answer: B
Explanation:

5. Right Answer: D
Explanation:

0 Comments

Leave a comment