Amazon AWS Certified Machine Learning - Specialty Exam Practice Questions (P. 2)
- Full Access (369 questions)
- Six months of Premium Access
- Access to one million comments
- Seamless ChatGPT Integration
- Ability to download PDF files
- Anki Flashcard files for revision
- No Captcha & No AdSense
- Advanced Exam Configuration
Question #11
A Machine Learning Specialist receives customer data for an online shopping website. The data includes demographics, past visits, and locality information. The
Specialist must develop a machine learning approach to identify the customer shopping patterns, preferences, and trends to enhance the website for better service and smart recommendations.
Which solution should the Specialist recommend?
Specialist must develop a machine learning approach to identify the customer shopping patterns, preferences, and trends to enhance the website for better service and smart recommendations.
Which solution should the Specialist recommend?
- ALatent Dirichlet Allocation (LDA) for the given collection of discrete data to identify patterns in the customer database.
- BA neural network with a minimum of three layers and random initial weights to identify patterns in the customer database.
- CCollaborative filtering based on user interactions and correlations to identify patterns in the customer database.Most Voted
- DRandom Cut Forest (RCF) over random subsamples to identify patterns in the customer database.
Correct Answer:
C
C

Collaborative filtering, as correctly indicated by the consensus among user comments, is indeed the most appropriate approach for this scenario where the goal is to identify customer shopping patterns, trends, and preferences based on user interactions and correlations. Understanding that LDA and Random Cut Forest target entirely different problem types like topic modeling and anomaly detection respectively, and neural networks, although versatile, would need more contextual tuning which makes it not the default choice here. Therefore, collaborative filtering stands out as it directly leverages user interaction data to generate personalized recommendations.
send
light_mode
delete
Question #12
A Machine Learning Specialist is working with a large company to leverage machine learning within its products. The company wants to group its customers into categories based on which customers will and will not churn within the next 6 months. The company has labeled the data available to the Specialist.
Which machine learning model type should the Specialist use to accomplish this task?
Which machine learning model type should the Specialist use to accomplish this task?
- ALinear regression
- BClassificationMost Voted
- CClustering
- DReinforcement learning
Correct Answer:
B
The goal of classification is to determine to which class or category a data point (customer in our case) belongs to. For classification problems, data scientists would use historical data with predefined target variables AKA labels (churner/non-churner) ג€" answers that need to be predicted ג€" to train an algorithm. With classification, businesses can answer the following questions:
✑ Will this customer churn or not?
✑ Will a customer renew their subscription?
✑ Will a user downgrade a pricing plan?
✑ Are there any signs of unusual customer behavior?
Reference:
https://www.kdnuggets.com/2019/05/churn-prediction-machine-learning.html
B
The goal of classification is to determine to which class or category a data point (customer in our case) belongs to. For classification problems, data scientists would use historical data with predefined target variables AKA labels (churner/non-churner) ג€" answers that need to be predicted ג€" to train an algorithm. With classification, businesses can answer the following questions:
✑ Will this customer churn or not?
✑ Will a customer renew their subscription?
✑ Will a user downgrade a pricing plan?
✑ Are there any signs of unusual customer behavior?
Reference:
https://www.kdnuggets.com/2019/05/churn-prediction-machine-learning.html
send
light_mode
delete
Question #13
The displayed graph is from a forecasting model for testing a time series.

Considering the graph only, which conclusion should a Machine Learning Specialist make about the behavior of the model?

Considering the graph only, which conclusion should a Machine Learning Specialist make about the behavior of the model?
- AThe model predicts both the trend and the seasonality wellMost Voted
- BThe model predicts the trend well, but not the seasonality.
- CThe model predicts the seasonality well, but not the trend.
- DThe model does not predict the trend or the seasonality well.
Correct Answer:
A
A
send
light_mode
delete
Question #14
A company wants to classify user behavior as either fraudulent or normal. Based on internal research, a Machine Learning Specialist would like to build a binary classifier based on two features: age of account and transaction month. The class distribution for these features is illustrated in the figure provided.

Based on this information, which model would have the HIGHEST accuracy?

Based on this information, which model would have the HIGHEST accuracy?
- ALong short-term memory (LSTM) model with scaled exponential linear unit (SELU)
- BLogistic regression
- CSupport vector machine (SVM) with non-linear kernelMost Voted
- DSingle perceptron with tanh activation function
Correct Answer:
C
C
send
light_mode
delete
Question #15
A Machine Learning Specialist at a company sensitive to security is preparing a dataset for model training. The dataset is stored in Amazon S3 and contains
Personally Identifiable Information (PII).
The dataset:
✑ Must be accessible from a VPC only.
✑ Must not traverse the public internet.
How can these requirements be satisfied?
Personally Identifiable Information (PII).
The dataset:
✑ Must be accessible from a VPC only.
✑ Must not traverse the public internet.
How can these requirements be satisfied?
- ACreate a VPC endpoint and apply a bucket access policy that restricts access to the given VPC endpoint and the VPC.Most Voted
- BCreate a VPC endpoint and apply a bucket access policy that allows access from the given VPC endpoint and an Amazon EC2 instance.
- CCreate a VPC endpoint and use Network Access Control Lists (NACLs) to allow traffic between only the given VPC endpoint and an Amazon EC2 instance.
- DCreate a VPC endpoint and use security groups to restrict access to the given VPC endpoint and an Amazon EC2 instance
Correct Answer:
A
A

For the dataset containing PII stored in Amazon S3, to ensure that it is accessible only from a VPC and doesn’t utilize the public internet, setting up a VPC endpoint is essential. Adding a specific bucket access policy, which effectively limits the accessibility to this VPC-only via the set VPC endpoint, addresses both security concerns. By doing this, no EC2 specifics are necessary nor is the use of ALLOW policies from various security peripherals like NACLs or different groups, ensuring a streamlined and secure access route that adheres strictly to the given constraints.
send
light_mode
delete
Question #16
During mini-batch training of a neural network for a classification problem, a Data Scientist notices that training accuracy oscillates.
What is the MOST likely cause of this issue?
What is the MOST likely cause of this issue?
- AThe class distribution in the dataset is imbalanced.
- BDataset shuffling is disabled.
- CThe batch size is too big.
- DThe learning rate is very high.Most Voted
Correct Answer:
D
Reference:
https://towardsdatascience.com/deep-learning-personal-notes-part-1-lesson-2-8946fe970b95
D
Reference:
https://towardsdatascience.com/deep-learning-personal-notes-part-1-lesson-2-8946fe970b95
send
light_mode
delete
Question #17
An employee found a video clip with audio on a company's social media feed. The language used in the video is Spanish. English is the employee's first language, and they do not understand Spanish. The employee wants to do a sentiment analysis.
What combination of services is the MOST efficient to accomplish the task?
What combination of services is the MOST efficient to accomplish the task?
- AAmazon Transcribe, Amazon Translate, and Amazon ComprehendMost Voted
- BAmazon Transcribe, Amazon Comprehend, and Amazon SageMaker seq2seq
- CAmazon Transcribe, Amazon Translate, and Amazon SageMaker Neural Topic Model (NTM)
- DAmazon Transcribe, Amazon Translate and Amazon SageMaker BlazingText
Correct Answer:
A
A

Amazon Transcribe, Amazon Translate, and Amazon Comprehend together provide the most seamless path to achieving sentiment analysis from a Spanish video for an English speaker. First, Transcribe converts spoken language in the video into written Spanish. Then, Translate converts the Spanish text into English. Finally, Comprehend analyzes the English text to determine sentiment. This sequence provides a completely managed service experience without the need for additional coding or infrastructure setup, ensuring an efficient solution.
send
light_mode
delete
Question #18
A Machine Learning Specialist is packaging a custom ResNet model into a Docker container so the company can leverage Amazon SageMaker for training. The
Specialist is using Amazon EC2 P3 instances to train the model and needs to properly configure the Docker container to leverage the NVIDIA GPUs.
What does the Specialist need to do?
Specialist is using Amazon EC2 P3 instances to train the model and needs to properly configure the Docker container to leverage the NVIDIA GPUs.
What does the Specialist need to do?
- ABundle the NVIDIA drivers with the Docker image.
- BBuild the Docker container to be NVIDIA-Docker compatible.Most Voted
- COrganize the Docker container's file structure to execute on GPU instances.
- DSet the GPU flag in the Amazon SageMaker CreateTrainingJob request body.
Correct Answer:
A
A

Upon reviewing the latest AWS documentation as referenced by users in their comments, it becomes clear that the correct preparation for leveraging NVIDIA GPUs in a Docker container on Amazon EC2 P3 instances involves ensuring that the container is NVIDIA-Docker compatible. Importantly, specialists should not bundle NVIDIA drivers within the Docker image; instead, they should include only the CUDA toolkit. This adjustment aligns with best practices for configuring environments in AWS SageMaker for efficient GPU utilization during model training.
send
light_mode
delete
Question #19
A Machine Learning Specialist is building a logistic regression model that will predict whether or not a person will order a pizza. The Specialist is trying to build the optimal model with an ideal classification threshold.
What model evaluation technique should the Specialist use to understand how different classification thresholds will impact the model's performance?
What model evaluation technique should the Specialist use to understand how different classification thresholds will impact the model's performance?
- AReceiver operating characteristic (ROC) curveMost Voted
- BMisclassification rate
- CRoot Mean Square Error (RMSE)
- DL1 norm
Correct Answer:
A
Reference:
https://docs.aws.amazon.com/machine-learning/latest/dg/binary-model-insights.html
A
Reference:
https://docs.aws.amazon.com/machine-learning/latest/dg/binary-model-insights.html
send
light_mode
delete
Question #20
An interactive online dictionary wants to add a widget that displays words used in similar contexts. A Machine Learning Specialist is asked to provide word features for the downstream nearest neighbor model powering the widget.
What should the Specialist do to meet these requirements?
What should the Specialist do to meet these requirements?
- ACreate one-hot word encoding vectors.
- BProduce a set of synonyms for every word using Amazon Mechanical Turk.
- CCreate word embedding vectors that store edit distance with every other word.
- DDownload word embeddings pre-trained on a large corpus.Most Voted
Correct Answer:
A
Reference:
https://aws.amazon.com/blogs/machine-learning/amazon-sagemaker-object2vec-adds-new-features-that-support-automatic-negative-sampling-and- speed-up-training/
A
Reference:
https://aws.amazon.com/blogs/machine-learning/amazon-sagemaker-object2vec-adds-new-features-that-support-automatic-negative-sampling-and- speed-up-training/
send
light_mode
delete
All Pages