Amazon AWS Certified Machine Learning - Specialty Exam Practice Questions (P. 5)
- Full Access (369 questions)
- Six months of Premium Access
- Access to one million comments
- Seamless ChatGPT Integration
- Ability to download PDF files
- Anki Flashcard files for revision
- No Captcha & No AdSense
- Advanced Exam Configuration
Question #41
A Machine Learning Specialist is building a prediction model for a large number of features using linear models, such as linear regression and logistic regression.
During exploratory data analysis, the Specialist observes that many features are highly correlated with each other. This may make the model unstable.
What should be done to reduce the impact of having such a large number of features?
During exploratory data analysis, the Specialist observes that many features are highly correlated with each other. This may make the model unstable.
What should be done to reduce the impact of having such a large number of features?
- APerform one-hot encoding on highly correlated features.
- BUse matrix multiplication on highly correlated features.
- CCreate a new feature space using principal component analysis (PCA)Most Voted
- DApply the Pearson correlation coefficient.
Correct Answer:
C
C
send
light_mode
delete
Question #42
A Machine Learning Specialist is implementing a full Bayesian network on a dataset that describes public transit in New York City. One of the random variables is discrete, and represents the number of minutes New Yorkers wait for a bus given that the buses cycle every 10 minutes, with a mean of 3 minutes.
Which prior probability distribution should the ML Specialist use for this variable?
Which prior probability distribution should the ML Specialist use for this variable?
- APoisson distributionMost Voted
- BUniform distribution
- CNormal distribution
- DBinomial distribution
Correct Answer:
D
D
send
light_mode
delete
Question #43
A Data Science team within a large company uses Amazon SageMaker notebooks to access data stored in Amazon S3 buckets. The IT Security team is concerned that internet-enabled notebook instances create a security vulnerability where malicious code running on the instances could compromise data privacy.
The company mandates that all instances stay within a secured VPC with no internet access, and data communication traffic must stay within the AWS network.
How should the Data Science team configure the notebook instance placement to meet these requirements?
The company mandates that all instances stay within a secured VPC with no internet access, and data communication traffic must stay within the AWS network.
How should the Data Science team configure the notebook instance placement to meet these requirements?
- AAssociate the Amazon SageMaker notebook with a private subnet in a VPC. Place the Amazon SageMaker endpoint and S3 buckets within the same VPC.
- BAssociate the Amazon SageMaker notebook with a private subnet in a VPC. Use IAM policies to grant access to Amazon S3 and Amazon SageMaker.
- CAssociate the Amazon SageMaker notebook with a private subnet in a VPC. Ensure the VPC has S3 VPC endpoints and Amazon SageMaker VPC endpoints attached to it.Most Voted
- DAssociate the Amazon SageMaker notebook with a private subnet in a VPC. Ensure the VPC has a NAT gateway and an associated security group allowing only outbound connections to Amazon S3 and Amazon SageMaker.
Correct Answer:
C
C
send
light_mode
delete
Question #44
A Machine Learning Specialist has created a deep learning neural network model that performs well on the training data but performs poorly on the test data.
Which of the following methods should the Specialist consider using to correct this? (Choose three.)
Which of the following methods should the Specialist consider using to correct this? (Choose three.)
- ADecrease regularization.
- BIncrease regularization.Most Voted
- CIncrease dropout.Most Voted
- DDecrease dropout.
- EIncrease feature combinations.
- FDecrease feature combinations.Most Voted
Correct Answer:
BCF
BCF
send
light_mode
delete
Question #45
A Data Scientist needs to create a serverless ingestion and analytics solution for high-velocity, real-time streaming data.
The ingestion process must buffer and convert incoming records from JSON to a query-optimized, columnar format without data loss. The output datastore must be highly available, and Analysts must be able to run SQL queries against the data and connect to existing business intelligence dashboards.
Which solution should the Data Scientist build to satisfy the requirements?
The ingestion process must buffer and convert incoming records from JSON to a query-optimized, columnar format without data loss. The output datastore must be highly available, and Analysts must be able to run SQL queries against the data and connect to existing business intelligence dashboards.
Which solution should the Data Scientist build to satisfy the requirements?
- ACreate a schema in the AWS Glue Data Catalog of the incoming data format. Use an Amazon Kinesis Data Firehose delivery stream to stream the data and transform the data to Apache Parquet or ORC format using the AWS Glue Data Catalog before delivering to Amazon S3. Have the Analysts query the data directly from Amazon S3 using Amazon Athena, and connect to BI tools using the Athena Java Database Connectivity (JDBC) connector.Most Voted
- BWrite each JSON record to a staging location in Amazon S3. Use the S3 Put event to trigger an AWS Lambda function that transforms the data into Apache Parquet or ORC format and writes the data to a processed data location in Amazon S3. Have the Analysts query the data directly from Amazon S3 using Amazon Athena, and connect to BI tools using the Athena Java Database Connectivity (JDBC) connector.
- CWrite each JSON record to a staging location in Amazon S3. Use the S3 Put event to trigger an AWS Lambda function that transforms the data into Apache Parquet or ORC format and inserts it into an Amazon RDS PostgreSQL database. Have the Analysts query and run dashboards from the RDS database.
- DUse Amazon Kinesis Data Analytics to ingest the streaming data and perform real-time SQL queries to convert the records to Apache Parquet before delivering to Amazon S3. Have the Analysts query the data directly from Amazon S3 using Amazon Athena and connect to BI tools using the Athena Java Database Connectivity (JDBC) connector.
Correct Answer:
A
A
send
light_mode
delete
Question #46
An online reseller has a large, multi-column dataset with one column missing 30% of its data. A Machine Learning Specialist believes that certain columns in the dataset could be used to reconstruct the missing data.
Which reconstruction approach should the Specialist use to preserve the integrity of the dataset?
Which reconstruction approach should the Specialist use to preserve the integrity of the dataset?
- AListwise deletion
- BLast observation carried forward
- CMultiple imputationMost Voted
- DMean substitution
Correct Answer:
C
Reference:
https://worldwidescience.org/topicpages/i/imputing+missing+values.html
C
Reference:
https://worldwidescience.org/topicpages/i/imputing+missing+values.html
send
light_mode
delete
Question #47
A company is setting up an Amazon SageMaker environment. The corporate data security policy does not allow communication over the internet.
How can the company enable the Amazon SageMaker service without enabling direct internet access to Amazon SageMaker notebook instances?
How can the company enable the Amazon SageMaker service without enabling direct internet access to Amazon SageMaker notebook instances?
- ACreate a NAT gateway within the corporate VPC.
- BRoute Amazon SageMaker traffic through an on-premises network.
- CCreate Amazon SageMaker VPC interface endpoints within the corporate VPC.Most Voted
- DCreate VPC peering with Amazon VPC hosting Amazon SageMaker.
Correct Answer:
A
Reference:
https://docs.aws.amazon.com/sagemaker/latest/dg/sagemaker-dg.pdf
(46)
A
Reference:
https://docs.aws.amazon.com/sagemaker/latest/dg/sagemaker-dg.pdf
(46)
send
light_mode
delete
Question #48
A Machine Learning Specialist is training a model to identify the make and model of vehicles in images. The Specialist wants to use transfer learning and an existing model trained on images of general objects. The Specialist collated a large custom dataset of pictures containing different vehicle makes and models.
What should the Specialist do to initialize the model to re-train it with the custom data?
What should the Specialist do to initialize the model to re-train it with the custom data?
- AInitialize the model with random weights in all layers including the last fully connected layer.
- BInitialize the model with pre-trained weights in all layers and replace the last fully connected layer.Most Voted
- CInitialize the model with random weights in all layers and replace the last fully connected layer.
- DInitialize the model with pre-trained weights in all layers including the last fully connected layer.
Correct Answer:
B
B
send
light_mode
delete
Question #49
An office security agency conducted a successful pilot using 100 cameras installed at key locations within the main office. Images from the cameras were uploaded to Amazon S3 and tagged using Amazon Rekognition, and the results were stored in Amazon ES. The agency is now looking to expand the pilot into a full production system using thousands of video cameras in its office locations globally. The goal is to identify activities performed by non-employees in real time
Which solution should the agency consider?
Which solution should the agency consider?
- AUse a proxy server at each local office and for each camera, and stream the RTSP feed to a unique Amazon Kinesis Video Streams video stream. On each stream, use Amazon Rekognition Video and create a stream processor to detect faces from a collection of known employees, and alert when non-employees are detected.Most Voted
- BUse a proxy server at each local office and for each camera, and stream the RTSP feed to a unique Amazon Kinesis Video Streams video stream. On each stream, use Amazon Rekognition Image to detect faces from a collection of known employees and alert when non-employees are detected.
- CInstall AWS DeepLens cameras and use the DeepLens_Kinesis_Video module to stream video to Amazon Kinesis Video Streams for each camera. On each stream, use Amazon Rekognition Video and create a stream processor to detect faces from a collection on each stream, and alert when non-employees are detected.
- DInstall AWS DeepLens cameras and use the DeepLens_Kinesis_Video module to stream video to Amazon Kinesis Video Streams for each camera. On each stream, run an AWS Lambda function to capture image fragments and then call Amazon Rekognition Image to detect faces from a collection of known employees, and alert when non-employees are detected.
Correct Answer:
D
Reference:
https://aws.amazon.com/blogs/machine-learning/video-analytics-in-the-cloud-and-at-the-edge-with-aws-deeplens-and-kinesis-video-streams/
D
Reference:
https://aws.amazon.com/blogs/machine-learning/video-analytics-in-the-cloud-and-at-the-edge-with-aws-deeplens-and-kinesis-video-streams/
send
light_mode
delete
Question #50
A Marketing Manager at a pet insurance company plans to launch a targeted marketing campaign on social media to acquire new customers. Currently, the company has the following data in Amazon Aurora:
✑ Profiles for all past and existing customers
✑ Profiles for all past and existing insured pets
✑ Policy-level information
✑ Premiums received
✑ Claims paid
What steps should be taken to implement a machine learning model to identify potential new customers on social media?
✑ Profiles for all past and existing customers
✑ Profiles for all past and existing insured pets
✑ Policy-level information
✑ Premiums received
✑ Claims paid
What steps should be taken to implement a machine learning model to identify potential new customers on social media?
- AUse regression on customer profile data to understand key characteristics of consumer segments. Find similar profiles on social media
- BUse clustering on customer profile data to understand key characteristics of consumer segments. Find similar profiles on social mediaMost Voted
- CUse a recommendation engine on customer profile data to understand key characteristics of consumer segments. Find similar profiles on social media.
- DUse a decision tree classifier engine on customer profile data to understand key characteristics of consumer segments. Find similar profiles on social media.
Correct Answer:
C
C
send
light_mode
delete
All Pages