
Explanation:
The most suitable ML models for the fraud detection system, considering the requirements, are:
Boosted Tree - XGBoost (C):
Random Forest (E):
These models are well-suited for real-time processing, outlier detection, and require minimal labeling. They can be periodically trained on aggregated data to identify anomalies in transaction patterns.
K-means, a clustering algorithm, may not be ideal for fraud detection as it doesn't explicitly model fraud or non-fraud. Decision Trees and Matrix Factorization are less suitable due to their limitations with complex patterns and large datasets.
Ultimate access to all questions.
No comments yet.
Your team is developing a real-time fraud detection system for a major international bank. The system must meet the following requirements: Process transactions in real-time from various banking applications in a standardized format, store data in real-time with some statistical aggregations, periodically train an ML model for outlier detection, provide a probability of fraud for each transaction, and minimize labeling and software development efforts. Additionally, the system must comply with international banking regulations, handle high transaction volumes without significant latency, and be scalable to accommodate future growth. Given these constraints, which two ML models would best meet these requirements? Choose two correct options.
A
Decision Tree
B
Matrix Factorization
C
Boosted Tree - XGBoost
D
K-means
E
Random Forest