Click4Ai

434.

Medium

Contextual Bandit Problem

**Example:** Imagine you are a personal shopping assistant and you have to recommend products to a customer based on their past purchases and preferences. The customer's past purchases and preferences can be considered as context. The goal is to maximize the customer's satisfaction by recommending the best product.

**Constraints:** The customer can only purchase one product at a time, and the customer's satisfaction is determined by the product's rating.

Test Cases

Test Case 1
Input: [[1, 2, 3], [4, 5, 6]]
Expected: [1, 1]
Test Case 2
Input: [[7, 8, 9], [10, 11, 12]]
Expected: [2, 2]
+ 3 hidden test cases