
Ultimate access to all questions.
You are developing a deep neural network classification model, and your dataset contains several columns with categorical input values. Some of these columns have a high cardinality, with more than 10,000 unique values. Considering the complexity and the amount of unique categorical values, what is the most suitable method to encode these categorical values for input into the model?
A
Convert each categorical value into an integer value.
B
Convert the categorical string data to one-hot hash buckets.
C
Map the categorical variables into a vector of boolean values.
D
Convert each categorical value into a run-length encoded string.