
Answer-first summary for fast verification
Answer: The argument to the subset parameter of fill() should be a string column name or a list of string column names rather than a Column object.
The error in the code block is due to the incorrect use of a Column object for the `subset` parameter in `na.fill()`. The `subset` parameter expects a string column name or a list of string column names. Using `col("sqft")`, which returns a Column object, is invalid here. The correct approach is to provide the column name as a string (e.g., `"sqft"`). Options B, C, D, and E are incorrect because `na.fill()` is the correct method, `subset` does not accept numerical positions, `nafill()` does not exist, and `fillna()` is an alias but the syntax error remains the same.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
Identify the error in the following code block intended to replace missing values in the sqft column of DataFrame storesDF with the value 30,000:
storesDF.na.fill(30000, col("sqft"))
storesDF.na.fill(30000, col("sqft"))
Sample of DataFrame storesDF:
storeld sqft
0 43161
1 51200
2 null
3 78367
4 null
...
storeld sqft
0 43161
1 51200
2 null
3 78367
4 null
...
A
The argument to the subset parameter of fill() should be a string column name or a list of string column names rather than a Column object.
B
The na.fill() operation does not work and should be replaced by the dropna() operation.
C
The argument to the subset parameter of fill() should be a the numerical position of the column rather than a Column object.
D
The na.fill() operation does not work and should be replaced by the nafill() operation.
E
The na.fill() operation does not work and should be replaced by the fillna() operation.
No comments yet.