
Databricks Certified Data Engineer - Professional
Get started today
Ultimate access to all questions.
Review the following error traceback:
AnalysisException
<command-3293767849433948> in <module>
---> 1 display(df.select(3 * "heartrate"))
Traceback (most recent call last):
File "/databricks/spark/python/pyspark/sql/dataframe.py", line 1692, in select
jdf = self._jdf.select(self._jcols(*cols))
File "/databricks/spark/python/lib/py4j-0.10.9-grc.zip/py4j/java_gateway.py", line 1304, in __call__
answer = self.gateway_client.send_command(command)
File "/databricks/spark/python/pyspark/sql/utils.py", line 123, in deco
raise converted from None
AnalysisException: cannot resolve 'heartrateheartrateheartrate' given input columns: [spark_catalog.database.table.device_id, spark_catalog.database.table.heartrate, spark_catalog.database.table.mrn, spark_catalog.database.table.time]:
'Project ['heartrateheartrateheartrate]
+- SubqueryAlias spark_catalog.database.table
+- Relation[device_id#75L,heartrate#76,mrn#77L,time#78] parquet
Which statement describes the error being raised?
Review the following error traceback:
AnalysisException
<command-3293767849433948> in <module>
---> 1 display(df.select(3 * "heartrate"))
Traceback (most recent call last):
File "/databricks/spark/python/pyspark/sql/dataframe.py", line 1692, in select
jdf = self._jdf.select(self._jcols(*cols))
File "/databricks/spark/python/lib/py4j-0.10.9-grc.zip/py4j/java_gateway.py", line 1304, in __call__
answer = self.gateway_client.send_command(command)
File "/databricks/spark/python/pyspark/sql/utils.py", line 123, in deco
raise converted from None
AnalysisException: cannot resolve 'heartrateheartrateheartrate' given input columns: [spark_catalog.database.table.device_id, spark_catalog.database.table.heartrate, spark_catalog.database.table.mrn, spark_catalog.database.table.time]:
'Project ['heartrateheartrateheartrate]
+- SubqueryAlias spark_catalog.database.table
+- Relation[device_id#75L,heartrate#76,mrn#77L,time#78] parquet
Which statement describes the error being raised?
Exam-Like
Explanation:
The error occurs because the code 3*"heartrate"
is using a string literal instead of a column reference. In PySpark, multiplying a string by an integer results in string repetition (e.g., 3 * 'a'
becomes 'aaa'
). Here, 3*"heartrate"
is interpreted as the string 'heartrateheartrateheartrate'
, which is treated as a column name. Since no such column exists in the DataFrame, Spark raises AnalysisException
. The correct syntax would use col("heartrate")
to reference the column, such as 3 * col("heartrate")
.