Unable to do count or isempty call in dataframe with exceptall

0

D1= spark.createDataframe("a",["b"]) Dd=d1.exceptAll(D1).count() Running the above in emr.6.8 throwing below error,

Java.lang.IllegalException: couldn't find value #400

Vaas
asked 8 months ago314 views
2 Answers
3
Accepted Answer

Hey,

Basically this is a bug in spark 3.3.0 as mentioned in this jira - https://issues.apache.org/jira/browse/SPARK-39612 caused by Prune unrequired child index - https://issues.apache.org/jira/browse/SPARK-38531.

I tested and verified at my end as well. The fix is available in park 3.3.1, 3.4.0. EMR 6.10.1 release has spark 3.3.1 that natively support this. - https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-6101-release.html

AWS
SUPPORT ENGINEER
answered 8 months ago
0

It worked in spark 3.3.1. Thanks 😊

Vaas
answered 8 months ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions