Sort-merge join in Spark SQL - waitingforcode.com
www.waitingforcode.com › apache-spark-sql › sortAug 12, 2017 · The sort-merge join can be activated through spark.sql.join.preferSortMergeJoin property that, when enabled, will prefer this type of join over shuffle one. Among the most important classes involved in sort-merge join we should mention org.apache.spark.sql.execution.joins.SortMergeJoinExec. This is the central point dispatching code generation according to defined join type.