pyspark.sql.DataFrame.join — PySpark 3.3.0 documentation
spark.apache.org › pysparkDataFrame.join(other: pyspark.sql.dataframe.DataFrame, on: Union [str, List [str], pyspark.sql.column.Column, List [pyspark.sql.column.Column], None] = None, how: Optional[str] = None) → pyspark.sql.dataframe.DataFrame [source] ¶ Joins with another DataFrame, using the given join expression. New in version 1.3.0. Parameters other DataFrame
pyspark.pandas.DataFrame.join — PySpark 3.3.1 documentation
spark.apache.org › docs › latestJoin columns of another DataFrame. Join columns with right DataFrame either on index or on a key column. Efficiently join multiple DataFrame objects by index at once by passing a list. Parameters right: DataFrame, Series on: str, list of str, or array-like, optional. Column or index level name(s) in the caller to join on the index in right, otherwise joins index-on-index. If multiple values given, the right DataFrame must have a MultiIndex. Can pass an array as the join key if it is not ...