org.apache.spark.api.java.JavaPairRDD ... - Tabnine
www.tabnine.com › code › javaJavaPairRDD.repartitionAndSortWithinPartitions (Showing top 18 results out of 315) origin: apache / drill @Override public JavaPairRDD<HiveKey, BytesWritable> shuffle( JavaPairRDD<HiveKey, BytesWritable> input, int numPartitions) { if (numPartitions < 0 ) { numPartitions = 1 ; } return input. repartitionAndSortWithinPartitions ( new HashPartitioner(numPartitions)); }
pyspark.RDD.repartitionAndSortWithinPartitions — PySpark 3.3. ...
spark.apache.org › docs › latestRDD.repartitionAndSortWithinPartitions(numPartitions: Optional [int] = None, partitionFunc: Callable [ [Any], int] = <function portable_hash>, ascending: bool = True, keyfunc: Callable [ [Any], Any] = <function RDD.<lambda>>) → pyspark.rdd.RDD [ Tuple [ Any, Any]] [source] ¶. Repartition the RDD according to the given partitioner and, within each resulting partition, sort records by their keys.