daft.DataFrame.repartition#

DataFrame.repartition(num: int, *partition_by: Union[daft.expressions.Expression, str]) daft.dataframe.dataframe.DataFrame[source]#

Repartitions DataFrame to num partitions

If columns are passed in, then DataFrame will be repartitioned by those, otherwise random repartitioning will occur.

Example

>>> random_repart_df = df.repartition(4)
>>> part_by_df = df.repartition(4, 'x', col('y') + 1)
Parameters
  • num (int) – number of target partitions.

  • *partition_by (Union[str, Expression]) – optional columns to partition by.

Returns

Repartitioned DataFrame.

Return type

DataFrame