• Aggregate function: returns a set of objects with duplicate elements eliminated.

    Parameters

    Returns Column

    The function is non-deterministic because the order of collected results depends on the order of the rows which may be non-deterministic after a shuffle.