DataFrame.Collect Method
Definition
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
Returns an array that contains all rows in this DataFrame
.
public System.Collections.Generic.IEnumerable<Microsoft.Spark.Sql.Row> Collect ();
member this.Collect : unit -> seq<Microsoft.Spark.Sql.Row>
Public Function Collect () As IEnumerable(Of Row)
Returns
Row objects
Remarks
This requires moving all the data into the application's driver process, and doing so on a very large dataset can crash the driver process with OutOfMemoryError.