Interface used to load a [[Dataset]] from external storage systems (e.g. file systems, key-value stores, etc). Use SparkSession.read() to access this.

1.0.0

Kent Yao yao@apache.org

TODO: Some featuers are not implemented yet:

  • [ ] Add support of legacy char/varchar as string type for schema method
  • [ ] Add support of json(jsonDataset: Dataset[string]) method
  • [ ] Add support of csv(csvDataset: Dataset[string]) method
  • [ ] Add support of xml(xmlDataset: Dataset[string]) method
  • [ ] Add support of textFile(path: string, ...paths: string[]): Dataset[string] method

Constructors

Properties

Methods

  • Parameters

    • url: string
    • table: string
    • columnName: undefined | string = undefined
    • lowerBound: undefined | number = undefined
    • upperBound: undefined | number = undefined
    • numPartitions: undefined | number = undefined
    • predicates: string[] = []
    • properties: { [key: string]: string }

    Returns DataFrame