Class

org.apache.spark.ml.source.libsvm

LibSVMDataSource

Related Doc: package libsvm

Permalink

class LibSVMDataSource extends AnyRef

libsvm package implements Spark SQL data source API for loading LIBSVM data as DataFrame. The loaded DataFrame has two columns: label containing labels stored as doubles and features containing feature vectors stored as Vectors.

To use LIBSVM data source, you need to set "libsvm" as the format in DataFrameReader and optionally specify options, for example:

// Scala
val df = spark.read.format("libsvm")
  .option("numFeatures", "780")
  .load("data/mllib/sample_libsvm_data.txt")

// Java
Dataset<Row> df = spark.read().format("libsvm")
  .option("numFeatures, "780")
  .load("data/mllib/sample_libsvm_data.txt");

LIBSVM data source supports the following options:

Source
LibSVMDataSource.scala
Note

This class is public for documentation purpose. Please don't use this class directly. Rather, use the data source API as illustrated above.

See also

LIBSVM datasets

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. LibSVMDataSource
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All