Spark ML -- Principal Components Analysis
Perform principal components analysis on a Spark DataFrame.
ml_pca(x, features = tbl_vars(x), k = length(features),
ml.options = ml_options(), ...)Arguments
| x | An object coercable to a Spark DataFrame (typically, a
|
| features | The columns to use in the principal components
analysis. Defaults to all columns in |
| k | The number of principal components. |
| ml.options | Optional arguments, used to affect the model generated. See
|
| ... | Optional arguments. The |
See also
Other Spark ML routines: ml_als_factorization,
ml_decision_tree,
ml_generalized_linear_regression,
ml_gradient_boosted_trees,
ml_kmeans, ml_lda,
ml_linear_regression,
ml_logistic_regression,
ml_multilayer_perceptron,
ml_naive_bayes,
ml_one_vs_rest,
ml_random_forest,
ml_survival_regression