Random Forrest Classifier in Spark ML

MLlib is Spark’s machine learning (ML) library. It’s goal is to make practical machine learning scalable and easy.

I tried to make a complete step by step classification example using the Iris flower data set using the BeakerX Jupyter kernel which covers the following steps

Setup
Data Preparation
Testing and Prediction
Validation

The example is written in Scala but you could use any other language which is supported by the JVM.

My example can be found the this GIST

I leave it up to you to replace the classifier with e.g. NaiveBayes or with a MultilayerPerceptronClassifier. The MLib Programming Guide contains the right level of information and is easy to use.

Random Forrest Classifier in Spark ML

Published by pschatzmann on 2. November 20182. November 2018

0 Comments

Leave a Reply Cancel reply

An Introduction to Arduino Audio Generated by Tensorflow Lite

TF-Lite micro_speech – An ESP32 Audio Provider using Arduino Audio Tools

An Introduction to Speech Recognition with Arduino

Random Forrest Classifier in Spark ML

Published by pschatzmann on 2. November 20182. November 2018

see also:

0 Comments

Leave a Reply Cancel reply

Related Posts

An Introduction to Arduino Audio Generated by Tensorflow Lite

TF-Lite micro_speech – An ESP32 Audio Provider using Arduino Audio Tools

An Introduction to Speech Recognition with Arduino