0 purchases
scikitmultiflow 0.5.3
scikit-multiflow is a machine learning package for streaming data in Python.
Quick links
Webpage
Documentation
Community
Features
Incremental Learning
Stream learning models are created incrementally and are updated continuously. They are suitable
for big data applications where real-time response is vital.
Adaptive learning
Changes in data distribution harm learning. Adaptive methods are specifically designed to be
robust to concept drift changes in dynamic environments.
Resource-wise efficient
Streaming techniques efficiently handle resources such as memory and processing time given the
unbounded nature of data streams.
Easy to use
scikit-multiflow is designed for users with any experience level. Experiments are easy to design,
setup, and run. Existing methods are easy to modify and extend.
Stream learning tools
In its current state, scikit-multiflow contains data generators, multi-output/multi-target stream
learning methods, change detection methods, evaluation methods, and more.
Open source
Distributed under the
BSD 3-Clause,
scikit-multiflow is developed and maintained by an active, diverse and growing community.
Use cases
The following tasks are supported in scikit-multiflow:
Supervised learning
When working with labeled data. Depending on the target type can be either classification
(discrete values) or regression (continuous values)
Single/multi output
Single-output methods predict a single target-label (binary or multi-class) for classification or
a single target-value for regression. Multi-output methods simultaneously predict multiple
variables given an input.
Concept drift detection
Changes in data distribution can harm learning. Drift detection methods are designed to rise an
alarm in the presence of drift and are used alongside learning methods to improve their robustness
against this phenomenon in evolving data streams.
Unsupervised learning
When working with unlabeled data. For example, anomaly detection where the goal is the
identification of rare events or samples which differ significantly from the majority of the data.
Jupyter Notebooks
In order to display plots from scikit-multiflow within a Jupyter Notebook we need to define
the proper mathplotlib backend to use. This is done by including the following magic command at the
beginning of the Notebook:
%matplotlib notebook
JupyterLab is the next-generation user interface
for Jupyter, currently in beta, it can display interactive plots with some caveats. If you use
JupyterLab then the current solution is to use the
jupyter-matplotlib extension:
%matplotlib widget
Citing scikit-multiflow
If scikit-multiflow has been useful for your research and you would like to cite it in a academic
publication, please use the following Bibtex entry:
@article{skmultiflow,
author = {Jacob Montiel and Jesse Read and Albert Bifet and Talel Abdessalem},
title = {Scikit-Multiflow: A Multi-output Streaming Framework },
journal = {Journal of Machine Learning Research},
year = {2018},
volume = {19},
number = {72},
pages = {1-5},
url = {http://jmlr.org/papers/v19/18-251.html}
}
For personal and professional use. You cannot resell or redistribute these repositories in their original state.
There are no reviews.