Home
Softono
LightAutoML

LightAutoML

Open source Apache-2.0 Python
1.5K
Stars
65
Forks
17
Issues
23
Watchers
5 months
Last Commit

About LightAutoML

Fast and customizable framework for automatic ML model creation (AutoML)

Platforms

Web Self-hosted

Languages

Python

Links

PyPI - Python Version PyPI - Version pypi - Downloads GitHub Workflow Status (with event) Read the Docs

Documentation | Installation | Examples | Telegram chat | Telegram channel

LightAutoML (LAMA) allows you create machine learning models using just a few lines of code, or build your own custom pipeline using ready blocks. It supports tabular, time series, image and text data.

Authors: Alexander Ryzhkov, Anton Vakhrushev, Dmitry Simakov, Rinchin Damdinov, Vasilii Bunakov, Alexander Kirilin, Pavel Shvets.

Quick tour

There are two ways to solve machine learning problems using LightAutoML:

  • Ready-to-use preset:

      from lightautoml.automl.presets.tabular_presets import TabularAutoML
      from lightautoml.tasks import Task
    
      automl = TabularAutoML(task = Task(name = 'binary', metric = 'auc'))
      oof_preds = automl.fit_predict(train_df, roles = {'target': 'my_target', 'drop': ['column_to_drop']}).data
      test_preds = automl.predict(test_df).data
  • As a framework:
    LightAutoML framework has a lot of ready-to-use parts and extensive customization options, to learn more check out the resources section.

Resources

Kaggle kernel examples of LightAutoML usage:

Google Colab tutorials and other examples:

Note 1: for production you have no need to use profiler (which increase work time and memory consomption), so please do not turn it on - it is in off state by default

Note 2: to take a look at this report after the run, please comment last line of demo with report deletion command.

Courses, videos

Installation

To install LAMA framework on your machine from PyPI:

# Base functionality:
pip install -U lightautoml

# For partial installation use corresponding option
# Extra dependencies: [nlp, cv, report] or use 'all' to install all dependencies
pip install -U lightautoml[nlp]
# Or extra dependencies with specific version
pip install 'lightautoml[all]==0.4.0'

Additionally, run following commands to enable pdf report generation:

# MacOS
brew install cairo pango gdk-pixbuf libffi

# Debian / Ubuntu
sudo apt-get install build-essential libcairo2 libpango-1.0-0 libpangocairo-1.0-0 libgdk-pixbuf2.0-0 libffi-dev shared-mime-info

# Fedora
sudo yum install redhat-rpm-config libffi-devel cairo pango gdk-pixbuf2

# Windows
# follow this tutorial https://weasyprint.readthedocs.io/en/stable/install.html#windows

Advanced features

GPU and Spark pipelines

Full GPU and Spark pipelines for LightAutoML currently available for developers testing (still in progress). The code and tutorials for:

Contributing to LightAutoML

If you are interested in contributing to LightAutoML, please read the Contributing Guide to get started.

Support and feature requests

Citation

If you mention LightAutoML in your publications, please cite our paper: Vakhrushev, et al. "LightAutoML: AutoML Solution for a Large Financial Services Ecosystem" arXiv:2109.01528, 2021.

BibTeX entry:

@article{vakhrushev2021lightautoml,
  title={Lightautoml: Automl solution for a large financial services ecosystem},
  author={Vakhrushev, Anton and Ryzhkov, Alexander and Savchenko, Maxim and Simakov, Dmitry and Damdinov, Rinchin and Tuzhilin, Alexander},
  journal={arXiv preprint arXiv:2109.01528},
  year={2021}
}

License

This project is licensed under the Apache License, Version 2.0. See LICENSE file for more details.

Back to top