Yandex open sources CatBoost machine learning library

Russian search engine creator Yandex has joined the ranks of Google, Amazon, and Microsoft by releasing its own open source machine learning library, CatBoost.

The Apache-licensed CatBoost is for “open-source gradient boosting on decision trees,” according to its GitHub repository’s README. It provides a way to perform classifications and rankings of data by using a collection of decision-making mechanisms, or “learners,” rather than a single one. Results generated by the learners are weighted and classified based on the strengths and weaknesses of each learner. By combining many learners, CatBoost can yield better results than decision-making systems that rely on individual learners.

CatBoost comes with support for Python and R, as well as a command-line interface to drive the machine learning library. The Python packages for CatBoost also include data visualization tools for plotting statistics of the training process. The resulting plots can be viewed in a Jupyter notebook or in CatBoost’s own data viewer application.

Many machine learning libraries already implement some manner of gradient boosting algorithm. Python’s Scikit-learn package has one versionXGBoost is available for multiple languages and data platforms; and Microsoft has the LightGBM library as part of its Distributed Machine Learning Toolkit project.

CatBoost is meant to stand apart from those projects, according to Yandex, by being pre-tuned to perform at scale for Yandex’s own services. Yandex noted that it uses CatBoost to deliver predictions for its weather services, and that CatBoost has been deployed at the European Organization for Nuclear Research (CERN) to refine results from the particle experiments conducted there. 

Trained models created in CatBoost can be deployed in Apple’s Core ML format, for use in MacOS, iOS, tvOS, and watchOS apps backed by machine learning.

IDG Insider


« How to pick the right collaboration tools


Bluetooth devices could soon have mesh networking capabilities »
IDG News Service

The IDG News Service is the world's leading daily source of global IT news, commentary and editorial resources. The News Service distributes content to IDG's more than 300 IT publications in more than 60 countries.

  • Mail

Recommended for You

Trump hits partial pause on Huawei ban, but 5G concerns persist

Phil Muncaster reports on China and beyond

FinancialForce profits from PSA investment

Martin Veitch's inside track on today’s tech trends

Future-proofing the Middle East

Keri Allan looks at the latest trends and technologies


Do you think your smartphone is making you a workaholic?