A curated list of data tools to help you make better product decisions

PredictionIO is an open source machine learning server for software developers to create predictive features, such as personalization, recommendation and content discovery.


MLlib is Spark’s machine learning (ML) library. Its goal is to make practical machine learning scalable and easy. It consists of common learning algorithms and utilities, including classification, regression, clustering, collaborative filtering, dimensionality reduction, as well as lower-level optimization primitives and higher-level pipeline APIs.


Scikit-learn is an open source machine learning library for the Python programming language. It features various classification, regression and clustering algorithms including support vector machines, random forests, gradient boosting, k-means and DBSCAN, and is designed to interoperate with the Python numerical and scientific libraries NumPy and SciPy.


BigML offers a highly scalable, cloud based machine learning service that is easy to use, seamless to integrate and instantly actionable. Through a simple to use interface, users can quickly analyze their data and build predictive models without any prior expertise. The user can explore these models for new insights and use them to make predictions.


Amazon Machine Learning is a service that makes it easy for developers of all skill levels to use machine learning technology. Amazon Machine Learning provides visualization tools and wizards that guide you through the process of creating machine learning (ML) models without having to learn complex ML algorithms and technology.