Mahout – Data Mining, Machine Learning

Mahout is  machine learning library that can be used on top of Hadoop HDFS.

It uses the MapReduce paradigm.

Mahout features:

  • User and Item based recommenders
  • Matrix factorization based recommenders
  • K-Means, Fuzzy K-Means clustering
  • Latent Dirichlet Allocation
  • Singular value decomposition
  • Logistic regression based classifier
  • Complementary Naive Bayes classifier
  • Random forest decision tree based classifier
  • High performance java collections (previously colt collections)