DataCloud Blog

Cloud, IoT, Big data tools, Data analytics techniques and tutorials

  • Home
  • Big data tutorials
  • Big data datasets
  • Big data tools
    • Hadoop
      • Hadoop query tools
      • Hadoop infrastructure
    • Data visualization
    • Analysis tools
    • Data store
    • Analysis environment
  • Data governance
  • Data security
  • Project & knowledge mgm.
  • About

Author: Peter Kortvelyesi

Posts Tagged: MapReduce

Environment setup for big data analytics

Environment setup for big data analytics

This articles covers basic tools and technologies to use when conducting the first steps on…

MongoDB NoSQL Database

MongoDB NoSQL Database

MongoDB is a NoSQL database designed to store huge amount of data. It supports dynamic…

Ambari – Hadoop management

Ambari – Hadoop management

Ambari simplifies Hadoop management by providing an easy to use provisioning and monitoring interface for…

Hue – Web UI for Hadoop

Hue – Web UI for Hadoop

Hue is an easy to use, user friendly web UI for Hadoop, featuring File browser…

Spark – Cyclic, high-performance data processing on top of Hadoop

Spark – Cyclic, high-performance data processing on top of Hadoop

Spark is a high-performance cyclic data-flow in memory computing platform that proves to be lot…

Pig – Hadoop Query Language

Pig – Hadoop Query Language

Pig is a platform for large dataset analysis, consisting of a language called Pig Latin.…

Hive – Hadoop Query Language

Hive – Hadoop Query Language

Hive is to query and manage large datasets of a Hadoop cluster using an SQL like…

MapReduce – Hadoop’s essential concept

MapReduce – Hadoop’s essential concept

MapReduce is a programming model used for processing large datasets with Hadoop. Map: to filter…

Hadoop big data framework – Hadoop virtual machines

Hadoop big data framework – Hadoop virtual machines

Hadoop is an open-source framework for processing large amount of data across clusters of computers…