DataCloud Blog

Cloud, IoT, Big data tools, Data analytics techniques and tutorials

  • Home
  • Big data tutorials
  • Big data datasets
  • Big data tools
    • Hadoop
      • Hadoop query tools
      • Hadoop infrastructure
    • Data visualization
    • Analysis tools
    • Data store
    • Analysis environment
  • Data governance
  • Data security
  • Project & knowledge mgm.
  • About

Author: Peter Kortvelyesi

Hadoop query tools

Hadoop’s open source query tools

Configure Apache Kylin with ODBC to work with MS PowerBI

Configure Apache Kylin with ODBC to work with MS PowerBI

PowerBI and Kylin – reporting from Hadoop via ODBC This article discusses how to setup…

Performance test of Pig vs Hive with code examples

Performance test of Pig vs Hive with code examples

Performance testing high level Hadoop query languages with example scripts. Analysis of NOAA weather data:…

Spark – Cyclic, high-performance data processing on top of Hadoop

Spark – Cyclic, high-performance data processing on top of Hadoop

Spark is a high-performance cyclic data-flow in memory computing platform that proves to be lot…

Pig – Hadoop Query Language

Pig – Hadoop Query Language

Pig is a platform for large dataset analysis, consisting of a language called Pig Latin.…

Hive – Hadoop Query Language

Hive – Hadoop Query Language

Hive is to query and manage large datasets of a Hadoop cluster using an SQL like…

MapReduce – Hadoop’s essential concept

MapReduce – Hadoop’s essential concept

MapReduce is a programming model used for processing large datasets with Hadoop. Map: to filter…