DataCloud Blog

Cloud, IoT, Big data tools, Data analytics techniques and tutorials

  • Home
  • Big data tutorials
  • Big data datasets
  • Big data tools
    • Hadoop
      • Hadoop query tools
      • Hadoop infrastructure
    • Data visualization
    • Analysis tools
    • Data store
    • Analysis environment
  • Data governance
  • Data security
  • Project & knowledge mgm.
  • About

Author: Peter Kortvelyesi

Hadoop infrastructure tools

Hadoop’s open source infrastructure tools for big data analysis

Configure Apache Kylin with ODBC to work with MS PowerBI

Configure Apache Kylin with ODBC to work with MS PowerBI

PowerBI and Kylin – reporting from Hadoop via ODBC This article discusses how to setup…

Evaluation of Apache Kylin 1.5.4.1 with HDP 2.5, performance comparison w Hive

Evaluation of Apache Kylin 1.5.4.1 with HDP 2.5, performance comparison w Hive

Apache Kylin is a data cube solution on top of Hadoop providing an ODBC interface…

Create a Hadoop Cluster easily by using PXE boot, Kickstart, Puppet and Ambari to auto-deploy nodes

Create a Hadoop Cluster easily by using PXE boot, Kickstart, Puppet and Ambari to auto-deploy nodes

This tutorial is to showcase unattended and automatic install of multiple CentOS 6.5 x86_64 Hadoop…

A multi-tiered Big Data warehouse & processing facility

A multi-tiered Big Data warehouse & processing facility

The article details an exemplary setup for a multi-tiered data warehouse and processing facility using…

Environment setup for big data analytics

Environment setup for big data analytics

This articles covers basic tools and technologies to use when conducting the first steps on…

Ambari – Hadoop management

Ambari – Hadoop management

Ambari simplifies Hadoop management by providing an easy to use provisioning and monitoring interface for…

Hue – Web UI for Hadoop

Hue – Web UI for Hadoop

Hue is an easy to use, user friendly web UI for Hadoop, featuring File browser…

Zookeper – Hadoop coordination and configuration management

Zookeper – Hadoop coordination and configuration management

Zookeper provides coordination, configuration management, naming, synchronization and group services for large Hadoop clusters. Zookeeper itself…