
Configure Apache Kylin with ODBC to work with MS PowerBI
PowerBI and Kylin – reporting from Hadoop via ODBC This article discusses how to setup…
PowerBI and Kylin – reporting from Hadoop via ODBC This article discusses how to setup…
Apache Kylin is a data cube solution on top of Hadoop providing an ODBC interface…
Performance testing high level Hadoop query languages with example scripts. Analysis of NOAA weather data:…
This tutorial is to showcase unattended and automatic install of multiple CentOS 6.5 x86_64 Hadoop…
Hadoop is not only a data processing but a data warehouse solution. When we are…
The article details an exemplary setup for a multi-tiered data warehouse and processing facility using…
This articles covers basic tools and technologies to use when conducting the first steps on…
Sqoop is an efficient data transfer tool between Hadoop and structured datastores, such as relational…
Storm real-time data processor: while Hadoop is mainly used for batch processing of data Storm…
Mahout is machine learning library that can be used on top of Hadoop HDFS. It…
Ambari simplifies Hadoop management by providing an easy to use provisioning and monitoring interface for…
Hue is an easy to use, user friendly web UI for Hadoop, featuring File browser…
Spark is a high-performance cyclic data-flow in memory computing platform that proves to be lot…
Zookeper provides coordination, configuration management, naming, synchronization and group services for large Hadoop clusters. Zookeeper itself…
HBase is a big data storage for tables with random read/write access needs. Billions of…
Pig is a platform for large dataset analysis, consisting of a language called Pig Latin.…
Hive is to query and manage large datasets of a Hadoop cluster using an SQL like…
MapReduce is a programming model used for processing large datasets with Hadoop. Map: to filter…
Hadoop is an open-source framework for processing large amount of data across clusters of computers…