DataCloud Blog

Cloud, IoT, Big data tools, Data analytics techniques and tutorials

  • Home
  • Big data tutorials
  • Big data datasets
  • Big data tools
    • Hadoop
      • Hadoop query tools
      • Hadoop infrastructure
    • Data visualization
    • Analysis tools
    • Data store
    • Analysis environment
  • Data governance
  • Data security
  • Project & knowledge mgm.
  • About

Author: Peter Kortvelyesi
Data Governance in the Big Data and Hadoop world – Whitepaper

Data Governance in the Big Data and Hadoop world – Whitepaper

My full whitepaper published and downloadable at EPAM.com Amidst vast data lakes and a high…

Air quality monitoring IOT – Arduino & sensors connected to a Raspberry Pi

Air quality monitoring IOT – Arduino & sensors connected to a Raspberry Pi

I was very interested in monitoring the surrounding air’s quality, so I built a box…

Configure Apache Kylin with ODBC to work with MS PowerBI

Configure Apache Kylin with ODBC to work with MS PowerBI

PowerBI and Kylin – reporting from Hadoop via ODBC This article discusses how to setup…

Evaluation of Apache Kylin 1.5.4.1 with HDP 2.5, performance comparison w Hive

Evaluation of Apache Kylin 1.5.4.1 with HDP 2.5, performance comparison w Hive

Apache Kylin is a data cube solution on top of Hadoop providing an ODBC interface…

Top 7 challenges of building a data lake

Top 7 challenges of building a data lake

While from the technical perspective, deployment, management and provisioning tools are available to quickly set…

IT support for project management processes

IT support for project management processes

I co-authored lecture notes on IT support for project management processes e-learning book. For a…

Performance test of Pig vs Hive with code examples

Performance test of Pig vs Hive with code examples

Performance testing high level Hadoop query languages with example scripts. Analysis of NOAA weather data:…

Create a Hadoop Cluster easily by using PXE boot, Kickstart, Puppet and Ambari to auto-deploy nodes

Create a Hadoop Cluster easily by using PXE boot, Kickstart, Puppet and Ambari to auto-deploy nodes

This tutorial is to showcase unattended and automatic install of multiple CentOS 6.5 x86_64 Hadoop…

Analysis tutorial with Tableau Desktop

Analysis tutorial with Tableau Desktop

Tableau Desktop supports visual analysis and data discovery, converts the raw information to easy to…

Setting up a firewall to secure a Hadoop cluster’s network with Shorewall

Setting up a firewall to secure a Hadoop cluster’s network with Shorewall

Shorewall is a tool to configure Linux inbuilt IPTables in an easy and understandable way.…

Security of a Hadoop cluster

Security of a Hadoop cluster

Hadoop is not only a data processing but a data warehouse solution. When we are…

Data mining from webpages with Python Mechanize Browser Automation – a Big data tutorial

Data mining from webpages with Python Mechanize Browser Automation – a Big data tutorial

Data mining from the web with Python Mechanize Browser Automation – Big data tutorial In…

Data mining from webpages with Selenium Python WebDriver Browser Automation – a Big data tutorial

Data mining from webpages with Selenium Python WebDriver Browser Automation – a Big data tutorial

Data mining from the web with Selenium Python WebDriver Browser Automation – Big data tutorial In…

Send to Kindle via email – using browser automation

Send to Kindle via email – using browser automation

I often read web articles on the Kindle e-reader for less eye strain. In order…

Weather data analysis and visualization – Big data tutorial Part 1/9 – Fundamentals

Weather data analysis and visualization – Big data tutorial Part 1/9 – Fundamentals

Tutorial big data analysis: Weather changes in the Carpathian-Basin from 1900 to 2014 – Part…

Weather data analysis and visualization – Big data tutorial Part 2/9 – Dataset

Weather data analysis and visualization – Big data tutorial Part 2/9 – Dataset

Tutorial big data analysis: Weather changes in the Carpathian-Basin from 1900 to 2014 – Part…

Weather data analysis and visualization – Big data tutorial Part 3/9 – Environment

Weather data analysis and visualization – Big data tutorial Part 3/9 – Environment

Tutorial big data analysis: Weather changes in the Carpathian-Basin from 1900 to 2014 – Part…

Weather data analysis and visualization – Big data tutorial Part 4/9 – Hadoop & Pig

Weather data analysis and visualization – Big data tutorial Part 4/9 – Hadoop & Pig

Tutorial big data analysis: Weather changes in the Carpathian-Basin from 1900 to 2014 – Part…

Weather data analysis and visualization – Big data tutorial Part 5/9 – Visualizing: GIS & map

Weather data analysis and visualization – Big data tutorial Part 5/9 – Visualizing: GIS & map

Tutorial big data analysis: Weather changes in the Carpathian-Basin from 1900 to 2014 – Part…

Weather data analysis and visualization – Big data tutorial Part 6/9 – SED example

Weather data analysis and visualization – Big data tutorial Part 6/9 – SED example

Tutorial big data analysis: Weather changes in the Carpathian-Basin from 1900 to 2014 – Part…

  • 1
  • 2
  • 3
  • 4