DataCloud Blog

Cloud, IoT, Big data tools, Data analytics techniques and tutorials

  • Home
  • Big data tutorials
  • Big data datasets
  • Big data tools
    • Hadoop
      • Hadoop query tools
      • Hadoop infrastructure
    • Data visualization
    • Analysis tools
    • Data store
    • Analysis environment
  • Data governance
  • Data security
  • Project & knowledge mgm.
  • About

Author: Peter Kortvelyesi

Posts Tagged: query

Performance test of Pig vs Hive with code examples

Performance test of Pig vs Hive with code examples

Performance testing high level Hadoop query languages with example scripts. Analysis of NOAA weather data:…

Elasticsearch – Search and analyze in real-time

Elasticsearch – Search and analyze in real-time

Elasticsearch is a real-time search and analytics engine. It is scalable, distributed and reliable. Entities…

Lucene and Solr – Search indexer and provider

Lucene and Solr – Search indexer and provider

Lucene and Solr provides leading search capabilities in NoSQL databases (XML, JSON). Lucene core is…

Pig – Hadoop Query Language

Pig – Hadoop Query Language

Pig is a platform for large dataset analysis, consisting of a language called Pig Latin.…

Hive – Hadoop Query Language

Hive – Hadoop Query Language

Hive is to query and manage large datasets of a Hadoop cluster using an SQL like…

MapReduce – Hadoop’s essential concept

MapReduce – Hadoop’s essential concept

MapReduce is a programming model used for processing large datasets with Hadoop. Map: to filter…