Unable to manage the query process of small files with Hadoop Hive
Published: 19 Feb 2021 Last Modified Date: 08 Jul 2022
Issue
Attempting to use Tableau's Data Connector to manage a Hadoop Hive Connection
Environment
Tableau Desktop
Windows 10
Resolution
As a potential workaround, it may be able to possible to run an Initial SQL statement.
For more information please see the following article from our online Product Help Guide: Run Initial SQL Statements
Cause
The ability to use Tableau's Data Connector to manage the query process for Small Files from Hadoop Hive is not built into the product.
Additional Information
Hive is a batch-oriented system and is not yet capable of answering simple queries with very quick turnaround. This limitation can make it difficult to explore a new data set or experiment with calculated fields. Some of the newer SQL-on-Hadoop technologies (for example, Cloudera's Impala and Hortonworks' Stringer project) are designed to address this limitation. For more information, please see the guide below. + Hortonworks Hadoop Hive
Thank you for providing your feedback on the effectiveness of the article.