Cloudera Hive Odbc Driver

Under Hortonworks ODBC Driver for Apache Hive, download the driver for Windows (64-bit). For installation instructions, expand the Hortonworks ODBC Driver for Apache Hive (v2.6.1.1) entry and click Documentation under the list of operating systems to open the 'Hortonworks Hive ODBC Driver User Guide.' Follow the instructions for the Windows driver. The driver achieves this by translating Open Database Connectivity (ODBC) calls from Tableau into SQL and passing the SQL queries to the underlying Impala or Hive engines. The driver supports Cloudera Impala 1.0 (and later) and the Apache Hive versions supplied with CDH 4.2 and later. The Cloudera ODBC Driver for Hive enables your enterprise users to access Hadoop data through Business Intelligence (BI) applications with ODBC support. The driver achieves this by translating Open Database Connectivity (ODBC) calls from the application into SQL and passing the SQL queries to the underlying Hive engine.

  1. Cloudera Hive Odbc Driver For Linux
  2. Cloudera Odbc Driver For Apache Hive
  3. Odbc Driver Install
Active5 years, 9 months ago

I downloaded and installed cloudera odbc hive drivers provided in the link https://ccp.cloudera.com/display/con/Cloudera+Connector+for+Tableau. But when running odbcad32.exe (ODBC Data Source Administrator) the Drivers tab does not show anything new related to this driver. I have tried it on Window Server 2008 R2 and Windows 7. Anyone experiencing the same and do have a solution?Thanks, Cees

Cees WesselingCees Wesseling

1 Answer

Which version of the ODBC manager are you using? If your platform is 64-bit, the Cloudera ODBC driver will not show up using the 64-bit manager (the Driver is 32-bit). Therefore, you must use the 32-bit manager which can be found under c:/windows/sysWoW64/odbcad32.exe.

Keep in mind that the 32-bit adapter can only be used with 32-bit applications.

Hope this is helpful.

Matt HarrisMatt Harris

Not the answer you're looking for? Browse other questions tagged windowsodbchiveclouderatableau or ask your own question.

-->

Microsoft's Big Data solution integrates Microsoft Business Intelligence (BI) components with Apache Hadoop clusters that have been deployed in Azure HDInsight. An example of this integration is the ability to connect Excel to the Hive data warehouse of a Hadoop cluster in HDInsight using the Microsoft Hive Open Database Connectivity (ODBC) Driver.

It is also possible to connect the data associated with an HDInsight cluster and other data sources, including other (non-HDInsight) Hadoop clusters, from Excel using the Microsoft Power Query add-in for Excel. For information on installing and using Power Query, see Connect Excel to HDInsight with Power Query.

Prerequisites

Before you begin this article, you must have the following items:

  • An HDInsight Hadoop cluster. To create one, see Get started with Azure HDInsight.
  • A workstation with Office 2010 Professional Plus or later, or Excel 2010 or later.

Install Microsoft Hive ODBC driver

Download and install the Microsoft Hive ODBC Driver version that matches the version of the application where you will be using the ODBC driver. For this article, the driver is used for Office Excel.

Create Apache Hive ODBC data source

The following steps show you how to create a Hive ODBC Data Source.

  1. From Windows, navigate to Start > Windows Administrative Tools > ODBC Data Sources (32-bit)/(64-bit). This opens the ODBC Data Source Administrator window.

  2. From the User DSN tab, select Add to open the Create New Data Source window.

  3. Select Microsoft Hive ODBC Driver, and then select Finish to open the Microsoft Hive ODBC Driver DSN Setup window.

  4. Type or select the following values:

    PropertyDescription
    Data Source NameGive a name to your data source
    Host(s)Enter HDInsightClusterName.azurehdinsight.net. For example, myHDICluster.azurehdinsight.net
    PortUse 443. (This port has been changed from 563 to 443.)
    DatabaseUse default.
    MechanismSelect Windows Azure HDInsight Service
    User NameEnter HDInsight cluster HTTP user username. The default username is admin.
    PasswordEnter HDInsight cluster user password. Select the checkbox Save Password (Encrypted).
  5. Optional: Select Advanced Options...

    ParameterDescription
    Use Native QueryWhen it is selected, the ODBC driver does NOT try to convert TSQL into HiveQL. You shall use it only if you are 100% sure you are submitting pure HiveQL statements. When connecting to SQL Server or Azure SQL Database, you should leave it unchecked.
    Rows fetched per blockWhen fetching a large number of records, tuning this parameter may be required to ensure optimal performances.
    Default string column length, Binary column length, Decimal column scaleThe data type lengths and precisions may affect how data is returned. They cause incorrect information to be returned due to loss of precision and/or truncation.
  6. Select Test to test the data source. When the data source is configured correctly, the test result shows SUCCESS!.

  7. Select OK to close the Test window.

  8. Select OK to close the Microsoft Hive ODBC Driver DSN Setup window.

  9. Select OK to close the ODBC Data Source Administrator window.

Import data into Excel from HDInsight

Cloudera Hive Odbc Driver For Linux

The following steps describe the way to import data from a Hive table into an Excel workbook using the ODBC data source that you created in the previous section.

  1. Open a new or existing workbook in Excel.

  2. From the Data tab, navigate to Get Data > From Other Sources > From ODBC to launch the From ODBC window.

  3. From the drop-down list, select the data source name that you created in the last section and then select OK.

  4. For the first use, an ODBC driver dialog will open. Select Windows from the left menu. Then select Connect to open the Navigator window.

  5. From Navigator, navigate to HIVE > default > hivesampletable, and then select Load. It takes a few moments before data gets imported to Excel.

Hive

Cloudera Odbc Driver For Apache Hive

Next steps

In this article, you learned how to use the Microsoft Hive ODBC driver to retrieve data from the HDInsight Service into Excel. Similarly, you can retrieve data from the HDInsight Service into SQL Database. It is also possible to upload data into an HDInsight Service. To learn more, see:

Odbc Driver Install

  • Visualize Apache Hive data with Microsoft Power BI in Azure HDInsight.
  • Visualize Interactive Query Hive data with Power BI in Azure HDInsight.
  • Use Apache Zeppelin to run Apache Hive queries in Azure HDInsight.
  • Connect Excel to Apache Hadoop by using Power Query.
  • Connect to Azure HDInsight and run Apache Hive queries using Data Lake Tools for Visual Studio.
  • Use Azure HDInsight Tool for Visual Studio Code.
  • Upload data to HDInsight.