Overview
After Hive data sources are configured, you can use Data Fusion > Task Management function to import its data into the divine data table or entity, easy to use in the report, analysis model, intelligent operations and other modules.
Before configuring the Hive data source, check whether your Hive data source meets the requirements by referring to the following:
Data Source Type | Data Source Name | Version Requirement | User Permission Requirement | Other Requirements |
---|---|---|---|---|
Database Class | Hive | 3.x or 2.x version | SELECT Permission | None |
Add Hive Data Source
- Select Data Fusion > Universal Data Access > Data Source Management.
- Click All Data Source Tab page.
- Click Hive Data Source.
- Click on the top right corner Create Button.
- Fill in Hive connection information.
- Data source connection name : It is customized by you and is the unique identification of the data source connection in the platform.
- Server : IP address to which the data source is connected.
- Port : indicates the port number to which the data source is connected.
- Database name : Optional. You can specify a specific database. If you do not specify a specific database, all databases under the connection will be obtained.
- Login mode:support Anonymous login and User name Password Indicates the login.
- Authentication mode:
- Authentication-free mode : indicates the configuration item in the hive configuration file hive-site.xmlhive.server2.authentication is NONE.In non-authentication mode, anonymous users access Hive in JDBC mode. Hive must have read permission for any user.
- KERBEROS:
- Client Principal: indicates the name of the client that has registered with the KDC.
- Krb5 Configuration File: Only supportkrb5.conf File upload with name and suffix.
- Keytab Secret key: Only support .keytab Format file.
- Data Connection URL: Automatically generated based on the above information.
- Click Test Connection button.
- Click Submit button.
Manage Hive Data Source
- Select Data Fusion > Universal Data Access > Data Source Management.
- Click Added data source Tab page.
- Click Hive data source.
- Edit: Supports modifying all configuration parameters of data connection.
- Delete: Delete the current connection.
If the current data connection is used by the task, modifying the parameters or deleting the connection will cause the task to fail.
Table field type mapping rules
Import data from the Hive data source to the Sensor Data table. Incorrect field type mapping may result in content conversion errors or task execution failures. Configure the field mapping according to the following rules to ensure data conversion is safe:
Original field type | Sensor Data table field type |
---|---|
TINYINT | NUMBER / INT / BIGINT |
SMALLINT | NUMBER / INT / BIGINT |
INT/ INTEGER | NUMBER / INT / BIGINT |
BIGINT | NUMBER / BIGINT |
FLOAT | NUMBER |
DOUBLE | NUMBER |
DECIMAL | NUMBER |
NUMERIC | NUMBER |
STRING | STRING |
VARCHAR | STRING |
CHAR | STRING |
BOOLEAN | BOOLEAN |
TIMESTAMP | TIMESTAMP |
DATE | TIMESTAMP |
Description
- The above mentioned types are the currently supported Hive field types.
- The configuration of unsupported type fields after task import will cause task execution to fail.