Table Names
In this array, a list of keys must be specified to precise where data should be generated.
ADLS
- ADLS_CONTAINER :
- ADLS_DIRECTORY :
- ADLS_FILE_NAME :
- ADLS_LOCAL_FILE_PATH :
GCS
- GCS_BUCKET :
- GCS_DIRECTORY :
- GCS_OBJECT_NAME :
- GCS_LOCAL_FILE_PATH :
HDFS
- HDFS_FILE_PATH : Directory where files will be generated
- HDFS_FILE_NAME : Suffix for files generated
Full file path + name will be: HDFS_FILE_PATHHDFS_FILE_NAME-XXXXXXXXXX.extension ; where XXXXXXXXXX a 10-digit number representing order of the file generated and extension depends on file type (.json, .csv, .orc, .parquet, .avro).
HBase
- HBASE_TABLE_NAME :
- HBASE_NAMESPACE :
HIVE
- HIVE_DATABASE :
-
HIVE_TABLE_NAME :
- HIVE_HDFS_FILE_PATH :
- HIVE_TEMPORARY_TABLE_NAME :
KAFKA
- KAFKA_TOPIC :
OZONE
- OZONE_VOLUME :
- OZONE_BUCKET :
- OZONE_KEY_NAME :
- OZONE_LOCAL_FILE_PATH :
SOLR
- SOLR_COLLECTION ;
KUDU
- KUDU_TABLE_NAME :
LOCAL
- LOCAL_FILE_PATH :
- LOCAL_FILE_NAME :
S3
- S3_BUCKET :
- S3_DIRECTORY :
- S3_KEY_NAME :
- S3_LOCAL_FILE_PATH :
Optional: Specific for Avro format
- AVRO_NAME :
Example
Below a Full example with all possibilities that can be passed:
"Table_Names": {
"HDFS_FILE_PATH": "/user/datagen/hdfs/full/",
"HDFS_FILE_NAME": "full",
"HBASE_TABLE_NAME": "full",
"HBASE_NAMESPACE": "datagen",
"KAFKA_TOPIC": "datagen_full",
"OZONE_VOLUME": "datagen",
"OZONE_BUCKET": "full",
"OZONE_KEY_NAME": "full",
"OZONE_LOCAL_FILE_PATH": "/tmp/datagen/temp/full/",
"SOLR_COLLECTION": "datagen_full",
"HIVE_DATABASE": "datagen",
"HIVE_TABLE_NAME": "full",
"HIVE_TEMPORARY_TABLE_NAME": "full_tmp",
"HIVE_HDFS_FILE_PATH": "/user/datagen/hive/full/",
"KUDU_TABLE_NAME": "datagen.full",
"LOCAL_FILE_PATH": "/tmp/datagen/full/",
"LOCAL_FILE_NAME": "datagen-full",
"S3_BUCKET": "datagen-test-fri",
"S3_DIRECTORY": "datagen/full",
"S3_KEY_NAME": "full-key",
"S3_LOCAL_FILE_PATH": "/tmp/datagen/temp/full/",
"ADLS_CONTAINER": "dgtest",
"ADLS_DIRECTORY": "datagen/full",
"ADLS_FILE_NAME": "full",
"ADLS_LOCAL_FILE_PATH": "/tmp/datagen/temp/full/",
"GCS_BUCKET": "datagenfri",
"GCS_DIRECTORY": "datagen/full",
"GCS_OBJECT_NAME": "full",
"GCS_LOCAL_FILE_PATH": "/tmp/datagen/temp/full/",
"AVRO_NAME": "datagenfull"
},