Datagen generate customizable data into various (Big) Data services (Databases, File Systems, Indexers, Queues) in many formats, at scale.
It works with:
- Hadoop (HDFS, Hive, HBase etc…)
- AWS S3
- Azure DLS
- Google Cloud Storage
- Local file systems
- Different File Formats: Avro, Parquet, ORC, JSON, CSV
- Kafka
- SolR
It deploys natively on:
- Cloudera Data Platform (fully integrated)
- Any machine in the Cloud or On-Prem
- Kubernetes platform (coming soon)