Datagen generate customizable data into various (Big) Data services (Databases, File Systems, Indexers, Queues) in many formats, at scale.

It works with:

  • Hadoop (HDFS, Hive, HBase etc…)
  • AWS S3
  • Azure DLS
  • Google Cloud Storage
  • Local file systems
  • Different File Formats: Avro, Parquet, ORC, JSON, CSV
  • Kafka
  • SolR

It deploys natively on:

  • Cloudera Data Platform (fully integrated)
  • Any machine in the Cloud or On-Prem
  • Kubernetes platform (coming soon)

Table of contents