Hi John,
1. The files returned from the /bulk POST endpoint are delivered in Parquet format, a widely used columnar storage format optimized for Big Data workflows.
If the downloaded file appears unreadable or lacks a file extension, it's possible that CloudFront is stripping the extension during the download. Nonetheless it will still be readable as Parquet format with the schema conforms to Avro (meaning schema and data come in together).
2. Regarding your query on importing to data warehouses etc. We are preparing some enhancements to the developer documentation that will outline a pattern for ingestion. Please stay tuned for updates.
3. Currently, the API is focused on conversation-level data only, and there are no plans at this time to expose aggregate Performance, Status, or Evaluation data through the Lakehouse API.
That said, aggregates can typically be derived from the raw conversation-level metrics, similar to how Genesys Cloud builds reporting views. We'll also be sharing more details about roadmap priorities soon so do keep an eye on the Ideas Portal for upcoming enhancements.
Hope this helps.
------------------------------
Austin Keogh
Principal Product Manager
------------------------------