Add support for JSON event output.
It coexists with regular event output, and is controlled by an optional parameter. By default it runs with event_record_type equal to 'EventRecord', but can be overridden by running with --event-record-type 'JsonEventRecord'. Includes bug fix to timestamp handling: Add validation screening dates < 1900. Also includes support for event loading to BigQuery, by adding support for partitioning to bigquery_load. * Use records for warehouse loading where defined. * Check bigquery availability in load code. Add support for loading to S3 by interval or by date. PerDate loading checks whether the data already exists, which is good for incremental runs. Bulk loading just runs over an interval, and assumes the data isn't already present on S3. This is better for processing many days more efficiently. To address issue with loading into BigQuery, null characters in column values are encoded as the string '\0'.
Showing
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
This diff is collapsed.
Click to expand it.
Please
register
or
sign in
to comment