- 21 Nov, 2017 2 commits
-
-
Update boto version.
brianhw committed -
Add host argument to s3_connect. Update unit test patches to match. Host argument is now required by boto. Latest version of boto provides a hook to read from a .boto file, and there is a way for remote-task to set such a file up and then read from it. However, it's not clear how to get the .boto file to the task instances in a multi-instance cluster so that the file would be read by boto when reducers open their own streams to output to an S3 file.
Brian Wilson committed
-
- 15 Nov, 2017 1 commit
-
-
use video duration from events
Muhammad Ammar committed
-
- 14 Nov, 2017 2 commits
-
-
EDUCATOR-1411
muhammad-ammar committed -
Added parameter import_credentials to HistogramFromSqoopToMySQLWorkflowBase
Jillian Vogel committed
-
- 13 Nov, 2017 2 commits
-
-
Allow empty inserts for ModuleEngagementSummaryMetricRangesMysqlTask
Jillian Vogel committed -
for sites with periods of low activity.
Jillian Vogel committed
-
- 06 Nov, 2017 1 commit
-
-
Inspect traceback for handling end of output error.
Hassan committed
-
- 02 Nov, 2017 1 commit
-
-
Hassan Javeed committed
-
- 01 Nov, 2017 1 commit
-
-
Add missing backslash for null ascii.
brianhw committed
-
- 31 Oct, 2017 1 commit
-
-
Brian Wilson committed
-
- 26 Oct, 2017 1 commit
-
-
Brian/also load json events
brianhw committed
-
- 25 Oct, 2017 1 commit
-
-
Translation of MySQL's LONGBLOB to Vertica's LONG VARBINARY.
Andrew Zafft committed
-
- 24 Oct, 2017 1 commit
-
-
It coexists with regular event output, and is controlled by an optional parameter. By default it runs with event_record_type equal to 'EventRecord', but can be overridden by running with --event-record-type 'JsonEventRecord'. Includes bug fix to timestamp handling: Add validation screening dates < 1900. Also includes support for event loading to BigQuery, by adding support for partitioning to bigquery_load. * Use records for warehouse loading where defined. * Check bigquery availability in load code. Add support for loading to S3 by interval or by date. PerDate loading checks whether the data already exists, which is good for incremental runs. Bulk loading just runs over an interval, and assumes the data isn't already present on S3. This is better for processing many days more efficiently. To address issue with loading into BigQuery, null characters in column values are encoded as the string '\0'.
Brian Wilson committed
-
- 23 Oct, 2017 5 commits
-
-
Check Sqoop job completion before running.
brianhw committed -
Brian Wilson committed
-
Make sure enrollment output is encoded
brianhw committed -
Brian Wilson committed
-
Including a best effort translation of MySQL's LONGBLOB to Vertica's LONG VARBINARY. Data loss is possible here.
Andrew Zafft committed
-
- 19 Oct, 2017 1 commit
-
-
Fix enrollment validation to use unicode version of course_id. Add non-ASCII testing for course-id and related values to ensure that encoding is properly done elsewhere.
Brian Wilson committed
-
- 17 Oct, 2017 1 commit
-
-
Also check for the marker file when checking for completeness.
Brian Wilson committed
-
- 16 Oct, 2017 4 commits
-
-
Incremental video.
Hassan committed -
Added tasks to load warehouse data into BigQuery.
Hassan committed -
Hassan Javeed committed
-
Hassan Javeed committed
-
- 04 Oct, 2017 1 commit
-
-
and tidy up the confusion between credentials and import_credentials.
David Adams committed
-
- 15 Sep, 2017 2 commits
-
-
Use merchant close date to compute interval.
Hassan committed -
Hassan Javeed committed
-
- 13 Sep, 2017 1 commit
-
-
Upgrade setuptools, and other supporting packages.
brianhw committed
-
- 12 Sep, 2017 1 commit
-
-
Brian Wilson committed
-
- 01 Sep, 2017 1 commit
-
-
Cleaned up and added logic to figure out the latest completion date.
Hassan committed
-
- 30 Aug, 2017 1 commit
-
-
Hassan Javeed committed
-
- 16 Aug, 2017 2 commits
-
-
Default to empty function for hadoop counter callback.
Hassan committed -
Hassan Javeed committed
-
- 15 Aug, 2017 2 commits
-
-
Speed up finance reporting workflow.
Hassan committed -
Hassan Javeed committed
-
- 11 Aug, 2017 1 commit
-
-
Event Exporter Improved Encryption Logging & Error Handling
Andrew Zafft committed
-
- 09 Aug, 2017 1 commit
-
-
Improving logging and adding a counter when there is a failure during the event-exporter encryption process using gpg
Andrew Zafft committed
-
- 03 Aug, 2017 1 commit
-
-
Workaround for travis boto issue with GCE.
brianhw committed
-
- 02 Aug, 2017 1 commit
-
-
Changed active users computation to a per week basis.
Hassan committed
-