1. 21 Nov, 2017 2 commits
    • Merge pull request #451 from edx/brian/upgrade-boto · 30d00b90
      Update boto version.
      brianhw committed
    • Update boto version. · 9d78c0f6
      Add host argument to s3_connect.  Update unit test patches to match.
      Host argument is now required by boto.  Latest version of boto provides a
      hook to read from a .boto file, and there is a way for remote-task
      to set such a file up and then read from it.  However, it's not
      clear how to get the .boto file to the task instances in a multi-instance
      cluster so that the file would be read by boto when reducers open
      their own streams to output to an S3 file.
      Brian Wilson committed
  2. 15 Nov, 2017 1 commit
  3. 14 Nov, 2017 2 commits
  4. 13 Nov, 2017 2 commits
  5. 06 Nov, 2017 1 commit
  6. 02 Nov, 2017 1 commit
  7. 01 Nov, 2017 1 commit
  8. 31 Oct, 2017 1 commit
  9. 26 Oct, 2017 1 commit
  10. 25 Oct, 2017 1 commit
  11. 24 Oct, 2017 1 commit
    • Add support for JSON event output. · 940fd7cb
      It coexists with regular event output, and is controlled by
      an optional parameter.  By default it runs with
      event_record_type equal to 'EventRecord', but can
      be overridden by running with --event-record-type 'JsonEventRecord'.
      
      Includes bug fix to timestamp handling:  Add validation screening dates < 1900.
      
      Also includes support for event loading to BigQuery, by adding
      support for partitioning to bigquery_load.
      
      * Use records for warehouse loading where defined.
      * Check bigquery availability in load code.
      
      Add support for loading to S3 by interval or by date.  PerDate loading
      checks whether the data already exists, which is good for incremental runs.
      Bulk loading just runs over an interval, and assumes the data isn't already
      present on S3.  This is better for processing many days more efficiently.
      
      To address issue with loading into BigQuery, null characters in column values
      are encoded as the string '\0'.
      Brian Wilson committed
  12. 23 Oct, 2017 5 commits
  13. 19 Oct, 2017 1 commit
  14. 17 Oct, 2017 1 commit
  15. 16 Oct, 2017 4 commits
  16. 04 Oct, 2017 1 commit
  17. 15 Sep, 2017 2 commits
  18. 13 Sep, 2017 1 commit
  19. 12 Sep, 2017 1 commit
  20. 01 Sep, 2017 1 commit
  21. 30 Aug, 2017 1 commit
  22. 16 Aug, 2017 2 commits
  23. 15 Aug, 2017 2 commits
  24. 11 Aug, 2017 1 commit
  25. 09 Aug, 2017 1 commit
  26. 03 Aug, 2017 1 commit
  27. 02 Aug, 2017 1 commit