Overview

Background

University of Edinburgh Distance Learning at Scale courses are to be delivered via a partner platform. 

Learner data from this platform needs to be accessed:  

  • For learning analytic tools that provide coaching feedback to learners based on actions taken in the MicroMasters modules;  

  • Course-based administration processes (e.g. identification of learners who have passed all modules in the MicroMasters and are eligible for the capstone). 

  • To generate reports on course usage and performance  

This project will design and implement an automated process and infrastructure to download, decrypt the data package and to make it available via a Relational Database Management System (RDBMS).  

Scope

This scope of the project is to: 

  • Design the automated process by which data is downloaded, decrypted and made available via an RDBMS. 

  • Build the infrastructure and workflows that will deliver this process in a resilient infrastructure that can be mirrored on both Test and Live  environments and test that the data is accurate. 

  • Document the data flow processes and ensure they meet legal requirements and university policy. 

The scope of the project does not include: 

  • Building the environments for the OnTask learning analytics tool or identifying how the use of OnTask should be incorporated into pilot DLAS programmes. 

  • The gathering of student data for UoE admissions systems during registration for the capstone assessment. (We assume that the data from edX on learners is so minimal that the benefits of part-populating UoE systems from edX data will not be worth the effort required). 

Out of Scope 

Integration with EDW.   Whilst there may be a longer term requirement, initially the main requirement is the need is for operational reporting and to increase the understanding of the edX data set. DLAS is a 3 year pilot for now but if approved as a service, consideration should be given to identify a subset of the data that could be included in EDW. This would be scheduled for  20/21.

Objectives

Phase Priority Owner
O1. Ensure Data Protection requirements are identified     
D1.1  Produce a Data Privacy Impact Assessment (DPIA) plus any other documentation identified as being required as a result of writing the DPIA.  Must Business Lead 
O2. Deliver a Platform Data Store for DLAS    
D2.1 The creation of both a Test and Live Production environments Must Technical Lead 
D2.2 A resilient infrastructure ensuring data is always available Must Technical Lead
D2.3  An automated mechanism for downloading and storage of encrypted data packages (weekly database snapshot of Edinburgh X courses and daily clickstream\event log data) from edx.org Amazon S3 cloud storage to a UoE IS locally-hosted infrastructure. Must Software Development Lead
D2.4 An automated decryption of data packages based on the existing Data Czar data key.  Must  Technical Lead
D2.5 An automated process for importing the weekly database snapshot into an RDBMS.  Must Software Development Lead
D2.6 Creation of additional tables of designated event log \ clickstream data  Must Software Development Lead
D2.7 An automated process for importing designated event log \ clickstream data  Must Software Development Lead
D2.8 Error handling mechanism for automated decryption and import processes of data packages (which allows the process to complete with any errors flagged).  Must Software Development Lead
D2.9 Documentation of the database schema, automated processes and error handling processes. Must Software Development Lead
D2.10 Consultation & creation of basic reporting of the EdinburghX data (e.g. user x is enrolled on what courses, course x has the following user enrolled) Must  Software Development Lead / Business Lead  / Project Sponsor 

Benefits

  • Data from edX.org is accessible via a locally hosted database. 

  • Data generated from edx.org can be incorporated into DLAS academic programmes, improving the quality of the experience for users. (e.g.  learning analytics tool, OnTask) 

  • Allows schools to manage and monitor capstone access. 

  • Allow the DLAS project to analyse and produce reports on the usage of the edx.org platform 

Success Criteria

  • Creation of an database (RDBMS) that hosts EdinburghX courses generated data that meets business requirements

  • Creation of robust automated processes that imports the weekly database snapshot and defined clickstream data.  

  • A decryption and import processes that allows the import process to complete and flags issues via an agreed method. 

  • DPIA created and approved with GDPR processes identified and established. 

  • Creation of basic reports that successfully query the EdinburghX data including the ability to query user enrolment, status and assessment progress across all courses

Project Milestones

Stage Milestone Date
Plan Planning 11-Jan-2019
Design Sign-Off Design Phase 15-Feb-2019
Build Sign-Off Build Phase 18-Mar-2019
Accept Sign-Off Acceptance Testing 19-Apr-2019
Deliver Go-Live 26-Apr-2019
Deliver Sign-Off DSOR 03-May-2019
Close Closure 03-May-2019

 

Project Info

Project
Establishing Platform Data Store for DLAS
Code
DLAS012
Programme
Distance Learning at Scale (DLAS)
Management Office
ISG PMO
Project Manager
Andrew Stewart
Project Sponsor
Anne-Marie Scott
Current Stage
Close
Status
Closed
Project Classification
Grow
Start Date
03-Sep-2018
Planning Date
11-Jan-2019
Delivery Date
23-Aug-2019
Close Date
29-Nov-2019
Overall Priority
Higher
Category
Discretionary

Documentation

Plan