Overview
Background
University of Edinburgh Distance Learning at Scale courses are to be delivered via a partner platform.
Learner data from this platform needs to be accessed:
-
For learning analytic tools that provide coaching feedback to learners based on actions taken in the MicroMasters modules;
-
Course-based administration processes (e.g. identification of learners who have passed all modules in the MicroMasters and are eligible for the capstone).
-
To generate reports on course usage and performance
This project will design and implement an automated process and infrastructure to download, decrypt the data package and to make it available via a Relational Database Management System (RDBMS).
Scope
This scope of the project is to:
-
Design the automated process by which data is downloaded, decrypted and made available via an RDBMS.
-
Build the infrastructure and workflows that will deliver this process in a resilient infrastructure that can be mirrored on both Test and Live environments and test that the data is accurate.
-
Document the data flow processes and ensure they meet legal requirements and university policy.
The scope of the project does not include:
-
Building the environments for the OnTask learning analytics tool or identifying how the use of OnTask should be incorporated into pilot DLAS programmes.
-
The gathering of student data for UoE admissions systems during registration for the capstone assessment. (We assume that the data from edX on learners is so minimal that the benefits of part-populating UoE systems from edX data will not be worth the effort required).
Out of Scope
Integration with EDW. Whilst there may be a longer term requirement, initially the main requirement is the need is for operational reporting and to increase the understanding of the edX data set. DLAS is a 3 year pilot for now but if approved as a service, consideration should be given to identify a subset of the data that could be included in EDW. This would be scheduled for 20/21.
Objectives
| Phase | Priority | Owner |
| O1. Ensure Data Protection requirements are identified | ||
| D1.1 Produce a Data Privacy Impact Assessment (DPIA) plus any other documentation identified as being required as a result of writing the DPIA. | Must | Business Lead |
| O2. Deliver a Platform Data Store for DLAS | ||
| D2.1 The creation of both a Test and Live Production environments | Must | Technical Lead |
| D2.2 A resilient infrastructure ensuring data is always available | Must | Technical Lead |
| D2.3 An automated mechanism for downloading and storage of encrypted data packages (weekly database snapshot of Edinburgh X courses and daily clickstream\event log data) from edx.org Amazon S3 cloud storage to a UoE IS locally-hosted infrastructure. | Must | Software Development Lead |
| D2.4 An automated decryption of data packages based on the existing Data Czar data key. | Must | Technical Lead |
| D2.5 An automated process for importing the weekly database snapshot into an RDBMS. | Must | Software Development Lead |
| D2.6 Creation of additional tables of designated event log \ clickstream data | Must | Software Development Lead |
| D2.7 An automated process for importing designated event log \ clickstream data | Must | Software Development Lead |
| D2.8 Error handling mechanism for automated decryption and import processes of data packages (which allows the process to complete with any errors flagged). | Must | Software Development Lead |
| D2.9 Documentation of the database schema, automated processes and error handling processes. | Must | Software Development Lead |
| D2.10 Consultation & creation of basic reporting of the EdinburghX data (e.g. user x is enrolled on what courses, course x has the following user enrolled) | Must | Software Development Lead / Business Lead / Project Sponsor |
Benefits
-
Data from edX.org is accessible via a locally hosted database.
-
Data generated from edx.org can be incorporated into DLAS academic programmes, improving the quality of the experience for users. (e.g. learning analytics tool, OnTask)
-
Allows schools to manage and monitor capstone access.
-
Allow the DLAS project to analyse and produce reports on the usage of the edx.org platform
Success Criteria
-
Creation of an database (RDBMS) that hosts EdinburghX courses generated data that meets business requirements
-
Creation of robust automated processes that imports the weekly database snapshot and defined clickstream data.
-
A decryption and import processes that allows the import process to complete and flags issues via an agreed method.
-
DPIA created and approved with GDPR processes identified and established.
-
Creation of basic reports that successfully query the EdinburghX data including the ability to query user enrolment, status and assessment progress across all courses
Project Milestones
| Stage | Milestone | Date |
| Plan | Planning | 11-Jan-2019 |
| Design | Sign-Off Design Phase | 15-Feb-2019 |
| Build | Sign-Off Build Phase | 18-Mar-2019 |
| Accept | Sign-Off Acceptance Testing | 19-Apr-2019 |
| Deliver | Go-Live | 26-Apr-2019 |
| Deliver | Sign-Off DSOR | 03-May-2019 |
| Close | Closure | 03-May-2019 |
