Skip to main content

Analytical Platform

The Analytical Platform provides users with a place to store, ingest and consume data.

This repository holds the Ministry of Justice’s Analytical Platforms published technical documentation and code to build its infrastructure for our users.

If you are a user of the Analytical Platform, please see our User Guidance.

Overview

Architecture

Infrastructure

Services

These are the services that we maintain and support as a platform. If you want to know what are our responsibilities then please see Shared Responsibility Model here.

Service Description
Amazon Managed Workflows Amazon Managed Workflows for Apache Airflow (Amazon MWAA) orchestrates your workflows using Directed Acyclic Graphs (DAGs) written in Python
Control Panel Provides users with data management and access to analytical tooling including Airflow, Jupyter notebooks, RStudio and VScode
Create a Derived Table A service to allow you to schedule deployments of tables derived from data available on the Analytical Platform straight to Athena.
Data Sharing This service allows users to ingest data into their Analytical Platform data warehouse using protocols such as SFTP.
Jupyter Lab Provides a interactive notebook documents that allows users to create and share documents containing live code, equations, visualisations, and narrative text
RStudio Integrated development environment (IDE) for R which integrates with git
Visual Studio Code Visual Studio Code provides data scientists with powerful tools for coding, debugging, and managing data, including integrated support for Jupyter Notebooks

Team information

Internal Processes

Please see the Analytical Platform Runbooks docs here