data-science-across-disciplines

Main repository for the Data Science Across Disciplines module offered at the Centre for Interdisciplinary Methodologies at the University of Warwick

Home
Detailed Information

:: Sessions ::

Session-01
Session-02
Session-03
Session-04
Session-05
Session-06
Session-07
Session-08
Session-09


View the Project on GitHub cagatayTurkay/data-science-across-disciplines

Session-08: DATA SCIENCE WORKSHOP

BRIEF: Analysing Health Indicators

In this practical, your task is to use visualisation effectively to support a “stakeholder” in informing a decision using data. And you will design and conduct a tiny data science approach to support your stakeholders decision making process.

We will be using data made available through the Health Profiles Programme. On their web pages, Health Profiles (HP) programme is defined as:

Health Profiles is a programme to improve availability and accessibility for health and health-related information in England. The profiles give a snapshot overview of health for each local authority in England. Health Profiles are produced annually.

Designed to help local government and health services make decisions and plans to improve local people’s health and reduce health inequalities, the profiles present a set of health indicators that show how the area compares to the national average. The indicators are carefully selected each year to reflect important public health topics.*

In this exercise, your task is to inform the your selected stakeholder on the relations between health indicators and socio-economic and demographic indicators.

DATA

From Health Profiles Programme:

In this case, we are lucky that the authorities have nicely structured the data and made sure that it is easily possible to join the different data sets. We will be using the information collected and made available through the Health Profiles Portal.

The data is for each local authority in England and presented in the form of indicators that have been carefully processed and made available for analysis (e.g., grouped under categories such as “Our communities”, “Disease and poor health”, etc.).

Through this link, you can investigate the data visually and download data on a single indicator in either CSV or Excel format. Each file starts with a meta-data section where the indicator is described so these brief information should be sufficient to understand the contents of the file. However, if you want to get a deeper understanding of the indicators and read about how they have been collected, you can have a look at this detailed page about the indicators. Note that the actual indicator values are under the column “Value” in the joint files.

The indicators are available from the above link but you can also download a csv with all the indicators in a zipped file here.

TASKS

As also described above, the expectation to utilise the relation between the health indicators and socio-demographics information.

Step-1: Sketch a hypothetical scenario and a stakeholder

Potential stakeholder involved could be:

Public Health Officer, Local Authority Officer, Patient-facing Clinician (GP), Clinical researcher, Patient…

Potential situations could be:

Consider yourself/your team to be a data scientists aiming to support:

Tasks to complete

T1: Explore the datasets through the webpages above to get yourself familiar.

T2: Identify a stakeholder and agree on a situation

T3: Use this persona template from servicedesigntools.org [link to template page] to help outline a stakeholder persona. Consider how you can help your stakeholder through data when responding to these challenges.

Step-2: Identify analytical questions and data

Tasks to complete

T4: Develop one or two analytical questions that you want to address

T5: Clarify which data variables you will be using in your analysis in order to answer your research question

Step-3: Plan an analysis approach/strategy

Tasks to complete

T6: Develop an analysis plan on which techniques you want to use and which data variables you will incorporate.

Step-4: Conduct your analysis

Tasks to complete

T7: Conduct your analysis using Jupyter Notebook by documenting your decisions as you go along.

Step-5: Presenting the results (Note: To be presented on Week-10)

Tasks to complete

T8: Prepare to present your results

Note: We’ll tell you how to get your slides to us

Workshop Timeline

We have a total of 2 hours for the session, so roughly 120 minutes.

15 min: Introduction and breakout into groups

20 min: Step-1, Sketch a hypothetical scenario and a stakeholder

15 min: Step 2, Identify analytical questions and data

15 min: Step-3, Plan an analysis approach/strategy

10 min: Break / Reflect

40 min: Step-4, Conduct your analysis (if you have the time, plan and move to Step-5)

5 min: Groups reconvene, debrief

+ Week-10 reporting back, 2-3 min. from each group