Main repository for the Data Science Across Disciplines module offered at the Centre for Interdisciplinary Methodologies at the University of Warwick
Home
Session-01
Session-02
Session-03
Session-04
Session-05
Session-06
Session-07
Session-08
Session-09
View the Project on GitHub cagatayTurkay/data-science-across-disciplines
In this practical, your task is to use visualisation effectively to support a “stakeholder” in informing a decision using data. And you will design and conduct a tiny data science approach to support your stakeholders decision making process.
We will be using data made available through the Health Profiles Programme. On their web pages, Health Profiles (HP) programme is defined as:
Health Profiles is a programme to improve availability and accessibility for health and health-related information in England. The profiles give a snapshot overview of health for each local authority in England. Health Profiles are produced annually.
Designed to help local government and health services make decisions and plans to improve local people’s health and reduce health inequalities, the profiles present a set of health indicators that show how the area compares to the national average. The indicators are carefully selected each year to reflect important public health topics.*
In this exercise, your task is to inform the your selected stakeholder on the relations between health indicators and socio-economic and demographic indicators.
From Health Profiles Programme:
In this case, we are lucky that the authorities have nicely structured the data and made sure that it is easily possible to join the different data sets. We will be using the information collected and made available through the Health Profiles Portal.
The data is for each local authority in England and presented in the form of indicators that have been carefully processed and made available for analysis (e.g., grouped under categories such as “Our communities”, “Disease and poor health”, etc.).
Through this link, you can investigate the data visually and download data on a single indicator in either CSV or Excel format. Each file starts with a meta-data section where the indicator is described so these brief information should be sufficient to understand the contents of the file. However, if you want to get a deeper understanding of the indicators and read about how they have been collected, you can have a look at this detailed page about the indicators. Note that the actual indicator values are under the column “Value” in the joint files.
The indicators are available from the above link but you can also download a csv with all the indicators in a zipped file here.
As also described above, the expectation to utilise the relation between the health indicators and socio-demographics information.
Public Health Officer, Local Authority Officer, Patient-facing Clinician (GP), Clinical researcher, Patient…
Consider yourself/your team to be a data scientists aiming to support:
Tasks to complete
T1: Explore the datasets through the webpages above to get yourself familiar.
T2: Identify a stakeholder and agree on a situation
T3: Use this persona template from servicedesigntools.org [link to template page] to help outline a stakeholder persona. Consider how you can help your stakeholder through data when responding to these challenges.
Tasks to complete
T4: Develop one or two analytical questions that you want to address
T5: Clarify which data variables you will be using in your analysis in order to answer your research question
Tasks to complete
T6: Develop an analysis plan on which techniques you want to use and which data variables you will incorporate.
Tasks to complete
T7: Conduct your analysis using Jupyter Notebook by documenting your decisions as you go along.
Tasks to complete
T8: Prepare to present your results
Note: We’ll tell you how to get your slides to us
We have a total of 2 hours for the session, so roughly 120 minutes.
15 min: Introduction and breakout into groups
20 min: Step-1, Sketch a hypothetical scenario and a stakeholder
15 min: Step 2, Identify analytical questions and data
15 min: Step-3, Plan an analysis approach/strategy
10 min: Break / Reflect
40 min: Step-4, Conduct your analysis (if you have the time, plan and move to Step-5)
5 min: Groups reconvene, debrief
+ Week-10 reporting back, 2-3 min. from each group