data-science-across-disciplines

Main repository for the Data Science Across Disciplines module offered at the Centre for Interdisciplinary Methodologies at the University of Warwick

Home
Detailed Information

:: Sessions ::

Session-01
Session-02
Session-03
Session-04
Session-05
Session-06
Session-07
Session-08
Session-09


View the Project on GitHub cagatayTurkay/data-science-across-disciplines

teaser

Data Science Across Disciplines

This is the complementary page for the IM939 - Data Science Across Disciplines module that is being run at the Centre for Interdisciplinary Methodologies at the University of Warwick.

Module leader: Cagatay Turkay

Teaching staff: Zofia Bednarowska-Michaiel, James Tripp

What this module is about?

This module introduces students to the fundamental techniques, concepts and contemporary discussions across the broad field of data science. With data and data related artefacts becoming ubiquitous in all aspects of social life, data science gains access to new sources of data, is taken up across an expanding range of research fields and disciplines, and increasingly engages with societal challenges. The module provides an advanced introduction to the theoretical and scientific frameworks of data science, and to the fundamental techniques for working with data using appropriate procedures, algorithms and visualisation. Students learn how to critically approach data and data-driven artefacts, and engage with and critically reflect on contemporary discussions around the practice of data science, its compatibility with different analytics frameworks and disciplinary, and its relation to on-going digital transformations of society. As well as lectures discussing the theoretical, scientific and ethical frameworks of data science, the module features coding labs and workshops that expose students to the practice of working effectively with data, algorithms, and analytical techniques, as well as providing a platform for reflective and critical discussions on data science practices, resulting data artefacts and how they can be interpreted, actioned and influence society.

Module Schedule

Session-01: INTRODUCTION, HISTORICAL PERSPECTIVES & BASIC CONCEPTS

This week discusses data science as a field that cuts across disciplines and provides a historical perspective on the subject. We discuss the terms Data Science and Data Scientists, reflect on examples of Data Science projects, and discuss the research process at a methodological level.

Session details and material

Session-02: THINKING DATA: THEORETICAL AND PRACTICAL CONCERNS

This week explores the cultural, ethical, and critical challenges posed by data artefacts and data-intensive scientific processes. Engaging with Critical Data Studies, we discuss issues around data capture, curation, data quality, inclusion/exclusion and representativeness. The session also discusses the different kinds of data that one can encounter across disciplines, the underlying characteristics of data and how we can analytically and practically approach data quality issues and the challenge of identifying and curating appropriate data sets.

Session details and material

Session-03: ABSTRACTIONS & MODELS

This week discusses ways of abstracting data. We start by visiting statistics as a means of representing data and its inherent characteristics. The session moves on to discuss the notion of a “model” and visit the different schools of thought within model-ing, as well as a tour of fundamental statistical models that help abstract data and its inherent relations.

Session details and material

Session-04: STRUCTURES AND SPACES

This week explores the notion of structures and how data science can enable the extraction of “hidden” underlying groups – clusters – and hierarchical structures from data. We discuss the different techniques to surface and generate artificial boundaries and how the resulting artefacts can be interpreted. This session then investigates how artificial and abstract spaces can be constructed through different “projection” techniques, and how these spaces help us navigate data that are high-dimensional in nature and apply analytic frameworks to them.

Session details and material

Session-05: MULTI-MODEL THINKING AND RIGOUR IN DATA SCIENCE

This week we focus on multi-model approaches as a way of thinking and how critical, pluralistic thinking can improve our understanding of the underlying phenomena implicit in data. We also discuss how to adopt a comprehensive approach to the data science process, and investigate indicators of rigor in data science.

Session details and material

Session-06: RECOGNISING AND AVOIDING TRAPS

Data analysis and statistical routines and procedures are ingrained with several pitfalls and limitations – these range from methodological pitfalls in the processes and data that once can use, to cognitive and behavioural pitfalls that one can come across in making inferences from data and data artefacts. This week we discuss such theoretical and practical traps and pitfalls, how we can be aware of them and what approaches we have to avoid them.

Session details and material

Session-07: DATA SCIENCE & SOCIETY

We will engage with academic and practices discourse on the social, cultural and ethical aspects of data science, and discuss around how one can responsibly carry out data science research on social phenomena, whether data science can be a transformative power in society, and what ethical and social frameworks can help us to critically approach data science practices and its effects on society, and what are ethical practices for data scientists.

Session details and material

Session-08: DATA SCIENCE WORKSHOP - 1 (DESIGN THINKING IN DATA SCIENCE)

This week explores the question “Can we approach data science as a design problem?” and discusses how one can embrace a user-centred approach to design appropriate data science processes. We will do this through hands-on practical where we go through the data science process over an applied case.

Session details and material

Workshop brief

Session-09: DATA SCIENCE WORKSHOP - 2

This workshop week will involve you working hands-on towards your final assessments individually but in small coding groups. In the workshop session, we would like to hear from you on your ideas for your second coursework. You will also be able to use this session to start working on the dataset(s) that you have decided (or still considering) to analyse as part of your second assessment and discuss with your peers and with the staff members.

Session details and material

Some general resources