SCRC Project Diary -2

2020.07.06

Enriching graphs further with disease status and location information

Updates and thinking on 2020.07.06 The goal here is to associate the state transition related information and location information to the nodes in the infection map. Some important questions to look into here are:

What is the state of an individual when they are infecting another individual?
- There are two ways of surfacing this information. Let’s assume that A infected three others, B, C, D. A → [B, C, D]. It is possible that all these infections could have happened at different states. So it does makes sense to associate this information to the edge.
- Potentially, this can be an information associated with a particular node, for instance, what is the most frequent state that a node infect others. This could be something to use as part of the graph based visualisations where we filter/colour by node
What is the location of infections? This is again similar to above. Let’s start with associating this information to the edges.

When all of these data sources are integrated, we are looking at some stats such as these:

Here are all the “states” of those infection sources. From these numbers, it looks like the asymptomatic cases are causing the highest number of infections, followed by the pre-symptomatic cases

And let’s have a look at the locations of the the infections. A few locations stand out, such as schools , restaurants, and small family settings.

Now all these data is embedded within the graphs, let’s see if there is anything interesting to look at within the graphs.

Only infections happening in school	Infections happening School, Restaurants and Small Family Settings	Looking at the 30 largest chains and colouring according to infection locations

[high-res pdf]	[high-res pdf]	[high-res pdf]

And a closer look at some of the chains

Here is a school cluster leading to infections at different locations

And now let’s look at the disease states of those who are infecting others. Since each individual can be at a different stage when it comes to infecting others, we are colouring the edges

Infections caused by Asymptomatic nodes	Infections caused by nodes at their Symptomatic and Pre-Sympotmatic stages

[high-res pdf]	[high-res pdf]

Incorporating Derived Data on the Graph

Generation times — One idea that was suggested by Louise was to use the generation times to colour the nodes. generation time is the average time that it takes an infected individual to infect all the others that they’ve infected. So if the average time is low, it means that it took the infections a very short time to occur. The following graph is coloured according to generation time. We are looking at the 10 largest components from the infection map.

Darker means very short infection times. Size is mapped to outDegree, i.e., how many infections a node has been the source of. What this means is that if a node is large and big, we are looking at several infections happening in short time — this is what one might call super-spreaders?

And then if we look at the distribution of generation times, we see a distribution that looks like this. For most nodes, the generation times are between 1 to 10.

mean = 6.523527, std = 2.866262, min = 2.0, max = 24.0

2020.07.10 – Discussion with Daniel:

It would be good to look at the degree vs. generation time
- Maybe remove the degree-1 nodes
Computing the R_0 value for different locations?