Skip to the content.

SCRC Project Diary -2

2020.07.06

Enriching graphs further with disease status and location information

Updates and thinking on 2020.07.06 The goal here is to associate the state transition related information and location information to the nodes in the infection map. Some important questions to look into here are:

When all of these data sources are integrated, we are looking at some stats such as these:

Here are all the “states” of those infection sources. From these numbers, it looks like the asymptomatic cases are causing the highest number of infections, followed by the pre-symptomatic cases

And let’s have a look at the locations of the the infections. A few locations stand out, such as schools , restaurants, and small family settings.

Now all these data is embedded within the graphs, let’s see if there is anything interesting to look at within the graphs.

Only infections happening in school Infections happening School, Restaurants and Small Family Settings Looking at the 30 largest chains and colouring according to infection locations
[high-res pdf] [high-res pdf] [high-res pdf]

And a closer look at some of the chains

Here is a school cluster leading to infections at different locations  

And now let’s look at the disease states of those who are infecting others. Since each individual can be at a different stage when it comes to infecting others, we are colouring the edges

Infections caused by Asymptomatic nodes Infections caused by nodes at their Symptomatic and Pre-Sympotmatic stages
[high-res pdf] [high-res pdf]

Incorporating Derived Data on the Graph

Generation times — One idea that was suggested by Louise was to use the generation times to colour the nodes. generation time is the average time that it takes an infected individual to infect all the others that they’ve infected. So if the average time is low, it means that it took the infections a very short time to occur. The following graph is coloured according to generation time. We are looking at the 10 largest components from the infection map.

Darker means very short infection times. Size is mapped to outDegree, i.e., how many infections a node has been the source of. What this means is that if a node is large and big, we are looking at several infections happening in short time — this is what one might call super-spreaders?

And then if we look at the distribution of generation times, we see a distribution that looks like this. For most nodes, the generation times are between 1 to 10.

mean = 6.523527, std = 2.866262, min = 2.0, max = 24.0

2020.07.10 – Discussion with Daniel: