1 00:00:01,890 --> 00:00:02,723 Welcome back. 2 00:00:02,723 --> 00:00:04,290 Welcome back for a third week 3 00:00:04,290 --> 00:00:07,080 of Modeling in Complex Systems. 4 00:00:07,080 --> 00:00:08,760 Last week when I left you in the videos, 5 00:00:08,760 --> 00:00:11,100 I gave a very vague definition 6 00:00:11,100 --> 00:00:13,110 of the first type of modeling 7 00:00:13,110 --> 00:00:16,710 that we started doing using compartmental models. 8 00:00:16,710 --> 00:00:19,081 The definition I gave was something like, 9 00:00:19,081 --> 00:00:20,460 you know, compartmental models 10 00:00:20,460 --> 00:00:22,650 are reduction of complex systems 11 00:00:22,650 --> 00:00:25,890 in schematics of boxes and arrows 12 00:00:25,890 --> 00:00:29,640 where boxes represent subpopulations of your system, 13 00:00:29,640 --> 00:00:31,200 of the parts of your system, 14 00:00:31,200 --> 00:00:36,153 where approximation of homogeneity holds. 15 00:00:37,410 --> 00:00:40,710 And this approximation of homogeneity, really, 16 00:00:40,710 --> 00:00:42,150 I didn't really define. 17 00:00:42,150 --> 00:00:44,940 So I thought it would be nice to revisit in this video 18 00:00:44,940 --> 00:00:46,410 in light of all the good discussions 19 00:00:46,410 --> 00:00:48,723 that we have this week, so. 20 00:00:53,040 --> 00:00:56,820 Lemme share a little bit of slides here. 21 00:00:56,820 --> 00:00:58,440 So I really wanna revisit 22 00:00:58,440 --> 00:01:00,393 this idea of compartmental modeling. 23 00:01:04,740 --> 00:01:06,690 And maybe a clearer definition 24 00:01:06,690 --> 00:01:10,440 would be really that compartments are boxes 25 00:01:10,440 --> 00:01:13,500 where we put indistinguishable parts of the system. 26 00:01:13,500 --> 00:01:17,520 So it's not just that we can approximate 27 00:01:17,520 --> 00:01:19,230 that their homogeneous, 28 00:01:19,230 --> 00:01:21,450 really these approximation of homogeneity 29 00:01:21,450 --> 00:01:25,050 is the idealization that we make to get to the model. 30 00:01:25,050 --> 00:01:27,120 But within the model, 31 00:01:27,120 --> 00:01:29,010 parts that are put in the same boxes 32 00:01:29,010 --> 00:01:31,260 are literally indistinguishable. 33 00:01:31,260 --> 00:01:34,080 So if we are curious about an ecosystem, 34 00:01:34,080 --> 00:01:37,860 for example, with wolves eating rabbits 35 00:01:37,860 --> 00:01:39,393 and rabbits reproducing, 36 00:01:40,230 --> 00:01:43,590 we might have two boxes, one for wolves, one for rabbits. 37 00:01:43,590 --> 00:01:46,320 And we're saying that all wolves are exactly the same. 38 00:01:46,320 --> 00:01:47,670 They behave in the same way. 39 00:01:47,670 --> 00:01:50,970 They have no individual features, 40 00:01:50,970 --> 00:01:53,193 they have no memory of their past dates. 41 00:01:54,270 --> 00:01:55,623 And same with rabbits. 42 00:01:57,600 --> 00:02:00,660 So once we do this compression, really, 43 00:02:00,660 --> 00:02:02,430 what we're trying to do is to think about 44 00:02:02,430 --> 00:02:05,850 the different parts of our system, which is heterogeneous, 45 00:02:05,850 --> 00:02:07,830 sort of by nature of complex system. 46 00:02:07,830 --> 00:02:12,830 And then we try to reduce this heterogeneity into boxes 47 00:02:13,230 --> 00:02:16,290 where there's no heterogeneity within the boxes, 48 00:02:16,290 --> 00:02:19,860 only across, so discrete types of parts. 49 00:02:19,860 --> 00:02:21,660 And then once we have that, 50 00:02:21,660 --> 00:02:24,510 what we try and do is reduce all the interactions, 51 00:02:24,510 --> 00:02:27,720 all the flow of energy or information 52 00:02:27,720 --> 00:02:30,420 within that system as arrows between the boxes. 53 00:02:30,420 --> 00:02:33,000 We can have different types of arrows. 54 00:02:33,000 --> 00:02:36,630 But really they're just a way to formalize what we mean 55 00:02:36,630 --> 00:02:39,813 by interaction and flow of parts across the system. 56 00:02:41,340 --> 00:02:44,070 The example that we discussed at great length 57 00:02:44,070 --> 00:02:49,070 was inspired by, you know, current events in the world. 58 00:02:49,980 --> 00:02:52,170 And we thought about outbreaks of COVID 59 00:02:52,170 --> 00:02:54,450 on a university campus, right? 60 00:02:54,450 --> 00:02:57,210 And then we saw the classic disease models which say, 61 00:02:57,210 --> 00:03:00,420 well, we have this complex human population, 62 00:03:00,420 --> 00:03:02,670 different types of people and all that, 63 00:03:02,670 --> 00:03:04,380 but at the very minimum, 64 00:03:04,380 --> 00:03:07,740 what we care about in terms of building a disease model 65 00:03:07,740 --> 00:03:12,090 is their state regarding this infectious disease. 66 00:03:12,090 --> 00:03:15,690 So in the classic case, we would've three boxes, 67 00:03:15,690 --> 00:03:20,690 so Box S, I, and R to distinguish our population 68 00:03:20,700 --> 00:03:24,930 in terms of whether people are susceptible, meaning healthy, 69 00:03:24,930 --> 00:03:28,380 and potentially could be infected in the future. 70 00:03:28,380 --> 00:03:30,330 Currently infected and infectious. 71 00:03:30,330 --> 00:03:31,710 So this I box. 72 00:03:31,710 --> 00:03:33,690 And then recovered. 73 00:03:33,690 --> 00:03:35,040 And we simply have two arrows, 74 00:03:35,040 --> 00:03:38,490 one from S to I, and one from I to R. 75 00:03:38,490 --> 00:03:41,733 So those two arrows, really the way to think about this is, 76 00:03:42,990 --> 00:03:44,850 I mean, the way I like to think about it 77 00:03:44,850 --> 00:03:49,830 is that boxes are simply backed up 78 00:03:49,830 --> 00:03:53,130 that get filled with water representing a certain density 79 00:03:53,130 --> 00:03:55,560 or fraction of the total system. 80 00:03:55,560 --> 00:03:59,970 And then mechanisms, the arrows are simply pipes 81 00:03:59,970 --> 00:04:02,160 through which water can flow, in this case. 82 00:04:02,160 --> 00:04:03,930 So you have a lot of water initially 83 00:04:03,930 --> 00:04:07,170 in the susceptible bathtub or compartment, 84 00:04:07,170 --> 00:04:11,490 which flows at some rate that might change in time. 85 00:04:11,490 --> 00:04:13,450 But this pipe has a certain width 86 00:04:14,760 --> 00:04:17,910 through which water flows from the S to I bathtub 87 00:04:17,910 --> 00:04:20,880 and that models infections in the system. 88 00:04:20,880 --> 00:04:23,970 And then the I, we get a little bit of water 89 00:04:23,970 --> 00:04:25,770 that grows in the I bathtub 90 00:04:25,770 --> 00:04:28,800 and another pipe that flows with a different width 91 00:04:28,800 --> 00:04:32,790 into the R bathtub, representing recovered individual. 92 00:04:32,790 --> 00:04:33,632 The reason why I use this idea of water 93 00:04:33,632 --> 00:04:38,100 will become clearer in the next videos for this week 94 00:04:38,100 --> 00:04:42,930 when we start thinking about the mathematical description 95 00:04:42,930 --> 00:04:45,390 that we can get from these systems of box and arrows. 96 00:04:45,390 --> 00:04:48,510 And then we'll see that they're not necessarily ideal 97 00:04:48,510 --> 00:04:50,400 at dealing with discrete quantities. 98 00:04:50,400 --> 00:04:51,960 So it's often better to think of it 99 00:04:51,960 --> 00:04:53,883 as a density of water. 100 00:04:54,720 --> 00:04:56,820 Now the one thing I'll say at this point 101 00:04:56,820 --> 00:04:59,670 is that we have three boxes that we've labeled, 102 00:04:59,670 --> 00:05:01,140 you know, susceptible, infected, 103 00:05:01,140 --> 00:05:02,880 or infectious and recovered. 104 00:05:02,880 --> 00:05:06,450 And we have pipes that we label infection events 105 00:05:06,450 --> 00:05:07,803 or recovery events. 106 00:05:09,870 --> 00:05:11,460 And I discussed this 107 00:05:11,460 --> 00:05:14,070 with the entire class throughout the week 108 00:05:14,070 --> 00:05:17,520 but I wanted to put it on the record in one video. 109 00:05:17,520 --> 00:05:20,850 My number one modeling tip is to be very careful 110 00:05:20,850 --> 00:05:23,973 with the labels that we put on boxes and on arrows. 111 00:05:25,080 --> 00:05:27,990 Putting label is super useful to remind ourselves 112 00:05:27,990 --> 00:05:29,550 of, okay, this compartment, 113 00:05:29,550 --> 00:05:31,770 this box represent infected individual. 114 00:05:31,770 --> 00:05:32,790 And then we have an idea 115 00:05:32,790 --> 00:05:34,740 of what this means in terms of mechanisms 116 00:05:34,740 --> 00:05:37,920 and it help us think about the mathematical description. 117 00:05:37,920 --> 00:05:42,920 But when comes the time to discuss the result of the model, 118 00:05:44,070 --> 00:05:47,460 we have to be careful about not putting too much weight 119 00:05:47,460 --> 00:05:48,990 on the words that we chose 120 00:05:48,990 --> 00:05:51,600 to use to label these boxes and arrows, right? 121 00:05:51,600 --> 00:05:54,240 So if I take, you know, some other model, 122 00:05:54,240 --> 00:05:56,250 we might think of some arrows as, 123 00:05:56,250 --> 00:05:58,260 you know, brain complexity, 124 00:05:58,260 --> 00:06:00,660 or some boxes as individuals 125 00:06:00,660 --> 00:06:04,470 with more technological savviness, right? 126 00:06:04,470 --> 00:06:07,230 And then if our conclusions read something like, 127 00:06:07,230 --> 00:06:09,693 imagine headlines read something like, 128 00:06:10,920 --> 00:06:14,220 with increases in brain complexity 129 00:06:14,220 --> 00:06:17,970 or increases in technological savviness comes X, Y and Z. 130 00:06:17,970 --> 00:06:19,830 Well that's not quite right, right? 131 00:06:19,830 --> 00:06:21,870 Technological sevens or brain complexities 132 00:06:21,870 --> 00:06:24,270 are what our model is trying to describe, 133 00:06:24,270 --> 00:06:26,550 which really it's with this decomposition 134 00:06:26,550 --> 00:06:28,890 and all the idealization that we've made 135 00:06:28,890 --> 00:06:30,450 that comes X, Y, and Z. 136 00:06:30,450 --> 00:06:32,823 And so sometimes putting things on, 137 00:06:34,680 --> 00:06:37,440 putting things with realistic label 138 00:06:37,440 --> 00:06:39,030 can be a little misleading. 139 00:06:39,030 --> 00:06:41,760 'Cause we forget that we're really talking about 140 00:06:41,760 --> 00:06:44,700 our model at this point and not reality, right? 141 00:06:44,700 --> 00:06:45,840 So seeing it by the dip, 142 00:06:45,840 --> 00:06:47,820 the representation is not the real world 143 00:06:47,820 --> 00:06:50,400 and sometimes labels can lead us to forget that. 144 00:06:50,400 --> 00:06:51,480 And that's my number one tip 145 00:06:51,480 --> 00:06:54,153 about modeling to be careful about the labels. 146 00:06:56,640 --> 00:06:58,800 Oka, but for getting to this now, 147 00:06:58,800 --> 00:07:01,740 we have, you know, three boxes, S, I, and R, 148 00:07:01,740 --> 00:07:05,040 and two arrows labeled infection and recoveries. 149 00:07:05,040 --> 00:07:06,750 And this little description 150 00:07:06,750 --> 00:07:08,460 is the simplest possible disease model 151 00:07:08,460 --> 00:07:10,440 for something like COVID on campus 152 00:07:10,440 --> 00:07:14,430 where R would be, you know, people that have recovered, 153 00:07:14,430 --> 00:07:16,590 that are either dead or immune. 154 00:07:16,590 --> 00:07:19,230 In any case, they're not gonna participate, 155 00:07:19,230 --> 00:07:21,240 they're not gonna get infected anymore. 156 00:07:21,240 --> 00:07:23,640 So they're not gonna spread the disease further. 157 00:07:24,480 --> 00:07:27,390 And then we discussed ways to generalize this model. 158 00:07:27,390 --> 00:07:29,880 Really, the way we did that was to try and list 159 00:07:29,880 --> 00:07:33,240 all the possible idealization that had to be made 160 00:07:33,240 --> 00:07:35,850 by the modelers who came up with the S I R model 161 00:07:35,850 --> 00:07:38,640 to go from a complex system, a human population, 162 00:07:38,640 --> 00:07:41,200 and an infectious pathogen spreading therein 163 00:07:42,120 --> 00:07:44,943 to this simple, like three boxes, two arrow system. 164 00:07:45,780 --> 00:07:48,480 One idealization is that infectious individuals 165 00:07:48,480 --> 00:07:49,980 are all the same. 166 00:07:49,980 --> 00:07:52,290 In terms of COVID, we know some people are asymptomatic, 167 00:07:52,290 --> 00:07:53,123 some are not. 168 00:07:53,123 --> 00:07:55,110 So one improvement that we made 169 00:07:55,110 --> 00:07:58,200 was to introduce two boxes for two different types 170 00:07:58,200 --> 00:07:59,820 of infectious individuals, 171 00:07:59,820 --> 00:08:03,330 whether they are aware, IA, or unaware, IU, 172 00:08:03,330 --> 00:08:05,700 of their infectious state, right? 173 00:08:05,700 --> 00:08:09,960 So untested, asymptomatic individual would be an IU 174 00:08:09,960 --> 00:08:13,888 and you can imagine that those individuals contribute more 175 00:08:13,888 --> 00:08:17,640 to the flow of individual from S to IA or IU 176 00:08:17,640 --> 00:08:18,870 than IA individuals. 177 00:08:18,870 --> 00:08:21,510 Meaning that IU individual might be more contagious 178 00:08:21,510 --> 00:08:23,220 'cause they're not taking any precautions, 179 00:08:23,220 --> 00:08:25,350 they don't even know they're sick. 180 00:08:25,350 --> 00:08:29,910 Another important generalizations that we made 181 00:08:29,910 --> 00:08:33,330 was to not only consider recovered individuals, 182 00:08:33,330 --> 00:08:36,783 that all individuals previously infectious. 183 00:08:37,920 --> 00:08:40,770 One reason why we might wanna model disease 184 00:08:40,770 --> 00:08:43,293 is to be able to test different interventions. 185 00:08:44,460 --> 00:08:46,530 While in which case, you know, it really matters 186 00:08:46,530 --> 00:08:48,960 whether people recovered and are now immune, 187 00:08:48,960 --> 00:08:52,530 recovered and might have long-term consequences from COVID, 188 00:08:52,530 --> 00:08:53,850 or if they're dead, right? 189 00:08:53,850 --> 00:08:55,560 So disease individuals, 190 00:08:55,560 --> 00:08:57,540 what we wanna be able to evaluate 191 00:08:57,540 --> 00:08:59,790 is the cost of this disease burden. 192 00:08:59,790 --> 00:09:03,480 So then we split all of that up into a disease compartment, 193 00:09:03,480 --> 00:09:04,950 a recovered compartment, 194 00:09:04,950 --> 00:09:08,190 and a long-lasting consequence compartment. 195 00:09:08,190 --> 00:09:10,800 In terms of dynamics of the spreading, 196 00:09:10,800 --> 00:09:11,640 they're all the same, 197 00:09:11,640 --> 00:09:14,160 meaning they're not gonna get sick anymore, 198 00:09:14,160 --> 00:09:16,380 they're not gonna infected anyone else. 199 00:09:16,380 --> 00:09:18,570 But in terms of evaluating different outcomes, 200 00:09:18,570 --> 00:09:20,190 it really makes a big difference. 201 00:09:20,190 --> 00:09:23,340 So again, here we have a more complex model, 202 00:09:23,340 --> 00:09:24,480 a lot more arrows. 203 00:09:24,480 --> 00:09:26,370 And mathematically, that can be annoying. 204 00:09:26,370 --> 00:09:28,530 It's gonna be harder to analyze. 205 00:09:28,530 --> 00:09:33,060 But if the purpose of our model is to evaluate the quality 206 00:09:33,060 --> 00:09:35,550 of different outcomes with different interventions, 207 00:09:35,550 --> 00:09:37,233 well this is essential, right? 208 00:09:41,220 --> 00:09:43,920 The final generalization that I wanna talk about 209 00:09:43,920 --> 00:09:46,080 that was made during the discussion group 210 00:09:46,080 --> 00:09:50,070 is also to consider 211 00:09:50,070 --> 00:09:52,830 more different types of susceptible individuals. 212 00:09:52,830 --> 00:09:56,250 So we're not all as susceptible on campus, right? 213 00:09:56,250 --> 00:09:58,500 So if you're a remote student, 214 00:09:58,500 --> 00:09:59,370 so you're in town, 215 00:09:59,370 --> 00:10:02,880 you might get sick through some other venue 216 00:10:02,880 --> 00:10:04,770 but you're not physically on campus, 217 00:10:04,770 --> 00:10:06,180 you're a low risk individual. 218 00:10:06,180 --> 00:10:08,310 And we might call that SL. 219 00:10:08,310 --> 00:10:09,420 If you're on campus, 220 00:10:09,420 --> 00:10:10,800 but you take all your precautions, 221 00:10:10,800 --> 00:10:13,230 you might be medium risk, SM. 222 00:10:13,230 --> 00:10:16,530 If you're on campus and you have pre-existing conditions 223 00:10:16,530 --> 00:10:17,820 and a lot of contact, 224 00:10:17,820 --> 00:10:21,150 then you might be a high risk individual, SH. 225 00:10:21,150 --> 00:10:24,120 And as you can see, like it's easy, if you want. 226 00:10:24,120 --> 00:10:26,610 There are so many idealizations that we made 227 00:10:26,610 --> 00:10:29,280 to go from the real world 228 00:10:29,280 --> 00:10:31,410 to a system of three boxes and two arrows. 229 00:10:31,410 --> 00:10:34,140 It's easy to think of generalizations to make. 230 00:10:34,140 --> 00:10:38,400 The key part is in making the right generalization 231 00:10:38,400 --> 00:10:40,710 that don't introduce too much more, 232 00:10:40,710 --> 00:10:43,470 too much additional complexity in the model, 233 00:10:43,470 --> 00:10:47,010 but are required for the purpose of the model, right? 234 00:10:47,010 --> 00:10:51,420 So if we wanna consider the role of having remote student, 235 00:10:51,420 --> 00:10:55,050 then we have to distinguish on these three different types 236 00:10:55,050 --> 00:10:57,330 or at least two types of susceptible individual. 237 00:10:57,330 --> 00:10:58,930 It's all about the balance here. 238 00:11:00,270 --> 00:11:01,103 So to wrap it up, 239 00:11:01,103 --> 00:11:04,620 before we start going from boxes and arrows 240 00:11:04,620 --> 00:11:06,243 to mathematical description, 241 00:11:06,243 --> 00:11:10,800 I'll give yet another definition of compartmental model. 242 00:11:10,800 --> 00:11:12,210 One way to think about it 243 00:11:12,210 --> 00:11:15,210 is that it's a decomposition representation 244 00:11:15,210 --> 00:11:18,540 or compression of a real system 245 00:11:18,540 --> 00:11:21,030 into different types of homogeneous parts. 246 00:11:21,030 --> 00:11:23,640 Really, the key thing is we wanna acknowledge 247 00:11:23,640 --> 00:11:27,600 that per nature of complex system, 248 00:11:27,600 --> 00:11:29,730 we have a collection of parts, 249 00:11:29,730 --> 00:11:32,370 multiple interconnected parts, and they're not all the same. 250 00:11:32,370 --> 00:11:34,050 They're often heterogeneous. 251 00:11:34,050 --> 00:11:36,210 But we wanna ask what is the minimum, 252 00:11:36,210 --> 00:11:40,380 minimum number of of different types that I can distinguish. 253 00:11:40,380 --> 00:11:42,720 And then within those types, then I'll start ignoring 254 00:11:42,720 --> 00:11:45,810 all individual heterogeneity for convenience. 255 00:11:45,810 --> 00:11:49,173 And then if you can get to this minimum description, 256 00:11:50,820 --> 00:11:52,920 then you have a good compartmental model 257 00:11:52,920 --> 00:11:56,790 that's worth going through a mathematical description 258 00:11:56,790 --> 00:11:57,623 and solving. 259 00:11:57,623 --> 00:11:59,670 And then that's what we will be up to 260 00:11:59,670 --> 00:12:02,070 over the next few videos. 261 00:12:02,070 --> 00:12:03,630 So hopefully that clears it up. 262 00:12:03,630 --> 00:12:07,410 Really, this is the type of of modeling tool 263 00:12:07,410 --> 00:12:09,663 that only becomes obvious the more you do it.