1 00:00:00,540 --> 00:00:02,930 - [Instructor] Hi, and welcome to module six 2 00:00:02,930 --> 00:00:05,363 of advanced geographic information systems. 3 00:00:06,320 --> 00:00:08,580 This week we'll cover two primary topics 4 00:00:08,580 --> 00:00:11,090 spread out over four lectures. 5 00:00:11,090 --> 00:00:14,300 The first topic relates to density calculations. 6 00:00:14,300 --> 00:00:17,410 And we'll hear about that here in lecture one. 7 00:00:17,410 --> 00:00:19,800 From there, we'll move on to interpolation 8 00:00:19,800 --> 00:00:23,110 and we'll hear about that in lectures two, three, and four. 9 00:00:23,110 --> 00:00:24,023 Let's get started. 10 00:00:24,930 --> 00:00:28,410 A density calculation is a focal operation, 11 00:00:28,410 --> 00:00:32,990 meaning that it computes the value of a cell of interest 12 00:00:32,990 --> 00:00:35,550 based on the value of the cells 13 00:00:35,550 --> 00:00:38,100 within a neighborhood that surrounds that location. 14 00:00:39,220 --> 00:00:42,000 It's up to you to specify that search region, 15 00:00:42,000 --> 00:00:47,000 that area over which the algorithm looks for other values, 16 00:00:47,670 --> 00:00:50,260 and it's what is considered the neighborhood. 17 00:00:50,260 --> 00:00:52,520 So think about it this way. 18 00:00:52,520 --> 00:00:56,040 You've got a blank slate for your raster dataset, 19 00:00:56,040 --> 00:00:58,720 and you've got a point data set is your input 20 00:00:58,720 --> 00:01:01,520 that you want to use to calculate the density. 21 00:01:01,520 --> 00:01:04,400 For each one of those cells in the new raster 22 00:01:04,400 --> 00:01:08,550 ArcGIS draws a circle and sums the number of points 23 00:01:08,550 --> 00:01:11,320 and writes that information into that cell. 24 00:01:11,320 --> 00:01:12,880 So it's sort of like a buffer 25 00:01:12,880 --> 00:01:15,410 and select by location operation 26 00:01:15,410 --> 00:01:19,593 for identifying the neighbors for a specific cell. 27 00:01:21,320 --> 00:01:22,980 Cells with no neighbors are assigned 28 00:01:22,980 --> 00:01:25,700 a no data value in the output. 29 00:01:25,700 --> 00:01:27,450 Let's walk through an example here. 30 00:01:29,290 --> 00:01:32,190 As (indistinct) has populates the values of the raster 31 00:01:32,190 --> 00:01:34,460 that identifies a cell, 32 00:01:34,460 --> 00:01:36,523 draws a neighborhood around that cell, 33 00:01:37,570 --> 00:01:40,600 then counts or sums the observations 34 00:01:40,600 --> 00:01:42,800 within that neighborhood. 35 00:01:42,800 --> 00:01:43,963 Once that's done, 36 00:01:45,420 --> 00:01:48,540 that value from number two is divided 37 00:01:48,540 --> 00:01:50,660 by the area of the neighborhood. 38 00:01:50,660 --> 00:01:51,500 Again, that's something 39 00:01:51,500 --> 00:01:54,110 that you specify for the calculation. 40 00:01:54,110 --> 00:01:57,860 And then that result from step number three 41 00:01:57,860 --> 00:02:02,850 is assigned to that cell from number one. 42 00:02:02,850 --> 00:02:05,940 From there ArcGIS just continues across the raster 43 00:02:05,940 --> 00:02:08,053 and continues to make those calculations. 44 00:02:09,810 --> 00:02:11,630 So why do this? 45 00:02:11,630 --> 00:02:13,190 Density calculations are nice 46 00:02:13,190 --> 00:02:17,170 to help us identify areas of concentration 47 00:02:17,170 --> 00:02:20,250 and they can create a predicted distribution 48 00:02:20,250 --> 00:02:21,560 of the population. 49 00:02:21,560 --> 00:02:25,870 The total value, or the sum of all values of the neighbors 50 00:02:25,870 --> 00:02:27,403 for each individual cell. 51 00:02:28,290 --> 00:02:29,930 Now we've heard a little bit about the how 52 00:02:29,930 --> 00:02:32,060 in terms of summing those observations 53 00:02:32,060 --> 00:02:34,640 within the neighborhood and dividing by that area 54 00:02:34,640 --> 00:02:39,430 of the neighborhood to produce that density calculation. 55 00:02:39,430 --> 00:02:41,580 You can also use a kernel. 56 00:02:41,580 --> 00:02:45,574 And in this case we're fitting a curve over each cell. 57 00:02:45,574 --> 00:02:47,420 And the kernel supplies the weights. 58 00:02:47,420 --> 00:02:50,710 So closer to the center point 59 00:02:50,710 --> 00:02:53,370 that cell of interest, the weight is highest. 60 00:02:53,370 --> 00:02:56,640 And that weight diminishes with increased distance 61 00:02:56,640 --> 00:02:58,480 from that center point. 62 00:02:58,480 --> 00:03:01,140 If you were to compute the volume under that curve, 63 00:03:01,140 --> 00:03:02,470 and I urge caution here 64 00:03:02,470 --> 00:03:05,620 because now we're getting into the realm of calculus. 65 00:03:05,620 --> 00:03:07,850 That volume would equal the sum 66 00:03:07,850 --> 00:03:10,320 of the population within that search limit. 67 00:03:10,320 --> 00:03:12,770 So within the neighborhood that you specify. 68 00:03:12,770 --> 00:03:14,290 Let's look at a graphical example 69 00:03:14,290 --> 00:03:15,453 to see if that helps. 70 00:03:17,190 --> 00:03:20,347 First, here's a number of different types 71 00:03:20,347 --> 00:03:21,400 of curves for fitting. 72 00:03:21,400 --> 00:03:24,270 And in this case, we're talking about a quartic curve. 73 00:03:24,270 --> 00:03:27,330 So that's the cyan blue that you see 74 00:03:27,330 --> 00:03:29,033 about halfway down in the legend. 75 00:03:30,860 --> 00:03:32,660 If we were to look from above, 76 00:03:32,660 --> 00:03:35,760 imagine that the dark green cell in the center 77 00:03:35,760 --> 00:03:39,870 is our cell of interest, and we fit the curve over that. 78 00:03:39,870 --> 00:03:43,093 The lighter the green color, the lower the weight. 79 00:03:44,096 --> 00:03:47,660 You can see how that weight factor tapers off 80 00:03:47,660 --> 00:03:49,270 as you get further and further out 81 00:03:49,270 --> 00:03:51,220 closer to the edge of the neighborhood. 82 00:03:53,020 --> 00:03:54,850 Now, we'll look at three 83 00:03:54,850 --> 00:03:57,390 different density type calculations. 84 00:03:57,390 --> 00:04:01,253 A kernel density, a line density, and a point density. 85 00:04:02,800 --> 00:04:06,260 The kernel density measures the magnitude per unit area 86 00:04:06,260 --> 00:04:10,100 for point or line inputs using that kernel function 87 00:04:10,100 --> 00:04:13,623 that we just heard about fit to each one of the features. 88 00:04:15,200 --> 00:04:17,680 Line density measures a magnitude per area 89 00:04:17,680 --> 00:04:20,660 from a line feature within some user-specified radius 90 00:04:20,660 --> 00:04:22,640 around each cell. 91 00:04:22,640 --> 00:04:25,220 And then lastly, a point density 92 00:04:25,220 --> 00:04:27,150 measures the magnitude per unit area 93 00:04:27,150 --> 00:04:30,673 from a point feature within a user-specified neighborhood. 94 00:04:33,600 --> 00:04:36,940 Now, when you're working with the density operations 95 00:04:36,940 --> 00:04:41,510 you have the option for not specifying a population field. 96 00:04:41,510 --> 00:04:44,740 So in that case it really is just a hotspot analysis 97 00:04:44,740 --> 00:04:48,050 identifying clusters of the presence 98 00:04:48,050 --> 00:04:51,260 of some specific point feature. 99 00:04:51,260 --> 00:04:53,700 In this case, we'll work with the recreation sites dataset 100 00:04:53,700 --> 00:04:54,803 we've seen before. 101 00:04:56,700 --> 00:04:59,210 We'll use that as our input point features. 102 00:04:59,210 --> 00:05:02,260 For now we'll hold off on specifying a population field. 103 00:05:02,260 --> 00:05:05,000 So we'll leave the default value of none. 104 00:05:05,000 --> 00:05:08,340 And then we'll choose a circular neighborhood 105 00:05:09,230 --> 00:05:11,500 with a radius of 25,000. 106 00:05:11,500 --> 00:05:15,810 Now that 25,000 I should note, is reported in native units 107 00:05:15,810 --> 00:05:20,470 to the input dataset, which is also the coordinate system 108 00:05:20,470 --> 00:05:22,923 of the output raster that will be produced. 109 00:05:25,770 --> 00:05:28,853 Here's the distribution of the input points, 110 00:05:30,400 --> 00:05:34,143 and the resulting raster from that tool parameterization. 111 00:05:35,940 --> 00:05:37,390 Let's take a look at another. 112 00:05:39,240 --> 00:05:41,650 In this case, I've used the same setup 113 00:05:41,650 --> 00:05:44,300 except for I've specified a rectangular neighborhood. 114 00:05:45,220 --> 00:05:47,650 And based on some theory that I have 115 00:05:47,650 --> 00:05:49,930 about a trend in the data 116 00:05:49,930 --> 00:05:53,263 I've decided to create a rectangular neighborhood 117 00:05:53,263 --> 00:05:57,890 7,500 meters wide and 15,000 meters tall. 118 00:05:57,890 --> 00:05:59,140 Let's look at that output 119 00:06:00,950 --> 00:06:03,570 and then compare the two outputs. 120 00:06:03,570 --> 00:06:06,840 So you can see that the definition of the neighborhood, 121 00:06:06,840 --> 00:06:09,590 that size and shape of the neighborhood 122 00:06:09,590 --> 00:06:11,540 can have pretty dramatic impact 123 00:06:11,540 --> 00:06:16,150 on the resulting surface that you estimate. 124 00:06:16,150 --> 00:06:17,630 Of course, if you're going to specify 125 00:06:17,630 --> 00:06:20,090 something other than a circular pattern, 126 00:06:20,090 --> 00:06:23,370 it's better to know about those trends ahead of time 127 00:06:23,370 --> 00:06:27,850 or plan to do some work in data exploration mode 128 00:06:27,850 --> 00:06:30,120 to better understand the potential 129 00:06:30,120 --> 00:06:31,943 for those trends in your data set. 130 00:06:33,690 --> 00:06:38,170 If we zoom in on the same location in each image 131 00:06:38,170 --> 00:06:41,840 we also see that those values are drastically different 132 00:06:41,840 --> 00:06:43,243 even at the local scale. 133 00:06:45,780 --> 00:06:47,610 Now, let's do the same thing 134 00:06:48,480 --> 00:06:52,140 with a circular neighborhood in our 25,000 meter radius. 135 00:06:52,140 --> 00:06:54,310 And this time for our population field 136 00:06:54,310 --> 00:06:55,600 we'll use the acreage. 137 00:06:55,600 --> 00:06:57,360 And if you recall, that's the size 138 00:06:57,360 --> 00:06:59,230 of each one of the recreation sites 139 00:06:59,230 --> 00:07:01,363 that's recorded in that point dataset. 140 00:07:03,510 --> 00:07:08,010 So this leads us to believe that there's more acreage 141 00:07:08,010 --> 00:07:09,680 in the Southern recreation sites 142 00:07:09,680 --> 00:07:11,880 than there are in the Northern recreation sites. 143 00:07:11,880 --> 00:07:13,640 Again, the darker the purple, 144 00:07:13,640 --> 00:07:16,023 the higher the value of that particular cell. 145 00:07:18,010 --> 00:07:21,950 If we compare the two, the result is even more interesting. 146 00:07:21,950 --> 00:07:24,670 Not surprisingly by sheer numbers 147 00:07:24,670 --> 00:07:27,260 there are more recreation opportunities 148 00:07:27,260 --> 00:07:28,840 in the Burlington area, 149 00:07:28,840 --> 00:07:31,480 but by total acreage we see a definite pattern 150 00:07:31,480 --> 00:07:34,363 that emerges along the spine of the Green Mountains. 151 00:07:37,950 --> 00:07:40,720 If we move over to line density we'll start again 152 00:07:40,720 --> 00:07:43,840 with withholding a population factor. 153 00:07:43,840 --> 00:07:46,130 And in this case, we're using the matter 154 00:07:46,130 --> 00:07:48,843 of Valley trails data set that we've looked at before. 155 00:07:49,710 --> 00:07:52,700 We'll specify a search radius of 5,000 for this 156 00:07:54,850 --> 00:07:57,130 and we can see the output here. 157 00:07:57,130 --> 00:08:00,130 The greater number of trails and close proximity, 158 00:08:00,130 --> 00:08:03,520 the higher the density and the darker the purple 159 00:08:03,520 --> 00:08:06,523 value you see displayed here in the image on the left. 160 00:08:07,360 --> 00:08:09,350 Moving onto our last example, 161 00:08:09,350 --> 00:08:10,860 we'll look at the kernel density 162 00:08:10,860 --> 00:08:13,253 using the recreation sites data set again. 163 00:08:14,290 --> 00:08:17,210 Now in this case, we'll once again 164 00:08:17,210 --> 00:08:20,370 withhold the population specification here. 165 00:08:20,370 --> 00:08:24,020 For our neighborhood note that this can only be a circle 166 00:08:24,020 --> 00:08:27,030 and it's up to the user to define the radius. 167 00:08:27,030 --> 00:08:28,820 I'll set that to 25,000 168 00:08:28,820 --> 00:08:31,990 so it matches the same setting 169 00:08:31,990 --> 00:08:35,180 in the point density calculation we did earlier. 170 00:08:35,180 --> 00:08:38,430 Now, ArcGIS will compute a default radius 171 00:08:38,430 --> 00:08:40,340 by looking at that spatial configuration 172 00:08:40,340 --> 00:08:43,100 of the features and the total number of observations 173 00:08:43,100 --> 00:08:47,110 in your input point or line dataset. 174 00:08:47,110 --> 00:08:50,020 Lastly, like the others the area units 175 00:08:50,020 --> 00:08:52,393 assumes the units of the input dataset. 176 00:08:56,590 --> 00:09:00,320 Here's the output from the kernel density calculation. 177 00:09:00,320 --> 00:09:04,580 We see some pretty distinct clusters of locations 178 00:09:05,870 --> 00:09:08,680 in the raster surface that was produced. 179 00:09:08,680 --> 00:09:10,420 And if we compare the two, 180 00:09:10,420 --> 00:09:12,863 the point density and the kernel density, 181 00:09:13,870 --> 00:09:17,370 generally speaking we see some similar patterns 182 00:09:17,370 --> 00:09:20,350 but note the difference here is that the kernel density 183 00:09:20,350 --> 00:09:23,720 because of the weighting scheme that it applies 184 00:09:23,720 --> 00:09:27,350 really drops off to the low end values here. 185 00:09:27,350 --> 00:09:29,800 The darker the color, the higher the value. 186 00:09:29,800 --> 00:09:32,760 And we can see a lot more of the background 187 00:09:32,760 --> 00:09:33,900 of the State of Vermont 188 00:09:33,900 --> 00:09:36,640 and the kernel density output 189 00:09:36,640 --> 00:09:39,110 than we do in the point density output. 190 00:09:39,110 --> 00:09:41,580 Once again, that's due to that weighting scheme 191 00:09:41,580 --> 00:09:43,830 of the kernel that we apply. 192 00:09:43,830 --> 00:09:46,240 This won't be the last time we see kernels 193 00:09:46,240 --> 00:09:47,770 in our investigation this week. 194 00:09:47,770 --> 00:09:48,983 So keep that in mind. 195 00:09:50,110 --> 00:09:52,550 Well, that's it for density calculations. 196 00:09:52,550 --> 00:09:54,210 Join me in a few minutes to learn more 197 00:09:54,210 --> 00:09:56,903 about geostatistics and interpolation.