1 00:00:01,420 --> 00:00:03,200 - Welcome back to module six. 2 00:00:03,200 --> 00:00:04,920 It's come down to this. 3 00:00:04,920 --> 00:00:07,550 We're at the the last of the four lectures 4 00:00:07,550 --> 00:00:09,570 for this learning module. 5 00:00:09,570 --> 00:00:12,300 And I've saved the best for last. 6 00:00:12,300 --> 00:00:14,750 Well, depending on how you look at it. 7 00:00:14,750 --> 00:00:17,610 What I can guarantee you is this is the most technical 8 00:00:17,610 --> 00:00:19,910 of the approaches that we've looked at so far. 9 00:00:21,750 --> 00:00:26,730 So there was a time when Kriging was essentially 10 00:00:26,730 --> 00:00:28,800 geostatistical analysis. 11 00:00:28,800 --> 00:00:31,750 Now that branch of statistics has expanded 12 00:00:31,750 --> 00:00:35,160 in the recent past to cover a lot more ground 13 00:00:35,160 --> 00:00:36,750 than just Kriging. 14 00:00:36,750 --> 00:00:39,850 But just note that Kriging is the most well-developed 15 00:00:39,850 --> 00:00:44,340 of all of the approaches that we've seen so far. 16 00:00:44,340 --> 00:00:46,610 It's a collection of advanced interpolation 17 00:00:46,610 --> 00:00:48,810 methods that can assess the quality 18 00:00:48,810 --> 00:00:53,010 of the prediction and provide estimated errors. 19 00:00:53,010 --> 00:00:56,820 Now, kriging assumes that the distance or the direction 20 00:00:56,820 --> 00:01:00,250 between sample points reflects spatial autocorrelation 21 00:01:00,250 --> 00:01:04,450 that can be used to explain the variation in the data. 22 00:01:04,450 --> 00:01:07,180 So the spatial variation of the attributes is 23 00:01:07,180 --> 00:01:10,730 neither totally random nor deterministic 24 00:01:10,730 --> 00:01:14,320 but that auto correlation can be expected to explain 25 00:01:14,320 --> 00:01:15,803 some of that variation. 26 00:01:16,730 --> 00:01:20,430 There are three components to a Kriging model, 27 00:01:20,430 --> 00:01:24,270 the spatially correlated component, the drift structure 28 00:01:24,270 --> 00:01:26,170 or trends in the data 29 00:01:26,170 --> 00:01:30,680 something larger than local autocorrelation might capture 30 00:01:30,680 --> 00:01:32,253 and the random error term. 31 00:01:34,290 --> 00:01:37,340 Now, Kriging is a two-step process where first 32 00:01:37,340 --> 00:01:39,600 you need to get to know your data. 33 00:01:39,600 --> 00:01:42,300 Uncover any dependencies that might exist 34 00:01:42,300 --> 00:01:46,000 by using variograms and covariance functions 35 00:01:46,000 --> 00:01:48,120 to estimate the spatial autocorrelation 36 00:01:48,120 --> 00:01:49,573 that's present in the data. 37 00:01:50,580 --> 00:01:53,380 Then once you've uncovered those dependencies 38 00:01:53,380 --> 00:01:56,370 in the dataset, you're able to make predictions 39 00:01:56,370 --> 00:01:59,150 by predicting those unknown values 40 00:01:59,150 --> 00:02:00,913 within some area of interest. 41 00:02:01,800 --> 00:02:04,040 There are two different types of Kriging, 42 00:02:04,040 --> 00:02:06,190 ordinary and universal. 43 00:02:06,190 --> 00:02:09,880 In ordinary approach, this assumes that the constant mean 44 00:02:09,880 --> 00:02:12,000 is an unknown value. 45 00:02:12,000 --> 00:02:15,180 Universal assumes that there's some overriding trend 46 00:02:15,180 --> 00:02:16,520 in the data. 47 00:02:16,520 --> 00:02:19,230 Again, really important to understand the data 48 00:02:19,230 --> 00:02:22,420 that you're using and how you'll be using it 49 00:02:22,420 --> 00:02:24,973 in order to make decisions about these approaches. 50 00:02:27,130 --> 00:02:30,810 Now, the semivariogram is where things 51 00:02:30,810 --> 00:02:32,123 get really complicated. 52 00:02:33,100 --> 00:02:35,010 This is a function that describes the difference 53 00:02:35,010 --> 00:02:39,120 between the samples separated by distance. 54 00:02:39,120 --> 00:02:42,520 So, there's a low variance for small distances 55 00:02:42,520 --> 00:02:46,240 and a larger variance expected at greater distances. 56 00:02:46,240 --> 00:02:50,410 Again, indicator of positive spatial autocorrelation. 57 00:02:50,410 --> 00:02:52,460 This function is fit through the points 58 00:02:52,460 --> 00:02:55,960 of the model that's known as the semivariogram model. 59 00:02:55,960 --> 00:03:00,950 Now, again, this is mapping the distance between every point 60 00:03:00,950 --> 00:03:03,083 and every other point in the data set. 61 00:03:04,340 --> 00:03:06,510 In order to narrow down the computation time 62 00:03:06,510 --> 00:03:07,710 that would be required, 63 00:03:07,710 --> 00:03:11,550 the data can actually be binned into these distance ranges 64 00:03:11,550 --> 00:03:14,300 and we'll see an example of that here in a few minutes. 65 00:03:15,160 --> 00:03:16,980 Now, a few other terms to note 66 00:03:16,980 --> 00:03:19,950 about the semivariogram. There's the range, 67 00:03:19,950 --> 00:03:23,990 this is the distance over which the model flattens out. 68 00:03:23,990 --> 00:03:27,950 The sill, this is the value obtained at the range, 69 00:03:27,950 --> 00:03:32,810 so the Y axis value where that model fit 70 00:03:32,810 --> 00:03:34,253 has started to flatten out. 71 00:03:35,220 --> 00:03:37,550 And then lastly, the nugget, and this is the value 72 00:03:37,550 --> 00:03:41,210 where the semivariogram intercepts the Y axis. 73 00:03:41,210 --> 00:03:43,210 Now it's assumed to be zero 74 00:03:43,210 --> 00:03:45,033 but that's typically not the case. 75 00:03:47,090 --> 00:03:48,860 So, up on top we see an example 76 00:03:49,880 --> 00:03:52,560 of what a series semivariogram would look like 77 00:03:52,560 --> 00:03:54,563 with all the constituent parts labeled. 78 00:03:55,640 --> 00:03:58,290 Down below we see five different options 79 00:03:58,290 --> 00:04:00,350 for the types of curves we could try 80 00:04:00,350 --> 00:04:01,993 to fit through our data. 81 00:04:05,100 --> 00:04:08,020 Once again, we're using that recreational site's dataset 82 00:04:08,020 --> 00:04:11,110 over on the left-hand side, just to point locations 83 00:04:11,110 --> 00:04:13,730 over on the right hand side, graduated symbols 84 00:04:13,730 --> 00:04:16,983 by that acreage attribute contained within the dataset. 85 00:04:18,920 --> 00:04:21,260 Now, I can start out with ordinary kriging 86 00:04:21,260 --> 00:04:24,110 from the spatial analyst toolkit. 87 00:04:24,110 --> 00:04:27,710 Using my recreation sites and that acreage attribute, 88 00:04:27,710 --> 00:04:29,339 I'm inputting point features 89 00:04:29,339 --> 00:04:34,320 and specifying some numeric attribute for my Z value field. 90 00:04:34,320 --> 00:04:36,160 I need to choose between ordinary 91 00:04:36,160 --> 00:04:37,950 and universal for my methods 92 00:04:39,520 --> 00:04:43,343 and then specify the area of the neighborhood itself. 93 00:04:44,620 --> 00:04:46,840 The default lag size in this case, 94 00:04:46,840 --> 00:04:48,410 those bins that I was mentioning 95 00:04:48,410 --> 00:04:52,150 in the previous slide, is equal to the default cell size. 96 00:04:52,150 --> 00:04:55,333 You can of course change that value if you see fit. 97 00:04:57,330 --> 00:04:59,930 And then all of the advanced parameters, 98 00:04:59,930 --> 00:05:02,530 all those other blanks that we see 99 00:05:02,530 --> 00:05:07,110 in the Kriging dialog box are computed internally 100 00:05:07,110 --> 00:05:09,273 if the user doesn't specify those. 101 00:05:11,600 --> 00:05:15,173 Here's the output from my ordinary kriging approach. 102 00:05:18,640 --> 00:05:21,600 If I move on to the universal kriging, again, 103 00:05:21,600 --> 00:05:25,540 all the same parameters that need to be specified 104 00:05:25,540 --> 00:05:28,550 but in this case you choose the semivariogram model 105 00:05:28,550 --> 00:05:31,033 that you would like to produce as well. 106 00:05:32,660 --> 00:05:34,680 Here's the output from that 107 00:05:34,680 --> 00:05:36,973 universal Kriging implementation. 108 00:05:40,630 --> 00:05:43,090 Now, I can take this one step further 109 00:05:44,350 --> 00:05:46,290 and take the more complicated route 110 00:05:46,290 --> 00:05:48,253 by using the geostatistical wizard. 111 00:05:49,730 --> 00:05:52,380 I choose my Kriging, co-kriging option 112 00:05:52,380 --> 00:05:54,250 up in the geostatistical methods 113 00:05:54,250 --> 00:05:56,050 and input my recreation sites 114 00:05:57,040 --> 00:05:59,500 with acreage as my data field. 115 00:05:59,500 --> 00:06:02,750 Note that I have the option here to input a second data set. 116 00:06:02,750 --> 00:06:06,550 So if I know something about an overarching trend 117 00:06:06,550 --> 00:06:10,430 in the data, I can use information from that second dataset 118 00:06:10,430 --> 00:06:14,690 in conjunction with the sample points of the first data set 119 00:06:14,690 --> 00:06:18,433 to produce my raster surface. 120 00:06:22,310 --> 00:06:23,890 I click next and I have options 121 00:06:23,890 --> 00:06:26,130 for doing data transformations here 122 00:06:26,130 --> 00:06:28,883 or specifying my trend removal order. 123 00:06:30,100 --> 00:06:33,190 I'll just leave those for none right now 124 00:06:33,190 --> 00:06:34,603 and click next once again. 125 00:06:36,250 --> 00:06:39,123 Now we see the first look at the semivariogram. 126 00:06:40,370 --> 00:06:44,230 Note that we have the point location 127 00:06:44,230 --> 00:06:48,000 with the search neighborhood identified in that lower image. 128 00:06:48,000 --> 00:06:50,740 The semivariogram is set up then 129 00:06:50,740 --> 00:06:54,023 with all of the data points in the dataset. 130 00:06:54,910 --> 00:06:58,300 I can click that optimize model icon 131 00:06:58,300 --> 00:07:00,460 over on the right hand side 132 00:07:00,460 --> 00:07:04,863 and RTIs will compute some of those advanced metrics for me. 133 00:07:06,780 --> 00:07:09,890 Alternatively, I could calculate 134 00:07:09,890 --> 00:07:14,370 those advanced metrics individually or use functions 135 00:07:14,370 --> 00:07:17,103 within RGIS to calculate those one by one. 136 00:07:18,730 --> 00:07:21,410 For now, I'll just click the optimized mode to take care 137 00:07:21,410 --> 00:07:24,020 of all factors within the model 138 00:07:24,020 --> 00:07:26,883 and now we see the revised semivariogram. 139 00:07:30,180 --> 00:07:31,550 If I click next, 140 00:07:31,550 --> 00:07:34,510 I can get more details about the search neighborhood 141 00:07:34,510 --> 00:07:39,330 for individual locations within the output surface area. 142 00:07:39,330 --> 00:07:41,210 I can click anywhere in this map 143 00:07:41,210 --> 00:07:44,840 RGIS will show me which neighbors have been selected 144 00:07:44,840 --> 00:07:46,950 and the relative weights of each 145 00:07:46,950 --> 00:07:49,593 of those neighbors in the output calculation. 146 00:07:53,010 --> 00:07:56,240 Once again, we see cross validation here 147 00:07:56,240 --> 00:07:58,560 with a pretty mediocre model. 148 00:07:58,560 --> 00:08:00,240 I can look at the table over 149 00:08:00,240 --> 00:08:03,600 on the right hand side to see the predicted value, 150 00:08:03,600 --> 00:08:06,840 the measured value and that error calculation 151 00:08:06,840 --> 00:08:08,790 a little bit further over to the right. 152 00:08:11,750 --> 00:08:15,430 Lastly, I can produce my output that we see over here 153 00:08:15,430 --> 00:08:19,070 on the right hand side and review my method report. 154 00:08:19,070 --> 00:08:20,620 That's all the details that went 155 00:08:20,620 --> 00:08:24,530 into setting up my Kriging operation. 156 00:08:24,530 --> 00:08:26,720 You can refer back to that at any time. 157 00:08:26,720 --> 00:08:28,640 It's sort of like a metadata document 158 00:08:28,640 --> 00:08:31,240 for the data layer that you create. 159 00:08:31,240 --> 00:08:32,410 Just as a reminder, 160 00:08:32,410 --> 00:08:34,820 when you're using the geostatistical wizard, 161 00:08:34,820 --> 00:08:38,000 it produces a geostatistical analyst layer. 162 00:08:38,000 --> 00:08:40,830 If you wanna use that layer in further calculations 163 00:08:40,830 --> 00:08:44,540 say for, Raster calculator or Aecon operation 164 00:08:44,540 --> 00:08:46,520 or even zonal statistics, 165 00:08:46,520 --> 00:08:50,520 you'll need to export that to a raster file format, 166 00:08:50,520 --> 00:08:52,820 right click on the geostatistical analyst layer 167 00:08:52,820 --> 00:08:55,460 of interest, choose export 168 00:08:55,460 --> 00:08:58,540 and I think you know the rest of the way from there. 169 00:08:58,540 --> 00:09:01,510 Well, that does it for this week's lecture materials. 170 00:09:01,510 --> 00:09:04,190 Hope you're not too overwhelmed by what you've just heard. 171 00:09:04,190 --> 00:09:07,610 This is a technical topic within a very technical subject. 172 00:09:07,610 --> 00:09:08,990 So don't beat yourself up too much 173 00:09:08,990 --> 00:09:11,400 if some of this seemed a bit elusive. 174 00:09:11,400 --> 00:09:13,480 I'm certainly happy to discuss this in more detail 175 00:09:13,480 --> 00:09:15,420 with those of you that are interested. 176 00:09:15,420 --> 00:09:16,340 For the rest of you, 177 00:09:16,340 --> 00:09:18,160 it's enough to know these techniques exist 178 00:09:18,160 --> 00:09:21,100 and how to find them if you ever need to employ them. 179 00:09:21,100 --> 00:09:22,580 However, if you're working 180 00:09:22,580 --> 00:09:24,800 in a discipline that employs field sampling methods 181 00:09:24,800 --> 00:09:27,260 for data collection, you should plan to spend some more time 182 00:09:27,260 --> 00:09:30,740 with background materials to dig deeper into the content. 183 00:09:30,740 --> 00:09:33,250 Enjoy the lab assignment and like always 184 00:09:33,250 --> 00:09:34,600 let me know how I can help.