library(tidyverse)
library(readxl)
library(ggthemes)
library(sf)
library(units)
library(USAboundaries)
library(rnaturalearth)
library(gghighlight)
library(ggrepel)
library(knitr)
library(rmapshaper)
library(mapview)
library(spatstat)




Points making up the US polygon
Original Simplified
Number of points in polygon 51976 161


I used the ms_simplify() function to reduce the number of points in the polygon outlining the US from 51976 points to only 161 points. Before simplifying, the computations took ~30 - 45 seconds and sometimes caused RStudio to crash After simplifying, the computations take ~4 seconds.


US county tesselations
Tesselation Number of features Mean area Standard deviation Total area
triangulation 6196 1253.691 1578.6475 7766614
voroni 3108 2525.425 2886.6610 7843970
square grid 3108 2553.313 560.6325 7843779
hexagonal grid 2271 3497.054 803.3588 7843892
county 3108 2521.745 3404.3252 7837583






Tesselation traits:


The triangulation tesselation has the most polygon features so when a point-in-polygon analysis is done the results may unclear compared to other tesselations as the points will be dispersed throughout a greater number of distinct polygons. On the other hand, the Voroni and Hexogonal tesselations have fewer polygons and thus may result in a more accurate representation of the number of points within a region of polygons.




Tesselated surfaces and point-in-polygon analysis:


I will be choosing to use the Voroni tessalation going forward as it allows you to see the regions that have the most dams most clearly. The Voroni tesselation also has the benefit of fewer point calculations than the Triangulation tesselation which requires more point calculations, thus Triangulation puts more demand on the computer and takes longer to run. In terms of MAUP, the Triangulation tesselation creates extra areas which are unneccessary and arbitray and do not add to the analysis at hand. I also like the result of the Hexogonal tesselation as it does a good job of showing the number of dams in an area, but not as clearly as the Voroni tesselation






# Inventory of dams in the US
dams = read_excel('../data/NID2019_U.xlsx')

Analysis


Most of the water supply dams are located in dry regions of the country like Southern California, southern/central Texas, and Utah; this makes sense as these regions experience water shortages for extended periods of time, thus having water supplies within dams can make up for the periods of extended water shortages.

Irrigation dams are found in the southwestern, central, and northwestern regions of the US. The most irrigation dams are located in the middle of the country which makes a lot of sense as large amounts of farming occurs in this region, thus the need for irrigation.

Hydroelectric dams are located in central California, the midwest Great Lakes region, and in the northeast along lake Ontario. The presence of Hydroelectric dams in each of these regions makes sense as they are all located on river systems. In central California the hydroelectric dams are located along the Sacramento River, in the midwest the hydroelectric dams are located on the Mississippi River, and in the northeast the dams are located along the region of Lake Ontario which flows out to the ocean. Hydroelectric dams require moving water to operate, hence there locations along river systems. The largest amount of hydroelectric dams are found in one county along Lake Ontario, in which ~60 hydroelectric dams are located in the area.

Most of the recreational dams are located at the end of the Mississippi river around several major metropolitan areas and population centers. It makes sense that this area has many recreational dams as the population centers will use them for recreation.

For all of these specified dams, the Voroni tessellation allows one to see areas were dams of a specific purpose are highly concentrated. For example, in the northeast, the tile along Lake Ontario represents a location were a very high concentration of hydroelectric dams are, if it were a triangulated tessellation, the concentration within that county may be less evident as the dams are located in differing tiles.