The EcoCommons currently provides 19 different algorithms across 4 different categories to run your species distribution model:
These models only use occurrence data, and are based on the characterization of the environmental conditions of locations associated with species presence.
|Defines a multi-dimensional environmental space bounded by the minimum and maximum values of environmental variables for all occurrences as the potential range where a species can occur.|
Machine learning models
These models typically use one part of the dataset to ‘learn’ and describe the dataset (training) and the other part to assess the accuracy of the model.
|Artificial Neural Network||A 'black box' model that predicts species occurrence probabilities as a weighted combination of features, which are calculated in a hidden layer form linear combinations of the predictor variables.|
|Boosted Regression Tree / General Boosting Model||Predicts species occurrence probabilities based on a combination of decision trees and boosting. It uses a stagewise procedure to iteratively fit random subsets of the data that are weighted in such a way that new trees take into account the error of previously built trees.|
|Classification Tree||Predicts species occurrence by repeatedly splitting the dataset into mutually exclusive groups based on a threshold value of one of the environmental variables.|
|Maxent||Predicts species occurrences by finding the distribution that is most spread out, or closest to uniform, while taking onto account the limits of the enviornmental variables of known locations.|
|Random Forest||Grows many decision trees based on random subsets of the data and averages the predictions of these trees to estimate the importance of each environmental variable.|
Statistical regression models
These models produce estimates of the effect of different environmental variables on the distribution of a species. These models use all the data available to estimate the parameters of the environmental variables, and construct a function that best describes the effect of these predictors on species occurrence. The suitability of a particular model is often defined by specific model assumptions.
|Flexible Discriminant Analysis||A classification model based on a mixture of linear regression models, which uses optimal scoring to transform the response variable so that the data are in a better form for linear separation, and multiple adaptive regression splines to generate the discriminant surface.|
|Generalized Linear Model||A regression model for data with non-normal distribution, fitted with maximum likelihood estimation.|
|Generalized Additive Model||A multiple regression model that uses smoothed functions of the environmental variables to model non-linear relationships between the response and the predictors.|
|Multivariate Adaptive Regression Splines||A regression model that builds multiple linear regression models across the range of predictor values by partitioning the data and run a linear regression model on each different partition. This allows to model complex relationships between the response and predictor variables.|
These models only use the geographic location of known occurrences of a species to predict the likelihood of presence in other locations, and do not rely on the values of environmental variables.
|Circles||Predicts that a species is present at sites within a certain radius around observed occurrences, and absent beyond that radius.|
|Convex Hull||Predicts that a species is present at sites inside the minimum spatial convex hull around observed occurrences, and absent outside that hull.|
|Geographical Distance||Predicts species occurrences based on the assumption that the closer to a known presence, the more likely it is to find the species.|
|Inverse-Distance Weighted Model||Predicts species occurrence probabilities for unknown locations as the average of values at the nearby known locations weighted by their inverse distance from the unknown location.|
|Voronoi Hull||Predicts that a species is present inside voronoi hulls around observed occurrences, which consist of all points whose distance to the known location is less than equal to its distance to any other known location, and absent outside those hulls.|