Most ecologists use statistical methods as their main analytical tools when analyzing data to identify relationships between a response and a set of predictors; thus, they treat all analyses as hypothesis tests or exercises in parameter estimation. However, little or no prior knowledge about a system can lead to creation of a statistical model or models that do not accurately describe major sources of variation in the response variable. We suggest that under such circumstances data mining is more appropriate for analysis. In this paper we 1) present the distinctions between data-mining (usually exploratory) analyses and parametric statistical (confirmatory) analyses, 2) illustrate 3 strengths of data-mining tools for generating hypotheses from data, and 3) suggest useful ways in which data mining and statistical analyses can be integrated into a thorough analysis of data to facilitate rapid creation of accurate models and to guide further research.
How to translate text using browser tools
1 September 2007
Data-Mining Discovery of Pattern and Process in Ecological Systems
WESLEY M. HOCHACHKA,
RICH CARUANA,
DANIEL FINK,
ART MUNSON,
MIREK RIEDEWALD,
DARIA SOROKINA,
STEVE KELLING
ACCESS THE FULL ARTICLE
It is not available for individual sale.
This article is only available to subscribers.
It is not available for individual sale.
It is not available for individual sale.
Journal of Wildlife Management
Vol. 71 • No. 7
September 2007
Vol. 71 • No. 7
September 2007
bagging
Data mining
decision trees
Exploratory data analysis
hypothesis generation
machine learning
prediction