Abstract
Spatial predictions, like other supervised learning tasks, require some criterion for a predictor's quality. Typical data-splitting schemes, such as holdouts and $k$ -fold cross-validation, ignore the fact that the training data are usually not available where predictions are being made. The common data-splitting schemes are thus biased estimates of a predictor's performance, which in turn may lead to choosing suboptimal predictors. In this contribution, we borrow ideas from the domain adaptation machine-learning literature, to suggest the importance-weighted source risk (IWSR). IWSR is a principled approach for weighting the prediction risk, which allows the practitioner to explicitly state the target locations for prediction. IWSR essentially consists of down-weighting training locations and up-weighting target locations. We show that, unlike the usual (unweighted) empirical risk, IWSR is an unbiased estimator of the prediction error. Equipped with this risk estimator, we use it to learn a model in the empirical risk minimization framework and to evaluate the existing predictors. We show the superiority of this weighted risk, using both simulated data and an empirical control: air-temperature prediction in France.
Original language | English |
---|---|
Article number | 9166521 |
Pages (from-to) | 5197-5205 |
Number of pages | 9 |
Journal | IEEE Transactions on Geoscience and Remote Sensing |
Volume | 59 |
Issue number | 6 |
DOIs | |
State | Published - Jun 2021 |
Bibliographical note
Publisher Copyright:© 1980-2012 IEEE.
Funding
Manuscript received March 4, 2020; revised May 20, 2020; accepted July 22, 2020. Date of publication August 13, 2020; date of current version May 21, 2021. The author Itai Kloog was supported by the Ministry of Science, Technology and Space, Israel-PRC 2018–2020 Grant. The authors Jonathan D. Rosenblatt and Ron Sarafian were supported by Grants 924/16 and 900/16 from the Israel Science Foundation. (Corresponding author: Ron Sarafian.) Ron Sarafian and Jonathan D. Rosenblatt are with the Department of Industrial Engineering, Ben Gurion University of the Negev, Be’er Sheva 84105, Israel (e-mail: [email protected]; [email protected]).
Funders | Funder number |
---|---|
Ministry of Science, Technology and Space | 900/16, 924/16 |
Israel Science Foundation |
Keywords
- Geospatial analysis
- machine learning algorithms
- remote sensing