Abstract
Many practical scenarios make it necessary to evaluate top-k queries over data items with partially unknown values. This paper considers a setting where the values are taken from a numerical domain, and where some partial order constraints are given over known and unknown values: under these constraints, we assume that all possible worlds are equally likely. Our work is the first to propose a principled scheme to derive the value distributions and expected values of unknown items in this setting, with the goal of computing estimated top-k results by interpolating the unknown values from the known ones. We study the complexity of this general task, and show tight complexity bounds, proving that the problem is intractable, but can be tractably approximated. We then consider the case of tree-shaped partial orders, where we show a constructive PTIME solution. We also compare our problem setting to other top-k definitions on uncertain data.
Original language | English |
---|---|
Title of host publication | 20th International Conference on Database Theory, ICDT 2017 |
Editors | Giorgio Orsi, Michael Benedikt |
Publisher | Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing |
ISBN (Electronic) | 9783959770248 |
DOIs | |
State | Published - 1 Mar 2017 |
Event | 20th International Conference on Database Theory, ICDT 2017 - Venice, Italy Duration: 21 Mar 2017 → 24 Mar 2017 |
Publication series
Name | Leibniz International Proceedings in Informatics, LIPIcs |
---|---|
Volume | 68 |
ISSN (Print) | 1868-8969 |
Conference
Conference | 20th International Conference on Database Theory, ICDT 2017 |
---|---|
Country/Territory | Italy |
City | Venice |
Period | 21/03/17 → 24/03/17 |
Bibliographical note
Publisher Copyright:© Antoine Amarilli, Yael Amsterdamer, Tova Milo, and Pierre Senellart; licensed under Creative Commons License CC-BY 20th International Conference on Database Theory (ICDT 2017).
Funding
This work is partially supported by the European Research Council under the FP7, ERC grant MoDaS, agreement 291071, by a grant from the Blavatnik Interdisciplinary Cyber Research Center, by the Israel Science Foundation (grant No. 1157/16), and by the Télécom ParisTech Research Chair on Big Data and Market Insights.
Funders | Funder number |
---|---|
Blavatnik Interdisciplinary Cyber Research Center | |
Seventh Framework Programme | 291071 |
European Commission | |
Israel Science Foundation | 1157/16 |
Keywords
- Crowdsourcing
- Interpolation
- Partial order
- Uncertainty
- Unknown values