Date & Time:
November 26, 2019 1:30 pm – 2:30 pm
Location:
Crerar 298, 5730 S. Ellis Ave., Chicago, IL,
11/26/2019 01:30 PM 11/26/2019 02:30 PM America/Chicago MS Presentation: Xi Liang Crerar 298, 5730 S. Ellis Ave., Chicago, IL,

Fast and Reliable Missing Data Contingency Analysis
with Predicate-Constraints

Today, data analysts largely rely on intuition to determine whether
missing or withheld rows of a dataset significantly affect their
analyses. We propose a framework that can produce automatic
contingency analysis, i.e., the range of values an aggregate SQL query
could take, under formal constraints describing the variation and
frequency of missing data tuples. We describe how to process SUM,
COUNT, AVG, MIN, and MAX queries in these conditions resulting in hard
error bounds with testable constraints. We propose an optimization
algorithm based on an integer program that reconciles a set of such
constraints, even if they are overlapping, conflicting, or
unsatisfiable, into such bounds. We also present a novel formulation
of the Fractional Edge Cover problem to account for cases where
constraints span multiple tables. Our experiments on 4 datasets
against several statistical imputation and inference baselines show
that statistical techniques can have a deceptively high error rate
that is often unpredictable. In contrast, our framework offers hard
bounds that are guaranteed to hold if the constraints are not
violated. In spite of these hard bounds, we show competitive accuracy
to statistical baselines.

Xi Liang

M.S. Candidate, University of Chicago

Xi's advisor is Prof. Sanjay Krishnan

Related News & Events

Students posing at competition

UChicago Undergrad Team Places Second Overall In Regionals For World’s Largest Programming Competition

Mar 17, 2023
Haifeng Xu

New CS and DSI Faculty Haifeng Xu Brings Strategic Intelligence to NeurIPS 2022

Nov 28, 2022

UChicago CS Research Finds New Angle on Database Query Processing with Geometry

Nov 08, 2022

Asst. Prof. Aloni Cohen Receives Award For Revealing Flaws in Deidentifying Data

Sep 09, 2022

UChicago Hosts NSF Workshop on Frontiers of Quantum Advantage

Aug 15, 2022

New 2022-23 Faculty Add Expertise in Linguistics, Visualization, Economics, and Data Science Education

Aug 11, 2022

UChicago Co-Leads $10 Million NSF Institute on Foundations of Data Science

Aug 09, 2022

Bill Fefferman Comments on New Standards for Quantum-Proof Cryptography

Jul 07, 2022

UChicago London Colloquium Features Data Science, Quantum Research

Jul 01, 2022

Faculty Bill Fefferman and Chenhao Tan Receive Google Research Scholar Awards

Jun 21, 2022

First-Year PhD Student Co-Authors Outstanding Paper Award Winner at TQC 2022

Apr 28, 2022

Quanta Magazine Features Prof. Bill Fefferman’s Work on Quantum Algorithms

Jan 20, 2022
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube