Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published February 28, 2022 | Submitted
Report Open

Automatic Synthesis of Diverse Weak Supervision Sources for Behavior Analysis

Abstract

Obtaining annotations for large training sets is expensive, especially in behavior analysis settings where domain knowledge is required for accurate annotations. Weak supervision has been studied to reduce annotation costs by using weak labels from task-level labeling functions to augment ground truth labels. However, domain experts are still needed to hand-craft labeling functions for every studied task. To reduce expert effort, we present AutoSWAP: a framework for automatically synthesizing data-efficient task-level labeling functions. The key to our approach is to efficiently represent expert knowledge in a reusable domain specific language and domain-level labeling functions, with which we use state-of-the-art program synthesis techniques and a small labeled dataset to generate labeling functions. Additionally, we propose a novel structural diversity cost that allows for direct synthesis of diverse sets of labeling functions with minimal overhead, further improving labeling function data efficiency. We evaluate AutoSWAP in three behavior analysis domains and demonstrate that AutoSWAP outperforms existing approaches using only a fraction of the data. Our results suggest that AutoSWAP is an effective way to automatically generate labeling functions that can significantly reduce expert effort for behavior analysis.

Additional Information

We thank Adith Swaminathan of Microsoft Research and Pietro Perona of Caltech for their invaluable feedback and helpful discussions regarding this work. We also thank Microsoft Research for the compute resources for our experiments. This work is partially supported by NSF Award #1918839 (YY) and NSERC Award #PGSD3-532647-2019 (JJS).

Attached Files

Submitted - 2111.15186.pdf

Files

2111.15186.pdf
Files (7.3 MB)
Name Size Download all
md5:77a70e96227661b1cd402e0c0d196271
7.3 MB Preview Download

Additional details

Created:
August 20, 2023
Modified:
October 23, 2023