Barrier Certificates for Assured Machine Teaching
Abstract
Machine teaching can be viewed as optimal control for learning. Given a learner's model, machine teaching aims to determine the optimal training data to steer the learner towards a target hypothesis. In this paper, we are interested in providing assurances for machine teaching algorithms using control theory. In particular, we study a well-established learner's model in the machine teaching literature that is captured by the local preference over a version space. We interpret the problem of teaching a preference-based learner as solving a partially observable Markov decision process (POMDP). We then show that the POMDP formulation can be cast as a special hybrid system, i.e., a discrete-time switched system. Subsequently, we use barrier certificates to verify set-theoric properties of this special hybrid system. We show how the computation of the barrier certificate can be decomposed and numerically implemented as the solution to a sum-of-squares (SOS) program. For illustration, we show how the proposed framework based on control theory can be used to verify the teaching performance of two well-known machine teaching methods.
Additional Information
© 2019 AACC. This work was supported by AFOSR FA9550-19-1-0005, DARPA D19AP00004, NSF 1646522 and NSF 1652113.Attached Files
Submitted - 1810.00093.pdf
Files
Name | Size | Download all |
---|---|---|
md5:d82e0c2aaa7d1283e45c4e048438ac6b
|
931.9 kB | Preview Download |
Additional details
- Eprint ID
- 92662
- DOI
- 10.48550/arXiv.1810.00093
- Resolver ID
- CaltechAUTHORS:20190205-102249789
- Air Force Office of Scientific Research (AFOSR)
- FA9550-19-1-0005
- Defense Advanced Research Projects Agency (DARPA)
- D19AP00004
- NSF
- CNS-1646522
- NSF
- CNS-1652113
- Created
-
2019-02-05Created from EPrint's datestamp field
- Updated
-
2023-06-02Created from EPrint's last_modified field