Learning to unknot
Abstract
We introduce natural language processing into the study of knot theory, as made natural by the braid word representation of knots. We study the UNKNOT problem of determining whether or not a given knot is the unknot. After describing an algorithm to randomly generate N-crossing braids and their knot closures and discussing the induced prior on the distribution of knots, we apply binary classification to the UNKNOT decision problem. We find that the Reformer and shared-QK Transformer network architectures outperform fully-connected networks, though all perform at ≳95% accuracy. Perhaps surprisingly, we find that accuracy increases with the length of the braid word, and that the networks learn a direct correlation between the confidence of their predictions and the degree of the Jones polynomial. Finally, we utilize reinforcement learning (RL) to find sequences of Markov moves and braid relations that simplify knots and can identify unknots by explicitly giving the sequence of unknotting actions. Trust region policy optimization (TRPO) performs consistently well, reducing ≳80% of the unknots with up to 96 crossings we tested to the empty braid word, and thoroughly outperformed other RL algorithms and random walkers. Studying these actions, we find that braid relations are more useful in simplifying to the unknot than one of the Markov moves.
Additional Information
© 2021 The Author(s). Published by IOP Publishing Ltd. Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 license. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI. Received 9 November 2020; Accepted 23 February 2021; Published 21 April 2021. We thank Peter Battaglia, Kyle Cranmer, Michael Freedman, Mark Hughes, Ciprian Manolescu, Alex Radovic, Danilo Rezende, and Adam Sikora for useful discussions. The work of S.G. is supported by the U.S. Department of Energy, Office of Science, Office of High Energy Physics, under Award No. DE-SC0011632, and by the National Science Foundation under Grant No. NSF DMS 1664227. J.H. is supported by NSF CAREER grant PHY-1848089. The work of P.S. is supported by the TEAM programme of the Foundation for Polish Science co-financed by the European Union under the European Regional Development Fund (POIR.04.04.00-00-5C55/17-00). Data availability statement: The data that support the findings of this study are available upon reasonable request from the authors.Attached Files
Published - Gukov_2021_Mach._Learn.__Sci._Technol._2_025035.pdf
Submitted - 2010.16263.pdf
Files
Name | Size | Download all |
---|---|---|
md5:6962c2d7cb0cb0865a26dc628de0960b
|
1.3 MB | Preview Download |
md5:d2c23bddecb927dc2aac5f01dfe4f0c9
|
1.2 MB | Preview Download |
Additional details
- Eprint ID
- 106623
- Resolver ID
- CaltechAUTHORS:20201111-131628016
- Department of Energy (DOE)
- DE-SC0011632
- NSF
- DMS-1664227
- NSF
- PHY-1848089
- Foundation for Polish Science
- European Regional Development Fund
- POIR.04.04.00-00-5C55/17-00
- Created
-
2020-11-11Created from EPrint's datestamp field
- Updated
-
2021-06-28Created from EPrint's last_modified field
- Caltech groups
- Walter Burke Institute for Theoretical Physics
- Other Numbering System Name
- CALT-TH
- Other Numbering System Identifier
- 2020-046