Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published December 2016 | Submitted
Journal Article Open

Semantic Spaces

Abstract

Any natural language can be considered as a tool for producing large databases (consisting of texts, written, or discursive). This tool for its description in turn requires other large databases (dictionaries, grammars etc.). Nowadays, the notion of database is associated with computer processing and computer memory. However, a natural language resides also in human brains and functions in human communication, from interpersonal to intergenerational one. We discuss in this survey/research paper mathematical, in particular geometric, constructions, which help to bridge these two worlds. In particular, in this paper we consider the Vector Space Model of semantics based on frequency matrices, as used in Natural Language Processing. We investigate underlying geometries, formulated in terms of Grassmannians, projective spaces, and flag varieties. We formulate the relation between vector space models and semantic spaces based on semic axes in terms of projectability of subvarieties in Grassmannians and projective spaces. We interpret Latent Semantics as a geometric flow on Grassmannians. We also discuss how to formulate Gärdenfors' notion of "meeting of minds" in our geometric setting.

Additional Information

© 2016 Springer International Publishing. Received: 27 May 2016; Accepted: 4 October 2016; Published online: 27 October 2016.

Attached Files

Submitted - 1605.04238v1.pdf

Files

1605.04238v1.pdf
Files (290.2 kB)
Name Size Download all
md5:75075027440cbd4726031bc258c949c2
290.2 kB Preview Download

Additional details

Created:
August 22, 2023
Modified:
March 5, 2024