Model for representing a text on the basis of the words occurring in it and their number, regardless of the exact position of their occurrence in the text. Information is necessarily lost in the process, but for many tasks this representation is already sufficient.
Source: Glossar. In: Biemann, C., Heyer, G., & Quasthoff, U. (2022). Wissensrohstoff Text: Eine Einführung in das Text Mining. Springer Vieweg. Online: https://www.wortschatz.uni-leipzig.de/public/documents/2022-glossar-wissensrohstoff-text.pdf