Byte-Pair Encoding - Byte-Pair Encoding is a subword tokenization algorithm that compresses text by iteratively merging the most frequent adjacent symbol pairs to form a flexible vocabulary.