تگ: Outlier Detection in Language Data and Compression