Now that we are using <longToken> in the normalized tokens, we need to remove those tags before we calculate Levenshtein distances in this file: https://github.com/FrankensteinVariorum/fv-postCollation/blob/master/postColl-workspace/edit-distance/extractCollationData.xsl