Find method to verify that semantic markup across the versions (tc, tcn, tl) is the same. If it is not, identify and fix
These would include all tags lik <m>
, <tl>
, <pa>
etc
And the ordering of nested tags
See also #103
Update, as in #103: @tcatapano to develop xpath query to get count of each element for tc, tcn, and tl
then review the counts across all versions and identify discrepancies to be fixed