#541: qc of comments csv

opened by tcatapano

see also #345

check that ids in comments table correspond to comment elements in ms-xml, and vice versa


tcatapano commented:

@njr2128, @thuchacz using this XPath expression to search files under ms-xml:

//comment[ not(. = doc('../../metadata/comment_ids_srt.xml')//id )] (that is, comment elements whose text does not equal the text of an id element in the file metadata/commentidssrt.xml -- which I generated from metadata/DCE_comment-tracking-Tracking.csv)

I get these ids used in the xml files which are not in DCE_comment-tracking-Tracking.csv

c156v1 c157r1 c002v06 c009r03 c013v02 c043r01 c053r02 c055r04 c055r05 c108r02 c118r011 c123r02 c148v01 c160r01 c162r08 p002r2 p009r1 p009r_2

You can view the HTML report here:

https://htmlpreview.github.io/?https://github.com/cu-mkp/m-k-manuscript-data/blob/master/qc/commentidsnotintable.html

(also in repo here: https://github.com/cu-mkp/m-k-manuscript-data/blob/master/qc/commentidsnotintable.html)