CuratorCR checks the name of the file in order to compare it with the reference Title. If it is the same or similar, you will receive a "Matched by PDF file name" score.
If the file's name has nothing in common with the reference Title, CuratorCR will check the first 1000 characters of the file's content to find matches in the Author, Abstract, and Title fields. If there are matches found, you will receive a "Matched by Title/Author/Abstract" score.
Note: PDF files have hidden formatting code which cannot be seen by the reader but is important for computer systems. Even if the text in the file looks exactly the same as the reference in CuratorCR, it does not mean it is an exact match. The matching score can consequently be low, but you can compare the reference and PDF at the last stage of file matching, before attaching the file.
Comments
0 comments
Please sign in to leave a comment.