Understanding your Similarity Report
How is the Similarity Score calculated?
To calculate the Similarity Score, iThenticate scans your submitted document’s text, and checks it against each of the repositories you’ve chosen. The system takes the number of matching words found within the document and divides it by the document’s total word count to produce the Similarity Score percentage for the report.
If you apply exclusion options to the document, the system removes all matches for the exclusion option logic and recalculates the Similarity Score percentage.
Learn more about exclusion settings when setting up a new folder, editing filters and exclusions in existing folders, filters and exclusions within the Similarity Report, and URL filters for account administrators.
How to interpret the Similarity Report
iThenticate does not check for plagiarism - it checks for similarity. Where a section of the submission’s content is similar or identical to one or more sources, it will be flagged for review. This doesn’t automatically mean plagiarism, however - just similarity.
It’s perfectly natural for a submission to match against some sources in the database. A high degree of overlap may indicate a well-researched document with many references to existing work, and as long as these sources are quoted and referenced correctly, this is perfectly acceptable. A high degree of overlap may also be present where an author has already shared their work on a preprint repository. If the author(s) are the same, this is not a problem.
It’s important that you don’t set a Similarity Score over which you automatically reject manuscripts - where there’s a high degree of overlap it’s important that your editors and reviewers decide if the match is acceptable or not, as part of their general review process.
Similarity Reports and preprints
It is entirely possible (and acceptable) for an author to submit an article to a journal even though they’ve previously made the article available as a preprint. In this case, we expect a high degree of similarity between the preprint and author’s submitted manuscript.
Therefore, if you find a high degree of similarity between a manuscript you’re checking in iThenticate and a preprint by the same author(s), this is likely to be because the manuscript is a match with its own preprint. However, if the manuscript and preprint do not have the same author(s), this may indicate a problem, and you should investigate further.
Some preprints can be found in iThenticate’s Crossref Posted Content repository, so take this into account if you are checking against this repository.
Even if you have excluded the Crossref Posted Content repository in your settings, it is still possible for preprints to appear as matches to a submission, because iThenticate also crawls preprint repositories on the web.
Since Crossref Posted Content is not the only source of preprints in iThenticate, excluding this repository from your reports will not guarantee that all preprints will be excluded from your results. Check any exact match results to see if they are indeed a match with a preprint repository.
Either way, we recommend including preprints in your results to ensure you are checking that preprints haven’t been plagiarised.