What is it about?
The paper introduces easy way to compare documents. There are number of factors identified that allow to answer the question how much one document was built by copying content from the other one. The method uses hashtags for coding of words. In the effect a vector of values is produced that can be used for identification algorithm to assess the plagiarism ratio. The method can be customized according to the field and specificity of texts. It also keep anonymity of sources if needed.
Why is it important?
There is lots of different methods to assess plagiarism, however not all of them take in the account the specificity of field of document type. It was the reason to introduce this method, that can be suited according to needs.
Read the Original
This page is a summary of: Features for Text Comparison, Springer Science + Business Media, DOI: 10.1007/978-3-540-68168-7_52.
You can read the full text:
The following have contributed to this page