|
• WELCOME TO PAPERSEEK
PaperSeek is a specialized web search engine which allows you to search the world wide web for web sites and documents which are plagiarisms are equal, similar or just contain pieces of a document you are querying. Therefore, PaperSeek makes it very easy to search the whole web for cheatings in homeworks, thesises and publications and infringement of copyrights. That is only one the reason why it is frequently used by teachers, professors or publishers. Our algorithms are designed to be very robust and are able to find the original document with high reliability even if some modifications have been done to the input document.
• Currently, our index consists of more than 6,000,000 crawled web sites, text, pdf and postscript files primarily from scientific sources, online encyclopedias and news sites. Our web based service is free and due to its clean and simple interfaces it can be easily used by everyone.
|
|
• BUSINESS SOLUTIONS PaperSeek has in-depth knowledge in the research areas Information Retrieval, text processing, machine learning and more and also offers different products and individual solutions for problems from these areas for businesses who want to profit from our reliable technology. Under Business Solutions you will find more details.
• BACKGROUNDS
PaperSeek is running on Linux systems equipped with standard hardware components. All software components are written by our own software developers and are designed to be fault-tolerant and to scale very well with the number of crawled pages. We have designed our own crawler architecture and due to our special requirements also our own distributed file system which is able to manage files with sizes up to several Exabytes efficiently. All components are written in C++ for efficiency.
|