About PaperSeek


Document Search Engine

       
About Us

Business Solutions

Press Center

Contact



back to search

choose language
 
      
WELCOME TO PAPERSEEK

PaperSeek is a specialized web search engine which allows you to search the world wide web for web sites and documents which are plagiarisms are equal, similar or just contain pieces of a document you are querying. Therefore, PaperSeek makes it very easy to search the whole web for cheatings in homeworks, thesises and publications and infringement of copyrights. That is only one the reason why it is frequently used by teachers, professors or publishers. Our algorithms are designed to be very robust and are able to find the original document with high reliability even if some modifications have been done to the input document.

Currently, our index consists of more than 6,000,000 crawled web sites, text, pdf and postscript files primarily from scientific sources, online encyclopedias and news sites. Our web based service is free and due to its clean and simple interfaces it can be easily used by everyone.
     
BUSINESS SOLUTIONS

PaperSeek has in-depth knowledge in the research areas Information Retrieval, text processing, machine learning and more and also offers different products and individual solutions for problems from these areas for businesses who want to profit from our reliable technology. Under Business Solutions you will find more details.

BACKGROUNDS

PaperSeek is running on Linux systems equipped with standard hardware components. All software components are written by our own software developers and are designed to be fault-tolerant and to scale very well with the number of crawled pages. We have designed our own crawler architecture and due to our special requirements also our own distributed file system which is able to manage files with sizes up to several Exabytes efficiently. All components are written in C++ for efficiency.
    
 Latest News

 
January 14, 2005: Domain registration and formation of PaperSeek with an initial index of over 6,000,000 web sites and documents

 Our service is primarily used by

   Publishers to detect plagiarism

   Professors to detect cheating in thesises

   Teachers to detect cheating in homeworks

 Our knowledge and experiences

PaperSeek has in-depth knowledge and experiences in the following research areas:

Information Retrieval Machine Learning
Distributed Filesystems Clustering
Text Processing Graph Algorithms

 User comments

"Finally we are able to check dozens of thesises a day and often we find small and sometimes even large pieces in them that have been just copied from other papers."
-- Professor

 Copyright © 2005 PaperSeek - All Rights Reserved