Academic Bots including Lexicography & Plagiarism
Now that ChatGPS is busy writing homework, detecting plagiarism is harder than reading Finnegans Wake. These largely academic bots crawl web content, and are looking at word utilisation, contemporary expressions of new words and phrases, as well as plagiarised content.
Vendor
Bot Service
Recommendation
Description
Universita Delgi Studi Di Milano
BUbiNG
Recommended
Not recommended
BUbiNG bot scrapes the internet for contacts and news articles that could be of interest to the University of Milan and its students.
UNIVERSITÄT LEIPZIG
Corpora Collection
Recommended
Not recommended
The Leipzig Corpora Collection (LCC) is a project of the Natural Language Processing Group of the University of Leipzig. The LCC offers access to monolingual dictionaries in more than 200 languages. The crawler that visited your website is collecting data for this project. The crawled data is used for language documentation and language statistics which are freely available on their website.The crawling is restricted to text. Audio and video material is excluded from the crawling. If such items are crawled they are never stored.
Turnitin
Turnitin
Recommended
Not recommended
Turnitin helps Educational establishments spot plagiarism, and to help educators to provide personalized feedback to students on how to improve their critical thinking and support them in ensuring originality and authorship. This bot collects content from the Internet o support their platform. In particular, it gathers data to allow comparison of student papers against content found on the Internet. It should be a well behaved bot, and if you have content which could be plagiarized you should consider whitelisting this bot.
RWTH Aachen University
ResearchScan
Recommended
Not recommended
Visits by this bot are for the purposes of an Internet-wide research study being conducted by computer scientists at RWTH Aachen University. The research involves making benign connection attempts to every public IP address so they can analyze global patterns and trends in protocol deployment and security.As part of this study, every public IP address receives a handful of packets per day on a selection of common ports. These consist of regular UDP probes and TCP connection attempts followed by RFC-compliant protocol handshakes with responsive hosts. They only receive data that is publicly visible.
Grammarly
Grammar Bot
Recommended
Not recommended
Grammar bot is sent by the team at Grammarly to check web sites for grammatical mistakes.
Fisher Yu
Princeton Bot Image crawler
Recommended
Not recommended
Image big data collection for Princeton academic research purposes
Docoloc
Docoloc
Recommended
Not recommended
A Plagiarism detection service based in Germany. Paid service with private sessions for checking for plagiarized text. Docoloc is partnered with a number of German, Austrian and Swiss Universities and Educational Institutions so if you have content you wish to protect or support academic research in any of these countries you may wish to allow this bot.
Digimarc
Digimarc
Recommended
Not recommended
Digimarc crawls looking for pirated copy contained in e-books and digital documents. The platform has links with enforcement services to aid in combating content theft
Chegg
EasyBib
Recommended
Not recommended
Suite of academic tools to automate citations, also includes grammar and plagiarism tools for academic work
Checksem
Checksem
Recommended
Not recommended
Checksem/Nutch-1.10 is used by the French intelligence research company Check Sim to scrape research content from the Internet for their Knowledge Engineering research.
Camtology
Camont Spider
Recommended
Not recommended
Camtology uses the power of Grid computing to build search intelligence services for science, based on context dependant searches with publications. This bot scrapes images and text, from academic sources from the web and is aiming to build the largest database of N-grams to help validate its statistical model.
Allclasses
Allclasses
Recommended
Not recommended
Allclasses is a search engine for educational classes. It allows users to search, sort, compare, book and share millions of classes covering academics, professional and personal skills.