Welcome to the Peregrine data-mining project

The data-mining project contains Peregrine and several supporting modules, e.g. for ontologies and datasets. Peregrine is an indexing engine or tagger: a piece of software that can be used to recognize concepts in human readable text, based on a database (thesaurus) of known terms. Multi-word terms are correctly recognized. If terms can represent multiple concepts, Peregrine will attempt to disambiguate them.

Peregrine was originally developed by Martijn Schuemie at the department of Medical Informatics of the Erasmus University Medical Center (EMC) in Rotterdam and has been improved and made into open source in collaboration with NBIC's  BioAssist Engineering Team. Peregrine project has a dual licensing model. Its source code is released under the AGPL license for the open source community. For people who are interested in a closed source and commercial use of Peregrine, please contact  Jan Kors or  ErasmusMC's Technology Transfer Office.

Key applications of the Peregrine system are currently:

If you will be using Peregrine yourself, drop us a line and we will add your application to this list. Also, become a member of the data-mining-users list mentioned below, so that we can keep you up to date with things we're planning to do and answer any questions you may have.

Learning about Peregrine

Please become a member of the data-mining-users mailing list (mentioned below), and if you have any question that is not addressed here feel free to post it to that list.

Getting started guide

Developer documentation

Accessing Peregrine from your browser

We provide a public Peregrine web service at  http://peregrine.nbiceng.net. This service uses Peregrine and the  Intext-semantic package to recognize concepts in the text you can supply. A English language bio-medical ontology is pre-loaded.

Mailing lists

The Peregrine project provides the following mailing lists:

Source access

You can get the source code by running the following Subversion (svn) command:

  svn co https://trac.nbic.nl/svn/data-mining data-mining

Write access is only available to registered developers. You can become a developer by  registering yourself on the NBIC trac system if you haven't already done so, and requesting write access on the  data-mining-admins mailing list.

Other NBIC software projects

Many other NBIC software projects can be accessed from the  project index.