Glossary

Abbreviation / Term: Abbreviation stands for: Explanation:
BDLSS Bodleian Digital Library Systems and Services Bodleian Libraries homepage.
CATCH programme Continuous Access To Cultural Heritage (NWO programme) CATCH develops generic methods and techniques cutting across the areas of the humanities and computer science, aiming to facilitate an interaction with cultural heritage institutions. Innovation, multidisciplinary collaboration and transferability are essential. Institute homepage.
CEN Catalogus Epistularum Neerlandicarum The Catalogus Epistularum Neerlandicarum contains descriptions of letters from 1600 to present from many Dutch libraries. Catalogue homepage.
CKCC Circulation of Knowledge and Learned Practices in the Seventeenth-Century Dutch Republic A Dutch consortium of universities, research institutes, and cultural heritage institutions  analyzing the scientific information system of letters.
CLARIN-EU Common Language Resources and Technology Infrastructure The CLARIN project is a large-scale pan-European collaborative effort to create, coordinate and make language resources and technology available and readily useable for the whole European Humanities (and Social Sciences) community. Clarin-EU homepage.
CofK Cultures of Knowledge: An Intellectual Geography of the Seventeenth-Century Republic of Letters Collaboration between the Humanities Division and Bodleian Libraries of Oxford University (U.K.) CofK homepage.
DANS Data Archiving and Networked Services DANS is an institute of KNAW and NWO. DANS homepage, KNAW homepage, NWO homepage.
DBNL Digitale Bibliotheek voor de Nederlandse Letteren (Digital Library of Dutch Literature ) DBNL was founded in 1999 and is a website about the Dutch literature, language and cultural history. DBNL homepage.
EAD2000 Encoded Archival Description A very common XML-based standard for encoding archival and library finding aids.
ePistolarium A virtual research environment, developped by the CKCC consortium, that provides for browsing and analysing the letters in the corpus. It allows the user full text searches in combination with a selection based on the metadata of the letters (date, sender, recipient, location of sending, location of receipt).
LDA Latent Dirichlet Allocation A topic modeling method.
LSA Latent Semantic Analysis A topic modeling method.
Lucene Software that provides Java-based indexing and search technology, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities. Lucene homepage.
MRofL Mapping the Republic of Letters ‘Mapping of the Republic of Letters’ (MRofL) is based at Stanford University (USA). MRofL has been focusing primarily on visualizing complexity and uncertainty in spatial, temporal and biographical information. MRofL homepage.
NER Named Entity Recognition A subtask of information extraction that seeks to locate and classify atomic elements in text into predefined categories such as the names of persons, organizations, locations, expressions of times, quantities, monetary values, percentages, etc. Source: Wikipedia.
RI Random Indexing A topic modeling method.
Semantic Vectors An open source software package used by the CKCC project for topic modeling. Semantic Vectors homepage.
TEI Text Encoding Initiative A consortium which collectively develops and maintains a standard for the representation of texts in digital form. TEI homepage.
Topic Modeling A type of statistical model for discovering the abstract “topics” that occur in a collection of documents. Models used are for example LDA, LSA and RI (see there).
VARD2 Variant Detector A tool for dealing with spelling variation in historical corpora. VARD homepage.
WMatrix A software tool for corpus analysis and comparison. Keyness analysis on the basis of frequency lists. WMatrix homepage.