Abbreviation / Term: | Abbreviation stands for: | Explanation: |
BDLSS | Bodleian Digital Library Systems and Services | Bodleian Libraries homepage. |
CATCH programme | Continuous Access To Cultural Heritage (NWO programme) | CATCH develops generic methods and techniques cutting across the areas of the humanities and computer science, aiming to facilitate an interaction with cultural heritage institutions. Innovation, multidisciplinary collaboration and transferability are essential. Institute homepage. |
CEN | Catalogus Epistularum Neerlandicarum | The Catalogus Epistularum Neerlandicarum contains descriptions of letters from 1600 to present from many Dutch libraries. Catalogue homepage. |
CKCC | Circulation of Knowledge and Learned Practices in the Seventeenth-Century Dutch Republic | A Dutch consortium of universities, research institutes, and cultural heritage institutions analyzing the scientific information system of letters. |
CLARIN-EU | Common Language Resources and Technology Infrastructure | The CLARIN project is a large-scale pan-European collaborative effort to create, coordinate and make language resources and technology available and readily useable for the whole European Humanities (and Social Sciences) community. Clarin-EU homepage. |
CofK | Cultures of Knowledge: An Intellectual Geography of the Seventeenth-Century Republic of Letters | Collaboration between the Humanities Division and Bodleian Libraries of Oxford University (U.K.) CofK homepage. |
DANS | Data Archiving and Networked Services | DANS is an institute of KNAW and NWO. DANS homepage, KNAW homepage, NWO homepage. |
DBNL | Digitale Bibliotheek voor de Nederlandse Letteren (Digital Library of Dutch Literature ) | DBNL was founded in 1999 and is a website about the Dutch literature, language and cultural history. DBNL homepage. |
EAD2000 | Encoded Archival Description | A very common XML-based standard for encoding archival and library finding aids. |
ePistolarium | A virtual research environment, developped by the CKCC consortium, that provides for browsing and analysing the letters in the corpus. It allows the user full text searches in combination with a selection based on the metadata of the letters (date, sender, recipient, location of sending, location of receipt). | |
LDA | Latent Dirichlet Allocation | A topic modeling method. |
LSA | Latent Semantic Analysis | A topic modeling method. |
Lucene | Software that provides Java-based indexing and search technology, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities. Lucene homepage. | |
MRofL | Mapping the Republic of Letters | ‘Mapping of the Republic of Letters’ (MRofL) is based at Stanford University (USA). MRofL has been focusing primarily on visualizing complexity and uncertainty in spatial, temporal and biographical information. MRofL homepage. |
NER | Named Entity Recognition | A subtask of information extraction that seeks to locate and classify atomic elements in text into predefined categories such as the names of persons, organizations, locations, expressions of times, quantities, monetary values, percentages, etc. Source: Wikipedia. |
RI | Random Indexing | A topic modeling method. |
Semantic Vectors | An open source software package used by the CKCC project for topic modeling. Semantic Vectors homepage. | |
TEI | Text Encoding Initiative | A consortium which collectively develops and maintains a standard for the representation of texts in digital form. TEI homepage. |
Topic Modeling | A type of statistical model for discovering the abstract “topics” that occur in a collection of documents. Models used are for example LDA, LSA and RI (see there). | |
VARD2 | Variant Detector | A tool for dealing with spelling variation in historical corpora. VARD homepage. |
WMatrix | A software tool for corpus analysis and comparison. Keyness analysis on the basis of frequency lists. WMatrix homepage. |