PCorpus-class {tm}R Documentation

Permanent Corpus

Description

A class representing a corpus with underlying permanent storage.

Objects from the Class

Objects can be created by calls of the form new("PCorpus",...) or by calling the function PCorpus.

Slots

.Data:
Object of class list holding keys (the document IDs) in order to access the underlying database entries.
CMetaData:
Object of class MetaDataNode containing the corpus specific meta data in form of tag-value pairs and information about children in form of a binary tree. This information is useful for reconstructing meta data after e.g. merging corpora.
DMetaData:
Object of class data.frame containing the document specific meta data for the collection. This data frame typically encompasses clustering or classification results which basically are metadata for documents but form an own entity (e.g., with its name, the value range, etc.).
DBControl:
Object of class list with two named components: dbName holds the path to the permanent database storage, and dbType stores the database type.

Extends

Class list, directly.

Methods

CMetaData
signature(object = "PCorpus"): Returns the corpus specific metadata in form of a tag-value paired list.
DMetaData
signature(object = "PCorpus"): Returns the document specific metadata in form of a data frame.
DBControl
signature(object = "PCorpus"): Returns the database configuration settings.

Author(s)

Ingo Feinerer

See Also

MetaDataNode PCorpus VCorpus


[Package tm version 0.4 Index]