PCorpus {tm} | R Documentation |
Constructs a permanent corpus.
PCorpus(object, readerControl = list(reader = object@DefaultReader, language = "eng"), dbControl = list(dbName = "", dbType = "DB1"), ...)
object |
A Source object. |
readerControl |
A list with the named components reader
representing a reading function capable of handling the file format
found in object , and language giving the text's language
(preferably in ISO 639-2 format). |
dbControl |
A list with the named components dbName
giving the filename holding the sourced out objects (i.e., the
database), and dbType holding a valid database type as
supported by package filehash. Under activated database
support the tm package tries to keep as few as possible
resources in memory under usage of the database. |
... |
Optional arguments for the reader . |
An S4 object of class PCorpus
which extends the class
list
containing a permanent corpus.
Ingo Feinerer
txt <- system.file("texts", "txt", package = "tm") ## Not run: PCorpus(DirSource(txt), dbControl = list(dbName = "myDB.db", dbType = "DB1"))