ModErn Text Analysis
META Enumerates Textual Applications
Classes | Namespaces | Functions
corpus.h File Reference
#include <memory>
#include <mutex>
#include <stdexcept>
#include "cpptoml.h"
#include "meta/config.h"
#include "meta/corpus/document.h"
#include "meta/corpus/metadata_parser.h"
#include "meta/meta.h"
#include "meta/parallel/thread_pool.h"
#include "meta/util/optional.h"
#include "meta/util/progress.h"

Go to the source code of this file.


class  meta::corpus::corpus
 Provides interface to with multiple corpus input formats. More...
class  meta::corpus::corpus_exception
 Basic exception for corpus interactions. More...


 The ModErn Text Analysis toolkit is a suite of natural language processing, classification, information retrieval, data mining, and other applications of text processing.
 Various ways to convert corpus formats into META-readable documents.


template<class LocalStorage , class ConsumeFunction >
void meta::corpus::parallel_consume (corpus &docs, parallel::thread_pool &pool, LocalStorage &&ls_fn, ConsumeFunction &&consume_fn)
 Consumes each document in a corpus using a pool of threads. More...

Detailed Description

Sean Massung

All files in META are dual-licensed under the MIT and NCSA licenses. For more details, consult the file and LICENSE.ncsa in the root of the project.