ModErn Text Analysis
META Enumerates Textual Applications
Classes | Namespaces | Functions
analyzer.h File Reference
#include <memory>
#include <stdexcept>
#include "meta/analyzers/featurizer.h"
#include "meta/config.h"

Go to the source code of this file.

Classes

class  meta::analyzers::analyzer
 An class that provides a framework to produce token counts from documents. More...
 
class  meta::analyzers::analyzer_exception
 Basic exception for analyzer interactions. More...
 

Namespaces

 meta
 The ModErn Text Analysis toolkit is a suite of natural language processing, classification, information retrieval, data mining, and other applications of text processing.
 
 meta::corpus
 Various ways to convert corpus formats into META-readable documents.
 
 meta::analyzers
 Contains various ways to segment text and deal with preprocessed files (POS tags, parse trees, etc).
 

Functions

std::unique_ptr< analyzer > meta::analyzers::load (const cpptoml::table &config)
 
std::unique_ptr< token_stream > meta::analyzers::default_filter_chain (const cpptoml::table &config)
 
std::unique_ptr< token_stream > meta::analyzers::default_unigram_chain (const cpptoml::table &config)
 
std::unique_ptr< token_stream > meta::analyzers::load_filters (const cpptoml::table &global, const cpptoml::table &config)
 
std::unique_ptr< token_stream > meta::analyzers::load_filter (std::unique_ptr< token_stream > src, const cpptoml::table &config)
 
std::string meta::analyzers::get_content (const corpus::document &doc)
 

Detailed Description

Author
Sean Massung
Chase Geigle

All files in META are dual-licensed under the MIT and NCSA licenses. For more details, consult the file LICENSE.mit and LICENSE.ncsa in the root of the project.