Package org.episteme.social.linguistics
package org.episteme.social.linguistics
Provides mathematical and computational models for natural language
processing, linguistic analysis, and glottochronology. This package
structures human language into constituent units (phonemes, morphemes, words)
and provides analytical tools for sentiment, readability, and evolutionary
divergence.
Key components include:
Language- Structural definition of a languageSentimentAnalyzer- Polarity and emotional detectionReadabilityIndex- Text complexity evaluationLanguageEvolution- Divergence time estimationsDialectMapper- Spatial linguistic variation
-
ClassDescriptionRepresents the interaction of participants communicating.Measures linguistic distance between dialects.Analytical tool for dialect mapping and isogloss analysis.Data point from a linguistic survey at a specific geographical location.Represents a line on a map marking the geographical boundary of a linguistic feature.Represents a specific linguistic variable (phonological, lexical, etc.).Graph structure for tracking word derivations and linguistic ancestry across languages.Interface for loading grammars from formal definition files (PEG, BNF, etc.).Represents the grammar rules for parsing a language.Checks for common grammatical errors.Represents a grapheme - the minimal unit of written language.The IPACodes class defines useful constants to use the International Phonetic Alphabet.Represents a human language as a systematic linguistic entity.Utility class for detecting the language of text samples using statistical methods.Analytical engine for modeling language evolution, divergence, and glottochronology.Standard registry for common world languages.Represents a lexeme—an abstract unit of the lexicon in a language that encompasses all inflected forms of a single word.Represents a lexicon - a dictionary of lexemes with definitions.Universal data model for linguistic analysis.Phonetic and textual distance metrics for linguistic analysis.Constants and enumerations for linguistics.Levels of linguistic analysis.Manner of articulation.Place of articulation.Kinesics (non-verbal communication).Major language families.Grice's Maxims of Pragmatics.Sentence types.Austin's Speech Acts.Writing systems.Universal data model for linguistic analysis.Utility class providing mathematical models used in linguistics, such as Zipf's law for word frequency distribution.Unified entry point for the Episteme Linguistics system.Conducts a full statistical profile of a corpus.Analyzes the readability and complexity of a text.Represents an individual participant in a linguistic interaction.Constants and Selma'o (Lexeme) definitions for the Lojban language.Poetry metric analysis (prosody).Represents a morpheme—the smallest irreducible unit of meaning or grammatical function within a language.Functional and structural classification of morphemes.Morphological analysis for word decomposition.Named Entity Recognizer (NER) for text samples.Universal analyzer for Natural Languages.The Parser interface defines methods that all parsers for languages should implement.Represents a phoneme - the minimal unit of sound in a language.Type of phoneme.Engine for converting text to phonetic representations (IPA).Phonetic transcription to International Phonetic Alphabet (IPA).Represents a phrase—a group of words that function together as a single syntactic unit within a sentence but do not necessarily contain a subject and a predicate (clause). * @version 2.0Functional categories of phrases based on their head word.Standard Part Of Speech (POS) tags for natural and formal languages.Utility tool for calculating various text readability indices.Encapsulates the evaluation of a readability calculation.The Rule class defines a rule used by a grammar to parse a language.The SemanticNetwork class provides a representation for the semantic information that can be extracted from a text.Represents a sentence, a sequence of phrases or words that expresses a complete thought or proposition.Communicative intent (illocutionary force) of the sentence.Complexity categories for sentence structure.Analytical tool for sentiment analysis and emotional polarity detection in text.Qualitative classification of text sentiment.Result of a sentiment analysis operation.Analyzes emotional tone and sentiment of text using keyword valences.Models syntactic complexity of text.Parser for formal grammars and syntax trees.Represents a document or textual passage, composed of a sequence of sentences.Automatic text summarization by sentence extraction.The Translation class provides biological and linguistic translation support.Represents a single act of verbal communication between locutors.Represents a linguistic word, defined as a sequence of one or more morphemes that form a minimal meaningful unit of language. * @version 2.0Standard parts of speech (POS) categories.Stores morphological and semantic analysis of a word.Validates Zipf's Law in a text corpus.