Class LinguisticAnalysis

java.lang.Object
org.episteme.social.linguistics.analysis.LinguisticAnalysis

public final class LinguisticAnalysis extends Object
Provides modern NLP (Natural Language Processing) analysis capabilities. Includes tokenization, n-gram extraction, and readability metrics.
Since:
1.0
Author:
Silvere Martin-Michiellot, Gemini AI (Google DeepMind)
  • Method Details

    • tokenize

      public static List<String> tokenize(String text)
      Splits text into individual word tokens.
    • extractNGrams

      public static List<String> extractNGrams(List<String> tokens, int n)
      Extracts N-grams from a list of tokens.
    • fleschReadingEase

      public static double fleschReadingEase(String text)
      Calculates the Flesch Reading Ease score. Score = 206.835 - 1.015 * (total_words / total_sentences) - 84.6 * (total_syllables / total_words)
    • countSyllables

      public static int countSyllables(String word)
      Simple heuristic for counting syllables in English.