Class LinguisticAnalysis
java.lang.Object
org.episteme.social.linguistics.analysis.LinguisticAnalysis
Provides modern NLP (Natural Language Processing) analysis capabilities.
Includes tokenization, n-gram extraction, and readability metrics.
- Since:
- 1.0
- Author:
- Silvere Martin-Michiellot, Gemini AI (Google DeepMind)
-
Method Summary
Modifier and TypeMethodDescriptionstatic intcountSyllables(String word) Simple heuristic for counting syllables in English.extractNGrams(List<String> tokens, int n) Extracts N-grams from a list of tokens.static doublefleschReadingEase(String text) Calculates the Flesch Reading Ease score.Splits text into individual word tokens.
-
Method Details
-
tokenize
-
extractNGrams
-
fleschReadingEase
Calculates the Flesch Reading Ease score. Score = 206.835 - 1.015 * (total_words / total_sentences) - 84.6 * (total_syllables / total_words) -
countSyllables
Simple heuristic for counting syllables in English.
-