TANIT morphological analyzer
Help
Type of upload
Upload a zip file
Upload one or more text file(s)
Select a zip file
Note: proper encoding type is important, otherwise the reading of the document might fail!
Specify encoding of files and select zip:
(Default: UTF-8)
Choose files...
Select text file(s) to upload
Note: proper encoding type is important, otherwise the reading of the document might fail!
Specify encoding and select file:
(Default: UTF-8)
Choose files...
Add more
Delete last
Apply stopword list on all actions
Enable stopwords
TANIT uses the standard stopwords. If you want to use a custom stopword list please upload it.
Note: proper encoding type is important, otherwise the reading of the document might fail!
Note: if no file gets uploaded, but checkbox stays enabled, the default will be used.
Specify encoding and select stopword file:
(Default: UTF-8)
Choose files...
Select operations to be performed on the document
Preprocessing
Convert to lower case
Replace numbers with NUM
Replace URLs with URL
Basic linguistic distributions
Distribution of parts of speech
Distribution of lemmas
Lexical richness metrics
TTR
Guiraud's R
Herdan's C
CTTR
Dugast's U
Summer's index
Topic modeling
Learn LDA on the given document(s)
Use a pre-trained LDA model
Number of topics:
Number of words per topic:
Number of iterations:
Size of "documents":
Alfa:
Beta:
Thinning:
BurnIn:
List of available models:
hun_press
hun_literature
hun_official
hun_personal
Submit