Albanian National Corpus

Full Name
Albanian National Corpus
Composer
Maria Morozova, Marina Domosiletskaya, Alexander Rusakov, Ekaterina Bernatskaya, Anastasia Sidko, Anna Konovalenko.
Language
Albanian
Register
Spoken and Written
Genre
Drama
Essay
Poetry
Prose
Style
Formal and Informal
Period
2000-2100 AD
1900-2000 AD
Number of words
10.000.000 - 100.000.000
Number of words (details)
20 million tokens
Annotation
Lemmatisation
Parsing
POS tagging
Tokenization
Annotation remarks

The corpus allows also to identify case and various other grammatical features

Format
Online
Data collection
Spontaneous
Availability
Open access