What languages does Cludo support?

The natural language processing at Cludo consists of multiple steps:

  • Tokenization – Splitting a sentence into individual words
  • Elision – Removing elisions; For example, in French: l’amour → amour, m’appelle → appelle
  • Stop words – Remove fill words such as a, an, it, is, that, this, me, you, your, etc., as they don’t provide any context to the content
  • Stemming – Convert words into their root form, e.g. pilots→pilot, grew→grow, living→live (supporting derivations)

Supported Languages

LanguageISO codeTokenizationElisionStop wordsStemming
Arabicar
Armenianhy
Basqueeu
Brazilianpt-br
Bulgarianbg
Catalanca
Chinese (Simplified)zh
Czechcs
Danishda
Dutchnl
Englishen
Estonianet
Finnishfi
Frenchfr
Galiciangl
Germande
Greekel
Hindihi
Hungarianhu
Icelandicis
Indonesianid
Irishga
Italianit
Japanesejp
Koreanko
Latvianlv
Lithuanianlt
Norwegian (bokmål)no
Norwegian (nynorsk)nn
Persianfa
Polishpl
Portuguesept
Romanianro
Russianru
Serbiansr
Sorani (Kurdish)ku
Spanishes
Swahilisw
Swedishsv
Thaith
Turkishtr
Ukrianianuk
Vietnamesevi

Tags: