What languages does Cludo support?

The natural language processing at Cludo consists of multiple steps:

  • Tokenization – Splitting a sentence into individual words
  • Elision – Removing elisions; For example, in French: l’amour → amour, m’appelle → appelle
  • Stop words – Remove fill words such as a, an, it, is, that, this, me, you, your, etc., as they don’t provide any context to the content
  • Stemming – Convert words into their root form, e.g. pilots→pilot, grew→grow, living→live (supporting derivations)

Supported Languages

LanguageISO codeTokenizationElisionStop wordsStemming
Arabicar✅✅✅✅
Armenianhy✅✅✅✅
Basqueeu✅✅✅✅
Brazilianpt-br✅✅✅✅
Bulgarianbg✅✅✅✅
Catalanca✅✅✅✅
Chinese (Simplified)zh✅✅✅✅
Czechcs✅✅✅✅
Danishda✅✅✅✅
Dutchnl✅✅✅✅
Englishen✅✅✅✅
Estonianet✅❌❌❌
Finnishfi✅✅✅✅
Frenchfr✅✅✅✅
Galiciangl✅✅✅✅
Germande✅✅✅✅
Greekel✅✅✅✅
Hindihi✅✅✅✅
Hungarianhu✅✅✅✅
Icelandicis✅❌❌❌
Indonesianid✅✅✅✅
Irishga✅✅✅✅
Italianit✅✅✅✅
Japanesejp✅✅✅✅
Koreanko✅❌❌❌
Latvianlv✅✅✅✅
Lithuanianlt✅✅✅✅
Norwegian (bokmål)no✅✅✅✅
Norwegian (nynorsk)nn✅✅✅✅
Persianfa✅✅✅❌
Polishpl✅✅✅✅
Portuguesept✅✅✅✅
Romanianro✅✅✅✅
Russianru✅✅✅✅
Serbiansr✅❌❌❌
Sorani (Kurdish)ku✅✅✅✅
Spanishes✅✅✅✅
Swahilisw✅❌❌❌
Swedishsv✅✅✅✅
Thaith✅✅✅❌
Turkishtr✅✅✅✅
Ukrianianuk✅✅✅✅
Vietnamesevi✅❌❌❌

Tags: