Comparison of available intentizer types

When creating a new NLU you can select different types of Intentizer models. Please note that the Complex model requires a special feature enabled on company level. Contact your system admin to enable this feature.

FeatureSimpleComplexLLMSemantic Search
SummaryLightweight, universal model, multilingualHeavy model, mainly for bankingClassification based on LLMIntent detection using modern embedding algorithms
DomainUniversalFinancesUniversalUniversal
Supported languagesMultilingual: 16 languages (Arabic, Simplified Chinese, Traditional Chinese, English, French, German, Italian, Japanese, Korean, Dutch, Polish, Portuguese, Thai, Turkish, Russian, Spanish)Mainly PolishMultilingual, as defined by the LLM model card. For the default ChatGPT 4o-mini these include: English, Spanish, French, German, Italian, Portuguese, Polish, Dutch, Russian, Ukrainian, Albanian, Basque, Bosnian, Bulgarian, Catalan, Corsican, Croatian, Czech, Danish, Estonian, Finnish, Galician, Greek, Hungarian, Icelandic, Latvian, Lithuanian, Luxembourgish, Macedonian, Maltese, Norwegian, Occitan, Romanian, Serbian, Slovak, Slovenian, Swedish, Turkish, Welsh, YiddishMultilingual, with rough support for 50+ languages. In practice it works best for major European languages and common global languages such as English, Polish, German, French, Spanish, Italian, Portuguese, Dutch, Russian, Turkish, Arabic, Chinese, Japanese, and Korean.
Support for multi-intentions
Classification quality (1)⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐
Classification performance (2)⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐
Model training time (3)⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐
Size of the resulting model (4)⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐⭐

Preprocessing

ModelPreprocessing
SimpleRemoves non-alphanumeric characters and replaces them with a single space; normalizes repeated spaces to one space; trims spaces from the beginning and end; converts text to lowercase; converts Unicode characters to ASCII equivalents
ComplexConverts all letters in the text to lowercase
LLMNone
Semantic SearchNone

Simple model preprocessing examples

  • What will the weather be like tomorrow?what will the weather be like tomorrow
  • By when do I get a response to my complaint?????by when do i get a response to my complaint
  • sign me up for a doctor tomorrow😁sign me up for a doctor tomorrow

Complex model preprocessing example

  • What will the weather be like tomorrow?what will the weather be like tomorrow ?

Semantic Search preprocessing examples

  • What will the weather be like tomorrow?what will the weather be like tomorrow
  • By when do I get a response to my complaint?????by when do i get a response to my complaint
  • sign me up for a doctor tomorrow😁sign me up for a doctor tomorrow

  1. f1-score measure, simple: 0.89 (precision: 0.92, sensitivity: 0.85), complex: 0.94 (precision: 0.94, sensitivity: 0.93), test set of 2807 samples for 91 classes
  2. simple: 58.82 phrases/s, complex: 47.62 phrases/s, test collection of 2807 samples
  3. simple: 107 s, complex: 25 min, training set of 9825 samples divided into 91 classes of intentions
  4. simple: 350MB (feature extractor) + 1MB / model, complex: 3.5GB / model

How to calculate NLU Complex models limit?

  1. Determine available RedisAI storage in GB (RAS).
  2. Take floor of RAS / 3.5 GB.
  3. Result is the maximum number of NLU Complex models that can be stored in RedisAI.
  4. Remember to leave some storage for Simple models.