Comparison of available intentizer types
When creating a new NLU you can select different types of intentizer models. Please note that complex model requires special feature enabled on company level. Contact your system admin to enable this feature.
| Simple | Complex | LLM | |
|---|---|---|---|
| Summary | Lightweight, universal model, multilingual | Heavy model, mainly for banking | Classification based on LLM |
| Domain | Universal | Finances | Universal |
| Supported languages | Multilingual - 16 languages (Arabic, Simplified Chinese, Traditional Chinese, English, French, German, Italian, Japanese, Korean, Dutch, Polish, Portuguese, Thai, Turkish, Russian, Spanish) | mainly Polish | Multilingual - as defined by LLM model card. For default ChatGPT 4o-mini those include: English, Spanish, French, German, Italian, Portuguese, Polish, Dutch, Russian, Ukrainian, Albanian, Basque, Bosnian, Bulgarian, Catalan, Corsican, Croatian, Czech, Danish, Estonian, Finnish, Galician, Greek, Hungarian, Icelandic, Latvian, Lithuanian, Luxembourgish, Macedonian, Maltese, Norwegian, Occitan, Romanian, Serbian, Slovak, Slovenian, Swedish, Turkish, Welsh, Yiddish |
| Support for multi-intentions | ❌ | ✅ | ✅ |
| Classification quality (1) | ⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐⭐ |
| Classification performance (2) | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ | ⭐⭐ |
| Model training time(3) | ⭐⭐⭐⭐⭐ | ⭐⭐⭐ | ⭐⭐⭐⭐⭐ |
| Size of the resulting model (4) | ⭐⭐⭐⭐⭐ | ⭐⭐⭐ | ⭐⭐⭐⭐⭐⭐ |
| Preprocessing | This mechanism performs a series of operations on the input text, aimed at clearing it of unwanted characters and normalizing it to a uniform form. The following is a description of each operation:
As a result, the text after preprocessing will contain text normalized to a uniform form, devoid of punctuation marks, multiple spaces and unwanted Unicode characters. Examples:
| This mechanism performs a series of operations on the input text, aimed at clearing it of unwanted characters and normalizing it to a uniform form. The following is a description of each operation:
Examples:
| None |
- f1-score measure, simple - 0.89 (precision - 0.92, sensitivity - 0.85), complex - 0.94 (precision - 0.94, sensitivity - 0.93), test set of 2807 samples for 91 classes
- simple - 58.82 phrases/s, complex - 47.62 phrases/s, test collection of 2807 samples
- simple - 107 s, complex - 25 min, training set of 9825 samples divided into 91 classes of intentions
- simple - 350MB (feature extractor) + 1MB / model, complex - 3,5GB / model
How to calculate NLU Complex models limit?
- Determine available RedisAI's storage in GB (RAS).
- Take floor of RAS / 3.5 GB.
- Result is the maximum number of NLU Complex models that can be stored in RedisAI.
- Remember to leave some storage for Simple models.
Updated 8 days ago
