How to Implement Multiple Languages
You follow the same process to create a dataset and model that supports a specific language, but you pass in different parameters.
There are two new API parameters that enable multilanguage support:
language
algorithm
You use the language
parameter when you create a dataset. The text in the dataset source file (.csv, .tsv, or .json) should be in one of the supported languages. Then, when you make the API call to create the dataset, pass in the language
parameter that contains the value of one of the supported languages. If you don’t pass the parameter, the dataset language defaults to English (en_US).
Multilanguage support is currently available only for Einstein Intent datasets and models.
Multilanguage support is currently available only for Einstein Intent datasets and models." }
After you create the dataset, you use the algorithm
parameter when you train a dataset to create a model. This optional parameter specifies the algorithm used to train the dataset. Valid values are:
intent
—Uses the original training algorithm. The training process defaults to this algorithm if thealgorithm
parameter isn’t supplied.multilingual-intent
—Uses the training algorithm that supports multiple languages. A model created using this algorithm returns one of the model labels, even if the text sent for prediction doesn’t fall into any of the labels.multilingual-intent-ood
—Uses the training algorithm that supports multiple languages and handles out-of-domain text. A model created using this algorithm returns an emptyprobabilities
array when text sent for prediction doesn’t fall into one of the labels.
Even if your dataset language is English, we recommend that you use either the multilingual-intent
or multilingual-intent-ood
algorithm for better results.