Product descriptions written in a variety of languages, technical terms, abbreviations, synonymous, and writing styles, significantly reducing the efficiency of data handling.
The challenge is to reduce the variance of the data and its consistency which will enable an efficient data handling.
The Formalizer roll is to transform the raw product descriptions, from any language or structure, into a more uniform, consistent description.
The Formalizer will be used as an initial step of the product data handling, before applying any of the other tools.
The Formalizer technology includes among others, tokenization, spelling correction, fuzzy-match and lexical match.
The formalization deliverables are product descriptions with low variance and high consistency.
|