text preprocessing
[/tɛkst ˌpriːˈprɒsɛsɪŋ/]
noun
pré-processamento de texto
1. The process of cleaning, normalizing, and preparing raw text data for analysis, machine learning, or natural language processing by removing irrelevant information and standardizing format
Text preprocessing includes tokenization, removing stopwords, and converting text to lowercase before training a machine learning model.
O pré-processamento de texto inclui tokenização, remoção de palavras-chave e conversão do texto para minúsculas antes de treinar um modelo de aprendizado de máquina.
2. A computational step in natural language processing that transforms unstructured text into a standardized, machine-readable format
Effective text preprocessing can significantly improve the accuracy of sentiment analysis algorithms.
Um pré-processamento de texto eficaz pode melhorar significativamente a precisão dos algoritmos de análise de sentimento.
Text preprocessing is a fundamental concept in computer science and artificial intelligence that emerged prominently with the rise of machine learning and NLP in the 2010s. The term is used identically in both Brazil and the USA within technical and academic communities, reflecting the globalized nature of technology terminology.
Look up more words on Fala2Me
The free English-Portuguese dictionary with real Brazilian accents, NYC slang, conjugator and more
Open Fala2Me →