Match the preprocessing techniques of text and their explanations.
Text Classification
Preprocessing techniques
Report a typo
Match the items from left and right columns
One-Hot encoding
Count-encoding
TF-IDF vectorization
Word2Vec or FastText vectors
Pretrained vectors from a language model
Trained vectors from scratch
Generates word embeddings based on contextual information
Develops custom embeddings trained on a specific corpus.
Represents token presence in the text using a binary vector with vocabulary size
Utilizes embeddings trained using language modeling techniques
Indicates token frequency in the text using a vector
Creates a vector with weighted token frequencies within the text and corpus
___
By continuing, you agree to the JetBrains Academy Terms of Service as well as Hyperskill Terms of Service and Privacy Policy.
Create a free account to access the full topic
By continuing, you agree to the JetBrains Academy Terms of Service as well as Hyperskill Terms of Service and Privacy Policy.