Process / pipeline
Text Classification — Text Categorization
Text classification, also called text categorization, is a supervised natural-language-processing task that automatically assigns documents to predefined categories. Building on the support-vector-machine approach to text categorization established by Joachims (1998) and consolidated in the text-mining literature by Aggarwal and Zhai (2012), it powers tasks such as spam detection and topic classification by learning from labelled examples.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Joachims, T. (1998). Text Categorization with Support Vector Machines: Learning with Many Relevant Features. ECML 1998. Lecture Notes in Computer Science, vol 1398. Springer. DOI: 10.1007/BFb0026683 ↗
- Aggarwal, C. C. & Zhai, C. (2012). Mining Text Data. Springer. ISBN: 978-1-4614-3222-7
Related methods
Referenced by
Argument MiningAspect-Based Sentiment AnalysisAuthorship AttributionAutomated Essay ScoringAutomatic Text EvaluationClinical Text MiningContent AnalysisContrastive Learning for NLPCross-lingual Text AnalysisDialogue Act ClassificationDiscourse ParsingDoc2VecDomain AdaptationEmotion DetectionEvent DetectionExplainable LDA Topic ModelFake News DetectionFew-Shot Text ClassificationGender Bias DetectionHallucination DetectionHate Speech DetectionImplicit Sentiment AnalysisIntent DetectionLanguage IdentificationLinguistic Acceptability AssessmentMachine Reading ComprehensionMulti-Document SummarizationN-gram Language ModelNamed Entity RecognitionOpinion MiningParaphrase DetectionPrompt EngineeringPropaganda DetectionQuestion AnsweringReadability AnalysisRelation ExtractionSelf-supervised Sentiment AnalysisSemantic ParsingSentiment AnalysisSlot FillingSocial Media NLPSpeculation DetectionStance DetectionSubjectivity DetectionText Coherence ScoringText DeduplicationText InfillingText RegressionTextual EntailmentTF-IDFTimeline ExtractionWord2VecZero-Shot Classification