Browse SoTA > Natural Language Processing > Text Classification

Text Classification

348 papers with code · Natural Language Processing

Text classification is the task of assigning a sentence or document an appropriate category. The categories depend on the chosen dataset and can range from topics.

( Image credit: Text Classification Algorithms: A Survey )

Benchmarks

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Greatest papers with code

Adversarial Training Methods for Semi-Supervised Text Classification

25 May 2016tensorflow/models

Adversarial training provides a means of regularizing supervised learning algorithms while virtual adversarial training is able to extend supervised learning algorithms to the semi-supervised setting.

SEMI SUPERVISED TEXT CLASSIFICATION SEMI-SUPERVISED TEXT CLASSIFICATION SENTIMENT ANALYSIS WORD EMBEDDINGS ADVERSARIAL TRAINING

Semi-supervised Sequence Learning

NeurIPS 2015 tensorflow/models

In our experiments, we find that long short term memory recurrent networks after being pretrained with the two approaches are more stable and generalize better.

LANGUAGE MODELLING TEXT CLASSIFICATION

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

NeurIPS 2020 huggingface/transformers

With the success of language pretraining, it is highly desirable to develop more efficient architectures of good scalability that can exploit the abundant unlabeled data at a lower cost.

READING COMPREHENSION TEXT CLASSIFICATION

FlauBERT: Unsupervised Language Model Pre-training for French

LREC 2020 huggingface/transformers

Language models have become a key step to achieve state-of-the art results in many different Natural Language Processing (NLP) tasks.

LANGUAGE MODELLING NATURAL LANGUAGE INFERENCE TEXT CLASSIFICATION WORD SENSE DISAMBIGUATION

XLNet: Generalized Autoregressive Pretraining for Language Understanding

NeurIPS 2019 huggingface/transformers

With the capability of modeling bidirectional contexts, denoising autoencoding based pretraining like BERT achieves better performance than pretraining approaches based on autoregressive language modeling.

DOCUMENT RANKING LANGUAGE MODELLING NATURAL LANGUAGE INFERENCE QUESTION ANSWERING READING COMPREHENSION SEMANTIC TEXTUAL SIMILARITY SENTIMENT ANALYSIS TEXT CLASSIFICATION

FastText.zip: Compressing text classification models

12 Dec 2016facebookresearch/fastText

We consider the problem of producing compact architectures for text classification, such that the full model fits in a limited amount of memory.

QUANTIZATION TEXT CLASSIFICATION WORD EMBEDDINGS

FLAIR: An Easy-to-Use Framework for State-of-the-Art NLP

NAACL 2019 zalandoresearch/flair

We present FLAIR, an NLP framework designed to facilitate training and distribution of state-of-the-art sequence labeling, text classification and language models.

TEXT CLASSIFICATION