Сравнение методов
Просматривайте выбранные методы рядом; строки с различиями подсвечены.
| Адаптация к домену при автоматическом реферировании текстов× | Адаптация BERT-модели к домену на основе классификации× | |
|---|---|---|
| Область | Глубокое обучение | Глубокое обучение |
| Семейство | Machine learning | Machine learning |
| Год появления≠ | 2019–2021 | 2019–2020 |
| Автор метода≠ | Multiple contributors; domain adaptation methods consolidated via transformer-era NLP (c. 2019–2021) | Gururangan et al. (2020); earlier domain-specific instances include Lee et al. (2020) — BioBERT |
| Тип≠ | Domain adaptation of sequence-to-sequence neural summarization | Domain-adaptive pre-training followed by supervised fine-tuning |
| Основополагающий источник≠ | Fabbri, A. R., KryŜiński, W., McCann, B., Xiong, C., Socher, R., & Radev, D. (2021). SummEval: Re-evaluating Summarization Evaluation. Transactions of the Association for Computational Linguistics, 9, 391–409. DOI ↗ | Gururangan, S., Marasovic, A., Swayamdipta, S., Lo, K., Beltagy, I., Downey, D., & Smith, N. A. (2020). Don't Stop Pretraining: Adapt Language Models to Domains and Tasks. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), 8342–8360. DOI ↗ |
| Другие названия | domain-adapted summarization, domain-specific summarization, cross-domain summarization, DA-summarization | DAPT BERT classification, domain-adaptive pre-training, domain-specific BERT fine-tuning, BERT DAPT |
| Связанные | 6 | 6 |
| Сводка≠ | Domain-adaptive text summarization fine-tunes or adapts a pre-trained sequence-to-sequence language model on a target domain corpus so that summaries conform to domain-specific vocabulary, style, and factual constraints. It bridges the gap between general-purpose summarization models trained on news or web data and specialized domains such as biomedical literature, legal documents, scientific papers, or financial reports. | Domain-adaptive BERT-based classification extends the standard fine-tuning pipeline by first continuing BERT's masked-language-model pre-training on a large corpus of in-domain unlabeled text, then fine-tuning the adapted model on labeled examples for the target classification task. This two-stage approach closes the vocabulary and distributional gap between BERT's general pre-training corpus and specialized domains such as biomedicine, law, finance, or social-media text. |
| ScholarGateНабор данных ↗ |
|
|