方法对比
并排查看您选择的方法;存在差异的行会高亮显示。
| 领域自适应文本摘要× | 基于领域自适应BERT的分类× | |
|---|---|---|
| 领域 | 深度学习 | 深度学习 |
| 方法族 | Machine learning | Machine learning |
| 起源年份≠ | 2019–2021 | 2019–2020 |
| 提出者≠ | Multiple contributors; domain adaptation methods consolidated via transformer-era NLP (c. 2019–2021) | Gururangan et al. (2020); earlier domain-specific instances include Lee et al. (2020) — BioBERT |
| 类型≠ | Domain adaptation of sequence-to-sequence neural summarization | Domain-adaptive pre-training followed by supervised fine-tuning |
| 开创性文献≠ | Fabbri, A. R., KryŜiński, W., McCann, B., Xiong, C., Socher, R., & Radev, D. (2021). SummEval: Re-evaluating Summarization Evaluation. Transactions of the Association for Computational Linguistics, 9, 391–409. DOI ↗ | Gururangan, S., Marasovic, A., Swayamdipta, S., Lo, K., Beltagy, I., Downey, D., & Smith, N. A. (2020). Don't Stop Pretraining: Adapt Language Models to Domains and Tasks. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), 8342–8360. DOI ↗ |
| 别名 | domain-adapted summarization, domain-specific summarization, cross-domain summarization, DA-summarization | DAPT BERT classification, domain-adaptive pre-training, domain-specific BERT fine-tuning, BERT DAPT |
| 相关 | 6 | 6 |
| 摘要≠ | Domain-adaptive text summarization fine-tunes or adapts a pre-trained sequence-to-sequence language model on a target domain corpus so that summaries conform to domain-specific vocabulary, style, and factual constraints. It bridges the gap between general-purpose summarization models trained on news or web data and specialized domains such as biomedical literature, legal documents, scientific papers, or financial reports. | Domain-adaptive BERT-based classification extends the standard fine-tuning pipeline by first continuing BERT's masked-language-model pre-training on a large corpus of in-domain unlabeled text, then fine-tuning the adapted model on labeled examples for the target classification task. This two-stage approach closes the vocabulary and distributional gap between BERT's general pre-training corpus and specialized domains such as biomedicine, law, finance, or social-media text. |
| ScholarGate数据集 ↗ |
|
|