Methoden vergelijken
Bekijk de geselecteerde methoden naast elkaar; rijen die verschillen zijn gemarkeerd.
| Domein-adaptieve tekstsamenvatting× | Domein-adaptieve BERT-gebaseerde Classificatie× | |
|---|---|---|
| Vakgebied | Deep learning | Deep learning |
| Familie | Machine learning | Machine learning |
| Jaar van ontstaan≠ | 2019–2021 | 2019–2020 |
| Grondlegger≠ | Multiple contributors; domain adaptation methods consolidated via transformer-era NLP (c. 2019–2021) | Gururangan et al. (2020); earlier domain-specific instances include Lee et al. (2020) — BioBERT |
| Type≠ | Domain adaptation of sequence-to-sequence neural summarization | Domain-adaptive pre-training followed by supervised fine-tuning |
| Oorspronkelijke bron≠ | Fabbri, A. R., KryŜiński, W., McCann, B., Xiong, C., Socher, R., & Radev, D. (2021). SummEval: Re-evaluating Summarization Evaluation. Transactions of the Association for Computational Linguistics, 9, 391–409. DOI ↗ | Gururangan, S., Marasovic, A., Swayamdipta, S., Lo, K., Beltagy, I., Downey, D., & Smith, N. A. (2020). Don't Stop Pretraining: Adapt Language Models to Domains and Tasks. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), 8342–8360. DOI ↗ |
| Aliassen | domain-adapted summarization, domain-specific summarization, cross-domain summarization, DA-summarization | DAPT BERT classification, domain-adaptive pre-training, domain-specific BERT fine-tuning, BERT DAPT |
| Verwant | 6 | 6 |
| Samenvatting≠ | Domain-adaptive text summarization fine-tunes or adapts a pre-trained sequence-to-sequence language model on a target domain corpus so that summaries conform to domain-specific vocabulary, style, and factual constraints. It bridges the gap between general-purpose summarization models trained on news or web data and specialized domains such as biomedical literature, legal documents, scientific papers, or financial reports. | Domain-adaptive BERT-based classification extends the standard fine-tuning pipeline by first continuing BERT's masked-language-model pre-training on a large corpus of in-domain unlabeled text, then fine-tuning the adapted model on labeled examples for the target classification task. This two-stage approach closes the vocabulary and distributional gap between BERT's general pre-training corpus and specialized domains such as biomedicine, law, finance, or social-media text. |
| ScholarGateGegevensset ↗ |
|
|