Machine learning

T5（Text-to-Text Transfer Transformer）

T5 是由 Google Brain 的 Raffel 等人在 2020 年推出、发表于《机器学习研究期刊》（Journal of Machine Learning Research, Vol. 21, No. 140）的一个统一序列到序列深度学习框架。它将所有自然语言处理（NLP）任务——分类、翻译、摘要、问答等——重构为文本到文本问题：输入和输出始终是字符序列，使得单个编码器-解码器 Transformer 能够预训练一次，然后以一致的接口跨任务进行微调。T5 引入了跨度损坏预训练（span-corruption pre-training）和 C4 语料库，其最大的变体（110亿参数）在发布时横跨广泛的 NLP 基准测试取得了最先进的结果。

在 MethodMind 中打开即将推出视频即将推出Download slides

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

Method map

The neighbourhood of related methods — select a node to explore.

T5（Text-to-Text Transfer Transformer）

注意力机制迁移学习

来源

Raffel, C., Shazeer, N., Roberts, A., Lee, K., Narang, S., Matena, M., Zhou, Y., Li, W., & Liu, P. J. (2020). Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. Journal of Machine Learning Research, 21(140), 1–67. link ↗
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is All You Need. Advances in Neural Information Processing Systems, 30. link ↗
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of NAACL-HLT 2019, 4171–4186. link ↗

如何引用本页

ScholarGate. (2026, June 3). T5: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. ScholarGate. https://scholargate.app/zh/deep-learning/t5

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

注意力机制深度学习↔ compare
迁移学习机器学习↔ compare

Compare side by side →

发现本页有问题？报告或提出修改建议 →

阅读完整方法

Method map

来源

如何引用本页

相关方法

Which method?