So sánh phương pháp

Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.

	Phân loại dựa trên RoBERTa Đa phương thức ×	Nhúng câu (Sentence Embeddings)×
Lĩnh vực	Học sâu	Học sâu
Họ	Machine learning	Machine learning
Năm ra đời≠	2019–2020	2015–2019
Người khởi xướng≠	Liu et al. (RoBERTa); multimodal extension by community	Kiros et al. (Skip-Thought, 2015); Reimers & Gurevych (Sentence-BERT, 2019)
Loại≠	Multimodal text + auxiliary feature classification	Representation learning / embedding
Công trình gốc≠	Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., & Stoyanov, V. (2019). RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv preprint arXiv:1907.11692. link ↗	Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3980–3990. DOI ↗
Tên gọi khác	Multimodal RoBERTa, RoBERTa multimodal classifier, cross-modal RoBERTa classification, MM-RoBERTa	sentence vectors, sentence representations, SBERT, semantic sentence encoding
Liên quan≠	6	4
Tóm tắt≠	Multimodal RoBERTa-based Classification combines the RoBERTa transformer encoder — a robustly optimised variant of BERT — with auxiliary modalities such as images, structured metadata, or tabular features. The fused representation is passed to a classification head, allowing the model to leverage both rich language understanding and non-textual signals simultaneously.	Sentence Embeddings convert a sentence or short text into a single fixed-length dense vector that captures its semantic meaning. These vectors allow downstream tasks — semantic similarity, clustering, retrieval, and classification — to operate on numerical representations instead of raw text, making them one of the most versatile building blocks in modern NLP pipelines.
ScholarGateBộ dữ liệu ↗	v1 2 Nguồn tài liệu PUBLISHED	v1 2 Nguồn tài liệu PUBLISHED

Đến trang tìm kiếm → Tải xuống bản trình chiếu