Machine learningDeep learning / NLP / CV
Multimodal Object Detection
Multimodal object detection extends single-modality object detectors by jointly processing signals from multiple sensor types — such as RGB cameras, depth sensors, LiDAR, radar, or text descriptions — to localize and classify objects with higher accuracy and robustness than any single modality alone. Fusion of complementary information is the core design principle.
MethodMind'de açSoonVideoSoon
Tam yöntemi oku
Members only
Sign inSign in with a free account to read this section.
Sources
- Liu, Y., Zhang, F., Li, Y., & Lv, H. (2022). Multimodal Object Detection via Bayesian Fusion. IEEE Transactions on Image Processing, 31, 5953–5965. DOI: 10.1109/TIP.2022.3204252 ↗
- Object detection. Wikipedia. link ↗