Longformer / BigBird
Transfoma za mawimbi marefu kama vile Longformer (Beltagy, Peters & Cohan, 2020) na BigBird (Zaheer et al., 2020) huchukua nafasi ya umakini wa O(n²) wa Transfoma sanifu na ruwaza za umakini zilizotawanyika ambazo huongezeka kwa mstari, O(n), kulingana na urefu wa mfuatano. Hii huwezesha modeli moja kushughulikia maelfu ya tokeni — hati kamili, maandishi ya kisheria, au mfuatano wa vinasaba — ambazo hazingefaa Transfoma ya kawaida.
Soma mbinu kamili
Ingia kwa akaunti ya bure ili kusoma sehemu hii.
Method map
The neighbourhood of related methods — select a node to explore.
Vyanzo
Jinsi ya kunukuu ukurasa huu
ScholarGate. (2026, June 1). Long-Sequence Transformers with Sparse Attention (Longformer / BigBird). ScholarGate. https://scholargate.app/sw/deep-learning/longformer-bigbird
Which method?
Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.
- Mtandao wa Makini wa GrafuUjifunzaji wa Kina↔ compare
- Mchanganyiko wa WataalamuUjifunzaji wa Kina↔ compare
- Msitu NasibuUjifunzaji wa Mashine↔ compare
- XGBoostUjifunzaji wa Mashine↔ compare
Imerejelewa na
Umeona tatizo kwenye ukurasa huu? Ripoti au pendekeza marekebisho →