-
Notifications
You must be signed in to change notification settings - Fork 621
Issues
is:issue state:open
is:issue state:open
Issue creation is restricted in this repository
Search results
[Proposal] Add Cohere2 / Command-A interleaved-attention adapter (Cohere2ForCausalLM)
complexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codegood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1474 In TransformerLensOrg/TransformerLens;[Proposal] Add BD3LM block-diffusion adapter (BD3LM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1473 In TransformerLensOrg/TransformerLens;[Proposal] Add RecurrentGemma (Griffin) adapter (RecurrentGemmaForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1472 In TransformerLensOrg/TransformerLens;[Proposal] Add HRM-Text two-timescale recurrent adapter (HrmTextForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1471 In TransformerLensOrg/TransformerLens;[Proposal] Add Ouro LoopLM looped-depth adapter (OuroForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1470 In TransformerLensOrg/TransformerLens;[Proposal] Add Raven/Huginn depth-recurrent adapter (RavenForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1469 In TransformerLensOrg/TransformerLens;[Proposal] Add Jamba attention+Mamba interleave adapter (JambaForCausalLM)
complexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1468 In TransformerLensOrg/TransformerLens;[Proposal] Add DeepSeek-V4 hybrid-attention MoE adapter (DeepseekV4ForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codenew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1466 In TransformerLensOrg/TransformerLens;[Proposal] Add LLaDA masked-diffusion adapter (LLaDAModelLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1465 In TransformerLensOrg/TransformerLens;[Proposal] Add Zamba2 shared-attention hybrid adapter (Zamba2ForCausalLM)
complexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1463 In TransformerLensOrg/TransformerLens;[Proposal] Add RWKV-7 "Goose" adapter (RWKV7ForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1462 In TransformerLensOrg/TransformerLens;[Proposal] Add LFM2 dense short-convolution adapter (Lfm2ForCausalLM)
complexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1461 In TransformerLensOrg/TransformerLens;