Added optional input embeddings to bypass NeoBERT.encoder

by Lolalb - opened Mar 14, 2025

base: refs/heads/main

←

from: refs/pr/8

Discussion Files changed

+19

-7

Added optional input embeddings to bypass NeoBERT.encodere7504ed9

Lolalb

Chandar Research Lab org Mar 14, 2025

No description provided.

Lolalb changed pull request status to merged Mar 17, 2025

saucam

Mar 18, 2025

@Lolalb In your latest change, you have added inputs_embed param to the class, but we also need to update NeoBERTForSequenceClassification as in its forward method we call base model's forward method which is now getting boolean value for the inputs_embed field !

def forward(
        self,
        input_ids: Optional[torch.Tensor] = None,
        position_ids: torch.Tensor = None,
        max_seqlen: int = None,
        cu_seqlens: torch.Tensor = None,
        attention_mask: torch.Tensor = None,
        output_hidden_states: bool = False,
        output_attentions: bool = False,
        labels: Optional[torch.Tensor] = None,
        return_dict: Optional[bool] = None,
    ):

        output = self.model.forward(
            input_ids,
            position_ids,
            max_seqlen,
            cu_seqlens,
            attention_mask,     
            output_hidden_states,             <--- missing  param here !
            output_attentions,
        )

Because of this classification task always fails with

if (input_ids is None) ^ (inputs_embeds is not None):
    raise ValueError("You must specify exactly one of input_ids or inputs_embeds")

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment