Load a speechbrain-fine-tuned huggingface model checkpoint with the huggingface interface #2375

MaloMn · 2024-01-19T13:15:21Z

MaloMn
Jan 19, 2024

🚀 The feature

Currently, to achieve point (2), a Pretrainer object is necessary.

Fine-tune a HuggingFace model and save checkpoint for resulting model
Use the fine-tuned model in a new recipe by loading it using the HuggingFaceWav2Vec2 object.

It would be nice if the HuggingFaceWav2Vec2 object could be used to load the fine-tuned model as well.

Solution outline

To achieve this, a Pretrainer object is necessary (example below using a wav2vec2 model):

pretrainer: !new:speechbrain.utils.parameter_transfer.Pretrainer
    collect_in: !ref <save_folder>
    loadables:
        wav2vec2: !ref <wav2vec2> # This corresponds the base wav2vec2 model used before fine-tuning
    paths:
        wav2vec2: path/to/fine-tuned/model/wav2vec2.ckpt

Along with it usage in the python recipe file once the brain (asr_brain here) has been instanciated.

if "pretrainer" in hparams.keys():
        run_on_main(hparams["pretrainer"].collect_files)
        hparams["pretrainer"].load_collected(asr_brain.device)

Additional context

No response

TParcollet · 2024-01-30T14:26:22Z

TParcollet
Jan 30, 2024
Maintainer

This can be used as a discussion. I thought a bit more about this issue. I am not sure that this feature is 100% aligned with SpeechBrain methodology as the Pretrainer is what SpeechBrain expects to be used to load a pre-trained model. The HuggingFace interface is an outlier just because they provide their own loading mechanism AND they are being massively used. Bypassing the PreTrainer for a ckpt originating from a 100% speechbrain fine-tuning goer against the logic of using the pretrainer to load a model. But it's also counter-intuitive to not simply give the path of the new fine-tuned model to the HuggingFace interface.... @mravanelli @Adel-Moumen @Gastron what do you think?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load a speechbrain-fine-tuned huggingface model checkpoint with the huggingface interface #2375

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Load a speechbrain-fine-tuned huggingface model checkpoint with the huggingface interface #2375

MaloMn Jan 19, 2024

🚀 The feature

Solution outline

Additional context

Replies: 1 comment

TParcollet Jan 30, 2024 Maintainer

MaloMn
Jan 19, 2024

TParcollet
Jan 30, 2024
Maintainer