[WIP] Support Mllama by TimoImhof · Pull Request #777 · adapter-hub/adapters

TimoImhof · 2025-01-06T21:02:57Z

Adresses #773 and adds support for the multimodal LLama 3.2 models. (WIP)

…dapters into dev/test-refactoring

- reorder directory structure to separate testing models and adapter methods (needs further refactoring for separating tests that are extecuted for each model vs tests that are just run once) - remove all model tests except albert for now (will be readded once final design is agreed on) - refactor albert adapter test class to group test mixins in categories which are then displayed accordingly by test viewer - adjust imports to reflect new directory structure

- refactor generate_test into base test class and adapt parallel generate test accordingly - refactor lm head selection into own method - create utils file for utility methods and adapt imports accordingly - remove is_speech_model flags - replace redundant attributes and methods - enforce proper method naming conventions

This reverts commit b390d61.

- Re-add bart adapter method tests - Remove duplicate added tests - In conversion test refactor model specific if else statements into model test class

- deberta - debertav2 - distilbert - electra - encoder-decoder - llama - mbart - mistral - mt5 - plbart - roberta

- introduce parameter to cut down on makefile commands - draw out tests that are not executed on every model into a seperate directory - move the adapter method test implemenations into the method tests directory - rename files for more clarity

…to dev/mllama

- Create Mllama testbase blueprint - fix multiple prefixtuningpools problem by updating Mllama mixins - add blueprint for static model conversion from MllamaForConditionalGeneration - Remove wrong labels argument in MllamaModel

- make style - make quality - Update Mllama test config and get_input_samples()

- Remove ConditionalGenerationMixin for now - Remove decoderlayermxin from mixin_mapping - Add DecoderLayerWithAdapters for bottleneck support - Fix input shape typo in test config - make style & make quality - Next: Find out why output is the same when using bottleneck adapters and when not

…to dev/mllama

- Add post_embedding_forward hook logic and hook registration to the text model - Without it invertible adapters are not invoked although added to the model

- With MllamaCrossAttentionDecoderLayerWithAdapters the normal crossattentionlayer is replaced during adapters.init() and adds the adapters logic - Also redundant mixins are removed form the model_mixin_mapping

TimoImhof and others added 30 commits September 16, 2024 10:10

Make generation tests generic

93bee1a

Merge remote-tracking branch 'origin/main' into dev/test-refactoring

f51cfdb

Draft Refactoring AdapterTestBase

7e65e82

Merge branch 'adapter-hub:main' into dev/test-refactoring

793cbe5

Replace import class names

65c3fb7

Merge branch 'dev/test-refactoring' of https://github.com/TimoImhof/a…

afdcfdd

…dapters into dev/test-refactoring

remove redundant imports

630b722

Add pytest markers and respective pytest commands

0d3577f

Add draft of README

1300856

Fix make quality

83d3b32

Add gpt2 tests

5e8e1b8

Fix config union and head tests

53eb0b9

Fix paths and imports

1dbd412

remove accidently added prompt tuning from gpt2 and make style

cf4f6a7

Revert PromptTuning changes

b390d61

Revert "Revert PromptTuning changes"

2193aee

This reverts commit b390d61.

Re-add missing adapter model tests

f555484

Refactoring:

8dccda2

- Re-add bart adapter method tests - Remove duplicate added tests - In conversion test refactor model specific if else statements into model test class

Introduce generic test creator function

c665948

Re-add beit adapter method tests

fb425b6

Refactor & Re-add bertgeneration and bert

225439c

Re-add clip tests

09f9cdc

Re-add:

7934350

- deberta - debertav2 - distilbert - electra - encoder-decoder - llama - mbart - mistral - mt5 - plbart - roberta

Add more models

5f55935

Re-add whisper

147c8af

Changes:

b2979ce

- introduce parameter to cut down on makefile commands - draw out tests that are not executed on every model into a seperate directory - move the adapter method test implemenations into the method tests directory - rename files for more clarity

Add debug statements and only execute failing test

ffd21a9

Add verbose information

0dba87c

Merge branch 'dev/mllama' of https://github.com/TimoImhof/adapters in…

a122a75

…to dev/mllama

calpt linked an issue Jan 9, 2025 that may be closed by this pull request

Support for Llama-3.2-11B-Vision-Instruct #773

Open

3 tasks

TimoImhof and others added 28 commits January 10, 2025 10:37

Fix import structure

d67692d

Reuse mixin implementations

958b2c6

Create MllamaModel class and adjust mixins accordingly

7507e1e

Re-implement MllamaAdapterModel

d2a28d8

Fix typos

6d39941

Draft adapter attention classes

4c153b4

Progress:

a52154f

- Create Mllama testbase blueprint - fix multiple prefixtuningpools problem by updating Mllama mixins - add blueprint for static model conversion from MllamaForConditionalGeneration - Remove wrong labels argument in MllamaModel

save links for useful resources

2ec0b35

Integrate CLIP into refactored test structure

88f6230

Merge branch 'dev/test-refactoring' into dev/mllama

39878e6

Progress:

a75846a

- make style - make quality - Update Mllama test config and get_input_samples()

Add mllama model tests

7b970c9

Adapt VisionEncoder forward pre hook

40871e5

Merge branch 'main' into dev/mllama

e177ecb

revert merging errors

8a17571

Fix test model config and base model

5abb4ab

Merge branch 'adapter-hub:main' into dev/mllama

5fc7e4b

update test config

08b5ef6

Merge remote-tracking branch 'origin/main' into dev/mllama

1f58977

Update forward context (make style & quality)

3f56276

Merge branch 'dev/mllama' of https://github.com/TimoImhof/adapters in…

90994eb

…to dev/mllama

Adapt adapter head logic from llama

f7ff891

Fix typos & make style

b180267

Fix invertible adapter forward pass:

b633433

- Add post_embedding_forward hook logic and hook registration to the text model - Without it invertible adapters are not invoked although added to the model

Add MllamaCrossAttentionDecoderLayerWithAdapters:

157e142

- With MllamaCrossAttentionDecoderLayerWithAdapters the normal crossattentionlayer is replaced during adapters.init() and adds the adapters logic - Also redundant mixins are removed form the model_mixin_mapping

Merge branch 'main' into dev/mllama

ce27b58

Use _default_init_adapter_methods in model mixin

93bad84

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Support Mllama#777

[WIP] Support Mllama#777
TimoImhof wants to merge 81 commits intoadapter-hub:mainfrom
TimoImhof:dev/mllama

TimoImhof commented Jan 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

TimoImhof commented Jan 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant