Skip to content

Comments

Support newer Transformers versions.#58

Open
w1ida wants to merge 1 commit intoapple:mainfrom
w1ida:remove-deprecated-imports-from-transformers
Open

Support newer Transformers versions.#58
w1ida wants to merge 1 commit intoapple:mainfrom
w1ida:remove-deprecated-imports-from-transformers

Conversation

@w1ida
Copy link

@w1ida w1ida commented Dec 9, 2025

Removed model forward docstring decorators and related imports from the patched Llama, Mistral, Qwen2, Phi-3, and Gemma2 shims to avoid relying on utilities missing in newer Transformers versions.

@w1ida
Copy link
Author

w1ida commented Dec 16, 2025

Hi @philkr @federicobucchi @rwebb @shahmishal
Just wanted to check if there’s any update on this PR — the changes should be ready to merge if there’s nothing else needed.

Thanks! 🙏

@NanoCode012
Copy link

@w1ida , noticed that the devs are busy, so we've been maintaining a separate fork here which works with newer transformers and also includes a lot of recent model arch too https://github.com/axolotl-ai-cloud/ml-cross-entropy

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants