feat: support frozen embeddings #2

000alen · 2025-07-13T23:49:19Z

Add embedding configuration options to TrainingConfig and implement loading of pretrained embeddings in Trainer class. Enhance model initialization to support freezing embeddings and ensure dimensional consistency when loading embeddings.

…oading of pretrained embeddings in Trainer class. Enhance model initialization to support freezing embeddings and ensure dimensional consistency when loading embeddings.

…nedModel instead of AutoModel for improved compatibility with GPT-2 architecture. Ensure consistent handling of input embeddings during initialization.

Copilot

Pull Request Overview

This PR adds support for loading and optionally freezing pretrained embeddings from Hugging Face models. The implementation allows models to initialize with pretrained embeddings (like GPT-2) and freeze them during training to preserve learned representations.

Adds embedding configuration options to TrainingConfig (freeze_embeddings, load_pretrained_embeddings, pretrained_model_name)
Implements pretrained embedding loading with dimensional consistency checks in the Trainer class
Reorders initialization sequence to load data first, ensuring correct vocab_size before model creation

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.

File	Description
train_launcher.py	Maps new embedding configuration parameters from config dict to TrainingConfig
secure_transformer/train.py	Implements embedding loading/freezing logic and reorders initialization sequence
secure_transformer/config.py	Adds new embedding-related configuration fields with defaults
configs/gpt2_frozen_config.yaml	Provides example configuration for using frozen GPT-2 embeddings

Comments suppressed due to low confidence (1)

secure_transformer/train.py:76

[nitpick] The variable name 'trainable_params' could be more descriptive. Consider renaming to 'trainable_parameters' for consistency with the logging message that mentions 'trainable params'.

        trainable_params = [p for p in self.model.parameters() if p.requires_grad]

secure_transformer/train.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

000alen added 2 commits July 13, 2025 16:48

Add embedding configuration options to TrainingConfig and implement l…

ebbd6f7

…oading of pretrained embeddings in Trainer class. Enhance model initialization to support freezing embeddings and ensure dimensional consistency when loading embeddings.

Refactor pretrained model loading in Trainer class to use GPT2PreTrai…

61d84ca

…nedModel instead of AutoModel for improved compatibility with GPT-2 architecture. Ensure consistent handling of input embeddings during initialization.

000alen self-assigned this Jul 20, 2025

000alen requested a review from Copilot July 20, 2025 19:13

Copilot AI reviewed Jul 20, 2025

View reviewed changes

secure_transformer/train.py Outdated Show resolved Hide resolved

secure_transformer/train.py Outdated Show resolved Hide resolved

secure_transformer/train.py Outdated Show resolved Hide resolved

000alen and others added 3 commits July 20, 2025 12:14

Update secure_transformer/train.py

bf57adf

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update secure_transformer/train.py

bda3cb7

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update secure_transformer/train.py

4a60ad9

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: support frozen embeddings #2

feat: support frozen embeddings #2

Uh oh!

000alen commented Jul 13, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: support frozen embeddings #2

Are you sure you want to change the base?

feat: support frozen embeddings #2

Uh oh!

Conversation

000alen commented Jul 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

000alen commented Jul 13, 2025 •

edited

Loading