Add a mini-transformer example #5

Copilot · 2025-11-28T14:07:32Z

Adds a mini-transformer example demonstrating the framework's capabilities for building and training transformer models with JAX sharding support.

Changes

State management: Moved step counter from s["trainer"]["step"] to s["step"] at experiment level
New layer primitives:
- SkipConnection - residual connections with configurable combiner function
- Repeated - sequential repetition of a layer with independent parameters per instance
- Unembedding - projects from hidden dimension to vocabulary logits
- RoPE - rotary position embeddings
Enhanced layer attributes: Added param_dtype, param_sharding, and out_sharding for mixed precision and distributed training
FrozenDict: Added __eq__ and __len__ methods for proper dict-like behavior

Example usage

from julax.layers import Chain, SkipConnection, Repeated, LayerNorm, Linear

transformer_block = SkipConnection(
    layer=Chain(layers=[
        LayerNorm(dim=512),
        Attention(...),
    ])
)

model = Chain(layers=[
    Embedding(in_dim=vocab_size, out_dim=512),
    Repeated(n=6, layer=transformer_block),
    Unembedding(in_dim=512, out_dim=vocab_size),
])

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

Initial plan

ab60b84

Copilot AI assigned Copilot and findmyway Nov 28, 2025

Copilot AI mentioned this pull request Nov 28, 2025

Add a mini-transformer example #1

Merged

Copilot started work on behalf of findmyway November 28, 2025 14:08 View session

Copilot AI changed the title ~~[WIP] Add a mini-transformer example~~ Add a mini-transformer example Nov 28, 2025

Copilot finished work on behalf of findmyway November 28, 2025 14:09

Copilot AI requested a review from findmyway November 28, 2025 14:09

findmyway closed this Nov 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a mini-transformer example #5

Add a mini-transformer example #5

Uh oh!

Copilot AI commented Nov 28, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add a mini-transformer example #5

Add a mini-transformer example #5

Uh oh!

Conversation

Copilot AI commented Nov 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Example usage

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Nov 28, 2025 •

edited

Loading