GitHub - aditi184/Multilingual-MoE

This repo is adapted from OLMo branch.

Pretraining

Clone this OLMo branch & create an environment with its dependencies via cd OLMo; pip install -e .. If you want to use new features in OLMo clone from the main branch instead. Run pip install git+https://github.com/Muennighoff/megablocks.git@olmoe Setup a config file. configs/OLMoE-1B-7B-0924.yml was used for the pretraining of OLMoE-1B-7B-0924. You can find configs from various ablations in configs/ablations. Tokenize it via the command below and adapt the paths in your training config to point to it.

dolma tokens \
--documents ${PATH_TO_DOWNLOADED_DATA} \
--destination ${PATH_WHERE_TO_SAVE_TOKENIZED_DATA} \
--tokenizer.name_or_path 'allenai/gpt-neox-olmo-dolma-v1_5' \
--max_size '2_147_483_648' \
--seed 0 \
--tokenizer.eos_token_id 50279 \
--tokenizer.pad_token_id 1 \
--processes ${NUMBER_OF_CPU_CORES_TO_USE}

Run via

torchrun --nproc_per_node=4 train.py configs/config_mulitlingual.yml

Run Analysis using analysis.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 4,157 Commits
.github		.github
checkpoints/official		checkpoints/official
configs		configs
docker		docker
docs		docs
evaluation		evaluation
hf_olmo		hf_olmo
inference		inference
logs		logs
olmo		olmo
olmo_data		olmo_data
scripts		scripts
test_fixtures		test_fixtures
tests		tests
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
analysis.ipynb		analysis.ipynb
conftest.py		conftest.py
culturax_preprocess.py		culturax_preprocess.py
dev.py		dev.py
dev1.py		dev1.py
dolma_tokenize.sh		dolma_tokenize.sh
download_global_mmlu.py		download_global_mmlu.py
loop_dolma_tokenize.sh		loop_dolma_tokenize.sh
preprocess.py		preprocess.py
preprocess_multilingual.py		preprocess_multilingual.py
pyproject.toml		pyproject.toml
quicktestmodel.py		quicktestmodel.py
run.sh		run.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 39

Uh oh!

Languages

License

aditi184/Multilingual-MoE

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 39

Uh oh!

Languages

Packages