Emergent Symbol-like Number Variables in Artificial Neural Networks

This repo contains the code used to generate the analyses in the paper Emergent Symbol-like Number Variables in Artificial Neural Networks (and Model Alignment Search).

🔍 Overview

Many neural analyses focus on static representations, correlational analyses, and sufficient behavioral representations to interpret neural networks (NNs). Many of these analyses, however, can disregard how the activity causally affects the NN's behavior, and they can disregard the necessity of the activity for behavior. Methods like Distributed Alignment Search are designed to causally intervene on the activations so as to causally relate NN activity to behavior through causal interventions while also isolating necessary activation subspaces for the behavior.

🚀 Installation / 📦 Dependencies

You can install the requirements for this repo via pip:

pip install -r requirements.txt

🧠 Key Features

⚙️ Drop-in analysis tools for trained PyTorch models
🔬 Methods to perform DAS using generalized Alignment Functions
🔌 Compatible with custom sequence-based pytorch models and Huggingface models

🧪 Example Usage

You will first need a model to analyze. You can create new models trained on the numeric equivalence tasks by first changing the make_models/make_model_training_file.py script and then running the following:

$ python make_models/make_model_training_file.py
$ bash make_models/run_scripts/gru.py

Once you have a working model, you can run a DAS or MAS experiment on that model by arguing a configuration yaml file to the main script:

$ python main.py configs/general_das_config.yaml

Look in the configs directory for example configuration files.

You can also override configuration settings using comand line arguments:

$ python main.py configs/general_das_config.yaml model_names=models/multiobject_gru/multiobject_gru_0_seed12345

To recreate the experiments used in Emergent Symbol-like Number Variables in Artificial Neural Networks, you can use the scripts located in scripts/das_scripts/ after editing the appropriate path variables in the respective scripts:

$ bash scripts/das_scripts/dispatch_exps.sh

🧑‍🔬 Citation

If you use this repo in your research, please cite:

Satchel Grant, Noah D. Goodman, James L. McClelland (2025). Emergent Symbol-like Number Variables in Artificial Neural Networks. Transactions on Machine Learning Research

BibTex

@article{grant2025alignmentfunctions,
    title={Emergent Symbol-like Number Variables in Artificial Neural Networks}, 
    author={Satchel Grant and Noah D. Goodman and James L. McClelland},
    journal={Transactions on Machine Learning Research},
    year={2025},
    url={https://arxiv.org/abs/2501.06141}, 
}

🙌 Contributing

Contributions, suggestions, and issues are welcome! Open a pull request or file an issue.

📄 License

MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 368 Commits
analysis/alignment_fns		analysis/alignment_fns
configs		configs
dl_utils		dl_utils
language		language
make_models		make_models
scripts		scripts
tests		tests
.gitignore		.gitignore
README.md		README.md
causal_models.py		causal_models.py
constants.py		constants.py
datas.py		datas.py
distr.py		distr.py
fca.py		fca.py
filters.py		filters.py
hooks.py		hooks.py
intrv_datas.py		intrv_datas.py
intrv_modules.py		intrv_modules.py
intrv_training.py		intrv_training.py
main.py		main.py
seq_models.py		seq_models.py
similarity.py		similarity.py
tasks.py		tasks.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Emergent Symbol-like Number Variables in Artificial Neural Networks

🔍 Overview

🚀 Installation / 📦 Dependencies

🧠 Key Features

🧪 Example Usage

🧑‍🔬 Citation

🙌 Contributing

📄 License

About

Uh oh!

Releases

Packages

Languages

grantsrb/mas

Folders and files

Latest commit

History

Repository files navigation

Emergent Symbol-like Number Variables in Artificial Neural Networks

🔍 Overview

🚀 Installation / 📦 Dependencies

🧠 Key Features

🧪 Example Usage

🧑‍🔬 Citation

🙌 Contributing

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages