AROMMA: Unifying Olfactory Embeddings for Single Molecules and Mixtures

Introduction

This is the pytorch implementation of our ICASSP 2026 paper "AROMMA: Unifying Olfactory Embeddings for Single Molecules and Mixtures". AROMMA (Aggregated Representations of Olfaction via Molecule and Mixture Alignment) is a novel framework that learns a unified embedding space for both single molecules and two-molecule mixtures by leveraging chemical foundation model (SPMM).

To address the label sparsity in the mixture dataset (BP), AROMMA employs a training strategy that combines:

Knowledge distillation from a molecule-level teacher model (POM), and
Class-distribution-aware pseudo-labeling.

Setup

Create and activate the conda environment:

conda env create -f environment.yml
conda activate aromma_env

Download trained checkpoints

The trained model checkpoints are available on Hugging Face:

Data	Model	Checkpoints
`data/mixture`	AROMMA	aromma_best_fold.pt
`data/mixture_p78`	AROMMA-P78	aromma_p78_best_fold.pt
`data/mixture_p152`	AROMMA-P152	aromma_p152_best_fold.pt

Training

Before training, download the pre-trained checkpoints following models/pom/README.md and models/spmm/README.md The directory structure should be:

models
├── pom
│   ├── gnn_embedder.pt
│   └── nn_predictor.pt
└── spmm
    ├── checkpoint_SPMM.ckpt
    ├── config_bert.json
    └── vocab_bpe_300.txt

Training Procedure:

python train.py --phase aromma
pseudo_labeling.ipynb
python train.py --phase aromma_p78 or python train.py --phase aromma_p152

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AROMMA: Unifying Olfactory Embeddings for Single Molecules and Mixtures

Introduction

Setup

Download trained checkpoints

Training

Citation

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
models		models
src		src
README.md		README.md
environment.yml		environment.yml
pseudo_labeling.ipynb		pseudo_labeling.ipynb
threshold.txt		threshold.txt
train.py		train.py

DGIST-Distributed-AI-Lab/aromma

Folders and files

Latest commit

History

Repository files navigation

AROMMA: Unifying Olfactory Embeddings for Single Molecules and Mixtures

Introduction

Setup

Download trained checkpoints

Training

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages