Infinite Recommendation Networks (∞-AE)

This repository is a fork of the orginal Github repository and has modified parts of the code. For more information about the modifications you can find at REPRO.md

This repository contains the implementation of ∞-AE from the paper "Infinite Recommendation Networks: A Data-Centric Approach" [arXiv] where we leverage the NTK of an infinitely-wide autoencoder for implicit-feedback recommendation. Notably, ∞-AE:

Is easy to implement (<50 lines of relevant code)
Has a closed-form solution
Has only a single hyper-parameter, $\lambda$
Even though simplistic, outperforms all complicated SoTA models

The paper also proposes Distill-CF: how to use ∞-AE for data distillation to create terse, high-fidelity, and synthetic data summaries for model training. We provide Distill-CF's code in a separate GitHub repository.

If you find any module of this repository helpful for your own research, please consider citing the below paper. Thanks!

@article{inf_ae_distill_cf,
  title={Infinite Recommendation Networks: A Data-Centric Approach},
  author={Sachdeva, Noveen and Dhaliwal, Mehak Preet and Wu, Carole-Jean and McAuley, Julian},
  booktitle={Advances in Neural Information Processing Systems},
  series={NeurIPS '22},
  year={2022}
}

Code Author: Noveen Sachdeva (nosachde@ucsd.edu)

Setup

Environment Setup

pip install -r requirements.txt

Data Setup

This repository already includes the pre-processed data for ML-1M, Amazon Magazine, and Douban datasets as described in the paper. The code for pre-processing is in preprocess.py.

How to train ∞-AE?

Edit the hyper_params.py file which lists all config parameters of ∞-AE.
Finally, type the following command to train and evaluate ∞-AE:

CUDA_VISIBLE_DEVICES=0 python main.py

Results sneak-peak

Below are the nDCG@10 results for the datasets used in the paper:

Dataset	PopRec	MF	NeuMF	MVAE	LightGCN	EASE	∞-AE
Amazon Magazine	8.42	13.1	13.6	12.18	22.57	22.84	23.06
MovieLens-1M	13.84	25.65	24.44	22.14	28.85	29.88	32.82
Douban	11.63	13.21	13.33	16.17	16.68	19.48	24.94
Netflix	12.34	12.04	11.48	20.85	Timed out	26.83	30.59*

Note: ∞-AE's results on the Netflix dataset (marked with a *) are obtained by training only on 5% of the total users. Note however, all other methods are trained on the full dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
.gitignore		.gitignore
REPRO.md		REPRO.md
Readme.md		Readme.md
data.py		data.py
eval.py		eval.py
hyper_params.py		hyper_params.py
install_enviroment.job		install_enviroment.job
main.py		main.py
model.py		model.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
run_experiment.job		run_experiment.job
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Infinite Recommendation Networks (∞-AE)

Setup

Environment Setup

Data Setup

How to train ∞-AE?

Results sneak-peak

MIT License

About

Uh oh!

Releases

Packages

Languages

IRLab-RecSysCourse-2025/inf-ae

Folders and files

Latest commit

History

Repository files navigation

Infinite Recommendation Networks (∞-AE)

Setup

Environment Setup

Data Setup

How to train ∞-AE?

Results sneak-peak

MIT License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages