Multi-Armed-Bandit

Implementation of the Multi-Armed Bandit where each arm returns continuous numerical rewards. Covers Epsilon-Greedy, UCB1, and Thompson Sampling with detailed explanations.

Multi-Armed Bandit Implementation on Numerical Data

This repository explores the Multi-Armed Bandit problem using numerical data instead of the traditional Bernoulli distribution (which returns only 0 or 1). It provides a comprehensive overview of the fundamental concepts, alongside practical implementations of three popular algorithms:

✅ Epsilon-Greedy – Balancing exploration and exploitation through a probability-based approach. ✅ UCB1 (Upper Confidence Bound) – Optimizing decision-making with confidence intervals. ✅ Thompson Sampling – A Bayesian approach to adaptive learning.

The notebook includes detailed explanations, code implementations, and visualizations to help you understand how these algorithms work in real-world scenarios.

📌 Ideal for: Data scientists, AI researchers, and anyone interested in reinforcement learning.

Feel free to explore, experiment, and contribute! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
Contextual Bandit Results		Contextual Bandit Results
archives		archives
dataset		dataset
experimentation		experimentation
.gitignore		.gitignore
1_epsilon_greedy_mab.ipynb		1_epsilon_greedy_mab.ipynb
2_ucb1_mab.ipynb		2_ucb1_mab.ipynb
3_thompson_sampling.ipynb		3_thompson_sampling.ipynb
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-Armed-Bandit

Multi-Armed Bandit Implementation on Numerical Data

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

ReinerJasin/Multi-Armed-Bandit

Folders and files

Latest commit

History

Repository files navigation

Multi-Armed-Bandit

Multi-Armed Bandit Implementation on Numerical Data

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages