JohannesAck

Follow

🗼

Johannes Ackermann JohannesAck

🗼

Follow

PhD student at the University of Tokyo working on Reinforcement Learning and broader Machine Learning

50 followers · 24 following

Achievements

Achievements

Pinned Loading

OffPolicyCorrectedRewardModeling OffPolicyCorrectedRewardModeling Public

Implementation for our COLM paper "Off-Policy Corrected Reward Modeling for RLHF"

Python 7
pfnet-research/multi-stage-blended-diffusion pfnet-research/multi-stage-blended-diffusion Public

Python 30 5
OfflineRLStructuredNonstationarity OfflineRLStructuredNonstationarity Public

Implementation for RLC paper "Offline Reinforcement Learning from Datasets with Structured Non-Stationarity".

Python 8
tf2multiagentrl tf2multiagentrl Public

Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x

Python 166 34