Hi, may I ask how you decide the number of training samples per client? and what is the exact number used, for example, in arithmetric reasoning experiment using MetaMathQA?
I couldn't find numbers about local data size in the paper but I think they are important details in the federated setup.