-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Description
“ours” 方法聚合时残差并没有使用SVD分解,此时传输数据量不是和全量参数传输一样吗?您提供的聚合代码如下:
residue = M - lora_B_avg @ lora_A_avg
global_dict[name + ".lora_A.default.weight"] = lora_A_avg
global_dict[name + ".lora_B.default.weight"] = lora_B_avg
global_dict[name + ".base_layer.weight"] += torch.transpose(residue * scaling_factor, 1, 0)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels