as mem_freq = 63, and seq_length 3535 can't divide 63, view[input_ids.shape[0], -1, mem_freq] would raise error?