-
Notifications
You must be signed in to change notification settings - Fork 11
Open
Labels
Description
Some ideas:
- We do not need to store both
YandYorigin the flash data object. - We should probably store
tauas a vector whenvar_type = by_columnorby_row. This could be tricky, but it's probably worth it since flash fit objects are frequently copied. - It shouldn't be too difficult to allow
Yto be adgCMatrix, and likewise forS.
If we do the above, then the only large dense matrices will be the matrices of residuals and squared residuals. (Or rather, R2, Rk, and R2k for the greedy step.) So, optimistically, we might be able to shoot for a memory requirement of 5x the size of the original data (measured as a dense matrix) when Y is sparse and 6-8x otherwise.