feat: Validation Split & MLflow Tracking by SK8-infi · Pull Request #62 · etsi-ai/etna

SK8-infi · 2026-01-13T20:39:56Z

Fixes

Closes: #22

Type of Change

Bug fix
New feature
Documentation / Refactor
Math / Logic correction

Description

Implemented validation split and MLflow tracking for monitoring model generalization performance.

Changes:

Added validation_split: float = 0.2 parameter to Model.train() method
Split data into training and validation sets after preprocessing
Modified training loop to train epoch-by-epoch and calculate validation loss after each epoch
Added _calculate_validation_loss() method supporting both classification (cross-entropy) and regression (MSE)
Extended Rust bindings with forward() method to expose raw model outputs for validation loss calculation
Updated save_model() to log both loss and val_loss metrics to MLflow with epoch steps
Added error handling for edge cases (empty predictions, shape mismatches, small validation sets)

Result: MLflow dashboard now displays overlapping train/validation loss curves for monitoring overfitting and generalization.

How Has This Been Tested?

Unit Tests: Created and ran test_validation_split.py covering classification, regression, and edge cases
Manual Testing: Verified with dummy datasets and confirmed loss curves in MLflow
Integration: Confirmed Rust core forward() method works correctly with Python API
Existing Tests: All existing pytest tests pass

Screenshots / Logs

📊 Data split: 80 training samples, 20 validation samples
Epoch 0/10 - Train Loss: 0.8602, Val Loss: 0.2851

📈 Logging 10 training metrics points...
📈 Logging 10 validation metrics points...

Contribution Context

I am contributing through the SWOC program.

github-actions · 2026-01-13T20:40:05Z

Manual Action Required!

This pull request is not yet linked to an issue. Please update the description to include 'Closes: #issue_number' in the appropriate section. This is required to pass validation and initiate metadata synchronization.

github-actions · 2026-01-13T20:40:05Z

Thank you for opening this PR! Our automated system is currently verifying the PR requirements.
Internal Discussion: Discord

github-actions · 2026-01-13T20:40:56Z

Validation Successful!

This pull request has been verified and linked to issue #22. The system is now synchronizing metadata from the referenced issue. Kindly await maintainer review of your changes.

Satyamgupta2365 · 2026-01-14T19:07:13Z

heyy. CI is failing on test_model_save_load_preserves_preprocessing.
The validation step is producing no predictions, causing a shape mismatch (y_true (1,2) vs y_pred (0,)).
Please ensure validation data uses the same preprocessing and that predict() is correctly called for X_val, especially for small validation sizes.

SK8-infi · 2026-01-14T19:30:11Z

heyy. CI is failing on test_model_save_load_preserves_preprocessing. The validation step is producing no predictions, causing a shape mismatch (y_true (1,2) vs y_pred (0,)). Please ensure validation data uses the same preprocessing and that predict() is correctly called for X_val, especially for small validation sizes.

Thanks @Satyamgupta2365 for review. Will keep this in mind. currently the work is in progress.

github-actions · 2026-01-14T19:53:32Z

Validation Successful!

This pull request has been verified and linked to issue #22. The system is now synchronizing metadata from the referenced issue. Kindly await maintainer review of your changes.

github-actions · 2026-01-14T19:55:27Z

Validation Successful!

This pull request has been verified and linked to issue #22. The system is now synchronizing metadata from the referenced issue. Kindly await maintainer review of your changes.

SK8-infi · 2026-01-14T19:57:39Z

Hii @Satyamgupta2365 finished the PR from my side and is ready for review. Thanks

github-actions · 2026-01-14T20:00:27Z

Validation Successful!

This pull request has been verified and linked to issue #22. The system is now synchronizing metadata from the referenced issue. Kindly await maintainer review of your changes.

github-actions · 2026-01-14T20:04:52Z

Validation Successful!

This pull request has been verified and linked to issue #22. The system is now synchronizing metadata from the referenced issue. Kindly await maintainer review of your changes.

debug-soham

@SK8-infi Thank you for the effort! While validation tracking is a great addition, this PR introduces critical architectural issues that must be addressed:

Optimizer State Reset: Moving the training loop to Python causes a new Adam/SGD instance to be created every epoch. This resets Adam's moment estimates and time-step to zero, breaking its adaptive logic.
Performance Overhead: Calling the Rust backend once per epoch significantly increases FFI marshaling and context-switching overhead.
Regressions: The code lacks the batch_size parameter recently merged in PR #63, which will cause build and runtime failures.

Requested Changes:

Revert the Python Loop: Move the validation logic into the Rust train method so the optimizer remains persistent.
Sync with Main: Update signatures to include the mandatory batch_size parameter.
Preserve forward(): Keep this new method in lib.rs as it is a valuable utility.

debug-soham · 2026-01-20T13:11:19Z

@SK8-infi Please update us on the status. You have 24 hours to address the issues or we’ll close this PR and reassign the task to keep development on track.

… into feat/val-split-track

github-actions · 2026-01-20T21:59:15Z

Validation Successful!

This pull request has been verified and linked to issue #22. The system is now synchronizing metadata from the referenced issue. Kindly await maintainer review of your changes.

SK8-infi · 2026-01-20T22:00:18Z

had to do some changes in tests as the new signature is (train_losses, val_losses)

github-actions · 2026-01-20T22:06:11Z

Validation Successful!

This pull request has been verified and linked to issue #22. The system is now synchronizing metadata from the referenced issue. Kindly await maintainer review of your changes.

debug-soham · 2026-01-21T06:23:53Z

@SK8-infi Build tests are failing.

SK8-infi · 2026-01-22T03:11:43Z

Seems to be an issue happened during resolving conflicts. Looking into it

debug-soham · 2026-02-01T08:45:17Z

@SK8-infi There are currently no merge conflicts. However, all build tests must pass prior to review. Please address the issues.

SK8-infi · 2026-02-01T09:46:23Z

Okay. Ensuring to be completed by EOD

Aamod007 · 2026-02-03T18:39:38Z

@SK8-infi any updates for PR ?

github-actions · 2026-02-04T23:23:45Z

Validation Successful!

This pull request has been verified and linked to issue #22. The system is now synchronizing metadata from the referenced issue. Kindly await maintainer review of your changes.

SK8-infi · 2026-02-04T23:24:17Z

Hii
Please check the fixes
Thanks

Aamod007

@SK8-infi Critical Fixes (MUST DO):

Fix parameter order in Python API call
Fix hidden layer type mismatch
Add integration tests with real Rust extension

Recommended Improvements (SHOULD DO):
4. Add data shuffling before split
5. Add random seed parameter for reproducibility
6. Add stratification option for classification

Future Enhancements (NICE TO HAVE):

K-fold cross-validation support
Early stopping based on validation loss
Validation metrics beyond loss
Learning rate scheduling

debug-soham · 2026-02-15T11:24:27Z

@SK8-infi, please update us on the status. You have 24 hours to address the issues or we’ll close this PR and reassign the task to keep development on track.

SK8-infi · 2026-02-15T11:37:52Z

Hii @debug-soham
Due to prior commitments, I will not be able to work on this issue anymore. You may close and reassign.

feat: Validation Split & MLflow Tracking

3194330

github-actions bot requested review from Aamod007, Arsh123344423, Romit23, Satyamgupta2365, SrishtiSonam and debug-soham January 13, 2026 20:40

github-actions bot assigned SK8-infi Jan 13, 2026

github-actions bot added area: python-api The user-facing Python library, Model class, and top-level CLI. feature New ML layers or API functionality. Medium Requires good codebase knowledge. labels Jan 13, 2026

fix minor bugs

ad1e1a5

SK8-infi marked this pull request as ready for review January 14, 2026 19:53

Delete unrelated files

0ca615c

SK8-infi changed the title ~~[WIP] feat: Validation Split & MLflow Tracking~~ feat: Validation Split & MLflow Tracking Jan 14, 2026

github-actions bot added the SWoC26 Contributions specifically for the Social Winter of Code program. label Jan 14, 2026

debug-soham requested changes Jan 15, 2026

View reviewed changes

SK8-infi added 2 commits January 21, 2026 03:26

resolve comments

345dc4d

Merge branch 'feat/val-split-track' of https://github.com/SK8-infi/etna…

7348e66

… into feat/val-split-track

github-actions bot requested a review from debug-soham January 20, 2026 21:59

Merge branch 'main' into feat/val-split-track

8e0b9ad

resolve build issues

32241d7

Aamod007 requested changes Feb 9, 2026

View reviewed changes

Aamod007 unassigned SK8-infi Feb 15, 2026

debug-soham closed this Feb 16, 2026

verdhanyash mentioned this pull request Feb 17, 2026

feat: Validation Split & MLflow Tracking (fixes #22) #78

Open

9 tasks

Comments

Conversation

SK8-infi commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fixes

Type of Change

Description

How Has This Been Tested?

Screenshots / Logs

Contribution Context

Uh oh!

github-actions bot commented Jan 13, 2026

Manual Action Required!

Uh oh!

github-actions bot commented Jan 13, 2026

Uh oh!

github-actions bot commented Jan 13, 2026

Validation Successful!

Uh oh!

Satyamgupta2365 commented Jan 14, 2026

Uh oh!

SK8-infi commented Jan 14, 2026

Uh oh!

github-actions bot commented Jan 14, 2026

Validation Successful!

Uh oh!

github-actions bot commented Jan 14, 2026

Validation Successful!

Uh oh!

SK8-infi commented Jan 14, 2026

Uh oh!

github-actions bot commented Jan 14, 2026

Validation Successful!

Uh oh!

github-actions bot commented Jan 14, 2026

Validation Successful!

Uh oh!

debug-soham left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

debug-soham commented Jan 20, 2026

Uh oh!

github-actions bot commented Jan 20, 2026

Validation Successful!

Uh oh!

SK8-infi commented Jan 20, 2026

Uh oh!

github-actions bot commented Jan 20, 2026

Validation Successful!

Uh oh!

debug-soham commented Jan 21, 2026

Uh oh!

SK8-infi commented Jan 22, 2026

Uh oh!

debug-soham commented Feb 1, 2026

Uh oh!

SK8-infi commented Feb 1, 2026

Uh oh!

Aamod007 commented Feb 3, 2026

Uh oh!

github-actions bot commented Feb 4, 2026

Validation Successful!

Uh oh!

SK8-infi commented Feb 4, 2026

Uh oh!

Aamod007 left a comment

Choose a reason for hiding this comment

Uh oh!

debug-soham commented Feb 15, 2026

Uh oh!

SK8-infi commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

SK8-infi commented Jan 13, 2026 •

edited

Loading

debug-soham left a comment •

edited

Loading

SK8-infi commented Feb 15, 2026 •

edited

Loading