PyTorch module #2

bsircelj · 2021-09-27T12:57:19Z

Two major updates.

Selecting "torch" as a model will build predefined wrapper class for PyTorch linear ANN model which behaves as other sklearn models with fit and predict functions.
Time offset between now includes other possibilities than hours.

klemenkenda

Great pull request making the code much more consistent. Some minor changes are requested.

README.md

klemenkenda · 2021-10-06T08:07:24Z

README.md

+| **retrain_period**| integer | None | A number of received samples after which the model will be re-trained. This is an optional parameter. If it is not specified no re-training will be done.|
+| **samples_for_retrain**| integer | None | A number of samples that will be used for re-training. If retrain_period is not specified this parameter will be ignored. This is an optional parameter. If it is not specified (and retrain_period is) the re-train will be done on all samples received since the component was started.|
+| **time_offset**| string | H | [String alias](https://pandas.pydata.org/pandas-docs/stable/user_guide/timeseries.html#offset-aliases) to define the data time offsets. The aliases used in training and topic names are lowercase for backwards compatibility.|
+| **learning_rate**| float| 4E-5 | Learning rate for the torch model.|


Separate PyTorch specific configuration parameters.

klemenkenda · 2021-10-06T08:08:28Z

README.md

+| **evaluation_split_point**| float | 0.8 | Define training and testing splitting point in the dataset, for model evaluation during learning phase (fit takes twice as long time).|
+| **retrain_period**| integer | None | A number of received samples after which the model will be re-trained. This is an optional parameter. If it is not specified no re-training will be done.|
+| **samples_for_retrain**| integer | None | A number of samples that will be used for re-training. If retrain_period is not specified this parameter will be ignored. This is an optional parameter. If it is not specified (and retrain_period is) the re-train will be done on all samples received since the component was started.|
+| **time_offset**| string | H | [String alias](https://pandas.pydata.org/pandas-docs/stable/user_guide/timeseries.html#offset-aliases) to define the data time offsets. The aliases used in training and topic names are lowercase for backwards compatibility.|


This is not very clear to me since it is new, does it affect ScikitLearn models?

Is this just the unit for prediction_horizons? If so - move these descriptions closer together.

Yes. It is as you said a unit for prediction_horizons. I will also write about time_offset in prediction_horizons and move them together.

The sk-learn models can now also use different time offsets but if nothing is specified it defaults to hours for backwards compatibility.

klemenkenda · 2021-10-06T08:13:20Z

README.md

 ## Requirements

-* Python 3.6+
+* Python 3.9+


3.9 is the version which I have installed and the version on which the code was tested. The sk-learn code should work on both versions as I didn't change that but I can't guarantee for any of new pip packages used. Do I find compatible versions of packages for 3.6 python and update this also along with requirements.txt?

klemenkenda · 2021-10-06T08:16:47Z

src/lib/predictive_model.py


-    def __init__(self, algorithm, sensor, prediction_horizon, evaluation_periode, error_metrics, split_point,
-                 retrain_period = None, samples_for_retrain = None, retrain_file_location = None):
+    def rmse(self, true, pred):


There is a file called regression_metrics.py. We should include this in there?

src/lib/predictive_model.py

klemenkenda · 2021-10-06T08:22:33Z

src/lib/predictive_model.py

-        #print "Loaded model from", filename
+        # print "Loaded model from", filename
+
+    class TorchNetwork:


Should we put this in a separate file?

klemenkenda · 2021-10-06T08:25:02Z

src/main.py

-    port= 3001
-    path= "/ping?id=5&secret=b9347c25aba4d3ba6e8f61d05fd1c011"
+    port = 3001
+    path = "/ping?id=5&secret=b9347c25aba4d3ba6e8f61d05fd1c011"


Can we put secret in some config file?

I don't know the purpose of the watchdog in this module but I moved all the hardcoded parameters to the config file.

klemenkenda · 2021-10-06T08:28:40Z

src/main.py

-                timestamp = rec['timestamp']
-                ftr_vector = rec['ftr_vector']
-                measurement = ftr_vector[0] # first feature is the target measurement
+            # try:


try-catch was there for a reason ... probably because of potential crash due to misformatted messages. Please add it back.

Yeah, that was an oversight. I wanted full error stack while debugging so I just put the try block in comments and forgot to uncomment after will fix it.

bsircelj added 2 commits September 23, 2021 16:27

Torch ann first version

03c378e

Updated model saving

e81edc9

aljazkosmerlj requested a review from klemenkenda September 28, 2021 08:25

klemenkenda requested changes Oct 6, 2021

View reviewed changes

gal9 approved these changes Oct 6, 2021

View reviewed changes

bsircelj added 3 commits October 12, 2021 12:06

GMM imputer layer added.

b3a2f9e

Update README.md

47f712d

Small import fix

2360a5b

PyTorch module #2

Are you sure you want to change the base?

PyTorch module #2

Uh oh!

Conversation

bsircelj commented Sep 27, 2021

Uh oh!

klemenkenda left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants