Skip to content

Limitation of Streams in the past [General discussion] #23

@perellonieto

Description

@perellonieto

What are the implications of using Streams of data with a timestamp in the Future (or in the present moment)?

At this moment, if you ask for a Stream to be computed for the current time it will raise the following Exception

File "some_path/site-packages/hyperstream/stream/stream_instance.py", line 39, in __new__
    raise ValueError("Timestamp {} should not be in the future!".format(timestamp))
ValueError: Timestamp 2017-08-22 14:05:00.308326+00:00 should not be in the future!

I can think of two cases where it could be interesting to allow Streams in the future:

  1. Asking a classifier tool to train from now till 1 hour in the future.
    • It would be nice to tell to a classifier, given the data that is given from a specific source stream, keep training until the specified time.
    • E.g. Some real time from stock exchanges that gets to a real time stream and that keeps yielding data. The model could take this data every time that it is available and train.
  2. Some dataset where the timestamps are in the future.
    • I am not sure how plausible this scenario is.
    • But I imagine that if someone wants to use a stream that outputs data from some particular future time.
  3. Asking a tool for predictions in the future
    • A model that makes predictions for the future, given that it has been already trained with past data.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions