Skip to content

[FEATURE] Featrue Generation Pipeline #5

@GongJr0

Description

@GongJr0

Feature Details

A pipeline module to handle the generation of numeric features, process embeddings, and format outputs into a PyTorch legible format.

Affected Modules

Exposed API regarding feature generation will be implemented in the FeatureGen class. Current modules will not recieve exposed updates.

Implementation Checklist

  • Lag formatting for $AR(n)$ features
  • Categorical embeddings for ticker, period, sector, etc. information
  • Concatenation layer for per-ticker data
  • Stateful feature storage and iterative update logic for incremental learning
  • Validation utilities for shape/mask consistency
  • Collate function for batching variable-length lag sets

Limitations

  • The implementation will focus on pre-planned features and abstractions will only be made to facilitate efficient development of the current architecture.
  • The pipeline will offer limited human-readable inspection/validation points. Format will strictly focus on torch readability.

Sub-issues

Metadata

Metadata

Assignees

Labels

featureImplementation tracking for approved features

Projects

Status

In progress

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions